Section 3.1 - Navidi/Monk

MEASURES OF CENTER
Section 3.1
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Objectives
1.
2.
3.
4.
5.
Compute the mean of a data set
Compute the median of a data set
Compare the properties of the mean and median
Find the mode of a data set
Approximate the mean with grouped data
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
OBJECTIVE 1
Compute the mean of a data set
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Mean of a Data Set
The mean of a data set is a measure of center. If we imagine each
data value to be a weight, then the mean is the point at which the data
set balances.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Notation - Population Versus Sample
Recall that a population consists of an entire collection of individuals
about which information is sought, and a sample consists of a smaller
group drawn from the population. The method for calculating the mean
is the same for both samples and populations, except for the notation.
Notation
Population Mean:
𝝁
Sample Mean:
𝒙
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Computing the Mean
A list of 𝑛 numbers is denoted by π‘₯1 , π‘₯2 , … , π‘₯𝑛
βˆ‘π‘₯𝑖 represents the sum of these numbers: βˆ‘π‘₯𝑖 = π‘₯1 + π‘₯2 + π‘₯3 + … + π‘₯𝑛
If π‘₯1 , π‘₯2 , … , π‘₯𝑛 is a sample, then the sample mean is given by π‘₯ =
βˆ‘π‘₯𝑖
𝑛
If π‘₯1 , π‘₯2 , … , π‘₯𝑁 is a population, then the population mean is given by πœ‡ =
βˆ‘π‘₯𝑖
𝑁
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Example - Mean
During a semester, a student took five exams. The population of exam
scores is 78, 83, 92, 68, and 85. Find the mean.
Solution:
The mean is given by πœ‡ =
βˆ‘π‘₯𝑖
𝑁
=
78 + 83 + 92 + 68 + 85 406
=
= 81.2.
5
5
Note that the mean is rounded to one more decimal place than the
original data. This is generally considered good practice.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
OBJECTIVE 2
Compute the median of a data set
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Median
The median is another measure of center. The median is a number
that splits the data set in half, so that half the data values are less than
the median and half of the data values are greater than the median.
The procedure for computing the median differs, depending on whether
the number of observations in the data set is even or odd.
If n is odd:
If n is even:
The median is the middle
number.
The median is the average of
the two middle numbers.
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Examples - Median
Example:
During a semester, a student took five exams. The population of exam scores
is 78, 83, 92, 68, and 85. Find the median of the exam scores.
Solution:
Arrange the data values in increasing order: 68
The median is the middle number, 83.
78
83
85
92
Example:
Eight patients undergo a new surgical procedure and the number of days spent
in recovery for each is as follows. Find the median number of days in recovery.
20 15 12 27 13 19 13 21
Solution:
Arrange the data values in increasing order: 12 13 13 15 19 20 21 27.
15 + 19
The median is the average of the two middle numbers:
= 17.
2
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Mean and Median on the TI-84 PLUS
The 1-Var Stats command in the TI-84 PLUS Calculator displays a list of the
most common parameters and statistics for a given data set. This command is
accessed by pressing STAT and then highlighting the CALC menu.
The 1-Var Stats command returns the following quantities:
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Mean and Median on the TI-84 PLUS
During a semester, a student took five exams. The
population of exam scores is 78, 83, 92, 68, and
85. Find the mean and median using the TI-84
PLUS.
Step 1: Enter the data in L1.
Step 2: Press STAT and highlight the CALC menu.
Step 3: Select 1-Var Stats and press ENTER.
Enter L1 in the List field and run the
command.
Mean
Median
Note: If your calculator does not support Stat Wizards,
enter L1 next to the 1-Var Stats command on the home
screen and press enter to run the command.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
OBJECTIVE 3
Compare the properties of the mean and median
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Resistant
A statistic is resistant if its value is not affected much by extreme values (large
or small) in the data set. The median is resistant, but the mean is not.
Example:
Five families have annual incomes of $25,000, $31,000, $34,000, $44,000 and
$56,000. One family, whose income is $25,000, wins a million dollar lottery, so
their income increases to $1,025,000.
Before the lottery win, the mean and median are:
Mean = $38,000
Median = $34,000
After the lottery win, the mean and median are:
Mean = $238,000
Median = $44,000
The extreme value of $1,025,000 influences the mean quite a lot; increasing it
from $38,000 to $238,000. In comparison, the median has been influenced
much less increasing from $34,000 to $44,000. That is, the median is resistant.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Mean, Median, and the Shape of a Data Set
The mean and median measure the center of a
data set in different ways. When a data set is
symmetric, the mean and median are equal.
When a data set is skewed to the right, there
are large values in the right tail. Because the
median is resistant while the mean is not, the
mean is generally more affected by these large
values. Therefore for a data set that is skewed to
the right, the mean is often greater than the
median.
Similarly, when a data set is skewed to the left,
the mean is often less than the median.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
OBJECTIVE 4
Find the mode of a data set
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Mode
Another value that is sometimes classified as a measure of center is the mode.
β€’ The mode of a data set is the value that appears most frequently.
β€’ If two or more values are tied for the most frequent, they are all considered
to be modes.
β€’ If the values all have the same frequency, we say that the data set has no
mode.
Example:
Ten students were asked how many siblings they had. The results, arranged in
order, were 0 1 1 1 1 2 2 3 3 6. Find the mode of this data set.
Solution:
The value that appears most frequently is 1. Therefore, the mode is 1.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Measure of Center?
The mode is sometimes classified as a measure of center. However, this isn’t
really accurate. The mode can be the largest value in a data set, or the
smallest, or anywhere in between.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Mode for Qualitative Data
The mean and median can be computed only for quantitative data. The mode,
on the other hand, can be computed for qualitative data as well. For qualitative
data, the mode is the most frequently appearing category.
Example:
Following is a list of the makes of all the cars rented by an automobile rental
company on a particular day. Which make of car is the mode?
Honda
Chevrolet
Honda
Chevrolet
Toyota
Nissan
Dodge
Toyota
Toyota
Ford
Ford
Toyota
Honda
Chevrolet
Ford
Toyota
Ford
Chevrolet
Toyota
Nissan
β€œToyota” appears most frequently. Therefore, the mode is β€œToyota”.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
OBJECTIVE 5
Approximate the mean using grouped data
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Approximating the Mean
Sometimes we don’t have access to the raw data in a data set, but we are
given a frequency distribution. In these cases we can approximate the mean
using the following steps.
Step 1: Compute the midpoint of each class. The midpoint of a class is found
by taking the average of the lower class limit and the lower limit of the
next larger class.
Step 2: For each class, multiply the class midpoint by the class frequency.
Step 3: Add the products (Midpoint)x(Frequency) over all classes.
Step 4: Divide the sum obtained in Step 3 by the sum of the frequencies.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Example
The following table presents the number of text messages sent via cell phone
by a sample of 50 high school students. Approximate the mean number of
messages sent.
Number of Text Messages Sent
Frequency
0 – 49
10
50 – 99
5
100 – 149
13
150 – 199
11
200 – 249
7
250 – 299
4
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Solution
Step 1: Compute the midpoint of each class.
Number of Text Messages Sent
Class
Midpoint
Frequency
0 – 49
25
10
50 – 99
75
5
100 – 149
125
13
150 – 199
175
11
200 – 249
225
7
250 – 299
275
4
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Solution
Step 2: For each class, multiply the class midpoint by the class frequency.
Number of Text Messages Sent
Class
Midpoint
Frequency
(Midpoint)x
(Frequency)
0 – 49
25
10
250
50 – 99
75
5
375
100 – 149
125
13
1625
150 – 199
175
11
1925
200 – 249
225
7
1575
250 – 299
275
4
1100
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Solution
Step 3: Add the products (Midpoint)x(Frequency) over all classes.
Frequency
(Midpoint)x
(Frequency)
10
250
5
375
13
1625
11
1925
7
1575
4
1100
Ξ£(Midpoint ×Frequency)
= 250 + 375 + 1625 + 1925 + 1575 + 1100
= 6850
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Solution
Step 4: Divide the sum obtained in Step 3 by the sum of the frequencies.
Frequency
(Midpoint)x
(Frequency)
10
250
5
375
Approximate Mean =
βˆ‘(Midpoint×Frequency)
βˆ‘Frequency
6850
50
13
1625
=
11
1925
= 137
7
1575
4
1100
50
6850
Sums
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Grouped Data on the TI-84 PLUS
To compute the mean for grouped data in a frequency distribution,
enter the midpoint for each class into L1 and the corresponding
frequencies in L2. Next, select the 1-Var Stats command and enter L1
in the List field and L2 in the FreqList field, if using Stats Wizards.
If you are not using Stats Wizards, you may run the1-Var Stats
command followed by L1, comma, L2.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
TI-84 PLUS Example
Class Midpoint
Frequency
25
10
75
5
125
13
175
11
225
7
275
4
The output for the last example on
the TI-84 PLUS Calculator is
presented below.
The value of π‘₯ represents the
approximate mean. In this example
π‘₯ = 137. Therefore the approximate
mean is 137.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
You Should Know…
β€’
β€’
β€’
β€’
β€’
β€’
β€’
β€’
How to compute and interpret the mean of a data set
The notation for a population mean and sample mean
How to compute the median
How to use the TI-84 PLUS calculator to compute the mean and
median
The definition of resistant and which measure of center is resistant
How the mean and median are related to the shape of a data set
including
β€’ Skewed to the left
β€’ Skewed to the right
β€’ Approximately symmetric
How to identify the mode of a data set
How to approximate the mean for grouped data
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.