Professional Documents
Culture Documents
Lesson 1 - Measures of Central Tendency
Lesson 1 - Measures of Central Tendency
● Ways to present data include pie charts, bar graphs, histograms, leaf-stem plots etc.
● An important distinction, is if the mean, median, or mode is stated based on the entire
population or a sample
Population or Sample?
○ 𝑁 is the size of the population
○ 𝑛 is the size of a sample
Mean
● Average of the data set
○ Add all of the values
○ Divide by , 𝑛 or 𝑁, the number of values in the set
● Good for data sets with generally evenly spread data, but can be misleading for skewed
data
Population Mean Sample Mean
Σ𝑥 𝑥1+𝑥2+...+𝑥𝑁 Σ𝑥 𝑥1+𝑥2+...+𝑥𝑛
µ= 𝑁
= 𝑁
𝑥= 𝑛
= 𝑛
● These are the same calculation, but we use different variables to inform the reader
whether the entire population or a sample was used in the calculation
Median
● The middle value of a ranked (ordered) data set
○ if there is an even number of data points, there will be 2 middle numbers.
In that case, take the average of the middle numbers
𝑛+1
○ add 1 to 𝑛 or 𝑁 and divide by 2 to find the rank of the median 𝑟 = 2
𝑚𝑒𝑑
● Good for data that is skewed or unevenly spread throughout the range
Mode
● The value that occurs most frequently (most popular)
○ May be no mode (if every value occurs once), 1 mode, or several modes
● Good for data that has many repeats of the same value
Outliers
● A data point that is significantly different than the rest of the data
○ Mean (average) may be best if there are no outliers
○ Median may be best if there are outliers
○ Mode may be best when there are few options or with categorical data
● The shape of a histogram or bar chart shows the distribution of the data.
● Skew pulls the mean toward the tail
Uniform Distribution
There is a ‘tail’ on the left side of the graph. The ‘tail’ is on the right side of the graph.
Bimodal Normal (Symmetrical)
Day 1 2 3 4 5 6 7
Temp 27 29 32 29 45 29 31
b) Is there an outlier in the data? How does it affect the measures of central tendency?
c) Which measure of central tendency would best represent the temperatures in this Mexican
location? Explain.
For histograms, data is aggregated. The mode is easy to read from the chart, but we don't need
to list out the entire data set to calculate the mean
Mean for Grouped Data
Weighted Mean
What unit grade is appropriate for a quiz score of 60% and a test score of 90%?
A math department assigns the following weights for each category in its Advanced Functions
course:
Catherine’s marks in the course so far are 84% overall on Unit Tests, 92% on her
assignments, and 88% on her RST, with the final exam still to be written.
a) Determine the weighted mean for Catherine before writing her final exam
b) Is it possible for Catherine to receive a final mark of 90% in the course? Justify your
answer.