Professional Documents
Culture Documents
1
Learning Objectives
Compute and interpret mean
Compute and interpret median
Compute and interpret mode
2
Measures of Central Tendency
Measures of Central Tendency 集中趨勢測量值
1. mean 平均數
2. median 中位數
3. mode 眾數
3
Descriptive statistics - review
統計測量值
Presenting data: 3. statistical measures
1. tables 統計表
measures of central
tendency: shows the
location of the center of
a distribution 集中量數
measures of 變異量數
2. graphs & charts 統計圖 variability: shows how
spread out a
distribution is
pie chart
clear and detailed
4 data characteristics
tables & graphs: data distribution
Descriptive statistics
Descriptive statistics often involves using a few
numbers to summarize a distribution.
Two important aspects of a distribution:
where its center is located. Measures of central
tendency are used to do so.
how spread out it is, i.e. how much the numbers in the
distribution vary from one another. Measures of
variability are used to do so.
5
分配
18
17
7 7
4
2
frequency table =
frequency distribution
6
Distribution
The distribution of all M&M's of the six different colors (left).
Since every M&M is one of the six familiar colors, the six proportions
shown in the figure add to one.
8 n = 20
Distribution - continuous variables
Grouped frequency distributions can be portrayed graphically.
Distributions for continuous variables are called continuous
distributions, also called probability density.
histogram
9
Shapes of distributions (1) -
Normal distribution
Some probability densities have particular importance in statistics. A
very important one is shaped like a bell, and called the normal
distribution. 常態分配
Many naturally-occurring phenomena can be approximated
surprisingly well by normal distribution. It will serve to
illustrate some features of all continuous distributions.
12
Shapes of distributions (4) -
bimodal distribution
The distributions shown so
far all have one distinct high
point or peak. The
distribution in Figure 10 has
two distinct peaks. A
distribution with two peaks is
called a bimodal distribution.
bimodal distribution雙峰分布
13
Shapes of
distributions (5)
– Kurtosis峰度
The top distribution has long tails.
It is called "leptokurtic."
高狹峰
14
3-1 Self review: Q1 out of 7.
1. A frequency distribution contains the
frequency of every value in the distribution.
true
false
15
3-1 Self review: Q2 out of 7.
2. A grouped frequency distribution should be
used instead of a frequency distribution
when the
distribution is bimodal
distribution is skewed.
variable is continuous
16
3-1 Self review: Q3 out of 7.
3. A symmetric distribution
has equal positive and negative skews.
has no skew.
can have either positive or negative skew,
but not both.
17
3-1 Self review: Q4 out of 7.
4. The following distribution has
a positive skew.
a negative skew.
no skew.
18
3-1 Self review: Q5 out of 7.
5. The area under the curve of a
probability distribution is
1
0
10
19
3-1 Self review: Q6 out of 7.
6. A normal or bell-shaped distribution has its
greatest probability density in its tails.
true
false
20
3-1 Self review: Q7 out of 7.
7. Which of the following distributions is/are
symmetric?
A
B
C
D
21
3-2 Central tendency
Central tendency has to do with the location of the center of
a distribution.
In statistics, averages are often referred to as ‘measures
of central tendency’.
This idea of comparing individual scores to a distribution of scores
is fundamental to statistics.
22
Central tendency
Central tendency definitions:
1. One definition of central tendency is the point at which the
distribution is in balance. (2, 3, 4, 9,16) Balance: 6.8
2. The other definition of the center of a distribution is the number
for which the sum of the absolute deviations is smallest.
3. The third definition is the target that minimizes the sum of
squared deviations.
23
Average
The Center of the Data
An average is a measure of where most of the values in the data
are located.
The center of the data is where most of the values in the data are
located. There are different types of averages. The most commonly
used are:
Mean
Median
Mode
Exercise #1
Read the text to find the definitions for Mean,
24 Median, and Mode. (3.4)
3-2 Self review: Q1 out of 3.
1. A frequency distribution contains the
frequency of every value in the distribution.
true
false
25
3-2 Self review: Q2 out of 3.
2. For the numbers 10, 12, 16, and 20, the sum of the
absolute deviations from 15 is:
14
15
16
28
平均數
ത
3-3-1 Mean (1) (μ; M /𝑋)
The mean is usually referred to as ‘the average’.
There are multiple types of mean values. The most common type
of mean is the arithmetic mean. 算術平均數
The mean is the sum of all the values in the data divided by the
total number of values in the data (the sum of the numbers divided
by the number of numbers)
The symbol "μ" ([mju]) is used for the mean of a
population. The symbol "M" is used for the mean of a
sample. (Σ=summation)
μ = ΣX/N M = ΣX/N
ΣX = the sum of all the numbers in ΣX = the sum of all the numbers in the
the population and sample and
N = the number of numbers in the N = the number of numbers in the
29 sample.
population.
Summation Notation
Many statistical formulas involve summing numbers. Fortunately
there is a convenient notation for expressing summation. This
section covers the basics of this summation notation.
= 3 + 4 + 21 = 28
31
3-3 Self review: Q1 out of 1.
1. For the data on the right, compute the following.
A. ΣX =
B. ΣY =
C. (ΣX)2=
D. (ΣXY)=
E. =
32
The Sample Mean 樣本平均數
34
Exercise #2
Calculate the mean for each data set with the function.
37
Exercise #4
Calculate the cells needed for exercise - a frequency table for
the Physics scores of some students.
1. N
2. Σfx
3. mean
4. draw a histogram for relative
frequency.
5. draw a cumulative frequency polygon.
38
Exercise #5
作業1
Calculate the mean for each data set with the function.
Following are the time spent on study per week for some students.
Please transfer the data into Excel and calculate:
17.5, 18.5, 17.0, 16.5, 18.5, 20.0, 19.0, 19.5, 15.0, 14.5,
16.0, 15.5, 16.5, 17.0, 16.5, 18.0, 17.5, 19.0, 17.0, 19.5,
15.5, 16.5, 19.0, 16.0, 17.0, 19.0, 19.5, 18.0, 18.0, 18.5,
17.5, 19.0, 21.0, 18.0, 20.5, 25.5, 24.5, 25.0, 18.5, 21.0
w = weight
41 x = value
Exercise #6
Calculate the weighted mean. Fill in all cells in gray.
Example #5. Weighted mean for a college grade card for John.
Credits Weighted
Subject Scores GP GPA
(weight) scores
Literature 3 A- 3.7
English 1 3 A- 3.7
Health & Fineness 1 A+ 4.3
Calculus 3 C+ 2.3
Economics 4 B+ 3.3
Accounting 3 B 3
Speech 1 A 4
Total
44
1-2#2中位數 (median)
Exercise
Calculate the median for each data set 2-1.
Steps:
1. Sort the data from the smallest to the largest.
2. n=10, an even number.
3, 4, 5, 5, 5, 6, 7, 8, 8, 9
3. Find the position of the median.
4. Me = [ n/2 + (n/2)+1] / 2 Me = ( 5 + 6 ) ÷ 2 = 5.5
= [(10/2) + (10/2)+1]=6 𝑋ത = 6
The median is the average of the 5th & 6th data.
45
Exercise #2 & #5
Calculate the median for each data set with the function.
46
Properties of Median 特質
1. Median is not dependent on all the data values in a dataset.
2. The median value is fixed by its position and is not reflected by the
individual value.
3. The distance between the median and the rest of the values is less than the
distance from any other point.
4. Every array has a single median.
5. Median cannot be manipulated algebraically. It cannot be weighed and
combined.
6. In a grouping procedure, the median is stable.
7. Median is not applicable to qualitative data.
8. The values must be grouped and ordered for computation.
9. Outliers and skewed data have less impact on the median.
10. If the distribution is skewed, the median is a better measure when
compared to mean.
47
Should we tell our
rival the mean or the
median of the
heights of our
players?
48
3-3-3 Mode (Mo)
The mode is the value(s) that appears most often in the data:
There can be more than one mode if multiple values appear the
same number of times in the data.
Example #7. 40, 21, 55, 21, 48, 13, 72
Here, 21 appears two times, and the other values only once. The
mode of this data is 21.
We can have more than one mode or no mode at all.
49
Mode
The mode is also used for categorical data, unlike the median
and mean. Categorical data can’t be described directly with
numbers, like names:
Alice, John, Bob, Maria, John, Julia, Carol
Here, John appears two times, and the other values only once. The
mode of this data is John.
50
Mode
The mode of continuous data is normally computed
from a grouped frequency distribution.
Table 2 shows a grouped frequency distribution for the
target response time data. Since the interval with the
highest frequency is 600-700, the mode is the middle of
that interval (650).
51
1-2#2中位數 (median)
Exercise
Calculate the mode for each data set 2-1.
Steps:
1. Sort the data from the smallest to the largest.
2. Find the number that that appears most
often.
3, 4, 5, 5, 5, 6, 7, 8, 8, 9
3. The mode is 5.
Mo = 5
Me = ( 5 + 6 ) ÷ 2 = 5.5
𝑋ത = 6
52
Exercise #2 & #5
Calculate the mode for each data set with the function.
53
Exercise #7
heights of female college students
Female college students (cm)
7-1 how many students
151 154 154 164 158 146 162 151 7-2 mean height
154 162 152 158 151 166 167 156 7-3 median of height
7-4 mode of height
160 156 161 150 155 161 159 166 7-5 max
160 162 160 155 155 143 153 159 7-6 min
7-7 frequency table +
163 157 160 157 165 156 146 157 graphs (histogram +
156 162 153 161 165 156 156 156 cumulative polygon)
158 162 155 168 154 149 160 159
156 169 163 162 148 162 151 156
154 150 160 153 169 159 151 156
160 162 159 154 158 164 157 161
54
Exercise #7
frequency table
relative cumulative
cumulative
group midpoint frequency frequency frequency
frequency
% %
141~145 143 1 1% 1 1%
146~150 148 6 8% 7 8%
151~155 153 19 24% 26 24%
156~160 158 30 38% 56 38%
161~165 163 18 23% 74 23%
166~170 168 6 8% 80 8%
100
80
74
80
56
60
40 26
20 7
11 6 19 30 18 6
55 0
141~145 146~150 151~155 156~160 161~165 166~170
56
S.No Mean Median Mode
Add up all the numbers and Place all the numbers in the mode is derived when a
2 divide by the total number of ascending or descending number has frequency
terms order occurred in a series
59
Chapter Exercises
This is the end of this week’s lesson.
Please finish the exercises and turn
them in on time.
Keep up the good work.
60