Professional Documents
Culture Documents
3-4. Descriptive
statistics
[2] CHAP.2
[3] CHAP.3
2
Outline
Sampling
Graphical Summaries
Summary Statistics
1
10/1/2019
3
The basic idea
4
Sample vs. Population
Population Sample
2
10/1/2019
5
Sampling
6
Sampling
Ex: the engineer might construct a sample simply by taking 10 blocks off
the top of the pile.
3
10/1/2019
7
Independence
8
Other Sampling Methods
4
10/1/2019
10
Types of Experiments
10
5
10/1/2019
11
Types of Data
numerical or quantitative
(how much or how many)
categorical or qualitative
11
12
Controlled Experiments and
Observational Studies
12
6
10/1/2019
13
Exercises
13
Descriptive Statistics 14
An Illustration:
Which Group is Smarter?
Class B--IQs of 13 Students
Class A--IQs of 13 Students
127 162
102 115
131 103
128 109
96 111
131 89
80 109
98 106
93 87
140 119
120 105
93 97
109
110
Each individual may be different. If you try to understand a group by remembering the qualities of
each member, you become overwhelmed and fail to understand the group.
Data Analysis 10/1/2019
14
7
10/1/2019
15
Descriptive Statistics
110.54 110.23
15
16
Types of descriptive statistics:
Statistics
Summarize Data (Summary Statistics)
Central Tendency
Variation
16
8
10/1/2019
17
Descriptive Statistics
Graphs
Bar Chart or Histogram
Stem and Leaf Plot
Frequency Polygon
17
18
18
9
10/1/2019
19
12/62
Class
width = 2 0.1935/2
Source: [1] William Navidi: Statistics for Engineers and Scientists, McGrawHill, 4th Edition, 2015.
19
20
Histogram
Source: [1] William Navidi: Statistics for Engineers and Scientists, McGrawHill, 4th Edition, 2015.
20
10
10/1/2019
21
Unequal
class
widths
Source: [1] William Navidi: Statistics for Engineers and Scientists, McGrawHill, 4th Edition, 2015.
21
22
22
11
10/1/2019
23
To construct a histogram:
23
24
12
10/1/2019
25
A bimodal
histogram
has two clearly distinct
modes
25
26
Stem and Leaf Plot
26
13
10/1/2019
27
Dotplots
Data Analysis
27
28
14
10/1/2019
29
Descriptive Statistics
Summarizing Data:
29
30
Mean
Data Analysis 10/1/2019
30
15
10/1/2019
31
Mean
Bill Gates
All of Us
Mean Outlier
Data Analysis 10/1/2019
31
32
Median
When data are listed in order, the median is the point at which
50% of the cases are above and 50% below it.
32
16
10/1/2019
33
Median
33
34
Descriptive Statistics
Summarizing Data:
34
17
10/1/2019
35
Range
The spread, or the distance, between the lowest and highest values of a variable.
To get the range for a variable, you subtract its lowest value from its highest value.
35
36
Interquartile Range (IQR)
A quartile is the value that marks one of the divisions that breaks a series of values into four equal parts.
25th percentile is a quartile that divides the first ¼ of cases from the latter ¾.
75th percentile is a quartile that divides the first ¾ of cases from the latter ¼.
The interquartile range is the distance or range between the 25th percentile and the 75th percentile. Below, what is the
interquartile range?
25% 25% 25%
25%
of of
cases cases
18
10/1/2019
37
Variance
The larger the variance, the further the individual cases are from the
mean.
Mean
The smaller the variance, the closer the individual scores are to the
mean.
38
Variance
Data Analysis 10/1/2019
38
19
10/1/2019
39
39
40
Coefficient of Variation
40
20
10/1/2019
41
Exercises
Q1. A sample of 100 adult women was taken, and each was asked how many children she
had. The results were as follows:
41
42
Exercises
Q2. A bowler’s scores for six games were 182, 168, 184, 190, 170, and 174. Using these
data as a sample, compute the following descriptive statistics.
a. Range c. Standard deviation
b. Variance d. Coefficient of variation
Q3. The Los Angeles Times regularly reports the air quality index for various areas of
Southern California. A sample of air quality index values for Pomona provided the
following data: 28, 42, 58, 48, 45, 55, 60, 49, and 50.
a. Compute the range and interquartile range.
b. Compute the sample variance and sample standard deviation.
c. A sample of air quality index readings for Anaheim provided a sample mean of
48.5, a sample variance of 136, and a sample standard deviation of 11.66. What
comparisons can you make between the air quality in Pomona and that in Anaheim
on the basis of these descriptive statistics?
42
21
10/1/2019
43
Reading
[2] 4
[3] 7
43
22