Professional Documents
Culture Documents
CHAPTER 1: INTRODUCTION TO
STATISTICS
- Qualitative data and
Quantitative data
- Measures of central tendency
and dispersion
WHAT IS STATISTIC???
Statistics is the mathematical science involved in the application of quantitative
principles to the collection, analysis, and presentation of numerical data. The
practice of statistics utilizes data from some population in order to describe it
meaningfully, to draw conclusions from it, and make informed decisions.
Descriptive statistics
consists of organizing and summarizing the
information collected. Descriptive statistics
describes the information collected through
numerical measurements, charts, graphs and
tables.
Inferential Statistics
generalize results obtained from a sample to
the population and measure their reliability.
Outlier
PRACTICE PROBLEM
A random sample of 30 middle school students average 1.8 hours
spent on homework each night. It is believed that middle school
students spend 2 hours each night on homework. Identify the sample,
the population, the sample statistic, and the population parameter.
The sample is the 30 middle school students who are randomly selected. The
population is all middle school students.
The sample statistic is x = 1.8 hours, and the population parameter is = 2 hours.
Remember that sample statistics are values that represent a sample, while
population parameters are values that represent a population.
1.1.3 Variables
Qualitative variables allow for classification of
individuals based some attribute or
characteristics
Example: the gender of new born babies;
the marital status of people, types of cars.
Quantitative variables provide numerical
measures of individuals. (countable).
Example: The weight of children; the
numbers of cars owned.
PRACTICE PROBLEM
Determine whether the
following variables are
qualitative or quantitative.
1.
Postal Code
2.
Salary
3.
PTPTN allowances
4.
Gender
5.
Marital Status
1.
2.
3.
4.
Solution:
Bar graph:
Pie chart:
300
3%
Frequency
250
23%
200
9%
A
150
100
65%
C
D
50
0
C
Rating
Example:
Solution:
(a) & (b)
Solution:
(c) Histogram
Polygon
Solution:
(d)
PRACTICE PROBLEM
The following scores represent the final examination grade for Statistic
subject:
23 60 79 32 57 74 52 70 82 36 80 77 81 95 41 65 92 85 55 76 52
10 64 75 78 25 80 98 81 67 41 71 83 54 64 72 88 62 74 43 60 78
89 76 84 48 84 90 15 79 34 67 17 82 69 74 63 80 85 61
a)
b)
c)
d)
e)
f)
MARKS
1-10
11-20
21-30
31-40
41-50
51-60
61-70
71-80
81-90
91-100
Total
No. of
Student
1
2
2
3
4
7
10
16
12
3
60
Relative
frequence
0.017
0.033
0.033
0.050
0.067
0.117
0.167
0.267
0.200
0.050
1.000
Percentage
2
3
3
5
7
12
17
27
20
5
100
Class
boundaries
0.5-10.5
10.5-20.5
20.5-30.5
30.5-40.5
40.5-50.5
50.5-60.5
60.5-70.5
70.5-80.5
80.5-90.5
90.5-100.5
No. of Student
1
2
2
3
4
7
10
16
12
3
Class
midpoints
5.5
15.5
25.5
35.5
45.5
55.5
65.5
75.5
85.5
95.5
BAR GRAPH
18
16
HOW ABOUT
HISTOGRAM (NO
GAP)?
No. of Students
14
12
10
8
6
4
2
0
1-10
Measures of Dispersion
Mean
Range
Median
Variance
Mode
Standard deviation
Skewness
Skewness:
MEAN
POPULATION
Mean
=
1 +2 ++
=1
SAMPLE
Mean
=
1 +2 ++
=1
MEDIAN
Steps in Computing the Median of a Data Set
Step 1: Arrange the data in ascending order.
Step 2: Determine the number of observations, n .
Step 3: Determine the observation in the middle of the data set.
If the number of observations is odd, then the median is the data
value that is exactly in the middle of the data set. That is, the median
+1
is the observation that lies in the
position.
2
MODE
To compute the mode, tally the number of observations that occur for
each data value. The data that occurs most often is the mode. A set of
data can have no mode, one mode or more than one mode. If there
is no observation that occurs with the most frequency, we say the
data has no mode.
EXAMPLE
The following data represent the monthly phone bill for six randomly
selected months (in RM).
35.34 42.09 39.43 38.93 43.39 49.26
Calculate the mean, median and mode for the monthly phone bill.
Solutions:
POPULATION
Mean
1 + 2 + +
=
=
Variance
1
2
=
2
=1
=1
=1
Standard deviation
=
2
=1
=1
SAMPLE
Mean
=1
1 + 2 + +
=
=
Variance
1
2
=
1
2
=1
=1
Standard deviation
=
1
1
2
=1
=1
EXAMPLE
The following data represent the monthly phone bill for six randomly
selected months (in RM).
35.34 42.09 39.43 38.93 43.39 49.26
Compute the range, sample variance and sample standard deviation.
Sample Variance, s2
PRACTICE PROBLEM
An engineer is interested in testing the bias in a pH meter. Data are
collected on the meter by measuring the pH of a neutral substance
(pH=7.0). A sample of size 10 is taken with results given by
7.07 7.00 7.10 6.97 7.00 7.03 7.01 7.01 6.98 7.08
Compute the range, sample variance and sample standard deviation.
STANDARD
DEVIATION
POPULATION
SAMPLE
EXAMPLE
The following data give the monthly expenditures (in hundred RM) on
food for 30 households randomly selected from the households who
incurred such expenses.
4.57 3.95 6.95 3.80 1.50 3.99 7.84 5.05 8.00 14.75 9.33 1.05
5.08 7.00 9.60 18.99 9.15 11.32 4.75 9.95 3.63 1.99 1.39 13.09
19.31 11.15 7.73 12.00 7.58 16.35
Find the sample mean for the monthly expenditures on food for 30
households
Sample Variance
END OF CHAPTER 1