Professional Documents
Culture Documents
Statistics: Prepared By: Larry Jay B. Valero, LPT
Statistics: Prepared By: Larry Jay B. Valero, LPT
STATISTICS
Prepared by:
Larry Jay B. Valero, LPT
Statistics
Statistics is derived from the Latin word "status" meaning state.
1.Descriptive Statistics
2.Inferential Statistics
Descriptive Statistics
Example:
All people living in the Philippines.
All students in CVSU.
Sample
A sample data set contains a part or a subset of a population.
The size of the sample is always less than the size of the population from
which it is taken.
Subset of a population.
Example:
Some people living in the Philippines
Some students in CvSU
DATA COLLECTION
METHODS
Direct or Interview method
a person to person encounter between the interviewee and the
interviewer.
Interviewer- the one who gathers the information
Interviewee- the source of information.
Example:
Birth and Death Rates – National Statistics Office (NSO)
Number of Registered Cars – Land Transportation Office (LTO)
List of Registered Voters – Commission on Elections ( COMELEC)
Observation Method
a technique in which data particularly those pertaining to the
behaviours of individuals or group of individuals during the given
situation are best describe through observation.
NS - Not Significant
S – Significant
HS- Highly Significant
NS - Not Significant
S – Significant
HS- Highly Significant
BOX HEAD- portion that contains the column heads which describe
the data in each column
STUB- First column on the left of the table, which describes the data
on the given row
Table 1. Relationship Between Academic Performance
and the identified variables
Variable Correlation Coefficient Significance Remarks
GPA 0.7461 0.000 HS
MI 0.4015 0.000 S
IQ 0.9891 0.000 S
Gender 0.1452 0.084 NS
NS - Not Significant
S – Significant
HS- Highly Significant
Where Where
Example:
The items listed below represent the scores of seven BS Mathematics students during the final examination.
Compute the mean score
89, 75, 90, 85, 78, 87, 80
Suppose BS Applied Mathematics has 10 students and the height (in cm) are as follows: 170, 165, 155, 160, 150,
149, 152, 161, 163, 175. Find the mean height of the students.
B. Median
The middle score for a set of data arranged in order of magnitude.
best used when data has several extreme entries.
Median for ungrouped data
defined as the middle value when a set of observed values have been arranged in either ascending or
descending order.
denoted by Md
If n is ODD
If n is EVEN
Example:
The items listed below represent the scores of seven BS Mathematics students during the final examination.
Compute the median score
89, 75, 90, 85, 78, 87, 80
75 78 80 85 87 89 90
n=7
Md = 85
Example:
Suppose BS Applied Mathematics has 10 students and the height (in cm) are as follows: 170, 165, 155, 160, 150,
149, 152, 161, 163, 175. Find the median height of the students.
149 150 152 155 160 161 163 165 170 175
n = 10
C. Mode
The most frequent score in the data set
The most popular option
Mode for ungrouped data
The mode is a value which occurs most often or the most frequently occurring observation
Denoted by Mo
Example:
Consider the data set 1 2 2 2 8 1 4 10
Mo = 2
Since there is only one mode, then the distribution is Unimodal
Consider the data set 1 2 2 8 1 4 10
Mo = 1, 2
Bimodal
Consider the data set 1 2 3 8 6 4 10
This data has no mode
GROUPED DATA
Mean
where:
Median
where:
Mode
where:
Class Intervals Relative Frequency <CF >CF
18-26 2 22
27-35 1 31
36-44 15 40
45-53 5 49
54-62 8 58
63-71 6 67
72-80 3 76
Total 40
𝑋 =
∑ 𝑓 𝑖 𝑥𝑖
𝐺
𝑛
Class Intervals 𝑓 𝑖 𝑥 𝑖 Relative Frequency <CF >CF
18-26 2 22 44
27-35 1 31 31
36-44 15 40 600
45-53 5 49 245
54-62 8 58 464
63-71 6 67 402
72-80 3 76 228
Total 40
𝑋𝐺=
∑ 𝑓 𝑖 𝑥 𝑖 2014
¿
∑ 𝑓 𝑖 𝑥𝑖=2014
𝑛 40 50.35
Class Intervals 𝑓 𝑖 𝑥 𝑖 𝐶𝑙𝑎𝑠𝑠
𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 ¿ 𝑐𝑓
18-26 2 22 44 17.5 −26.5 2
𝑓 𝑚𝑑 ] ∑ 𝑓 𝑖 𝑥𝑖=2014
Median Class :
𝑛 40
2
¿ ¿ 20
2
Class Intervals 𝑓 𝑖 𝑥 𝑖 𝐶𝑙𝑎𝑠𝑠
𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 ¿ 𝑐𝑓
18-26 2 22 44 17.5 −26.5 2
𝑜𝐺 = 𝐿𝑚𝑜+ 𝑐
𝑓 𝑚𝑜 − 𝑓 𝑏
[
2 𝑓 𝑚𝑜 − 𝑓 𝑎 − 𝑓 𝑏 ] ∑ 𝑓 𝑖 𝑥𝑖=2014
Modal Class :
Class Intervals 𝑓 𝑖 𝑥 𝑖 Class Boundary <cf
c
𝑀𝑑 𝐺 =35.5+9
[ 2 ( 15 ) − 5 −1 ]
Modal Class : 𝑓 𝑚𝑜=15 𝑓 𝑏 =1
𝑓 𝑎=5
Thank you for listening
Frequency Distribution Table
Prepared by: Elaine C. Ricohermoso, LPT
TERMS
Array
is an arrangement of the numerical data/values according to order of
magnitude either ascending or descending order.
Frequency Distribution Table
is a condensed version of an array.
It categorizes the numerical data into intervals or classes.
Classes
are mutually exclusive categories defining the lower limit and the upper
limit with equal intervals
TERMS
Class frequency
is the number of observations in each class.
Class mark
class midpoint
Cumulative frequency
tells the sum of frequencies in a particular class of interest.
Relative frequency
tells the percentage of observations in a particular class of interest.
Steps in Constructing a Frequency Distribution Table
1. Make an array
1. Make an array
112 125 132 136 139 144 147 151 156 160
116 126 133 136 140 145 149 152 156 163
122 127 134 137 141 145 149 152 159 168
122 129 135 138 142 146 150 154 159 172
123 131 135 139 144 147 150 155 160 172
1. Make an array
112 125 132 136 139 144 147 151 156 160
116 126 133 136 140 145 149 152 156 163
122 127 134 137 141 145 149 152 159 168
122 129 135 138 142 146 150 154 159 172
123 131 135 139 144 147 150 155 160 172
2.
Determine
the Range R.
R = |172 – 112 | = 60
3. Determine the number of classes K using the Sturge’s Approximation: (Round-up)
=
5. Determine the lower limit of the first class.
112
112 125 132 136 139 144 147 151 156 160
116 126 133 136 140 145 149 152 156 163
2. R= 60
122 127 134 137 141 145 149 152 159 168 3. K = 7
122 129 135 138 142 146 150 154 159 172 4. C = 9
5. first lower limit is 112
123 131 135 139 144 147 150 155 160 172
Class Intervals Frequency Class Mark Class Boundary Relative Frequency <CF >CF
112 – 120 2
121 – 129 7
130 – 138 10
139 – 147 12
148 – 156 11
157 – 165 5
166 – 174 3
Total 50
Class Intervals Frequency Class Mark Class Boundary Relative Frequency <CF >CF
112-120 2 116
121-129 7 125
130-138 10 134
139-147 12 143
148-156 11 152
157-165 5 161
166-174 3 170
Total 50
𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐼𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑜𝑓 𝑡h𝑒 𝑖𝑡h 𝑐𝑙𝑎𝑠𝑠+𝑈𝑝𝑝𝑝𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐼𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑜𝑓 𝑡h𝑒 𝑖𝑡h 𝑐𝑙𝑎𝑠𝑠
𝐶𝑙𝑎𝑠𝑠 𝑀𝑎𝑟𝑘 =
2
Class Intervals Frequency Class Mark Class Boundary Relative Frequency <CF >CF
𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦= ( 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐼𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑜𝑓 𝑡h𝑒 𝑖𝑡h 𝑐𝑙𝑎𝑠𝑠 −0.5 ) −(𝑈𝑝𝑝𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐼𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑜𝑓 𝑡h𝑒 𝑖𝑡h 𝑐𝑙𝑎𝑠𝑠+0.5)
Class Intervals Frequency Class Mark Class Boundary Relative Frequency <CF >CF
Relative Frequency
DATA ANALYSIS AND
INTERPRETATION
Descriptive Statistics
Three methods of describing a set of values
a. measures of central tendency
b. measures of dispersion
c. measures of skewness and kurtosis
Measures of Central Tendency
Measures of Central Tendency
single number that represents the typical score of the data.
are measures indicating the center of a set of data which are arranged in order of magnitude.
Three measures of Central Tendency
a. Mean
b. Median
c. Mode
A. Mean/Arithmetic Mean/Average
the most popular and well known measure of central tendency
The average value of all the data in the set
Mean for ungrouped data
defined as the sum of all the scores or data divided by the number of scores in the data.
denoted by a symbol “” for population mean and “” for sample mean.
Population mean Sample mean
Where Where
Example:
The items listed below represent the scores of seven BS Mathematics students during the final examination.
Compute the mean score
89, 75, 90, 85, 78, 87, 80
Suppose BS Applied Mathematics has 10 students and the height (in cm) are as follows: 170, 165, 155, 160, 150,
149, 152, 161, 163, 175. Find the mean height of the students.
B. Median
The middle score for a set of data arranged in order of magnitude.
best used when data has several extreme entries.
Median for ungrouped data
defined as the middle value when a set of observed values have been arranged in either ascending or
descending order.
denoted by Md
If n is ODD
If n is EVEN
Example:
The items listed below represent the scores of seven BS Mathematics students during the final examination.
Compute the median score
89, 75, 90, 85, 78, 87, 80
75 78 80 85 87 89 90
n=7
Md = 85
Example:
Suppose BS Applied Mathematics has 10 students and the height (in cm) are as follows: 170, 165, 155, 160, 150,
149, 152, 161, 163, 175. Find the median height of the students.
149 150 152 155 160 161 163 165 170 175
n = 10
C. Mode
The most frequent score in the data set
The most popular option
Mode for ungrouped data
The mode is a value which occurs most often or the most frequently occurring observation
Denoted by Mo
Example:
Consider the data set 1 2 2 2 8 1 4 10
Mo = 2
Since there is only one mode, then the distribution is Unimodal
Consider the data set 1 2 2 8 1 4 10
Mo = 1, 2
Bimodal
Consider the data set 1 2 3 8 6 4 10
This data has no mode
GROUPED DATA
Mean
where:
Median
where:
Mode
where:
Class Intervals Relative Frequency <CF >CF
18-26 2 22
27-35 1 31
36-44 15 40
45-53 5 49
54-62 8 58
63-71 6 67
72-80 3 76
Total 40
𝑋 =
∑ 𝑓 𝑖 𝑥𝑖
𝐺
𝑛
Class Intervals 𝑓 𝑖 𝑥 𝑖 Relative Frequency <CF >CF
18-26 2 22 44
27-35 1 31 31
36-44 15 40 600
45-53 5 49 245
54-62 8 58 464
63-71 6 67 402
72-80 3 76 228
Total 40
𝑋𝐺=
∑ 𝑓 𝑖 𝑥 𝑖 2014
¿
∑ 𝑓 𝑖 𝑥𝑖=2014
𝑛 40 50.35
Class Intervals 𝑓 𝑖 𝑥 𝑖 𝐶𝑙𝑎𝑠𝑠
𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 ¿ 𝑐𝑓
18-26 2 22 44 17.5 −26.5 2
𝑓 𝑚𝑑 ] ∑ 𝑓 𝑖 𝑥𝑖=2014
Median Class :
𝑛 40
2
¿ ¿ 20
2
Class Intervals 𝑓 𝑖 𝑥 𝑖 𝐶𝑙𝑎𝑠𝑠
𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 ¿ 𝑐𝑓
18-26 2 22 44 17.5 −26.5 2
Median Class :
] 𝐿𝐶𝐵 𝑚𝑑=44.5𝐶𝐹
n
c
𝑏 =18
𝑀𝑑 𝐺 = 44.5+9
5
𝑛 40 𝑓
2
¿ ¿ 20 𝑚𝑑 =5
2
Class Intervals 𝑓 𝑖 𝑥 𝑖 Class Boundary <cf
𝑜𝐺 = 𝐿𝐶𝐵 𝑚𝑜+ 𝑐
𝑓 𝑚𝑜 − 𝑓 𝑏
[
2 𝑓 𝑚𝑜 − 𝑓 𝑎 − 𝑓 𝑏 ] ∑ 𝑓 𝑖 𝑥𝑖=2014
Modal Class :
Class Intervals 𝑓 𝑖 𝑥 𝑖 Class Boundary <cf
c
𝑀𝑜𝐺 =35.5+ 9
[ 2 ( 15 ) −5 −1 ]
Modal Class : 𝑓 𝑚𝑜=15 𝑓 𝑏 =1
𝑓 𝑎=5
Thank you for listening
MEASURES OF
DISPERSION
Prepared by:
ELAINE C. RICOHERMOSO, LPT
MEASURES OF DISPERSION
• The items listed below represent the scores of seven BSIT students during
the final examination. Compute the range.
89, 75,90,85,78,87,80
R = |90 – 75| = 15
• Suppose BSIT has 10 students and the height (in cm) are as follows: 170,
165, 155, 160, 150, 149, 152, 161, 163, 175. Find the range of the score of
the students.
R = |175 – 149| = 26
B. VARIANCE
••
Mean absolute deviation
• Consider the position of each observation relative to the mean.
• The variance of given data set is the average of the sum of the square
deviation of the observation from the mean.
• The variance from the population is denoted by and for the sample.
VARIANCE FOR UNGROUPED DATA:
Formula
Example:
• The data below represents the score of 4 students from BSIT-3OLD.
3, 2, 2, 1.
Compute the variance.
••
Based on the deviations of all the scores in a series
• It is always computed from the mean
• defined as the positive square root of the variance
• denoted by “” for the population standard deviation and “s” for the
sample standard deviation.
• Population standard deviation
••
Theratio of the standard deviation and the mean and is usually expressed
in percent.
• Population
• Sample
Example:
• The data below represents the score of 4 students from BSIT-3OLD.
3, 2, 2, 1.
Compute the variance, standard deviation and the coefficient of variation.
----variance
-----standard deviation
Suppose BSIT has 6 students and the height (in cm) are as follows: 170, 166, 171, 160, 150, 161. Compute the
coefficient of variation.
GROUPED DATA
• • Range
• Standard deviation
• Coefficient of Variation
Class Intervals Relative Frequency <CF >CF
18-26 2 22
18-26 2
27-35 1 31
27-35 1
36-44
36-44
15
15 40
45-53 5 49
45-53 5
54-62 8 58
54-62 8
63-71 6 67
63-71 6
72-80 3 76
72-80 3
Class Intervals Relative Frequency <CF >CF
18-26 2 22
18-26 2
27-35 1 31
27-35 1
36-44
36-44
15
15 40
45-53 5 49
45-53 5
54-62 8 58
54-62 8
63-71 6 67
63-71 6
72-80 3 76
72-80 3
Total
Total
2 2
2 𝑛∑ 𝑓 𝑖 𝑥𝑖 −( ∑ 𝑓 𝑖 𝑥𝑖 )
𝑠𝐺 =
𝑛 (𝑛 − 1 )
Class Intervals 𝑓 𝑖 𝑥 𝑖 Relative Frequency <CF >CF
18-26 2 22 44
27-35 1 31 31
36-44 15 40 600
45-53 5 49 245
54-62 8 58 464
63-71 6 67 402
72-80 3 76 228
Total 40
2 2
∑ 𝑓 𝑖 𝑥𝑖=2014
𝑛∑ 𝑓 𝑖 𝑥𝑖 −( ∑ 𝑓 𝑖 𝑥𝑖 )
𝑛 (𝑛 − 1 )
Class Intervals 𝑓 𝑖 𝑥 𝑖 𝑥𝑖 2 𝑓 𝑥 2
𝑖 𝑖
2 2
∑ 𝑓 𝑖 𝑥𝑖=2014
2
∑ 𝑓 𝑖 𝑥𝑖 =109,108
𝑛 ∑ 𝑓 𝑖 𝑥𝑖 −( ∑ 𝑓 𝑖 𝑥𝑖 ) 40 ( 109,108 ) − ( 2014 )
𝑠 𝐺2 = ¿ ¿ 197.52
𝑛(𝑛 −1) 40(40− 1)
Class Intervals 𝑓 𝑖 𝑥 𝑖 𝑥𝑖 2 𝑓 𝑥 2
𝑖 𝑖
𝑠 𝐺2 =197.52
Class Intervals 𝑓 𝑖 𝑥 𝑖 𝑥𝑖 2 𝑓 𝑥 2
𝑖 𝑖
𝑓 𝑖 𝑥 𝑖 2014
∑ 𝑓 𝑖 𝑥𝑖 =109,108
a. Percentiles
b. Deciles
c. Quartiles
A. Percentiles
• Are values that divide a set of observations into 100 equal parts
• These values denoted by
Example:
• Given a random sample of size, n=12
• 4 7 8 2 7 5 8 9 10 14 3 4
2 3 4 4 5 7 7 8 8 9 10 14
𝑃 50 =7
This means that 50% of the values fall below 7.
𝑃2 0=4
This means that 20% of the values fall below 4.
𝑃82 = 9
This means that 82% of the values fall below 9.
B. Deciles
• Are values that divide a set of observations into 10 equal parts
• These values denoted by
• D
Example:
• Given a random sample of size, n=12
• 4 7 8 2 7 5 8 9 10 14 3 4
2 3 4 4 5 7 7 8 8 9 10 14
𝐷 5=7
This means that 50% of the values fall below 7.
𝐷 9=10
This means that 90% of the values fall below 10.
𝐷 1fall
This means that 10% of the values
=3below 3.
C. Quartiles
• Are values that divide a set of observations into 4 equal parts
• These values denoted by
• Q
Example:
• Given a random sample of size, n=12
• 4 7 8 2 7 5 8 9 10 14 3 4
2 3 4 4 5 7 7 8 8 9 10 14
𝑄 1= 4
This means that 25% of the values fall below 4.
𝑄 2=7
This means that 50% of the values fall below 7.
𝑄 3= fall
This means that 75% of the values
8 below 8.
𝑃 50 = 𝐷 5 =𝑄 2
Thank you for Listening!
Skewness and Kurtosis
Prepared by:
Larry Jay B. Valero, LPT
Skewness
• Is a measure or a criterion on how asymmetric the distribution of data is from
the mean.
• (Sample)