Professional Documents
Culture Documents
STATISTICS
TABLE OF CONTENTS
Chapter 1. Preliminary Concepts
1.1 Introduction and Basic Concepts
1.2 Variables and Data
1.3 Summation
∑ x =x +x +…+x
i 1 2 n
i=1
Where 1 and n are called the lower and upper
limits respectively. We note that x1, is read as “x
sub 1”
Chapter 2
2.1 Data Collection
This section aims to:
Identify, compare and contrast the different
types of data;
List and explain the various techniques of
selecting a sample; and
Enumerate and illustrate the different
sampling techniques.
Types of Data
Primary Data - data collected directly by the
researcher himself. These are first-hand or
original sources.
They can be collected through the ff:
3. Direct observation or measurement (primary
source of info).
4. By interview (questionnaires or rating scales).
5. By mail of recording or of recording forms.
6. Experimentation.
Secondary Data
Are information taken from published or
unpublished materials previously
gathered by other researchers or agencies
such as book, newspapers, magazines;
journals, published and unpublished
thesis and dissertations.
Two types of Sampling
Technique:
Probability Sampling - every unit has a
chance of being selected and that chance
can be qualified.
Non-Probability Sampling - every item in
a population does not have an equal
chance of being selected.
Sampling Technique
Procedure in selecting the numbers of samples
from the entire population.
Different Types
of Sampling
Techniques
Simple Random Sampling
It is recommended to prevent the possibility of a
bias or erroneous inference. Under the concept
of randomness, each member of the population
has an equal chance to be included in the sample
gathered.
Systematic Random Sampling
The items or individuals are arranged in
some way perhaps alphabetically or other
sort.
Stratified Random
Sampling
In this type of planning a population is first
divided into subsets based on homogenity called
Strata. The Strata are internally homogenous as
possible and at the same time each stratum is
different from one another as much as possible.
Cluster Sampling
Can be done by subdividing the population into
smaller units and then selecting only a random
some primary units where the study would then
be concentrated if sometimes referred are
sampling because it is frequently applied on a
geographical basis
2.2 Data Presentation
This section aims to:
Summarize and present data in different forms;
Arrange and organize the raw data into a n
array and construct the frequency distribution,
stem and lead diagram; and
Define, illustrate, and solve for the class limits,
class boundaries and class marks.
Methods in Presenting
Data
Textual Form - data in paragraph form.
Tabular Form - systematic arrangement of data in
rows and columns.
Graphical Form - a graph or chart is a device for
showing numerical values in pictorial form.
Semi Tabular/Semi Tabular Form - the
combination of Textual and Tabular Form.
Stem and Leaf Diagram
Raw data are data collected in an investigation
and they are not organized systematically. Raw
data that are presented in the form of a
frequency distribution are called grouped data.
59 50 52 38 80 62 77 56
60 61 58 62 51 36 54 18
71 54 44 52 26 63 58 56
41 34 61 50 60 53 62 62
53 43 63 71 65 79 45 66
I. Setting up an array from
the largest to the smallest
80 79 77 71 71 66 66 66
63 63 62 52 52 52 61 61
60 60 59 58 58 55 54 54
53 53 52 52 50 50 50 45
44 43 41 38 36 34 26 18
II. An array from the
smallest to the largest
18 26 34 36 38 41 43 44
45 50 50 51 52 52 53 53
54 54 55 58 58 59 60 60
61 61 62 62 62 62 53 53
66 66 66 71 71 77 79 80
III. Setting up into
stem-and-leaf diagram
1 8
2 6
3 4 6 8
4 1 3 4 5
5 0 0 1 2 2 3 4 4 5 8 8
6 0 0 1 1 2 2 2 3 3 6 6
7 1 1 7 9
8 0
Tally Method
14
12
10
8
Series1
6
0
14.5 24.5 34.5 44.5 54.5 64.5 74.5 84.5
Frequency Polygon
It is a line chart that is constructed by
plotting the frequencies and class mark
and connecting the plotted pointed by
means of a straight line; the polygon us
closed by considering an additional class
at each end and each end of the lines are
brought down to the horizontal axis at the
mid point of the additional classes.
Frequency Polygon
14
12
10
8
Series1
6
0
14.5 24.5 34.5 44.5 54.5 64.5 74.5 84.5
Ogive
It is a graph of a cumulative frequency
distribution and sometimes called a
cumulative frequency distribution graph.
Ogive
45
40
35
30
25 Series1
20 Series2
15
10
5
0
9.5 19.5 29.5 39.5 49.5 59.5 69.5 79.5
Pie Chart
It is a graphical presentation that uses
circle or pie.
Pie Chart
36 36
27 36
36 9
9 117 9
9
117
117
117
27
Chapter 3
3.1 The Mean
Mean = ∑X
n
= 883
11
= 80.2727
GROUPED DATA
Short Method
Mean = AM (∑fd/n) i
Long Method
Mean = ∑fx
n
= 2250
40
Mean = 56.25
3.2 Median and Mode
This section aims to:
Differentiate the three principal measurements
of central tendency;
Apply the computations of the median and
mode in various sets of data
B. Median - is the middle measure in a set
of measures arranged in order magnitude.
If the total number of measure is given by
the average of two middle measures.
Thus, in the median, half the distribution
lies above it.
Mode = 112
if in case of two mode, it is called bimodal.
if no mode, there is no pair of data.
C. Mode - is the item or measure which occurs
most often. It has the highest number of
frequency.
ASSUMED MEAN
Mean = AM+ (∑fd/n)i
= 64.5+ (-33/40)10
= 64.5-8.25
Mean = 56.25
Median
Median = LL+(n/2-<cf) i
f
= LL + (20-9/13)10
= 49.5 + (11/13) 10
= 49.5 + 8.4615
Median =57.9615
Mode
Mode= LL + (∆1/ ∆1+ ∆2) i
*where ∆1 = difference between the modal
class and the next lower score.
∆2 = difference between the modal
class and the next upper score.
3.3 Percentiles, Deciles,
and Quartiles
This section aims to:
Define, illustrate, and distinguish percentiles,
deciles, and quartiles; and
Discuss the formulas of percentiles, deciles, and
quartiles.
Measure of Location
Position/Location
QUARTILE (Q)
Q1- 25% 1/2
= 4905 + 0.7692
Q1 = 50.2692
Q3 = LL + 3n/4 - <cf i
f
= 59.5 + 30-22 10
13
= 59.5 + (80/13) 10
= 59.5 + 6.1538
Q3 = 65.6538
D2 = LL + 0.2n - <cf i
f
= 39.5 + 8-5 10
4
= 39.5 + (30/4) 10
= 39.5 + 7.5
D2 = 47
P23 = LL + 0.23n - <cf i
f
= 49.5 + 92.2 - 9 10
13
= 49.5 + (0.2/13) 10
= 49.5 + (0.0153)10
= 49.5 + 0.1538
P23 = 49.6538
Q3 = LL + 3n/4 - <cf i
f
= 59.5 + 30-22 10
13
= 59.5 + (80/13) 10
= 59.5 + 6.1538
Q3 = 65.6538
MEASURE OF VARIABILITY OR
DISPERSION
Measure of the scatteredness of a particular
data in a given data set.
Average of distance
1. Range = H.S. – L.S.
C.L. Range
80-89 8905-9.5 = 80
MAD = ∑|x-x|
n
3. Standard Deviation
- the most commonly used in measures of variability
UNGROUPED DATA
Sample SD
1. SD= ∑(x-x)2 2.
SD= ∑x2 – (x)2
n-1 n-1
Population SD
= ∑x-m)2
N
GROUPED DATA:
SD = i ∑f(d1)2 - ∑fd1 2
n n
4. Quartile Deviation
- semi- center quartile range.
- represent mid-point of middle part of a
distribution.
FORMULA:
UNGROUPED DATA:
QD = Q3 – Q1
2
Chapter 4
MEASURE OF VARIABILITY
COEFFICIENT OF VARIATION
Coefficient of Variation denoted by CV allows
the variability of scores in 2 sets of data that do
not necessarily measures the same thing.
The one who got highest scores is the one who
needs improvement.
FORMULA:
CV SD x 100%
x
Example:
10- Highest
1- Lowest
1 2 3 4 5 6 7 8 9 10
Coke 8 10 2 8 9 5 8 6 8 10
Pepsi 9 8 1 10 9 3 7 8 8 10
x= 7.4 (coke)
x x x-x |x-x| (x-x)2 (x)2
8 7.4 .6 .6 .36 64
10 7.4 2.6 2.6 6.76 100
2 7.4 5.4 5.4 29.16 4
8 7.4 .6 .6 .36 64
9 7.4 1.6 1.6 2.56 81
5 7.4 -2.4 2.4 5.76 25
8 7.4 .6 .6 .36 64
6 7.4 -1.4 1.4 1.96 36
8 7.4 .6 .6 .36 64
10 7.4 2.6 2.6 6.76 100
∑|x-x|=18.4 ∑(x-x)2=54.4 ∑(x)2=602
x= 7.3 (pepsi)
CV= 33.2229%
SD= 2.4585
Pepsi
SD= ∑(x-x)2
n-1
= 80.1
10-1
= 80.1
9
= √ 8.9
SD = 2.9833
SD2= 8.9
DECISION:
taste
A distribution of 2 different units is given to compare
in dispersion of heights versus in dispersion of
weights. The mean height is 5.70 feet with SD = 0.9 ft.
The mean weight is 72.5 kg with SD = 801 kg.
Compare the dispersion in heights and in weights.
HEIGHTS WEIGHTS
CV = SD x 100% CV = SD x 100%
x x
= 0.15789 = 0.111724
= 15.7985% = 11.1724%
MEASURE OF SKEWNESS
FORMULA:
SK5 = ∑f(x-x)3
n(SD)3
Negatively Skewed Distribution (all negative)
FORMULA:
Ungrouped Data Grouped
K = ∑(x-x)4 K= ∑f(x-x)4
n(SD)4 n(SD)4
Leptokurtic Distribution
Mesokurtic Distribution
Platykurtic Distribution
Chapter 5
5.1 PRINCIPLE OF COUNTING
This section aims to:
State and illustrate the principle of
counting;
Diagram the computations involving the
principle of counting; and
Apply the principle of counting in various
area of problem solving.
Principle of Counting
20 . 19 . 18 . 17 = 116280
1. Certain government employees are classified
into 2 categories
Sex: (male, female)
Marital Status : (single. Married, widow,
separated)
2 . 4 = 8
GENERALIZATION OF
PRINCIPLE OF COUNTING
If a choice has k steps of which the first
can be made ian N1 ways, of which each
of these 2nd can be made in n2 ways…. 3rd
of which of these kth can be made… in nk
which then the whole choice can be made
by n1 . n2 . nk(ways)
EXAMPLES:
1. How many nos, of five(5) digits each can be made from the digit 1-9 if:
a. No. must be odd
b. The last two(2) digit each no. are even number.
1 ,2 ,3 , 4 , 5
5.2 Permutations
This section aims to:
Define and illustrate permutations
Apply permutations in various situational
conditions; and
State and illustrate the circular permutation.
PERMUTATIONS
Arrangement of group of things in a definite
order that is, there is a 1st element, 2nd element,
3rd element etc. In other words, the order of
arrangement of an element is important.
EXAMPLES:
1. In how many ways can the five(5) starting position on the PBA
team with 12 mean who can play any of the position.
12P5 = 12! = 12! = 12 .11 .10 .9 . 8 . 7 . 6 . 5 .4 .3 .2 .1 = 95,040
(12-5) 7! 7!
1ST Formula:
• the number if permutation n
things taken n at a time is nPn=n!
CIRCULAR PERMUTATION
P = (n-1)!
COMBINATION
A combination also concerns arrangement but
without regards to order. This means that the
order or arrangement in which the element are
taken is not important.
nCr = n!
r!(n-1)!
The End
Thank You!
Presenters:
Mary Ann Frogosa
Mary Ann Mosquerra
BOA IV-1