Professional Documents
Culture Documents
--------------------------------------------
REVIEW NOTES/ EXERCISES
Compiled by:
Melissa E. Agulto, Ph.D.
Professor, Central Luzon State University
2015 Review Class
(a)
STATISTICS: REVIEW NOTES and
EXERCISES
I. BASIC CONSIDERATIONS
II. ELEMENTS OF SAMPLING AND
DESCRIPTIVE STATISTICS
III. SETS AND PROBABILITY
IV. RANDOM VARIABLES
V. SAMPLING DISTRIBUTIONS
VI. ESTIMATION
VII. TESTS OF HYPOTHESES
VIII. REGRESSION AND CORRELATION
I. BASIC CONSIDERATIONS
Statistics, Defined
(a) (b)
(c ) _________
(d) __________
(e) Stem-and-Leaf Plot
.
Ages of Students
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Stem-and-Leaf Plot
Ages of Students
Key: 1|8 = 18
1 888999
2 0011124799 Most of the values lie between 20
3 002234789 and 39.
4 469
5 14
This graph allows us to see the
shape of the data as well as the
actual values.
Exercise. Try your own Stem and Leaf Plot with the
following temperatures (deg F)
77 80 82 68
57 50 62 69
67 70 65 76
87 82 83 79
Temperatures
Tens Ones
5 07
6 25789
7 0679
8 02237
(f) _______
Ages of Students
15 18 21 24 27 30 33 36 39 42 45 48 51 54 57
Class Frequency, f
1–4 4
Lower and 5–8 5
Upper Class
________ 9 – 12 3 Frequencies
13 – 16 4
17 – 20 2
The _____________ is the distance between lower (or upper)
limits of consecutive classes.
Class Frequency, f
1–4 4
5–1=4 5–8 5
9–5=4 9 – 12 3
13 – 9 = 4 13 – 16 4
17 – 13 = 4 17 – 20 2
Ages of Students
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Continued.
Constructing a Frequency Distribution
36 = 7.2 Round up to 8.
Class width =
5
Continued.
Constructing a Frequency Distribution
3. The minimum data entry of 18 may be used for the lower limit of
the first class. To find the lower class limits of the remaining
classes, add the width (8) to each lower limit.
The lower class limits are 18, 26, 34, 42, and 50.
The upper class limits are 25, 33, 41, 49, and 57.
4. Make a tally mark for each data entry in the appropriate class.
5. The number of tally marks for a class is the frequency for that
class.
Continued.
Constructing a Frequency Distribution
Number of
Ages students
Ages of Students
Class Tally Frequency, f
18 – 25 13
26 – 33 8
34 – 41 4
42 – 49 3
Check that the
50 – 57 2 sum equals the
number in the
f 30
sample.
Midpoint
Relative
Class Frequency, f Frequency
18 – 25 13
26 – 33 8
34 – 41 4
42 – 49 3
50 – 57 2
f 30
Relative Frequency
Relative Portion of
Class Frequency, f Frequency students
18 – 25 13 0.433 f 13
26 – 33 8 0.267 n 30
34 – 41 4 0.133 0.433
42 – 49 3 0.1
50 – 57 2 0.067
f
f 30 1
n
Cumulative Frequency
The cumulative frequency of a class is the sum of the frequency
for that class and all the previous classes.
Ages of Students
Cumulative
Class Frequency, f Frequency
18 – 25 13
26 – 33 + 8
34 – 41 + 4
42 – 49 + 3
50 – 57 + 2
f 30
Cumulative Frequency
The cumulative frequency of a class is the sum of the frequency
for that class and all the previous classes.
Ages of Students
Cumulative
Class Frequency, f Frequency
18 – 25 13 13
26 – 33 +8 21
34 – 41 +4 25
42 – 49 + 3 28
Total number of
50 – 57 + 2 30 students
f 30
Drawn below is a _______________ for the “Ages of
Students” frequency distribution. The class boundaries are
used..
14 13 Ages of Students
12
10
8
8
f 6
4
4 3
2 2
0
17.5 25.5 33.5 41.5 49.5 57.5
Broken axis
Age (in years)
Frequency Histogram
14 13 Ages of Students
12
10
8
8
f 6
4
4 3
2 2
0
17.5 25.5 33.5 41.5 49.5 57.5
Broken axis
Age (in years)
A_______________ is a line graph that emphasizes the
continuous change in frequencies.
14
Ages of Students
12
10
8 Line is extended to
the x-axis.
f 6
4
2
0
13.5 21.5 29.5 37.5 45.5 53.5 61.5
Broken axis
Age (in years) Midpoints
Frequency Polygon
0.5
0.433
(portion of students)
Relative frequency
0.5
0.433
(portion of students)
Relative frequency
30 Ages of Students
Cumulative frequency
(portion of students)
24
18
The graph ends at
the upper
12 boundary of the
last class.
6
0
17.5 25.5 33.5 41.5 49.5 57.5
Age (in years)
Cumulative Frequency Graph
A cumulative frequency graph or ogive, is a line graph
that displays the cumulative frequency of each class at its
upper class boundary.
30 Ages of Students
Cumulative frequency
(portion of students)
24
18
The graph ends at
the upper
12 boundary of the
last class.
6
0
17.5 25.5 33.5 41.5 49.5 57.5
Age (in years)
II. ELEMENTS OF SAMPLING AND
DESCRIPTIVE STATISTICS
Sampling Methods
Non-probability Sampling – the elements of the
universe or population have no known
chance of being taken in the sample
Probability Sampling –assigns a known probability
of selection for all possible samples; allows
for the computation of sampling error, or the
error in inference inherent to the fact that
what was observed was only a sample
Classification of Sampling Methods
Sampling
Methods
Probability Non-
Samples probability
Simple
Cluster Judgment Quota
Random
Sampling Procedures
The population is the age of all his students in his hydrology class
A sample is any subset of that population. For example, we might
select 10 students and determine their age.
The variable is the “age” of each student in the class.
One data would be the age of a specific student.
The parameter of interest is the “average” age of all students in his
hydrology class.
The statistic is the “average” age for all students in the sample.
Some Descriptive Statistics
MEASURES OF CENTAL TENDENCY – values
computed from the data that tend to center or
cluster around
Arithmetic Mean – the arithmetic average of all the values
Ungrouped data
X = ∑ Xi / n
where n = sample size
Grouped data
X = ∑ fi Xi / ∑ fi
53 32 61 57 39 44 57
Calculate the population mean.
Mean
Exercise:
• The following are the ages of all seven employees of
a small company:
53 32 61 57 39 44 57
Calculate the population mean.
Class x f
18 – 25 21.5 13
26 – 33 29.5 8
34 – 41 37.5 4
42 – 49 45.5 3
50 – 57 53.5 2
n = 30
Class x f (x · f )
18 – 25 21.5 13 279.5
26 – 33 29.5 8 236.0
34 – 41 37.5 4 150.0
42 – 49 45.5 3 136.5
50 – 57 53.5 2 107.0
n = 30 Σ = 909.0
(x w ) 87 0.87
x
w 100
53 32 61 57 39 44 57
Exercise:
Find the mode of the ages of the seven employees.
53 32 61 57 39 44 57
The mode is 57 because it occurs the most times.
An outlier is a data entry that is far removed from the other entries
in the data set.
Exercise : Find Mode
53 32 61 57 39 44 57
53 32 61 57 39 44 57
(N+1)/2 - S
Md = LMd + w
fMd
Where:
LMd = lower limit of the median class
N = number of observations in the sample
S = sum of the frequencies in all classes
preceding the median class
fMd = frequency of the median class
w = width of the median class
Finding Quartiles
The quiz scores for 15 students is listed below. Find the first,
second and third quartiles of the scores.
28 43 48 51 43 30 55 44 48 33 45 37 37 42 38
Exercise:
The quiz scores for 15 students is listed below. Find the first,
second and third quartiles of the scores.
28 43 48 51 43 30 55 44 48 33 45 37 37 42 38
Q1 Q2 Q3
About one fourth of the students scores 37 or less; about one half score
43 or less; and about three fourths score 48 or less.
MEASURES OF DISPERSION – values used to
describe the extent of dispersion or variability
of data
Range – the difference between the largest and the
smallest measurement in the data set
R = Xmax - Xmin
∑ │Xi - X│
MD =
n
Exercise : Consider the following set of test scores for Rob in Statistics
Range = 82 – 40
= 42 i
Mean = (40+58+60+82)/4
= 240/4 = 60
Mean Deviation = 44/4
_ = 11
X / X – X/
40 20
58 2
60 0
82 22
Total 44
Population Variance and Standard Deviation
x
56
58
61
63
67
Σx = 305
Finding the Population Variance and
Standard Deviation
Find the population variance and standard deviation for the
following set of data. The population mean is 61.
Always positive!
Solution:
X (X−X¯)2
2 (2−7)2=25 X¯ = ∑X/ n = 42/6 = 7
4 9
8 1 S = √ ∑(X−X¯)2 / (n –1) = √70 / 5 = 3.74
6 1
10 9 CV = (3.74 / 7) (100) = 53.43%
12 25
∑X=42 ∑(X−X¯)2=70
Interquartile Range
The quartiles for 15 quiz scores are listed below. Find the
interquartile range.
Q1 = 37 Q2 = 43 Q3 = 48
The range is
a. 50 b. 30 c. 40 d, 10
Five-number summary
• The minimum entry 28
• Q1 37
• Q2 (median) 43
48
• Q3
55
• The maximum entry
Box and Whisker Plot
Five-number summary
• The minimum entry 28
• Q1 37
• Q2 (median) 43
48
• Q3
55
• The maximum entry
Quiz Scores
28 37 43 48 55
28 32 36 40 44 48 52 56
Exercises:
Symmetric
MEASURES OF SKEWNESS – values measuring the
extent of departure of the distribution from
symmetry
a. Pearson’s First Coefficient of Skewness
Skewness = ( X - Mo) / S
b. Pearson’s Second Coefficient of Skewness, SK
SK = 3 ( X – Md)/ S
m3 ∑ ( X - X)3 / n
α3 = ------ = -------------------
S3 S 3
= m3 / m23/2
where
m3 = ∑(x−x̅)3 / n and m2 = ∑(x−x̅)2 / n
m3 is called the third moment of the data set.
m2 is called the second moment of the data set;
also known as the variance,
or the square of the standard deviation.
c. Moment Coefficient of Skewnessr the
[n(n-1)]1/2
a3 = ----------------------- α3
(n-2)
Exercise:
Here are grouped data for heights of 100 randomly selected
male students, adapted from Spiegel and Stephens (1999, 68).
n = 5+18+42+27+8 = 100
x̅ = (61×5 + 64×18 + 67×42 + 70×27 + 73×8) ÷ 100
x̅ = 9305 + 1152 + 2814 + 1890 + 584) ÷ 100
x̅ = 6745÷100 = 67.45
Class Frequency
Mark, x
xf (x−x̅) (x−x̅)²f (x−x̅)³f
f
61 5 305 -6.45 208.01 -1341.68
64 18 1152 -3.45 214.25 -739.15
67 42 2814 -0.45 8.51 -3.83
70 27 1890 2.55 175.57 447.70
73 8 584 5.55 246.42 1367.63
∑ 6745 n/a 852.75 −269.33
x̅, m2, m3 67.45 n/a 8.5275 −2.6933
The sample skewness:
[n(n-1)]1/2
a3 = ----------------------- α3
(n-2)
m4 ∑ ( X - X)4 / n
a4 = -------- = -------------------
m22 ( S2)2
where
m4 = ∑(x−x̅)4 / n
m2 = ∑(x−x̅)2 / n = S2
If a4 = 3, distribution is normal
b. Excess Kurtosis
The excess kurtosis is generally used because
the excess kurtosis of a normal distribution is 0
For the Population Excess Kurtosis , K
κ = ( a4 - 3)
If K = 0, distribution is normal
K > 0, distribution is leptokurtic
K < 0, distribution is platykurtic
The sample excess kurtosis uses the formula below,
which comes from Joanes and Gill:
(n – 1)
k = --------------------- [(n + 1) κ + 6]
(n - 2) (n – 3)
Exercise:
Let’s continue with the example of the heights,
and compute the kurtosis of the data set.
n = 100, x̅ = 67.45 inches, and the variance m2 = 8.5275 in²
The kurtosis is
a4 = m4 / m2² = 199.3760/8.5275² = 2.7418
and the excess kurtosis is
κ = 2.7418−3 = −0.2582
And the sample excess kurtosis is:
k = [99/(98×97)] [101×(−0.2582)+6)] = −0.2091
Class Frequency
x−x̅ (x−x̅)4f
Mark, x ,f
61 5 -6.45 8653.84
This sample is
slightly platykurtic: 64 18 -3.45 2550.05
its peak is just a bit 67 42 -0.45 1.72
shallower than the 70 27 2.55 1141.63
peak of a normal
73 8 5.55 7590.35
distribution.
∑ n/a 19937.60
m4 n/a 199.3760
III. SETS AND PROBABILITY
Definition
Let P = { a, e, i, o, u }
Q = { x, y, z }
Then,
P∩Q=Ø Sets P and Q are disjoint;
that is P and Q have no
elements in common
Set Operations
The union of two sets A and B is the set of
elements that belong to A or to B, or to both.
Let A = { 1, 3, 5, 8, 10 }
B = { 3, 6, 8 }
Then,
A U B = { 1, 3, 5, 6, 8, 10 }
Set Operations
If A is a subset of the universal set U, then the
complement of A with respect to U is the set of all
elements of U that are not in A. The complement of
A is denoted by A’.
From the above definitions/operations,
it should be noted that:
A∩Ø = Ø
A U Ø = A
A ∩A’ = Ø
A U A’ = U
U’ = Ø
Ø’ = U
(A’)’ = A
Venn diagrams for
Total is 8 * 25 = 200
Exercise. Participants at a large conference are
offered 6 sightseeing tours on each of 3
days.
Solution.
a. N = (6)(6)(6) = 216 ways
b. N = (6)(5)(4) = 120 ways
Exercise
Total is 8 + 25 = 33
Permutations
The number of permutations of n distinct objects
taken n at a time is:
n!
Solution.
N = n!
= 4!
= (4)(3)(2)(1)
= 24
Exercise:
7! = 7 · 6 · 5 · 4 · 3 · 2 · 1
= 5040 surveys
Exercises:
Your little brother just broke into your Facebook account
and now you need to change your password. You want it to
be 5 characters long. The first 3 characters are to be letters
and the last 2 characters are to be numbers.
How many passwords are possible?
Answer: 26x26x26x10x10=1,757,600
What if you don’t want to repeat letters and numbers?
Answer: 26x25x24x10x9=1,404,000
Answer: 6x5x4x3x2x1=6!=720
Permutations
The number of permutations of n distinct objects
taken r at a time is:
n!
nPr = ---------
(n-r)!
8!
(8 5)! 8!
3!
= 8 7 6 5 4 3 2 1
3 2 1
6720 wa ys
Exercise
An inspector randomly selects 2 of 5
parts for inspection. In a group of 5
parts, how many permutations of 2
parts can be selected?
Exercise
An inspector randomly selects 2 of 5
parts for inspection. In a group of 5
parts, how many permutations of 2
parts can be selected?
5! 5! (5)(4)(3)(2)(1) 120
P
2
5
20
(5 2)! 3! (3)(2)(1) 6
AB BA AC CA AD DA AE EA BC CB BD DB BE EB CD DC CE EC
DE and ED
Permutations
The number of permutations of n distinct objects
arranged in a circle is
( n – 1)!
Solution.
N = (11 – 1)! = 10! = 3,628,800 ways
Permutations
The number of distinct permutations of n things of
which n1 are of one kind, n2 are of a second kind,
…, nk of a kth kind is:
n!
---------------------
n1! n2! … nk!
Exercise. How many distinct permutations can be
formed from the letters of the word
“PHILIPPINES”
Solution.
P=3 H=1 I=3 L=1 N=1 E =1 S =1
11!
P3, 3, 1, 1, 1, 1
11 or N = ------------------------- = 1,108,800 ways
3! 3! 1! 1! 1! 1!
Exercise
9! 9•8•7•6•5•4! 9•8•7•6•5
----------- = ------------------ = --------------- = 1260
4!•3!•2! 4!•3!•2! 3•2•1•2•1
Exercise
Hannah: 6!/(2!2!2!)=90
Christopherson: 14!/(2!2!2!2!)=5,448,643,2000
Permutations
Exercise:
A combination lock will open when the right
choice of three numbers (from 1 to 30,
inclusive) is selected. How many different
lock combinations are possible assuming no
number is repeated?
30! 30!
30 p3 30 * 29 * 28 24360
( 30 3)! 27!
Permutations
Exercise:
From a club of 24 members, a President,
Vice President, Secretary, Treasurer and
Historian are to be elected. In how many
ways can the offices be filled?
24! 24!
24 p5
( 24 5)! 19!
24 * 23 * 22 * 21 * 20 5,100,480
Exercise:
In some states, license plates have six characters:
three letters followed by three numbers. How many
distinct such plates are possible? (hint: with
replacement)
8 7 6 5 4 3 2 1 8 7 6
8 C3 56
(3 2 1) (5 4 3 2 1) 3 2 1
Exercise
Answer : C25=142,506
30
Combinations
Exercise:
A student must answer 3 out of 5 essay
questions on a test. In how many
different ways can the student select the
questions?
5! 5! 5 * 4
5 C3 10
3! (5 3)! 3!2! 2 * 1
Combinations
Exercise:
A basketball team consists of two centers, five
forwards, and four guards. In how many ways can
the coach select a starting line up of one center,
two forwards, and two guards?
2 C1 * 5 C 2 * 4 C 2
12
P(cruise) = 0.24
50
Probability
Exercise:
A farmer predicts that the probability he gets a good
harvest is 0.75.
Some Probability Rules
Ages Frequency, f
18 – 25 13
26 – 33 8
34 – 41 4
42 – 49 3
50 – 57 2
Probabilities with Frequency Distributions
Exercise:
The following frequency distribution represents the ages of
30 students in a statistics class. What is the probability that
a student is between 26 and 33 years old?
Ages Frequency, f
8
18 – 25 13 P (age 26 to 33)
30
26 – 33 8
0.267
34 – 41 4
42 – 49 3
50 – 57 2
f 30
Exercise. A coin is tossed 6 times in succession. What
is the probability that at least 1 head occurs?
P(E1 ∩ E2)
P(E1/E2) = --------------
P(E2)
Then,
P(AUB) = P(A) + P(B) - P(A∩B)
= 4/52 + 13/52 - 1/52
= 16/52 = 4/13
Other Exercises
Experiment:
Toss a coin three times.
Observe the number of
heads. The possible results
are:
Zero head,
One head,
Two heads, and
Three heads.
x p(x)
1 p(x=1)=1/6
2 p(x=2)=1/6
3 p(x=3)=1/6
4 p(x=4)=1/6
5 p(x=5)=1/6
6 p(x=6)=1/6
1.0
Cumulative Distribution Function
F ( x) Pr( X x)
is called the cumulative distribution
function of X.
Cumulative Distribution Function (CDF)
1.0 P(x)
5/6
2/3
1/2
1/3
1/6
1 2 3 4 5 6 x
Cumulative Distribution Function
x P(x≤A)
1 P(x≤1)=1/6
2 P(x≤2)=2/6
3 P(x≤3)=3/6
4 P(x≤4)=4/6
5 P(x≤5)=5/6
6 P(x≤6)=6/6
Exercise
12 .25
1.0
Answer (b)
x f(x)
Though this sums to 1,
you can’t have a
1 (3-1)/2=1.0 negative probability;
therefore, it’s not a
2 (3-2)/2=.5 probability function.
3 (3-3)/2=0
4 (3-4)/2=-.5
Answer (c)
x f(x)
0 1/25
1 3/25 Doesn’t sum to 1. Thus, it’s
not a probability function.
2 7/25
3 13/25
24/25
Exercise
x 10 11 12 13 14
P(x) .4 .2 .2 .1 .1
a. 1/6
b. 1/3
c. 1/2
d. 5/6
e. 1.0
Exercise
a. 1/6
b. 1/3
c. 1/2
d. 5/6
e. 1.0
Exercise
a. 1/5
b. 2/3
c. 1/2
d. 5/6
e. 1.0
Exercise
n x n x
P ( x) p q
x
Binomial Distribution Properties
Properties:
Mean = np
Variance = npq
q p
Skewness = npq
Kurtosis= 1 6 pq
3
npq
Exercise
a. 20/100
b. 20
c. 80/100
d. 80
Binomial Distribution - Exercise
20
0 20 20 !
(.
5) (.
5) (.
5)209
.5x10 7
0 20
!0!
20
1 19 20 !
(.
5) (.
5) (.
5)2020x9.5x
10 7
1.9x
105
1 19
!
1!
20
2 18 20 !
(.
5) (.
5) (.
5)20190
x9.5x
10 7
1.8x
10 4
2 18
!
2!
4
1.8x
10
Binomial Distribution
Exercise
Binomial Distribution
Exercise
Exercise
10 5 5
a. (.50) (.50)
0
b. 10 5
(.50) (.50)
5
5
c. 10 10 5
(.50) (.50)
5
d.
10 10 0
(.50) (.50)
10
Exercise
10 5 5
a. (.50) (.50)
0
b. 10 5
(.50) (.50)
5
5
c. 10 10 5
(.50) (.50)
5
d.
10 10 0
(.50) (.50)
10
Exercise
a. .5, .25
b. 1.0, 1.0
c. 1.5, .5
d. .25, .5
e. .5, .5
Exercise
a. .5, .25
b. 1.0, 1.0
c. 1.5, .5
d. .25, .5
e. .5, .5
Hypergeometric Probability Distribution
Consider an urn with N balls, M of which are white and N-M are red. Suppose
that we draw a sample of n balls at random (without replacement) from
the urn, then the probability of getting k white balls out of n (k<n) will
follow hypergeometric distribution.
pdf: Hg(N,M,n)
M N M
P(X k) k n k ; k 0 ,1, 2 ,...., min( n , M )
N
n
Hypergeometric Distribution
50 150
k 30 k
P(X k) ; k 0 ,1, 2 ,...., 30
200
30
50 150
11 19
P ( X 11 )
200
30
Exercise: Milk Container Contents
f(x)
area=1
a b x
Features:
• Has a bell-shaped curve
• Has a mean μ and variance σ2 , X ~ N (μ, σ2 )
• Normal curve is symmetric
• Mean, median, and mode coincide at the center of
the curve
• Asymptotic to the X-axis
• Has a total area equal to one
The Normal Distribution
0
1
0
Calculating Normal Probabilities
• P(–.5 < Z < 1) looks like this:
0 1.6
-2.23 0 2.23
0 1.52
0 0.9 1.9
P(0.9 < Z < 1.9) = P(0 < Z < 1.9) – P(0 < Z < 0.9)
=.4713 – .3159
= .1554
Exercise
• The return on investment is normally distributed with
a mean of 10% and a standard deviation of 5%.
What is the probability of losing money?
X 0 10
P ( X 0) P
5
P ( Z 2)
.5 P ( 0 Z 2)
.5 .4772
.0228
Finding Values of Z
• What value of z corresponds to an area under the curve
of 2.5%? That is, what is z.025 ?
μ = 10 μ=0
σ=5 σ=1
x z
μ =10 15 μ =0 1
Same area
P(x < 15) = P(z < 1) = Shaded area under the curve
= 0.8413
Probability and Normal Distributions
Exercise:
The average on a statistics test was 78 with a standard deviation of 8.
If the test scores are normally distributed, find the probability that a
student receives a test score less than 90.
μ = 78 x - μ 90 - 78
σ=8 z =
σ 8
= 1.5
P(x < 90)
x - μ 85 - 78
μ = 78 z = =
σ 8
σ=8
= 0.875 0.88
P(x > 85)
The probability that a
x student receives a test
μ =78 85 score greater than 85 is
z
μ =0 0.88
?
0.1894.
P(x > 85) = P(z > 0.88) = 1 P(z < 0.88) = 1 0.8106 = 0.1894
Probability and Normal Distributions
Exercise:
The average on a statistics test was 78 with a standard deviation of 8.
If the test scores are normally distributed, find the probability that a
student receives a test score between 60 and 80.
x - μ 60 - 78 = -2.25
z1 = =
σ 8
P(60 < x < 80) x - μ 80 - 78 = 0.25
z2 =
σ 8
μ = 78
σ=8
The probability that a
x student receives a test
60 μ =78 80 score between 60 and 80
z
2.25
? μ =0 0.25
?
is 0.5865.
P(60 < x < 80) = P(2.25 < z < 0.25) = P(z < 0.25) P(z < 2.25)
= 0.5987 0.0122 = 0.5865
Transforming a z-Score to an x-Score
x μ + zσ.
Exercise:
The monthly electric bills in a city are normally distributed with a
mean of P120 and a standard deviation of P16. Find the x-value
corresponding to a z-score of 1.60.
x μ + zσ
= 120 + 1.60(16)
= 145.6
We can conclude that an electric bill of P145.60 is 1.6 standard
deviations above the mean.
Finding a Specific Data Value
Exercise:
The weights of bags of chips for a vending machine are normally
distributed with a mean of 1.25 ounces and a standard deviation
of 0.1 ounce. Bags that have weights in the lower 8% are too
light and will not work in the machine. What is the least a bag of
chips can weigh and still work in the machine?
P(z < ?) = 0.08
? 1.25
1.25 (1.41)0.1
1.11
1.11
The least a bag can weigh and still work in the machine is 1.11 ounces.
V. SAMPLING DISTRIBUTIONS
Terminologies:
• Sampling Distribution - the probability distribution
of statistic
• Standard Error of the Statistic – the standard
deviation of the sampling distribution
The Central Limit Theorem
The sampling distribution of sample means has a mean equal to the population
mean.
μx μ Mean of the
sample means
The sampling distribution of sample means has a standard deviation equal to the
population standard deviation divided by the square root of n.
σ
σx Standard deviation of the
sample means
n
This is also called the
standard error of the mean.
The Mean and Standard Error
Exercise:
The heights of fully grown bushes have a mean height of 8 feet and a standard
deviation of 0.7 feet. 38 bushes are randomly selected from the population, and
the mean of each sample is determined. Find the mean and standard error of
the mean of the sampling distribution.
= 1 - P( Z < 2.646)
= 1 - 0.9959 = 0.0041
t- Distribution
If X and S2 are the mean and variance, respectively, of
a random sample of size n taken from a normal
population having the mean μ and unknown variance
σ2, then _
X - μ
t =
S / (n)1/2
… it is positively skewed
… it is non-negative
… it is based on degrees of
freedom
…when the degrees of freedom
change a new distribution is
created
Chi-Square Distribution
1
F 1- α (df1, df2) =
F α (df2, df1)
F-Distribution
Exercise. If S12 and S22 represent the variances of
independent random samples of size n1 = 25 and n2 = 31,
taken from normal populations with variances σ12 = 10
and σ22 = 15, respectively, find P (S12 / S22 > 1.26)
F-Distribution
Exercise. If S12 and S22 represent the variances of
independent random samples of size n1 = 25 and n2 = 31,
taken from normal populations with variances σ12 = 10
and σ22 = 15, respectively, find P (S12 / S22 > 1.26)
Solution.
σ22 S12
F =
σ12 S22
15
= (1.26) = 1.89
10
From the F-distribution table:
P (S12 / S22 > 1.26) = 0.05
F-distribution Table
Critical values of F are found in a series of tables
for different values of
For any given , tables are read as follows:
Given n1 = 5, n2 = 9, F=3.84
10 · 10 · 10 · 10 · 10 = 100,000 codes
b.) Because each digit cannot be repeated, there are 10 choices for the first
digit, 9 choices left for the second digit, 8 for the third, 7 for the fourth
and 6 for the fifth.
10 · 9 · 8 · 7 · 6 = 30,240 codes
Exercise
Say that you have 42 friends on Facebook. Any
time you access your profile, 6 of your friends will
appear in a box below your picture. Each time
these friends are randomly generated out of your
42 friends. How many different arrangements of 6
people can show up?
Answer:
42x41x40x39x38x37= 3,776,965,920
5 x4 x3
5 x 4 x 3 x 2 x1 5!
2 x1 2!
5!
( 5 3 )!
Combinations