Professional Documents
Culture Documents
1.1 Definition, Natures and Importance of Statistics
1.1 Definition, Natures and Importance of Statistics
1.1 Definition, Natures and Importance of Statistics
2009
CHAPTER 1:
INTRODUCTION
At the end of this chapter, the students are expected to:
a. Determine the nature of statistics applicable to a given situation;
b. Identify population and variable in a given situation or problem; and,
c. Classify the data according to variable type and appropriate level of
measurement.
1.1
Surveys
Consumer Preference
Experiments
Sampling
Economics
NATURES OF STATISTICS
DESCRIPTIVE STATISTICS
It deals with the methods of organizing, summarizing and presenting a mass
of data so as to yield meaningful information.
INFERENTIAL STATISTICS
2009
It deals with making generalizations about a body of data where only a part
of it is examined. This comprises methods concerned with the analysis of a
subset of data leading to predictions or inferences about the entire set of
data.
Example: Determine whether the following statements use the area of descriptive
statistics or statistical inference.
1. A bowler wants to find his bowling average for the past 12 games.
2. A manager would like to predict based on previous years sales, the sales
performance of a company for the next five years.
3. A politician would like to estimate, based on an opinion poll, his chance for
winning in the upcoming senatorial election.
4. A teacher wishes to determine the percentage of students who passed the
examination.
5. A student wishes to determine the average monthly expenditures on school
supplies for the past five months.
6. A basketball player wants to estimate his chance of winning the most
valuable player (MVP) award based on his current season averages and the
averages of his opponents.
Answer:
1.
2.
3.
4.
5.
6.
___________________
___________________
___________________
___________________
___________________
___________________
2009
Name:_________________________________
Course/Year/Section:_____________________
Teacher:_________________
Score:___________________
Exercise 1.1
Direction: Classify the following statements as belonging to the area of
descriptive statistics or statistical inference. Write DS for descriptive statistics
and write IS for inferential statistics on the space provided.
_________________1. Yesterdays records show that five (5) employees were absent
due to Dengue fever.
_________________2. If the present trend continues, architects will construct more
contemporary homes than colonials in the next 5 years.
_________________3. In a certain city, arsonists deliberately set 3% of all fires
reported last year.
_________________4. At least 30% of all new homes being built today are of a
contemporary design.
_________________5. As a result of a recent poll, most Filipinos are in favor of
finding work employment abroad.
_________________6. Philippines Gross Domestic Product (GDP) grows by 4.6% in
2002, 1.4 percentage higher than its 3.2% performance in
2001.
_________________7. The average grade of 10 students in English is 89.46%.
_________________8. Based from the present sales trend, it is expected that after
two years, this years sales will be doubled.
_________________9. All four provinces of ARMM are among the 10 poorest
provinces in the Philippines for 2002.
_________________10. During the period 1996 to 2002, unemployment rates among
women were consistently higher compared to men except in
1999 and 2000.
1.2
2009
2009
Name:________________________________
Course/Year/Section:____________________
Score:___________________
Teacher:_________________
Exercise 1.2.1
Direction: Identify the population, variable of interest, and type of variable in the
following:
1. The dean of CvSU-CBE would like to determine the average weekly
allowance of BS Computer Science students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
2. The registrar of CvSU-CBE would like to conduct a survey on the preferred
courses of 4th year high school students in Cavite.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
3. The dean of the CBE would like to know the number of students who are
smoking.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
4. A survey by a group of students entitled Dress Code will be conducted to
first year students to determine the fashion preferences of these students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
5. Information will be collected to new voters for 2004 election to identify their
opinion regarding politics in the Philippines.
Population:______________________________________________________________
5
2009
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
6. From all students registered this semester, the Mathematics Department
would like to know how many students like mathematics.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
7. A study to be conducted by NGO would determine the Filipinos awareness
about the war against IRAQ.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
8. A group of students taking Statistics conducted a study on the effect of boygirl relationship to the academic performance of the students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
9. Some parents would like to determine whether Counter Strike is good or bad
to the behavior of their children.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
10.
The head librarian would like to identify the book/s commonly read by
CvSU-CBE students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
11.
A statistics teacher, Mrs. Nacion, would like to determine whether the
number of students in a class at the start of the semester can determine the
number of failures.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
2009
Name:________________________________
Course/Year/Section:____________________
Score:___________________
Teacher:_________________
Exercise 1.2.2
Direction: Identify each statement as having discrete or continuous data. Write D
for discrete and C for continuous.
_____________1. Among the 1,500,000 microcomputer chips made by Motocolla, 2
are found to be defective.
_____________2. Yesterdays records show that 25 students were absent.
_____________3. Radar on EDSA indicated that the driver was going 150 kph when
ticketed for speeding.
_____________4. The amount of time that a taxi driver spends yielding to individual
pedestrians each year is 2.367 seconds.
_____________5. Upon completion of a diet and exercise program, Elmer weighed
12.37 lbs. less than when he started the program.
Exercise 1.2.3
Direction: Identify which of the following quantitative data would be presented by
a discrete variable or a continuous variable. Write DV for discrete variable and
write CV for continuous variable.
_____________1. Number of students
_____________2. Time (in minutes) to finish an exam
_____________3. Distance (in km.) of school from place of residence
_____________4. Length (in cm) of fish caught
_____________5. Width of the newest brand of cellular phone
_____________6. Percentage increase in enrolment this year
_____________7. Number of enrollees
_____________8. Monthly income of 100 randomly selected persons at KADIWA
Market
_____________9. Sum of points in tossing a pair of dice
_____________10. Lifetime (in years) of televisions produced by ZONY
2009
Name:________________________________
Course/Year/Section:____________________
Score:___________________
Teacher:_________________
Exercise 1.3
8
2009
At what level are the following variables measured? Write nominal, ordinal, interval or
ratio on the space provided.
___________________1. Student number
___________________2. Weights of a sample of candies
___________________3. Zip codes
___________________4. SSS number
___________________5. Final course grades of 1.0,1.25,1.50,.
____________________6. Instructors rated as superior, above average, average, below
average, or poor
____________________7. Movies listed according to their genre such as, comedy, adventure,
romance, action, suspense, or horror
___________________8. Lengths of TV commercials ( in seconds)
___________________9. The years 1896, 2000, 1776,1995
___________________10.Attitude toward gun laws such as favorable, somewhat favorable,
somewhat unfavorable..
___________________11.Zip codes
___________________12. Ideal number of children
___________________13. Family Income
___________________14. Candidate voted for in 2002 barangay elections
___________________15. Tax Identification Number
___________________16. Gender
___________________17. Average number of glasses of water consumed per day
___________________18. Blood pressure
___________________19. Height of students
___________________20. Number of clients
___________________21. Number of won cases in court
___________________22. Academic rank in High School
___________________23. Savings Account Number
___________________24. Are you a Pag-Ibig Member?(Yes/No)
___________________25. Number of books sold per day
___________________26. Weekly allowance of CvSU-CBE
___________________27. Main source of income
___________________28. Birth order in the family
___________________29. Number of organizations involved in
___________________30. Car plate number
CHAPTER 2:
2009
2009
Questions must not be vague. The question should indicate clearly the
manner how the answers must be given.
Example of a precise question:
a. In terms of mathematical ability, do you think male and female are equal?
Example of a vague question:
a. Do you think male and female are equal?
4. Good questionnaires lend themselves to easy analyses.
TWO CATEGORIES OF SURVEY QUESTIONS
1. Open question- allows free response.
Example:
What do you think can be done to reduce crime?
2. Closed question- allows only a fixed response.
Example:
Which of the following approaches would be the most effective in reducing crime?
Choose one.
A. Get parents to discipline more.
B. Correct social and economic conditions in slums.
C. Improve rehabilitation efforts in jails.
D. Give convicted criminals tougher sentences.
E. Reform courts.
TYPES OF DATA
1. Primary Data- are information collected from an original source of data, which is
first-hand in nature. Examples are data collected from interviews and surveys.
2. Secondary Data- are information collected from published or unpublished sources
like books, newspapers, and thesis.
2009
2. The method of data collection used may expedite or delay the process. Avoid a
medium that would produce low response rates.
3. Ensure that the sample size is large enough for the required purposes.
4. Ensure that the method used to collect data actually results in a sample that is
representative of the population.
METHODS OF DATA COLLECTION
1. Direct or Interview Method
The direct or interview method of data collection use at least two persons (an
interviewer and interviewee/s) exchanging information. This method will give us
precise and consistent information because clarifications can be made. Also,
questions not fully understood by the respondent, the interviewer could repeat the
question until it suits the interviewees level. However, this method is time
consuming, expensive and has limited field coverage.
2. Indirect or Questionnaire Method
This is a method where written answers are given to prepared questions. This
method requires less time and is inexpensive since the questionnaires can be
mailed or hand-carrried. Also, this will give a respondent a sense of freedom in
honestly answering the questions because of anonymity.
3. Registration Method
This is a method enforced by certain laws.
4. Observation Method
This is a method, which observes the behavior of individuals or organizations in the
study. This is also used when the respondents cannot read nor write.
5. Experiment Method
This method is used when the objective of the study is to determine the cause and
effect of certain phenomena or event.
Name:_________________________________
Course/Year/Section:_____________________
Teacher:_________________
12
Score:___________________
2009
Exercise 2.1
A. Answer the following questions:
1. What are the advantages and disadvantages of open questions?
2. What are the advantages and disadvantages of closed questions?
3. Choose the letter that corresponds to an unbiased option.
A. Anyone is free to smoke cigarettes if he wants to.
__________ Agree
_________Disagree
B. No sane person will burn money on cigarettes.
__________ Agree
_________Disagree
C. Smoking may be dangerous to your health.
__________ Agree
_________Disagree
4. Change the following questions to make it simpler and clearer.
A. What is your opinion about beauty contest being demeaning to
womanhood and against the vaunted equality of the sexes?
B. What is your mass measure in metric units?
5. Determine which of the following questions is best to ask about someones
health?
A. Are you a weakling?
B. Are you bursting over with health?
C. Can you be an advertisement for a health club?
D. Are you in good health?
6. To find out how much someone likes music, which question is best to ask?
A. Do you go for the present trend-all noise and sound?
B. How much time do you spend listening to the radio?
C. Do you prefer popular music, jazz, classics, or rock?
D. Do you listen to boring classical tunes?
B. From the situations given below, identify the possible errors or difficulties that
may be encountered during data collection.
1. To research recognition of a certain brand of deodorant, you plan to
conduct a telephone survey of 1000 consumers in the Philippines. What is
wrong with using telephone directories as the population from which the
sample is drawn?
2. A group of college students conduct a survey in an attempt to determine
the typical annual salary of the schools alumni. Would alumni with very low
salaries be likely to respond? How would this affect the result? Identify one
other factor that might affect the result.
3. Assume that you are hired by a company to collect data on the waist sizes
of college students. Why is it better to obtain direct measurements than to
ask people sizes of their waists?
C. Carrying out a survey
13
2009
1. Get a partner/group mate. Work together to write a plan for a survey using
questionnaire method.
2. Think of any topic of your interest and construct a questionnaire.
3. The plan should include:
The questionnaire
4. Submit the plan and the questionnaire to your teacher for approval. Finalize
the questionnaire.
5. Prepare the necessary survey forms and conduct the survey.
2.2.1 SAMPLING
Concept of Sampling
Sampling is the process of selecting units, like people, organizations, or
objects from a population of interest in order to study and fairly generalize the
results back to the population from which sample was chosen.
ADVANTAGES OF SAMPLING
1. Reduced Cost
14
2009
2. Greater Speed
3. Greater Scope
4. Greater Accuracy
2.2.2 Probability Sampling
A probability sampling method is any method of sampling that utilizes
some form of random selection. In order to have a random selection method,
you must set up some process or procedure that assures that the different units
in your population have equal probabilities of being chosen. Humans have long
practiced various forms of random selection, such as picking a name out of hat,
or choosing the short straw. These days we tend to use computers as the
mechanism for generating random numbers as the basis for random selection.
Random selection is performed by selecting a group subjects (a sample)
for the study from a larger group (a population). Each individual is chosen
entirely by chance and each member of the population has a known, but possibly
non-equal, chance of being included in the sample. By using a random selection,
the likelihood of bias is reduced.
1. Simple Random Sampling
The simplest form of random sampling is called the simple random sampling.
It is the basic sampling technique where a group of subjects (a sample) is selected
for a study from a larger group (a population). Each individual is chosen entirely
by chance and each member of the population has an equal chance of being
included in the sample. Every possible sample of a given size has the same
chance of selection; i.e. each member of the population is equally likely to be
chosen at any stage in the sampling process. The most common techniques for
selecting simple random sample are by using strips of paper, use of printed table
of random numbers, or use of random numbers generated by many computer
programs or scientific calculators.
2. Stratified Random Sampling
This sampling method involves dividing the population into homogeneous
subgroups and then taking a simple random sample in each group.
2
Types of Stratified Random Sampling
Equal Allocation (EA) - the sample sizes from the different strata are equal.
That is
3= .= nk
2= n
1= n
n
15
EA=
Formula:
2009
n
ni
Formula:
Where:
PA=
Ni
nk ,
from the
k th
stratum is taken.
Ni
n
N
N = population
n = sample size
Example: Given is the enrollment data of CvSU-CBE for first semester, SY 20092010.
PROGRAM
Ni
EA
PA
BSIT
420
BSCS
210
BSBM
300
BSE
40
ABPS
40
BENT
25
BSHRM
400
TOTAL(N)
1435
Select a sample of 200 students using Equal and Proportional Allocation
3. Systematic Random Sampling
Systematic sampling with a random start is a method of selecting a sample by
th
taking every k
unit from an ordered population, the first unit being selected at
16
2009
K=
1
k is the sampling
N
n
N = population
n = sample size
17
2009
when it would be the best way to elicit the views of persons who have specific
expertise.
Quota Sampling
In quota sampling, respondents are selected non-randomly according to some
fixed quota.
Heterogeneity Sampling
Heterogeneity sampling is performed when all opinions or views about a specific
topic are the primary concern and representing these views proportionately is not of
major importance.
Snowball Sampling
In snowball sampling, the process starts by identifying someone who meets the
criteria for inclusion in the study. The respondent is then asked to recommend others
whom they may know who also meet the criteria.
Name:_________________________________
Course/Year/Section:_____________________
Teacher:_________________
Score:___________________
Exercise 2.2
Identify the type of sampling used in the following statements.
1. An engineer selects every 50 th cell phone from the assembly line for careful
testing and analysis.
__________________________________________________________________________
2. A reporter writes the name of each senator on a separate card, shuffles the
cards, and then draws five names.
__________________________________________________________________________
3. The dean at CvSU-CBE surveys all students from each of the 12 randomly
selected classes.
__________________________________________________________________________
4. A reporter obtains sample data from readers who decide to mail in a
questionnaire printed in the latest issue.
__________________________________________________________________________
5. A mathematics professor selects 18 men and 18 women from each of the four
classes.
__________________________________________________________________________
18
2009
2009
Perception of
Leadership Behavior
Strongly Favorable
Favorable
Slightly Favorable
Slightly Unfavorable
Unfavorable
Strongly Unfavorable
CAPTIO
STUBS/CLAS
SES
Frequency
TABLE
10
11
12
14
22
31
TOTAL
100
BODY
TYPES OF FDT:
1. Qualitative or Categorical FDT is a frequency distribution table where the
data are grouped according to some qualitative characteristics; data are
grouped into non numerical categories.
Example:
TABLE 2: Frequency Distribution of the Gender Respondents of a Survey
Gender of Respondents
Frequency
Male
38
Female
62
TOTAL
100
Frequency
20
2009
7-9
10-12
13-15
16-18
2
8
14
19
19-21
TOTAL
50
2009
2. Class Mark (CM) midpoint of the class interval where the observations tend
to cluster about.
1
1
CM = ( +UL )CM = (LTCB+UTCB)
2
2
3. Relative Frequency (RF) the proportion of observations falling in a class
and is expressed in percentage.
frequency
frequency
RF=
%RF=
x 100
N
N
4. Cumulative Frequency (CF) accumulated frequency of the classes.
a. Less than CF (<CF) - total number of observations whose values do not
exceed the upper limit of class.
b. Greater than CF (>CF) total number of observations whose values are
not less than the lower limit of the class.
5. Relative Cumulative Frequency (RCF)
a. Less than RCF (<RCF)
b. Greater than RCF (>RCF)
Example: Construct the FDT of the given data set.
Age (in years) of 40 patients confined at a certain hospital.
5
15
23
27
33
38
44
52
10
20
25
31
35
15
24
30
33
40
45
53
42
50
57
13
21
20
25
31
34
42
45
55
26
32
36
43
51 57
22
1.
2.
3.
4.
It
It
It
It
2009
is accurate.
is clear.
is simple.
has a good appearance.
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 2.3
1. A survey taken at a hotel in Malate indicated that 40 guests preferred the
following means of transportation:
car car bus plane train bus bus plane car plane plane
bus plane car car train train car car car car plane
plane car bus car bus car plane car plane plane car
car car bus train car bus car
Construct a categorical distribution showing the frequencies corresponding
to the different means of transportation. Interpret the results.
23
2009
104
133
104
132
146
129
124
113
95
148
Construct the FDT of the given data set and write a brief report about it.
3. The following are the number of customers a restaurant served for lunch on
60 weekdays:
50
64
55
51
60
41
71
53
63
64
49
59
66
45
61
57
65
62
58
65
55
61
60
55
53
57
58
66
53
56
64
46
59
49
64
60
58
64
42
47
59
62
56
63
61
68
57
51
61
51
60
59
67
52
52
58
64
43
60
62
Construct the FDT of the given data set and write a brief report about it.
4. Construct a graph for the given FDT and write a brief interpretation.
The 2002 Purchases by A Car Rental Agency
CAR MAKER
1. Chevrolet Cavalier
2. Ford Mustang
3. Ford Taurus
4. Pontiac Grand Am
5. Toyota Camry
CHAPTER 3:
NUMBER OF PURCHASES
45
30
60
15
30
24
2009
X i=X 1 + X 2+ X 3+ + X n
i=1
The numbers 1 and n are called the lower and upper limits of summation,
respectively.
Example: Write out the following in full, that is, without summation signs:
6
1. X i
i =1
2. X i Y i
i=1
3. X i
i=4
4. ( X i +Y 2)
i=2
Rules on Summation
Rule # 1: The summation notation is distributive over addition.
n
i=1
i=1
i=1
( X i+ Y i )= X i+ Y i
25
2009
i=1
i=1
i=1
i=1
i=1
i=1
cx i =c x i
Rule #3: If c is a constant, then
n
c =nc
i=1
Examples:
A. Use the rules on summation to write out the expansion of the given
expression:
4
(2 x i +4 )
i=1
X 1=2, X 2=3, X 3 =1
Y 1=1, Y 2=1,Y 3 =0
Find the value of the following expressions:
3
1. X i
i =1
26
2009
2. X i Y i
i=1
3
i=1
i=1
3. X i Y i
4.
i=1
X i+1
Yi
( )
Exercise 3.1
Evaluate the following:
X 1=4, X 2=3, X 3=2, X 4 =1
1. Given:
4
a . X i ( X i3)
2
i=1
b . ( X i+1)2
i=2
2. Given:
3
a . Xi Y i
i=1
b . (3 X i +Y i 4)
i=2
3. Given:
3
a . ( X i + 2 X i Y i +Y i )
2
i=1
b . X i (Y i4)
i=1
27
2009
( )
x =
Xi
/n
i=1
Examples:
1. The numbers of employees at 5 different stores are 4, 8, 10, 12, and 6. Find
the mean number of employees for the 5 stores.
Solution:
5
( )
Xi
x =
i=1
4 +8+10+12+6 40
= =8
5
5
2. Scores in Algebra for the first long quiz for a sample of 10 students are as
follows: 84, 75, 90, 98, 88, 79, 95, 86, 93, and 89.
Solution:
28
2009
If N is even:
XN+XN
d =
+1
Example: Find the median of the given data set: 75, 75, 67, 71, 72
Solution:
X 1=67, X 2=71, X 3=72, X 4 =75, X 5=75
d =X (5 +1) = X 3=72
2
29
2009
It does not always exist, and if it does, it may not be unique. A data
set is said to be unimodal if there is only one mode, bimodal if
there are two modes, trimodal if there are three modes, and so on.
It is not affected by extreme values.
It can be used for qualitative as well as quantitative data.
Examples:
Identify the mode(s) of the following data sets.
Data Set 1.
2
Red
Green
Data Set 2.
2
Data Set 3.
1
Data Set 4.
Red
Blue
Orange
Blue WhiteYellow
Blue
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 3.2
1. The grades of a student on seven examinations were 85, 96, 72, 89, 95, 82
and 85. Find the students average grade.
2. The salaries of 4 employees were P12,000, P10,000, P15,000 and P18,000.
What is the average salary?
3. Out of 100 numbers, 20 were 5s, 40 were 4s, 35 were 6s and 5 were 2s.
Find the mean.
4. Find the median of the set of numbers:
5
3
7
3
8
2
1
5. Find the median of the set of numbers:
30
11
25
2009
18
79
12
13
6. A student received grades of 89, 72, 92 and 35. What is the mode of the
given grades?
7. The reaction times of an individual to certain stimuli were measured by a
psychologist to be 0.23, 0.52, 0.25, 0.52, 0.26, 0.25, 0.39, and 0.22
seconds. Determine the modes of the given reaction times.
8. The numbers of incorrect answers on a true-false test for 15 students were
recorded as follows:
2, 1, 3, 0, 1, 3, 6, 0, 3, 3, 5, 2, 1, 4, 2. Find the
mean, median and mode.
9. The following are the response times in seconds of a smoke alarm after the
release of smoke from a fixed source: 12, 9, 11, 7, 9, 14, 6,10. Find the
mode.
10. A bridge is designed to carry a maximum load of 150,000 pounds. Is the
bridge overloaded if it is carrying 18 vehicles having a mean weight of 4,630
pounds?
11. For three rounds of golf Peter scores 88, 79, and 82. What fourth-round
score would he need to reduce his mean score to 81 for all rounds?
12. The average IQ of 10 students in Math is 114. If 9 of the students have IQ
scores of 101, 118, 128, 106, 115, 99, 118, 109 and 125. What must be the
other IQ?
P2
2009
P99
i(n+1) th
observationthe array
100
10
10
2
P64=
64 (10+1)
100
10
10
th
Therefore ,the 64 th percentile is 9, which isinterpreted as :64 of the scores are below 9.
Other forms of fractiles:
3.3.2 Deciles
Deciles are values that divide the array into 10 equal parts. Thus,
read as first decile, is the value below which 10% of the values fall,
as second decile, is the value which 20% of the values fall,,
ninth decile, is the value below which 90% of the values fall.
Formula:
i ( n+1 )
Di is the value of the
10
th
) observationthe array
32
D9
D2
D1
, read
, read as
2009
Example: From the given set of scores in a quiz, find the 4th decile or
3
11
12
18
D4 .
19
Solution: Since the data is already arranged from lowest to highest then we may
proceed in finding the 4th decile.
3
D4 =
4 (7 +1 )
10
11
12
th
18
19
) observation=3.2the 4 observation
th
Therefore ,the 4th decile is11, this implies that :40 of the scoresthe quiz are below 11.
3.3.3 Quartiles
Quartiles are values that divide the array into 4 equal parts. Thus,
read as first quartile, is the value below which 25% of the values fall,..,
Q1
Q3
read as third quartile, is the value which 75% of the values fall.
Example: From the given set of scores in a quiz, find the 3rd quartile or
3
11
12
18
Q3
19
Solution: Since the data is already arranged in ascending order, then we may
proceed in finding the 3rd quartile.
3
Q 3=
11
12
18
19
3 (7+1) th
observation=6 th observation
4
Therefore ,the 3rd quartileis 18, this implies that :75 of the scoresthe quiz are below 18.
33
2009
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 3.3
1. A psychologist obtained the IQ scores of 10 students. The IQ scores are as
follows:
110 95
85
140 132 100 95
70
85
100
P65 , D 3 , D 9 ,Q3.
Find
Interpret the values.
2. The number of absences in a semester of 25 randomly selected students
were obtained by a teacher: The number of absences were as follows:
3
5
2
6
3
8
2
3
6
2
1
3
8
5
2
1
3
1
0
2
0
1
2
0
0
Q2 , P95 ,D2 .
Find
Interpret the results.
34
2009
CHAPTER 4:
MEASURES OF DISPERSION
Objectives:
At the end of this chapter, the students are expected to:
a.
b.
c.
d.
e.
f.
g.
h.
2009
i
X
/N
i=1
2
=
And the population standard deviation is
i
X
/N
N
i=1
=
The population variance
36
s 2 , where
2009
X ix
/( n1)
n
i=1
s2=
And the population standard deviation can be estimated by the sample standard
deviation s, where
X ix
/(n1)
i=1
s=
Example:
A sample of 5 households showed the following number of household members: 3,
8, 5, 4, and 4. Find the standard deviation.
4.2 Measures of Relative Dispersion
Measures of Relative Dispersion are unitless and are used when one wishes to
compare the scatter of one distribution with another distribution.
4.2.1 Coefficient of Variation
The coefficient of variation, CV, is the ratio of the standard deviation to the
mean and is usually expressed in percentage. It is computed as
s
CV = x 100 CV = x 100
37
2009
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 4.1
1. Find the range of the following data sets:
Data set a: 15, 16, 7, 5, 14, 10
Data set b: 110, 125, 98, 85, 62, 100
2. Find the standard deviation of each set in #1.
3. On a final examination in Statistics, the average grade of 100 students of
DS School was 80 and the standard deviation was 8. In another school, DF
School, the average grade of 120 students was 75. And the standard
deviation was 7.9. In which school was there a greater
a. Absolute deviation
b. Relative deviation
4. Find the coefficient of variation of each set in #3.
5. On 16 days, a restaurant had the following numbers of orders for chicken
and steak:
Chicken:
46
55
43
48
54
65
36
40
51
53
64
32
41
46
53
47
Steak:
39
41
25
30
46
36
37
23
30
33
50
44
41
28
35
37
Calculate the mean, median, mode, standard deviation, variance and CV
and determine which item the number of order is relatively more variable.
38
2009
CHAPTER 5:
PROBABILITY
Objectives:
At the end of this chapter, the students are expected to:
a.
b.
c.
d.
2009
2009
n1
n2
n2
ways.
Example:
1. How many sample points are in the sample space when a pair of dice is
tossed once?
2. Miguel is going to watch a basketball game at the Araneta Coliseum. The
Coliseum has four gates where he can enter. If he cannot pass through the
same gate twice, in how ways can he enter and leave the Coliseum?
3. How many different ways are there to arrange the letters in the word
LOGIC?
5.2.2 Permutation
A permutation is an arrangement of all parts of a set of objects.
Linear Permutation
The number of permutations of n distinct objects is n! (read as n
factorial).
Example: Consider the letters a, b, c. List down the possible permutations.
Permutation of n elements taken r at a time
Theorem: The number of permutation of n distinct objects taken r at a
time is
nPr=
n!
( nr ) !
Example: Two raffle tickets are drawn from 20 for the first and second prizes. Find
the number of sample points in the sample space S.
41
2009
Circular Permutation
The number of permutations of n distinct objects arranged in a circle is (n1)!.
Example: In how many ways can 5 different plants be planted in a circle?
Permutation of Things Not All Different
The number of distinct permutations of n objects of which
n2
kind,
of second kind,
nk
, of a
k th
n1
are one of a
kind is
n!
n1 ! n 2 ! n k !
Example: How many distinct permutations can be made from the letters of the
word Mississippi?
5.2.3 Combination
A combination is a selection of r objects from n without regard to order.
Theorem: The number of combinations of n distinct objects taken r at a time is
nCr=
n!
r ! ( nr ) !
42
2009
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 5.2
1. How many numbers consisting of 5 digits can be made from 1, 2, 3, 4, 5, 6,
7, 8, 9?
2. In how many ways can 5 people be seated on a sofa, if there are only three
seats available?
3. In how many ways can 6 children be seated in a round table?
4. An organization sponsored a raffle where they were to sell all 500 tickets
sequentially numbered from 001 to 500. How many of the tickets
a. Has the same three digits
b. Ends in 3
c. Contain the digits 1, 2, 3 (not necessarily in that order)
5. The geographical distribution of the hometown of 80 students of CvSU-CBE
is given as: 50 from Luzon, 10 from Visayas, and 20 from Mindanao. How
many ways can three students be selected at random such that
a. All of them come from any three places
b. No student comes from Luzon and Mindanao
6. In how many ways can 10 individuals be selected from 25 individuals?
7. If a student can answer any 6 questions from an exam with 10 questions,
how many ways can he answer the exam?
8. If there are 9 horses in a race, in how many ways can they finish first,
second, and third?
43
2009
44
2009
P ( )=1
P ( ) =0
Example: A coin is tossed twice. What is the probability that at least one head
occurs?
Theorem: If an experiment can result in any one of N different equally likely
outcomes and if exactly n of these outcomes corresponds to event A, then the
probability of event A is:
P ( A )=
n
N
45
2009
Example: If a card is drawn from an ordinary deck, find the probability that it is a
heart.
Solution: Let H be the event of obtaining a heart
P (H )=
N= 52 cards
13 1
=
52 4
n= number of hearts
Therefore, the probability that the card is heart, is
1
4 .
P ( A B )=P ( A ) + P ( B )P( A B) .
Example:
1. The probability that a student passes History is
passes English is
2
3 , and the probability that he
4
9 . If the probability of passing at least one course is
2
3
4
9
46
4
5
4
5 ,
2009
P ( H E ) =P ( H ) + P ( E )P ( H E)
2 4 4
+
3 9 5
P (H E)=
14
45
2. What is the probability of drawing a red or a face card if a card is selected from an
ordinary deck of 52 cards?
Solution:
26
52
12
52
6
52
P ( R F )=
26 12 6
+
52 52 52
32 16 8
52 26 13
8
13 .
Corollary 1:
If A and B are mutually exclusive events, then
P ( A B )=P ( A ) + P ( B ) .
Corollary 2:
If
A 1, A 2 , A 3 , .. , A n
P ( A 1 A2 A 3 .. An ) =P ( A 1 ) + P ( A 2 ) + P ( A 3 ) ++ P ( A n )=P ( S )=1
47
2009
Theorem:
If A and
Ac
P ( A )+ P ( A c )=1
Exercises:
1. What is the probability of getting a total of 7 or 11 when a pair of dice is tossed?
2. What is the probability of drawing an Ace or a King, if a card is randomly chosen
from a deck of 52 cards?
3. The probability of passing History is
2
3 , what is the probability of failing the
subject?
Conditional Probability
Definition: The probability of an event B occurring when it is known that some event A
has occurred is called a conditional probability. It is defined by:
P ( B| A )=
P ( A B)
if P( A)> 0.
P( A)
13
52
4
52
P ( A|S )=
P( A S ) 1/52
1
=
=
P(S)
13 /52 13
48
2009
Definition: Two events are said to be independent if any one of the following conditions
are satisfied:
P(A
|B )=P ( A ) if P ( B ) >0
P ( B| A )=P ( B ) if P ( A ) >0
P ( A B ) =P ( A ) P ( B )
1
4 . The probability that Rose will correctly answer the same question is
4
5 . Find
the probability that both will answer the question correctly, assuming that they do not
copy from each other.
Solution:
1
4
4
5
*events are said to be independent since they do not copy from each other.
1 4 4 1
P ( J R ) =P ( J ) P ( R )= =
4 5 20 5
Therefore, the probability that Jack and Rose will both correctly answer the toughest
question is
1
5 .
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 5.3
1. If two dice are thrown and the sum of dots are noted, what is the probability
that the sum is:
a. 5
b. 11 or 8
49
2009
c. At least 10
d. 5 and 6
e. Even?
2. If a card is drawn at random from a deck of 52 cards, what is the probability
that the card is :
a. Red
b. Even
c. Face card
d. Red or face
e. Red and face?
3. An organization sponsored a raffle where they were to sell all 500 tickets
sequentially numbered from 001 to 500. Find the probability that the
winning number
a. Has the same three digits
b. Ends in 3
c. Contain the digits 1,2,3 (not necessarily in that order)
4. The geographical distribution of hometown of 80 students of CvSU-CBE is
given as: 50 from Luzon, 10 from Visayas, and 20 from Mindanao. Suppose
three students are selected. Find the probability that
a. They all come from any of the three places
b. No student comes from Luzon and Mindanao
5. A poker hand consists of 5 cards, the order is conventionally disregarded
from a well-shuffled deck of 52 cards. What is the probability of having
a. 3 Kings and 2 Queens
b. 4 red cards and 1 black card
c. 4 aces and a Jack
6. The probability that a student will pass Statistics is 0.50 and the probability
that he will pass English is 0.80. The probability that he will pass both is
0.60. What is the probability that the student will pass at least one of the
two subjects?
7. What is the probability that an even number or a 3 will appear when a fair
die is tossed?
8. The probability that it will rain tomorrow is 0.73. What is the probability that
it will be sunny tomorrow?
9. A box contains 6 red and 4 black balls. Two balls from the box are drawn one
at a time without replacement. What is the probability that the second ball
is red if it is known that the first is red?
10. The balls used in selecting the numbers for bingo carry the numbers
1,2,3,75. If one of the balls is selected at random, what is the probability
that it is a number less than or equal to 15?
50
2009
11. If three of twenty tires are defective and four of them are randomly
chosen for inspection, what is the probability that one of the defective tires
will be included?
12. If an airlines records show that 468 of 600 of its jets from San Francisco
to Phoenix arrived on time, what is the probability that any one of the
airlines jets from San Francisco to Phoenix will not arrive on time?
13. Harry Potter is faced with the problem of opening a safe with 10 buttons
numbered from 0 to 9 without using his magic. The safe can be opened by
pressing three buttons, not necessarily distinct, in correct order.
a. What is the probability that Harry will hit the right combination?
b. Realizing that the probability of getting the right combination is too small
Harry thought of a way of swinging the odds in his favor. He pulverized
the lead from a pencil and blew the powder onto the buttons, revealing
three buttons that have been pressed many times before. What is the
probability that Harry will be able to open the safe in a single trial?
14. A bowl contains 15 red beads, 30 white beads, 20 blue beads, and 7 black
beads. If one of the beads is drawn at random, what are the probabilities
that it will be
a. Red beads
b. White or blue beads
c. Black beads
d. Neither white nor black
15. If the probabilities are respectively, 0.92, 0.33 and 0.29 that a person
vacationing in Boracay, will visit Palawan, or both, what is the probability
that a person vacationing there will visit at least one of the two areas?
16. The probabilities that a student will get an A or a B or a C in a Math
course is 0.09, 0.15, and 0.53. What is the probability that a student will get
a grade lower than C?
17. If the probability of passing Statistics is 0.65, what is the probability of
failing the subject?
18. If a coin is tossed twice, what is the probability of getting a head on the
second toss if the outcome on the first toss is also a head?
19. A jeepney driver buys spare part X and Y for his vehicle at a bargain shop.
The probability that X is defective is 0.10 and the probability that Y is
defective is 0.05. Determine the following probabilities:
a. Both parts are defective
b. Both parts are good
c. X is good, Y is defective
20. A box contains 12 black balls and 8 white balls. Two balls are drawn in
succession.
51
2009
a. Find the following probabilities if the balls are drawn with replacement:
a.1. A black ball is drawn then another black ball is drawn.
a.2 A black, then a white ball is drawn.
a.3 Two white balls are drawn.
a.4 Both balls are of the same color.
b. Answer the questions in (a) if the balls are drawn without replacement.
CHAPTER 6:
PROBABILITY DISTRIBUTION
Objectives:
At the end of this chapter, the students are expected to:
52
2009
S= { RR , BB , RB, BR }
X ={ 0,1,2 }
3. Let W be a random variable giving the number of heads minus the number of tails
in three tosses of a coin. List the elements of a sample space for the three tosses
of the coin and to each sample point assign a value for the random variable W.
53
2009
Solution:
S = { HHH , HHT , HTH ,THH , HTT ,THT , TTH ,TTT }
W = {3,1,1,3 }
If a sample space contains a finite number of possibilities or an unending sequence
with as many elements as there are whole numbers, it is called a discrete sample space.
A random variable defined over a discrete sample space is called discrete random
variable.
If a sample space contains an infinite number of possibilities equal to the number
of points on a line segment, it is called continuous sample space. A random variable
defined over a continuous sample space is called continuous random variable.
Example: Determine the following whether discrete or continuous random variable.
1. A pair of dice is rolled and X is a random variable that represents the sum of the
spots on the two dice.
2. An economist is interested in the random variable C, the number of persons filing
for employment at the Labor Office.
3. The proprietor of a hamburger franchise is interested in the volume of Coke sold
per day at her franchise.
4. A businessman is interested in the random variable Q, the total overtime
accumulated by his employees each week.
5. An experiment consists of observing the random variable L, where L is the length
of time that it takes for IT student to finish a major examination.
Discrete and Continuous Probability Distribution
A probability distribution is a listing of all the outcomes of an experiment and the
probability associated with each outcome. It has the following characteristics:
1. The probability of a particular outcome is between 0 and 1, inclusive.
2. The sum of the probabilities of all mutually exclusive outcomes is 1.
Definition: The table or formula listing of all the possible values that a random variable
can take on, along with the associated probabilities, is called discrete probability
distribution.
Note: The probabilities associated with all possible values of a discrete random variable
must sum to 1.
Example:
54
2009
1. A coin is tossed three times. List down the elements of the sample space. List down
the possible values of a random variable Y, the number of heads that fall.
Solution:
S = { HHH , HHT , HTH ,THH , HTT ,THT , TTH ,TTT }
Note:
P (Y =0 ) =
1
8
P (Y =1 )=
3
8
P (Y =2 )=
3
8
P (Y =3 )=
1
8
1 3 3 1
+ + + =1
8 8 8 8
2. Let W be a random variable giving the number of heads minus the number of tails
in three tosses of a coin. List the elements of a sample space for the three tosses
of the coin and to each sample point assign a value for the random variable W.
Solution:
S = { HHH , HHT , HTH ,THH , HTT ,THT , TTH ,TTT }
W = {3,1,1,3 }
The discrete probability distribution of the random variable W is:
P (W =3 )=
1
8
P (W =1 )=
3
8
55
Note:
2009
P (W =1 )=
3
8
P (W =3 ) =
1
8
1 3 3 1
+ + + =1
8 8 8 8
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 6.1
1. Construct the probability distribution of the sum of the numbers when a pair
of dice is tossed.
DISCRETE PROBABILITY DISTRIBUTION
BINOMIAL DISTRIBUTION
An experiment often consists of repeated trials, each with two possible
outcomes, which may be labeled success or failure. This is true in flipping of a
coin 5 times, where each trial may result in a head or tail. We may choose to
define either outcome as a success. It is also true if 5 cards are drawn in
succession from an ordinary deck and each trial is labeled success or failure,
depending on whether the card is red or black. Experiments of this type are
known as binomial experiments.
A binomial experiment is one that possesses the following properties:
1. The experiment consists of n repeated trials.
2. Each trial results in an outcome that may be classified as a success or a failure.
3. The probability of a success, denoted by p, remains constant from trial to trial.
4. The repeated trials are independent.
DEFINITION:
56
2009
BINOMIAL DISTRIBUTION
If a binomial trial can result in a success with probability p and a failure with
probability q= 1-p, then the probability distribution of the binomial random
variable X, the number of successes in n independent trials is
n
b(x; n, p) x p x qn x
for x=0,1,2,.,n.
SAMPLE PROBLEM:
1. Find the probability of obtaining exactly three 2s if an ordinary die is tossed
5 times.
Solution:
1
5
From the given problem, n = 5, x = 3, p= 6 , q= 6
1
1 3 5 2
(
b(3;5; 6 ) = 5C3 6 ) ( 6 ) =
0.032
15C 10(0.4)10 (0.6)5 +15 C 11( 0.4)11 (0.6)4 +15 C 12(0.4)12 (0.6)3 +15 C 13(0.4)13( 0.6)2+15 C 14 ( 0.4
= 0.034
Therefore, the probability that at least 10 people will survive is 0.034.
57
2009
15C 3(0.4)3 (0.6)12 +15 C 4 ( 0.4 )4 (0.6)11 +15 C 5(0.4)5 ( 0.6)10 +15 C 6 (0.4)6 (0.6)9 +15 C 7( 0.
0.88
Thus, the probability that from 3 to 8 patients will survive is 0.88.
c.
POISSON DISTRIBUTION
Experiments yielding numerical values of a random variable X, the number
of outcomes occurring during a given time interval or in a specified region, are
often called Poisson experiments. The given time interval may be of any
length, such as a minute, a day, a week, a month, or even a year. Hence a Poisson
experiment might generate observations for the random variable X representing
the number of telephone calls per hour received by an office, or the number of
days school is closed due to snow during winter. The specified region could be a
line segment, an area, a volume, or perhaps a piece of material. In this case X
might represent the number of typing errors per page, or the number of bacteria
in a given culture.
A Poisson experiment is one that possesses the following properties:
1. The number of outcomes occurring in one time interval or specified region is
independent of the number that occur in any other disjoint time interval or
region of space.
2. The probability that a single outcome will occur during a very short time
interval or in a small region is proportional to the length of the time interval or
the size of the region and does not depend on the number of outcomes
occurring outside this time interval or region.
3. The probability that more than one outcome will occur in such a short time
interval or fall in such a small region is negligible.
58
2009
them by a symbol
DEFINITION:
POISSON DISTRIBUTION
The probability distribution of the Poisson random variable X, representing
the number of outcomes occurring in a given time interval or specified region, is
p x;
e x
x!
for x = 0,1,2,..,
Where is the average number of outcomes occurring in the given time interval or
specified region and e=2.71828.
EXAMPLE:
1. The average number of days school is closed due to snow during the winter
in a certain city in the eastern part of United States is 4. What is the
probability that the schools in this city will close for 6 days during a winter?
Solution:
Given: =4, X =6
P ( X=6 )=
e4 46
=0.10
6!
Thus, the probability that the schools in the United States will close in 6
days due to winter, is 0.10.
2. The average number of field mice per acre in a 5-acre wheat field is
estimated to be 10. Find the probability that a given acre contains 12 mice.
59
2009
Solution:
Given:
10
P ( X=12 )=
=10, X=12
12
10
=0.095
12!
Therefore, the probability that a 5-acre wheat field contains 12 mice is 0.095.
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 6.1.1
1. A baseball players batting average is 0.250. What is the probability that he
gets exactly 1 hit in his next 5 times at bat?
2. A multiple-choice quiz has 15 questions, each with 4 possible answers of
which only 1 is the correct answer. What is the probability that sheer
guesswork yields from 5 -10 correct answers?
3. The probability that a patient recovers from a delicate heart operation is
0.9. What is the probability that exactly 5 of the next 7 patients having this
operation survive?
4. A study conducted at George Washington University and the National
Institute of Health examined national attitudes about tranquilizers. The
study revealed that approximately 70% believe that tranquilizers dont
really cure anything, they, they just cover up the real trouble. According to
this study, what is the probability that at least 3 of the next 5 people
selected at random will be that tranquilizers actually do cure the problem
rather than just cover it up?
5. On the average a certain intersection results in 3 traffic accidents per
month. What is the probability that in any given month at this intersection
(a) exactly 5 accidents will occur? (b) less than 3 accidents will occur?
6. A secretary makes 2 errors per page on the average. What is the probability
that on the next page she makes (a) 4 or more errors? (b) no error?
7. A certain area of the eastern United States is, on the average, hit by 6
hurricanes a year. Find the probability that in a given year this area will be
hit by (a) fewer than 4 hurricanes; (b) anywhere from 6 to 8 hurricanes.
60
2009
6.2
-3
-2
-1
61
2009
2
1
f ( x )=
e
2
< x <
for
and
for
constants
and
where
X N ( , 2) .
62
2009
X 1 X 2
X 1
X
Z 2= 2
Therefore,
Examples:
P ( Z <1.64 )=0.0505
b.
c.
P ( Z <0.08 )=0.5319
d.
P ( Z <a ) =0.8413
a=1.00
63
b.
2009
P ( Z >a ) =0.0071
1P ( Z <a )=0.0071
10.0071=P ( Z< a )
0.9929=P ( Z< a )
a=2.45
c.
probabilities:
a.
P( X <3)
b.
P(X >2.5)
Solution:
Given:
a.
=5, 2=4
P( X <3)
=2
c.
P(7.2< X <7.5)
Z=
2009
X 35 2
=
= =1
2
2
P ( Z <1.00 )=0.1587
Therefore, the
b.
P ( X <3 )=0.1587
P ( X >2.5 )
Z=
2.55
=1.25
2
c.
P ( X >2.5 )=0.8944
P (7.2< X <7.5 )
Z 1=
7.25
7.55
=1.10Z2 =
=1.25
2
2
65
2009
P ( X 100 )
Z=
10084
=1.33
12
X
; X is unknown
X =Z +
66
2009
Z=
6084
=2.00
12
P ( Z <2.00 )=0.0228
The number of students who got a score lower than 60 is equal to the
product of the probability and the total number of students.
( 0.0228 ) ( 654 )=14.9115
Therefore , out of 654 students who took the examination ,15 students got score below 60.
67
2009
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 6.2
1. Let Z be a standard normal variable. Find the following probabilities:
a.
P ( Z <1.96 )
b.
P ( Z >1.23 )
c.
P ( Z >0.15 )
d.
P (1.34< Z <0.00 )
2=25,
2009
CHAPTER 7:
ESTIMATION
Objectives:
At the end of this chapter, the students are expected to:
1. Demonstrate knowledge and understanding of the basic concepts of
estimation;
2. Compute and interpret the estimates of the mean, variance and standard
deviation; and
3. Relate knowledge of estimation to real life situations.
Basic Concepts of Estimation
Definition of terms:
Estimator- any statistic whose value is used to estimate an unknown parameter.
Estimate- a realized value of an estimator.
Point Estimate- a single value used to represent the parameter of interest.
Interval Estimator- a rule that tells us how to calculate two numbers based on a
sample data, forming an interval within which the parameter is expected to lie.
The pair of numbers (a,b) is called interval estimate or confidence interval.
69
2009
( 1 ) .
Parameter
Statistic
n
Xi
Population mean ()
x = i=1
n
X ix
/(n1)
i=1
s=
known variance
<
2 n
x Z
70
x + Z
2
2009
Note:
For small samples selected from nonnormal populations, we cannot expect our
degree of confidence to be accurate. However, for small samples of size n 30 ,
regardless of the shape of most population, sampling theory guarantees good
results.
To compute a
( 1 ) x 100
is
n 30.
Example:
A survey of the delivery time of 100 orders worth P20,000 from WILLIAMS
PIZZA yielded a mean of 55 minutes with a standard deviation of 12 minutes.
Assuming that the delivery time follow a normal distribution, construct a 95%
confidence interval for the true mean.
Solution:
Given:
x =55 minutes,
<
2 n
x Z
x + Z
2
55 ( 1.96 ) (
12
12
) <
55+ ( 1.96 ) (
)
100
100
we obtained:
552.352< <55+2.352
52.648<< 57.352
71
= 5%
2009
Conclusion: The WILLIAMS PIZZA is 95% confident that the true mean delivery
time is between 52.648 minutes and 57.352 minutes.
If
( 1 ) x 100
2 n
Example:
The heights of a random sample of 50 college students showed a mean of
174.5 cm and a standard deviation of 6.9 cm. What can we assert with 98%
confidence about the possible size of our error if we estimate the mean height of
all college students to be 174.5?
Solution:
Given:
x = 174.5 cm,
= 2%
Conclusion: We can therefore conclude that we are 98% confident that the
sample mean differs from the true mean height by 2.27 cm.
72
If
2009
( 1 ) x 100
error will not exceed a specified amount e when the sample size is
n=( Z
2
2
) .
e
Example:
The monthly wage of new employees at a certain broadcasting company is
said to follow a normal distribution with a standard deviation of P1,000. How large
sample would be needed to be 99% confident that the sample mean will be within
P300 of the true mean.
Solution:
=P1,000 ,
Given:
e=P 300 ,
= 1%
( )
n= Z
2
by substitution:
2
2
1000
n= Z .01 (
) = [ ( 2.575 ) (3.33) ] =73.53 74
300
2
is unknown
sample of size
variance
2 ,
n<30
( 1 ) x 100
73
2009
s
s
< < x +t
n
2 n
x t
2
where
t
2
v =n1
degrees of freedom.
Solution:
Given:
x =3.6 milligrams ,
x t
2
s
s
< < x +t
n
2 n
with
= 1%
by substitution:
3.6 ( 3.50 ) (
0.9
0.9
) < <3.6+ ( 3.50 ) (
)
8
8
we obtained:
3.60.3182< <3.6+ 0.3182
3.2812<<3.9182
Conclusion: Therefore we can conclude that we are 99% confident that the true
average nicotine content of a certain brand of cigarette is within 3.2818
milligrams and 3.9182 milligrams.
74
2009
Name:_________________________________
Score:___________________
Course/Year/Section:_____________________
Teacher:_________________
Exercise 7.
1. An electrical firm manufactures light bulbs that have a length of life that is
approximately normally distributed, with a standard deviation of 40 hours. If
a random sample of 30 bulbs has an average life of 780 hours, find a 96%
confidence interval for the population mean of all bulbs produced by this
firm. How large a sample is needed if we wish to be 96% confident that our
sample mean will be within 10 hours of the true mean?
2. The contents of 7 similar containers of sulfuric acid are 9.8, 10.2, 10.4, 9.8,
10.0, 10.2 and 9.6 liters. Find a 95% confidence interval for the mean
content of all such containers, assuming an approximate normal distribution
for container contents.
75
2009
3. A random sample of 100 PUJ (Public utility jeep) shows that a jeepney is
driven on the average 24,500 km per year, with a standard deviation of
3,900 km.
a. Construct a 99% confidence interval for the average number of kilometer
a jeepney is driven annually.
b. What can we assert with 99% confidence about the possible size of our
error if we estimate the average number of km driven by jeepney drivers
to be 23,500 km per year?
4. Suppose that the time allotted for commercials on a primetime TV program
is known to have a normal distribution with a standard deviation of 1.5
minutes. A study of 35 showings gave an average commercial time of 10
minutes. Compute for the maximum error. Construct a 95% confidence
interval for the true mean.
5. A random sample of 12 female students in a certain dorm showed an
average weekly expenditure of P750 for snack foods, with a standard
deviation of P175. Construct a 90% confidence interval for the average
amount spent each week on snack foods by female students living in this
dormitory, assuming the expenditures to be approximately normally
distributed.
6. The mean and standard deviation for the quality grade point averages of a
random sample of 28 college seniors are calculated to be 2.6 and 0.3
respectively. Find the 95% confidence interval for the mean of the entire
senior class. How large a sample is required if we want to be 95% confident
that our estimate of is not off by more than 0.05?
7. To estimate the average serving time at a fast food restaurant, a consultant
noted the time taken by 40 counter servers to complete a standard order
(consisting of 2 burgers, 2 large fries and 2 drinks). The servers averaged
78.4 seconds with a standard deviation of 13.2 seconds to complete the
orders. What can the consultant assert with 95% confidence about the
maximum error if he uses x =78.4 seconds as an estimate of the true
average time required to complete this standard order?
8. A company surveyed 4400 college graduates about the lengths of time
required to earn their bachelors degrees. The mean is 5.15 years, and the
standard deviation is 1.68 years. Based on these sample data, construct the
99% confidence interval for the mean time required by all college
graduates.
9. In a time-use study, 20 randomly selected managers were found to spend
an average of 2.4 hours each day on paperwork. The standard deviation of
76
2009
77