Professional Documents
Culture Documents
CHAPTER 1
INTRODUCTION TO STATISTICS
LATEXVersion
1.1 Introduction
(a) In plural sense (as Statistical Data): statistics are the raw data themselves , like statis-
tics of births, statistics of deaths, statistics of plants, statistics of students, statistics
of imports and exports, etc.
(b) In singular sense (as Statistical Methods): statistics is the subject that deals with the
collection, organization, presentation, analysis and interpretation of data.
(1) Descriptive statistics: Descriptive statistics are concerned with collecting, summa-
rizing and describing the characteristics of data
– With descriptive statistics we are only concerned with the data collected and
make no effort to generalize it to any other data, such as for the population
(2) Inferential statistics In inferential statistics we select a random sample and we use
the information from it to make generalization about the population from which the
sample was taken. Inferential statistics generalizes from sample to populations, per-
forming estimations and hypothesis tests, determining relationships among variables,
and making predictions.
1
CHAPTER 1 PROBABILITY AND STATISTICS
Stages in statistical investigation: The common stages or steps in any statistical inves-
tigation are.
(1) Collection of data: the process of measuring, gathering, assembling the raw data up
on which the statistical investigation is to be based. Data can be collected in a variety
of ways; one of the most common methods is through the use of survey. Survey can
also be done in different methods, three of the most common methods are:
• Structured Questionnaire
• Telephone survey
• Mailed questionnaire
• Personal interview
(2) Organization of data: Summarization of data in some meaningful way, e.g table
form
(4) Analysis of data: The process of extracting relevant information from the summa-
rized data, mainly through the use of elementary mathematical operation.
(5) Inference: the interpretation and further observation of the various statistical mea-
sures through the analysis of the data by implementing those methods by which con-
clusions are formed and inferences made.
(1) Population: A collection of individuals or items about which we want to draw Con-
clusions. The population represents the target of an investigation, and the objective
of the investigation is to draw conclusions about the population hence we sometimes
call it target population.
Examples:
2
CHAPTER 1 PROBABILITY AND STATISTICS
(2) Census : The collection of information from the whole population or it’s a complete
enumeration of the population.
(3) Sample : A selection of information from a subset of the population. Selected using
some pre-defined sampling technique in such a way that they represent the population
very well.
(5) Statistic : A quantity calculated from data gathered from a sample. It’s Character-
istic or measure obtained from a sample. It is usually used to estimate a population
parameter.
(6) Sampling : The process or method of sample selection from the population.
(7) Sample size: The number of elements or observation to be included in the sample.
(9) Random sample : Is a sample which is randomly selected from a population where
every individuals items having the same chance of being selected.
3
CHAPTER 1 PROBABILITY AND STATISTICS
Example 1.1: When examining the mean age of all first year students in a given collage,
the mean age found would be a parameter. If we took a random sample of 300 first year
students in that collage, then the mean age would be a statistic.
Exercise 1.1: A business is considering purchasing newly produced light bulbs it will make
the purchase if no more than 1.5% of the bulbs are defective. Because of time factors in
testing all 40,000 bulbs the business decides to test a random sample of 400 for defects.
They will then use the results of this sample to estimate the percentage of defectives for the
population to be purchased.
Applications of Statistics: In this modern time, statistical information plays a very im-
portant role in a wide range of fields. Today, statistics is applied in almost all fields of human
endeavor.
For Decision Making: Statistics helps to enhance the power of decision making in
the face of uncertainty by providing sufficient information.
In Public Health and Medicine: Statistical methods are used for computation and
interpretation of birth and death rates.
4
CHAPTER 1 PROBABILITY AND STATISTICS
Uses of Statistics: The main function of statistics is to enlarge our knowledge of complex
phenomena. The following are some uses of statistics:
Data reduction.
Limitations of Statistics: As a science statistics has its own limitations. The following
are some of the limitations:
Deals with only aggregate of facts and not with individual data items.
Statistical results are true on average; i.e. Laws of statistics are not universally true
like the laws of physics, chemistry and mathematics.
5
CHAPTER 1 PROBABILITY AND STATISTICS
Discrete variables: assume only certain values, and there are usually ”gaps”
between the values, discrete variables can assign values such as 0, 1, 2, 3 and are
said to be countable. Some examples are
• Weight of a person
• Distance covered by athlete
6
CHAPTER 1 PROBABILITY AND STATISTICS
Based on Scales of Measurement: Proper knowledge about the nature and type of data
to be dealt with is essential in order to specify and apply the proper statistical method for
their analysis and inferences. Measurement scale refers to the property of value assigned to
the data based on the properties of order, distance and fixed zero.
1. Nominal
2. Ordinal
3. Interval
4. Ratio
Country name
Country code
7
CHAPTER 1 PROBABILITY AND STATISTICS
Ordinal level of measurement: Data measured at this level can be placed into categories,
and these categories can be ordered, or ranked. Precise differences between the ranks do not
exist. Some examples are
Interval scales of measurements: are measurement systems that possess the properties
of Order and existence of precise differences between units, but not the property of fixed
zero, Interval scales are numerical scales in which intervals have the same interpretation
throughout. some examples are
IQ
The difference between 30o F and 40o F represents the same temperature difference between
45o F and 55o F. This because each 10 degree interval has the same physical meaning in terms
of the kinetic energy of molecules and 0o F does not mean no heat at all.
Ratio Scales of measurements: Level of measurement which classifies data that can
be ranked, differences are meaningful, and there is a true zero. True ratios exist between
the different units of measure. All arithmetic and relational operations are applicable. Some
examples are
Weight
Blood pressure.
Length
~ Primary Data
~ Secondary Data
8
CHAPTER 1 PROBABILITY AND STATISTICS
Primary Data: are data collected for the first time either through direct observation or by
enquiring individuals. It refers to the data collected either by or under the direct supervision
and instruction of the researcher.
Secondary Data: are data obtained from published or unpublished sources like news-
papers, journals, official records, etc.