Professional Documents
Culture Documents
COMPUTER APPLICATION
L E SS O N 1 – B a s i c s o f S t a t i s t i cs
What you will learn…
Importance of Statistics
Basic Statistics Concepts
Variables
Scale of Measurement
Types of Data
What is Statistics
25%
75%
15%
55%
25%
45%
Basics of Statistics
Introduction
He asked Kate to
collect data regarding
the current soft skill
levels of the members
of the team and also
the areas where they
perceive that they need
to improve.
Introduction
Instead he chose to go
through almost 20
pages of data that Kate
had collected for the
purpose.
It’s all about making sense of data and figuring out how to
put that information to use
Field of Statistics
The study and practice of collecting and
analyzing data
Statistics
Facts about, or summaries of data
TYPES OF STATISTICS
Collect data
e.g. Survey
Present data
e.g. Tables and graphs
Characterize data
e.g. Sample mean = X i
18
INFERENTIAL STATISTICS
Population
The collection of all responses,
measurements, or counts that are of
interest.
The population must be defined explicitly before the study begins and the
research hypothesis/questions specify the population being studied
Sample
A portion or subset of the
population.
Sampling Error
reflects the fact that the result we get from our sample is not
going to be exactly equal to the result we would have got if
we have been able to measure the entire population. And
each possible sample we could take give a different result.
IMPORTANT TERMS
•Parameter
•Statistic
Examples:
Simple random sampling
Stratified random sampling
Cluster sampling
Systematic Sampling
2. Nonprobability
Consecutive sampling: commonly used in intervention studies.
Convenience sampling
Purposive sampling: commonly used in qualitative research.
Random Sampling: Each member of the population
has an equal chance of being selected.
Simple Random Sampling: All samples of the same size are
equally likely.
x x x x x x x
x x x
x x x x x x
x x
x x
x x x x x
x x x x xx x x
x x x
x
x
x x x x x x x x x xx x
x x
x x x x x xx x x x x x x x x
x x x x x
xxx xx x x x x x x x x x x x x x
x x
x xx x x x x x x
x x x x
x xx x
Sample
27
CLUSTER SAMPLING
Cluster Sample
SYSTEMATIC SAMPLING
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
29
OTHER SAMPLING TECHNIQUE
From the data collected, inferences are made about the larger
population. For example, if 80% of patients self-administering
analgesia report good pain control, whereas only 40% of
patients receiving nurse-administered analgesia report good pain
control, one would conclude that there is a difference between
the two methods and that self-administered analgesia is superior.
However, there is always a possibility that since we have only
used a sample of all possible patients, there may, in fact, be no
difference between the two but the results have just occurred due
to chance To test this formally, a statistical test would be done.
HYPOTHESIS-BASED SAMPLE SIZES
In this case the P value is 0.03. This P value means that the
probability of obtaining these results or results even more
extreme, if in truth there is no difference between the two
methods, is no more than 3%. Therefore, either self-administered
analgesia is better than nurse-administered analgesia or a very
unusual event has occurred. When there is truly no difference
between two interventions, but the results of our study suggest
there is a difference, a type 1 error has occurred. Generally,
studies will accept a 5% risk (α level) of making a type 1 error.
The calculated P value is the probability that we may have made
a type 1 error.
HYPOTHESIS-BASED SAMPLE SIZES
Dichotomous variable
• A variable that can have only two values
Qualities of Variables
• Exhaustive
– Should include all possible answerable responses
• Mutually exclusive
– No respondent should be able to have two attributes
simultaneously
e.g. Employed vs. Unemployed
- it is possible to be both if looking for a second
job while employed
Variables
Qualitative / Quantitative/
categorical variable numerical variable
Discrete Continuous
e.g. Height
Weight
Time
Independent Variables
Independent Cause or
Determine or Dependent Variables
Variables
Influence
Intervening Variables
Intervening
Variables
Independent Dependent
Variables Variables
B. LEVELS OF MEASUREMENT
Levels of Measurement
• Nominal
• Ordinal
• Interval
• Ratio
Nominal level
- Marital status
- Religion
- Type of car used
Ordinal level
e.g. Income
No. of children
Age
Weekly mobile data load spending
Properties held by each level of measurement
Property
Level of
measurement Equal True zero
Categories Ranks
intervals point
Nominal Yes No No No
Level of
Nominal Ordinal Interval/Ratio
Measurement
Mode
Measures of Mode
Mode Median
central tendency Median
Mean
Min/Max/Range
Measures of Min/Max/Range
IQR
dispersion IQR
Std. Deviation
Frequencies
Procedures Frequencies Frequencies
Descriptives
Levels of measurement guidelines
Primary Secondary
Data Collection Data Compilation
Experimentation
Types of data
Data
Categorical Numerical
(Qualitative) (Quantitative)
Discrete Continuous
(integers) (takes any value)
LESSON OVERVIEW