Professional Documents
Culture Documents
STATISTICAL ANALYSIS
Introduction
to
Statistics
Page 1
Objectives
To define statistics
To discuss the wide range of
applications of statistics
To discuss key statistical concepts
To understand the branches of
statistics
To describe the levels of
measurement of data
Page 2
What
is Statistics?
Page 3
YHJ@SOM, USM
Page - 1
Introduction to Statistics
STATISTICAL ANALYSIS
Applications of Statistics
Statistical techniques are used in a wide range
of types of scientific and social research,
including: Biostatistics, Computational
biology, Computational sociology, Network
biology, Social science, Sociology and Social
research.
YHJ@SOM, USM
Page - 2
Introduction to Statistics
STATISTICAL ANALYSIS
Population
a population is
Sample
A sample is
Page 8
Parameter
Statistic
Page 9
YHJ@SOM, USM
Page - 3
Introduction to Statistics
STATISTICAL ANALYSIS
Sample
Subset
Statistic
Parameter
Branches of Statistics
Statistics
Descriptive Statistics
Parametric Statistics
Inferential Statistics
Non-Parametric Statistics
Page 11
Descriptive Statistics
Graphical Techniques
Numerical Techniques
Page 12
YHJ@SOM, USM
Page - 4
Introduction to Statistics
STATISTICAL ANALYSIS
Inferential Statistics
Page 13
Statistical Inference
Statistical inference is the process of making an
estimate, prediction, or decision about a
population based on a sample.
Population
Sample
Inference
Statistic
Parameter
Statistical Inference
We use statistics to make inferences about
parameters.
Therefore, we can make an estimate,
prediction, or decision about a population
based on sample data.
Thus, we can apply what we know about a
sample to the larger population from which
it was drawn!
Page 15
YHJ@SOM, USM
Page - 5
Introduction to Statistics
STATISTICAL ANALYSIS
Statistical Inference
Rationale:
Large populations make investigating each
member impractical and expensive.
Easier and cheaper to take a sample and make
estimates about the population from the
sample.
However:
Such conclusions and estimates are not
always going to be correct.
For this reason, we build into the statistical
inference measures of reliability, namely
confidence level and significance level.
Page 16
to estimate
Sample
(parameter)
(statistic )
Select a
random sample
Page 18
YHJ@SOM, USM
Page - 6
Introduction to Statistics
STATISTICAL ANALYSIS
Branches of Statistics
Statistics
Descriptive Statistics
Parametric Statistics
Inferential Statistics
Non-Parametric Statistics
Page
19
Parametric Statistics
Parametric statistics is a branch of
statistics that assumes data come
from a type of probability
distribution and makes inferences
about the parameters of the
distribution. Most well-known
elementary statistical methods are
parametric.
Page
20
Non-parametric statistics
distribution free methods which do not rely on
YHJ@SOM, USM
Page - 7
Introduction to Statistics
STATISTICAL ANALYSIS
Page
22
Interval Data
Nominal Data
Ordinal Data
Page 23
Interval Data
Interval data
Real numbers, i.e. heights, weights,
prices, etc.
Also referred to as quantitative or
numerical.
Arithmetic operations can be performed on
Interval Data, thus its meaningful to talk
about 2*Height, or Price + $1, and so on.
Page 24
YHJ@SOM, USM
Page - 8
Introduction to Statistics
STATISTICAL ANALYSIS
Nominal Data
Nominal Data
The values of nominal data are categories.
E.g. responses to questions about marital status,
coded as:
Single = 1, Married = 2, Divorced = 3, Widowed = 4
Because the numbers are arbitrary, arithmetic
operations dont make any sense (e.g. does
Widowed 2 = Married?!)
Nominal data are also called qualitative or
categorical.
Page 25
Ordinal Data
Ordinal Data appear to be categorical in nature,
but their values have an order
order;; a ranking to them:
E.g. College course rating system:
poor = 1, fair = 2, good = 3, very good = 4,
excellent = 5
While its still not meaningful to do arithmetic on
this data (e.g. does 2*fair = very good?!), we can
say things like:
excellent > poor or fair < very good
That is, order is maintained no matter what
numeric values are assigned to each category.
Page 26
YHJ@SOM, USM
Page - 9
Introduction to Statistics
STATISTICAL ANALYSIS
Hierarchy of Data
Interval
Values are real numbers.
All calculations are valid.
Data may be treated as ordinal or nominal.
Ordinal
Values must represent the ranked order of the data.
Calculations based on an ordering process are valid.
Data may be treated as nominal but not as interval.
Nominal
Values are the arbitrary numbers that represent
categories.
Only calculations based on the frequencies of
occurrence are valid.
Data may not be treated as ordinal or interval.
Page 28
End of discussion
Page
29
YHJ@SOM, USM
Page - 10