Professional Documents
Culture Documents
TH em at Ics
TH em at Ics
Statistics 1
Types of Data 2
s
Normal Distribution 3
ic
Stem and Leaf Diagram 6
at
Back to Back Stem Plot 7
Box and Whisker Plots 8
m
he
at
l M
na
io
it
dd
A
Statistics
Statistics is the science of collecting, organizing, presenting and analysing
numerical facts called data.
s
Variable – The characteristic being studied
ic
Data – consist of observations of the variables being measured.
at
Raw data – unorganized data
m
Information – data that is organized and processed to make it useful
he
Population – The collection of all outcomes, measurements, or
at
possibilities that are of interest.
M
Page | 1
Types of Data
1. Qualitative (Categorical)
Values or observations that can be sorted into groups or categories.
Description of the variable.
s
E.g. eye colour, blood type.
ic
a. Nominal
at
Can be counted, but not ordered. No natural order. Cannot
be ranked.
m
E.g. Gender, hair colour
he
b. Ordinal
Can be counted and ordered. Natural order. Can be ranked.
E.g. house number, Linkert scale (high, medium, low) pain
at
severity
M
2. Quantitative (Numerical)
Values or observations that can be measured. Numbers that can be
l
a. Discrete
Countable, individual items. Finite options.
it
Page | 2
Normal Distribution
A type of continuous probability distribution that has the shape of a
bell. It describes data that is symmetrical about the mean and the width
s
of the curve is described by its standard deviation.
ic
at
m
he
at
M
Properties:
io
- Mean=mode=median
it
Page | 3
Empirical Rule (Area under the curve)
- 68.2% of data fall within 1 std. dev (standard deviation) of mean
- 95.4% of the data falls within 2 std. dev of the mean
- 99.7% of the data falls within 3 std. dev of the mean
s
ic
at
m
he
at
Skewness is the degree of asymmetry observed in the normal distribution
M
mean<median<mode
na
io
it
dd
A
Page | 4
- A right/positively skewed distribution has a long tail on the right
mean>median>mode
s
ic
at
m
he
at
l M
na
io
it
dd
A
Page | 5
Stem and Leaf Diagram
A stem-and-leaf diagram (stemplot) is a very useful way of presenting data
while still retaining the original data.
s
ic
at
m
he
at
l M
na
io
it
dd
A
Advantages:
● It is easy & quick to construct
● The order observations are easy to identify
Page | 6
● The data is arranged compactly
Disadvantages:
● There is not much flexibility in the choice of the stem
● It is not suitable for displaying a large set of data
s
● The class frequency cannot be rapidly read off
ic
Back to Back Stem Plot
at
Used to compare two sets of data. The same stem is used for the leaves of
both sets of data, and two keys are required.
m
he
at
l M
na
io
it
dd
A
Page | 7
Box and Whisker Plots
It is a statistical diagram used to
represent the:
1. Lowest value
s
2. Q1
ic
3. Q2
4. Q3
at
5. highest value
m
he
Advantages:
● Two or more sets of data can be easily compared
● Indicates the data’s symmetry and skewness
at
● Easily handles extremely large sets of data
M
Disadvantages:
● Simple appearance, not visually appealing as other graphs
l
Page | 8