You are on page 1of 30

Descriptive Statistics

Measure of Central Tendency


(stats of location/describing distribution using numbers)

Dr. Lim Jac Fang


CFM, FPSK, UMS

1
2
Visual information

3
How do you make sense of it?
• Central Tendency – the tendency of the data
to cluster, or centre, about certain numerical
values
• Variability is the spread of the data.

4
DS
• Involves Collecting, Presenting and
Characterizing the data.
• Purpose – To describe the data.

5
MEAN MEDIAN MODE
Measures of Centre Measures of Spread
• Mean • Range
• Median • Standard Deviation
• Mode

6
MEAN (average)
• The most common statistic of location.
• It is the sum of all measurements divided by
the number of observations in the data set.
• Balance point.

7
• The capital Greek sigma, Σ, simply means the
sum of the items indicated.
• The i = 1 means that the items should be
summed, starting with the first one and
ending with the nth one, as indicated by the i
= η above the Σ.

8
• This formula tells us, "Sum all the (E) items
and divide the sum by n."
• The mean of a sample is the centre of gravity
of the observations in the sample.

9
Mean?
• 154
• 139
• 154
• 193
• 186
• 130
• 154
• 156
• 190
• 110
10
The median
• That value of the variable (in an ordered
array) that has an equal number of items on
either side of it.
• Thus, the median divides a frequency
distribution into two halves.
• Physical middle point.

11
Median?
• 154
• 139
• 154
• 193
• 186
• 130
• 154
• 156
• 190
12
• M is …… n + 1/2
• This quantity is easily evaluated from a sample
array with an odd number of individuals.
• Medians are most often used for distributions
that do not conform to the standard
probability models.

13
• Sometimes the median is a more
representative measure of location than the
arithmetic mean.
• Such instances almost always involve
asymmetric distributions.

14
15
• The basic advantage of the median in
describing data compared to the mean (often
simply described as the "average") is that it is
not skewed so much by a small proportion of
extremely large or small values, and so it may
give a better idea of a "typical" value.
• OUTLIERS

16
17
18
Mode
• The mode refers to the most frequent value in
the data set.
• This is the only central tendency measure that
can be used with nominal data (gender,
nationality, ethnicity, language, genre, style,
biological species, and form) which have
purely qualitative category.

19
Mode?
• 154
• 139
• 154
• 193
• 186
• 130
• 154
• 156
• 190
20
Range
• R = Max – Min
• 154
• 139
• 154
• 193
• 186
• 130
• 154
• 156
• 190
• R =?

21
22
23
Xi Xi - X bar (Xi - Xbar)2

10

12

16

19

20

SUM E

24
Xi Xi - X bar (Xi - Xbar)2

10 10 – 15.4 29.16

12 12 – 15.4 11.56

16 16 – 15.4 0.36

19 19 – 15.4 12.96

20 20 – 15.4 21.16

SUM E 75.2

25
SD = 4.336

26
27
Interpretation?

28
29
Tqvm!

30

You might also like