Professional Documents
Culture Documents
Presented by
Aung Kay Tu, MBBS, DTM&H, MCTM, PhD
Variables: Discrete and Continuous
•x 1 2 3 4 5
• LN(x) 0 0.693147 1.098612 1.386294 1.609438
e Euler's number- mathematical constant
• "Euler's number" named after Leonhard Euler
• The number e is a mathematical constant approximately equal to
2.71828 and is the base of the natural logarithm:
• The unique number whose natural logarithm is equal to 1.
e = 2.71828183….
Try it in excel
=ln(2.71828183)
Order the numbers from least to greatest.
A. 7, 4, 15, 9, 5, 2 2, 4, 5, 7, 9, 15
Find the mean, median, mode, and range of the data set.
4, 7, 8, 2, 1, 2, 4, 2
mean:
4 + 7 + 8 + 2 + 1 + 2 + 4 + 2 = 30 Add the values.
8 items sum
Divide the sum by the
30 8 = 3.75 number of items.
median:
1, 2, 2, 2, 4, 4, 7, 8 Arrange the values in order.
The median is 3.
Find the mean, median, mode, and range of the data set.
4, 7, 8, 2, 1, 2, 4, 2
mode:
1, 2, 2, 2, 4, 4, 7, 8 The value 2 occurs three times.
The mode is 2.
Find the mean, median, mode, and range of the data set.
4, 7, 8, 2, 1, 2, 4, 2
range:
1, 2, 2, 2, 4, 4, 7, 8 Subtract the least value
from the greatest value.
8– 1 = 7
The range is 7.
Find the mean, median, mode, and range of the data set.
6, 4, 3, 5, 2, 5, 1, 8
mean:
6 + 4 + 3 + 5 + 2 + 5 + 1 + 8 = 34 Add the values.
8 items sum
median:
1, 2, 3, 4, 5, 5, 6, 8 Arrange the values in order.
mode:
1, 2, 3, 4, 5, 5, 6, 8 The value 5 occurs two times.
The mode is 5.
In the data set below, the value 12 is much less than
the other values in the set. An extreme value such as
this is called an outlier.
x
x x x x x x x
10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42
The data shows scores for the last 5 math
tests: 88, 90, 55, 94, and 89. Identify the
outlier in the data set.
outlier 55
With the Outlier
55, 88, 89, 90, 94
outlier 55
range:
1, 2, 3, 4, 5, 5, 6, 8 Subtract the least value
from the greatest value.
8– 1 = 7
The range is 7.
Identify the outlier in the data set. Then
determine how the outlier affects the mean,
median, and mode of the data. The tell
which measure of central tendency best
describes the data with the outlier.
63, 58, 57, 61, 42
outlier 42
With the Outlier
42, 57, 58, 61, 63
outlier 42
Caution!
Since all the data values occur the same
number of times, the set has no mode.
PERCENTILE
• Because i is not an integer, round up. The position of the 85th percentile is the
next integer greater than 10.2, the 11th position
• The 85th percentile is the data value in the 11th position that is 3730.
• let us consider the calculation of the 50th percentile for the data set
• Because i is an integer, step 3(b) states that the 50th percentile is the
average of the sixth and seventh data values
• 3310 3355 3450 3480 3480 3490 3520 3540 3550 3650 3730 3925
• the 50th percentile is (3490 + 3520)/2 = 3505.
• the 50th percentile is also the median.
• 3310 3355 3450 3480 3480 3490 3520 3540 3550 3650 3730 3925
• the first quartile, or 25th percentile, is the average of the third and
fourth data values; thus, Q1 = (3450 + 3480)/2 = 3465.
• the third quartile, or 75th percentile, is the average of the ninth and
tenth data values; thus, Q3 = (3550 + 3650)/2 = 3600
Interquartile Range (IQR)
1. Find the mean, median, mode, and range of the data set. 8, 10, 46, 37, 20, 8, and 11
2. Identify the outlier in the data set, and determine how the outlier affects the mean,
median, and mode of the data. Then tell which measure of central tendency best
describes the data with and without the outlier. Justify your answer. 85, 91, 83, 78,
79, 64, 81, 97
3. Consider a sample with data values of 27, 25, 20, 15, 30, 34, 28, and 25. Compute
the 20th, 25th, 65th, and 75th percentiles