Professional Documents
Culture Documents
General Concepts
Definition of Statistics
statistics is scientific method for collecting, organizing, summarizing, presenting and analyzing data as
well as drawing valid conclusions and making reasonable decisions on the basis of such analysis.
deals with collecting, summarizing, and simplifying data to achieve conclusions can be readily drawn
from the data. It facilitates an understanding of the data and systematic reporting; and also makes them
amenable to further discussion, analysis, and interpretations.
Inferential statistics
it consists of methods that are used for drawing inferences, or making broad generalizations, about a
totality of observations on the basis of knowledge about a part of that totality. we can estimate a value
about the entire population from the sample information by using inferential statistics.
General Concepts
General Concepts
Data collection method
:Survey method
This method is based on selecting a part of the community under study, and this method is
characterized by reducing time, effort and cost.
Population : It is the total group of the study items, whether they are individuals or things
)Median(
)Variance(
Mean (average)
The mean of a set of quantitative data is the sum of the observed values divided by the
number of values
𝑿=
∑ 𝑿 𝑿 𝟏+ 𝑿 𝟐+…+𝑿 𝒏
=
𝒏 𝒏
The mean of a sample is typically denoted by x-bar, but the population mean is denoted
by the Greek symbol μ.
Statistical Measures
Measures Of Central Tendency
(Mean) :
𝑋=
∑ 𝑋 29+ 21+18 +27+25+ 30+16 166
¿ ¿ ¿ 23 . 7142
𝑛 7 7
Statistical Measures
Measures Of Central Tendency
Median
The median of a set of quantitative data is the value which is located in the middle of the
data, arranged from lowest to highest values (or vice versa), with 50% of the observations
above and 50% below.
Statistical Measures
Measures Of Central Tendency
Median
Finding the Median, M :
Arrange the n measurements (observations) from smallest to largest
if n is odd
the median is the middle value
if n is even
the median is the mean of the two middle values and
Statistical Measures
Measures Of Central Tendency
66 56 39 28 18 15
𝟑𝟗+𝟐𝟖
𝐦𝐞𝐝𝐢𝐚𝐧= =𝟑𝟑 . 𝟓
𝟐
Statistical Measures
Measures Of Central Tendency
30 29 27 25 21 18 16
Median=
Statistical Measures
Measures Of Central Tendency
)Mode( The mode is another measure of central tendency. It is the value at the point around
which the items are most heavily concentrated. (The most frequent values)
18, 10, 15, 13, 17, 15, 12, 15, 18, 16, 11
Solution
:Order data 18 ,18 ,17 ,16 ,15 ,15 ,15 ,13 ,12 ,11 ,10
𝒎𝒐𝒅𝒆 ¿ 𝟏𝟓
𝑴𝒆𝒅𝒊𝒂𝒏¿𝟏𝟓
𝟏𝟎+𝟏𝟏+𝟏𝟐+𝟏𝟑+𝟏𝟓× 𝟑+𝟏𝟔+𝟏𝟕+𝟏𝟖×𝟐
𝑴𝒆𝒂𝒏 ¿
𝟏𝟏
𝑿 ¿𝟏𝟒.𝟓𝟓
Problems
2. Calculate the mean, median and mode of the following data:
11, 10, 14, 11, 11, 14, 15, 14, 15, 11, 16, 14
Solution
:Order data 16 ,15 ,15 ,14 ,14 ,14 ,14 ,11 ,11 ,11 ,11 ,10
𝒎𝒐𝒅𝒆 ¿ { 𝟏𝟏,𝟏𝟒}
𝟏𝟒 +𝟏𝟒
𝑴𝒆𝒅𝒊𝒂𝒏¿ 𝟐 =𝟏𝟒
𝟏𝟎+𝟏𝟏×𝟒+𝟏𝟒× 𝟒+𝟏𝟓×𝟐+𝟏𝟔
𝑴𝒆𝒂𝒏 ¿
𝟏𝟐
𝑿 ¿𝟏𝟑
Statistical Measures
Measures of Dispersion
Averages are not sufficient to give a complete description of the data, as they are not
suitable for measuring how different or homogeneous the data are with each other.
A 90 80 65 60 55 4030
B 65 63 61 60 59 5755
We found that the mean and the median for each are 60
Values in group B are close to each other and are not far from the mean or median
)The Range( the difference between the maximum value and the minimum value of data
Example 6:
A 90 80 65 60 554030
𝑹𝒂𝒏𝒈𝒆=𝟗𝟎−𝟑𝟎=𝟔𝟎
B 65 63 61 60 595755
𝑹𝒂𝒏𝒈𝒆=𝟔𝟓−𝟓𝟓=𝟏𝟎
Statistical Measures
Measures of Dispersion
)Variance( variance is the mean squared difference between all elements of a group and
the mean of this group.
Greek letters are used for populations and Roman letters for samples
𝟐
𝑺 =
∑ ( 𝑿 − 𝑿 )𝟐
𝒏
s2 = sample variance
s2 = population variance
A 90 80 65 60 554030
Example 7:
B 65 63 61 60 595755
Variance 𝑺𝟐=
∑ ( 𝑿 − 𝑿 )𝟐
∑ |𝑿 − 𝑿|
𝒏
Mean deviation 𝑴𝑫=
𝒏
Group A Group B
30 -30 900 55 -5 25
40 20- 400 57 3- 9
55 5- 25 59 1- 1
60 0 0 60 0 0
65 5 25
61 1 1
80 20 400
63 3 9
90 30 900
65 5 25
∑ 0 2650 ∑ 0 70
2650 70
2
𝑆 =
𝐴
7 ¿ 378 . 571
2
𝑆 =
𝐵
7
¿ 10
Statistical Measures
Measures of Dispersion
Standard Deviation standard deviation is the mean of difference between all elements of a
𝟗𝟒
𝑿 𝑩= =𝟏𝟑 . 𝟒𝟑
20 𝟕
15
8 𝟏𝟏𝟓. 𝟕𝟏
2
𝑆= =𝟏𝟗 . 𝟐𝟖𝟓
8 𝟕
15
12
𝑺 𝑩 =𝟒 . 𝟑𝟗
16
94 0 115.71
3. In the following data, which group is more homogenous? Why?
Group A 187 284 201 151 100 154 105
Group B 20 15 8 8 15 12 16
Solution
𝑺 𝑫 𝑩=𝟒 .𝟑𝟗