Professional Documents
Culture Documents
CH. 3
ششح يجغط ثبنهغز ٍٛانؼشثٛخ ٔاإلَجهٛضٚخ
(أيضهخ يزُٕػخ ٔأعئهخ اخزجبساد عبثقخ)
إعـــــداد
أستاذ /قاسم مضوي
جٕال 0502180703
NEW
1432-1433
CHAPTER 3: DATA DESCRIPTION:
:ْٙ زكٌٕ يٍ أسثؼخ أقغبوٚٔ :بَبدٛ ٔصف انج:انجبة انضبنش
1- MEASURES OF CENRTRAL TENDENCY : خٚظ انُضػخ انًشكضٛٚيقب
Mean = X
x 2 9 4 15 5
n 3 3
EXAMPLE 3-2:
FIND THE VALUE OF X IF YOU HAVE 2, X, 9, the mean of these value equal to 5.
SOL.:
X
x 5
2 x9 11 x 15
n 3 x 15 11 4
Median: طٛانٕع:
بٛب أٔ رُبصنٚجٓب رصبػذٛبَبد ثؼذ رشرٛ رزٕعط انجًٙخ انزْٕٛ انق
EXAMPLE 3-3:
Find the median to these values: 2, 5, 9, 7, 3
SOL.: first step we should arrange the value as sending or descending order.
Arranging data 2, 3, 5, 7, 9
Then the median equal to 5 (the value in the middle of values)
EXAMPLE 3-4:
Find the median to 2, 5, 9, 7, 3 , 12
Arranging data: 2, 3, 5, 7, 9, 12
Then the median equal to:
2
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
MODE: انًُٕال
Is the value which having a higher frequency. . انقُمت األكثس تكسازا أو شُىعا:تعسَفه
EXAMPLE 3-5:
Find the mode: 6, 7, 2, 6, 5, 6
SOL.: Mode = 6 (we say data having unimodal)
EXAMPLE 3-6:
Find the mode 6, 7, 2, 6, 5, 7
mode = 6, 7 (bimodal)
REMARKS:
:يالحظخ يًٓخ جذا جذا
ذح انًُٕالٛبَبد رغًٗ ٔحٛبَبد يُٕال ٔاحذ انجٛإرا كبٌ نهج
.خ انًُٕالٛ يُٕال) رغًٗ صُبئ2( ٍٛبَبد يُٕانٛإرا كبٌ نهج
ٍ رغًٗ يزؼذدح انًُٕالٛبَبد أكضش يٍ يُٕانٛإرا كبٌ نهج
one mode - - - unimodal
two mode ---- bimodal
three mode --- multimodal
EXAMPLE 3-7:
Find the midrange for 2, 9, 4
ٌ قاسم مضى/أستاذ
0502180703
3
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
2- MEASURES OF VARIATIONS (Dispersion): ظ االخزالفٛٚيقب
range ٖانًذ
2
)S ّشيض نٚ ُخٍٛ انؼٚ (رجبsample variance ٍٚانزجب
)S ّشيض نٚ ُخٛ (اَحشاف انؼsample standard deviation ٘بسٛاالَحشاف انًؼ
)C.V ّشيض نٚ( coefficient of variation يؼبيم االخزالف
.ظ انزشزذٛٚظ االخزالف أٔ يقبٛٚظ أػالِ رغًٗ يقبٛٚانًقب
1- Range:
the highest value minus the lowest value.
انمدي عبازة عه أعهً قُمت – أقم قُمت
EXAMPLE 3-8:
Range = 9 – (-16) = 25
2- VARIANCE: ٍٚانزجب
x 2
x X X
2 2
S
2 n S 2
n 1 OR n 1
Where X is mean (average)
EXAMPLE 3-9:
Find the variance to 2, 4, 9
ٌ قاسم مضى/أستاذ
SOL:
0502180703
x 2
x 2
n
S
2
n 1
X X2
2 4
4 16
9 81
Sum 15 Sum 101 انمجمىع
x 2
152
x 2
n
101
3
S2 13
n 1 3 1
4
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
3- STANDARD DEVIATION: ٘بسٛاالَحشاف انًؼ
x 2
x 2
n
S
n 1
EXAMPLE 3-10:
x 2
152
x 2
n
101
3
S 13 3.61
n 1 3 1
ٍ أػًبس ػًبل ٔسٔارجٓى) ٔرنك ثقغًخٍٛ يضم (انًقبسَخ ثٍٛ ظبْشرٛغزخذو نهًقبسَخ ثٚ بطٛيق
ٗبس٘ نهشٔارت ػهٛى االَحشاف انًؼٛ صى رقغ، بس٘ نهؼًش ػهٗ يزٕعط انؼًشٛاالَحشاف انًؼ
. األكضش اخزالفبْٙ ًٌٕخ األكجش ركًُٛٓب ٔانقٛيزٕعط انشٔارت ٔيٍ صى انًقبسَخ ث
S
C.V 100%
X
EXAMPLE 3-11:
5
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
يالحظخ يًٓخ جذا REMARKS:
1. MEAN < MEDIAN < MODE
Then the distribution has a negative skewed or skewed to the left.
يالحظخ يًٓخ:
انؼالقخ ث ٍٛانًزٕعط انحغبثٔ ٙانٕعٛط ٔانًُٕال.
-1إذا كان انمتىسط انحسابٍ أقم قُمت مه انىسُط وانمنىال فإن انتىشَع مهتىٌ جهت انُساز.
-2إذا كان انمتىسط انحسابٍ أكبس قُمت فمعنً هرا أن انتىشَع مهتىٌ جهت انُمُه.
-3إذا تساوي انمتىسط انحسابٍ وانىسُط وانمنىال فإن انتىشَع َسمً متماثم.
6
أعزبر /قبعى يضٕ٘ – جٕال 0502180703
3- MEASURES OF POSITIONS: ظ انًٕاضغٛٚصبنضب يقب
1. STANDARD SCORE خٚبسٛانذسجخ انًؼ
2. QUARTILES بدٛاألسثبػ
3. OUTLIER ًخ انًزطشفخٛانق
EXAMPLE 3-13:
If the mean of a set of data is 19 and 23.5 has z-sore of 0.75, then the standard
deviation must be?
SOL.:
X = 23.5, and X = 19, Z = 0.75, S = ?
XX
Z
S
23.5 19
0.75
S
4.5
0.75S 4.5 , S 6
0.75
EXAMPLE 3-14:
The mean of marks in test A is 80 and the standard deviation is 5,
if student get 70 in this test.
His relative position (z-score) in this exam is:
A) 6 B) 36 C)-3 D) -2
If the student in question (EXAMPLE 3-14) has a z-score = -1 in test B, one can say
about his relative position that:
A- B better is better than A B- information are not sufficient
C- Test (A) better than test (B) D- Test (A) is similar to test (B)
2. QUARTILES بدٛاألسثبػ
Q2 َٙغ انضبٛ ٔانشثQ1 غ األٔلٛ انشثْٙٔ بَبد إنٗ صالس أقغبوٛ رقغى انجٙى انزٛقصذ ثٓب انقٚٔ
Q3 غ انضبنشٛ ٔانشث، طْٕٛٔ َفغّ انٕع
Quartile divided the data into 4 groups.
Quartiles are denoted by Q1, Q2 , and Q3.
The second quartile (Q2) is median.
7
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
3. OUTLIER ًخ انشبرحٛانق
An outlier is an extremely high or an extremely how data value when compared with
the rest of the data values.
، ىٛ انقٙظ نٓب ػالقخ ثجبقًٛخ ركٌٕ نٛ قْٙٔ ًخًٛخ أٔ أقم قٛ أصغش قْٙ ًخ انشبرحٛ(انق
ًٔكٍ اعزخشاجٓب ثغٕٓنخ يٍ دٌٔ قبٌَٕ أٚٔ بَبد انًؼطبحٛٔٔاضحخ جذا ثبنُغجخ نهج
.)ًخ انشبرح أو الٕٛجذ قبٌَٕ نهزأكذ يٍ أَٓب انقٚ ٍ ٔنك، قبػذح
Inter Quartile Range = IQR = Q3 – Q1
IQR ٙؼٛانًذٖ انشث
EXAMPLE 3-15: Given the data set 5, 6, 12, 13, 15, 18, 22, 50 Find the outlier ?
ًخ انشبرح ؟ٛجبد انقٚانًطهٕة إ
First of all we should arrange data , our data is already arranged.
Arranging of data : 5, 6, 12, 13, 15, 18, 22, 50
Q2 = (13+15)/2 = 14
Q1 = (6+12)/2 = 9
Q3= (18+22)/2 = 20
IQR = Q3- Q1 = 20- 9 = 11
1.5 * IQR = 1.5 * 11 = 16.5
Q1 – 1.5 * IQR = 9 – 16.5 = -7.5
Q3 + 1.5 * IQR = 20 + 16.5 = 36.5
8
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
4- EXPLORATORY DATA ANALYSIS:
Stem and leaf plot:انغبق ٔاألٔساق
EXAMPLE 3-16:
If you have the following data 20, 32, 13, 14, 43, 02, 57, 23, 36, 32, 25, 31, 33, 32,
44, 32, 52, 44, 51, 45.
بَبد انغبثقخٛقى ثإَشبء انغبق ٔاألٔساق نهج
Construct a stem and leaf for the data.
SOL. انحم
First step we must arranging data as following: ٙهٚ ب كًبٚجب رصبػذٛبَبد رشرٛالصو َشرت انج
02, 13, 14, 20, 23, 25, 31, 32, 32, 32, 32, 33, 36, 43, 44, 44, 45, 51, 52, 57
Stem Leaf
0 2
1 3 4
2 0 3 5
3 1 2 2 2 2 3 6
4 3 4 4 5
5 1 2 7
EXAMPLE 3-17:
Given the data set 5, 6, 12, 13, 15, 18, 22, 50 construct Box plot ?
First of all we should arrange data , our data is already arranged.
Arranging of data : 5, 6, 12, 13, 15, 18, 22, 50
Q2 = (13+15)/2 = 14
Q1 = (6+12)/2 = 9
Q3= (18+22)/2 = 20
Q1 = 9 Q2 = 14 Q3 = 20
Max. = 50
Min. = 5
9
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
REMARKS:يالحظبد يًٓخ جذا
1. If the median is near the center of the box, the distribution is approximately
symmetric.
بَبد رؼزجش يزًبصهخٛ انًُزصف فإٌ انجٙ فQ2 طٛإرا كبٌ انٕع
2. If the lines are about the same length, the distribution is approximately
symmetric.
.غ يزًبصمٚبٌ فإٌ انزٕصٚٔ انصُذٔق يزغبٙ طشفٙإرا كبٌ انخطبٌ ف
3. if the median fall to the left of the center of the box, the distribution is
positively skewed.
.ًٍٛٛؼزجش يهزٕ جٓخ انٚ غٚ فإٌ انزٕص، غبسٛ انٙ فQ2 طٛإرا كبٌ انٕع
4. If the right line is larger than the left line, the distribution is positively
skewed.
.ًٍٛٛغ يزهٕ جٓخ انٚغبس انصُذٔق فإٌ انزٕصٚ ٍ انصُذٔق أكجش يٍ انخطًٛٚ إرا كبٌ انخط
5. if the median fall to the right of the center of the box, the distribution is
negatively skewed.
.غبسٛغ يهزٕ جٓخ انٚ فإٌ انزٕص، ًٍٛٛ انٙ فQ2 طٛإرا كبٌ انٕع
6. If the left line is larger than the right line , the distribution is negatively
skewed.
.غبسٛغ يهزٕ جٓخ انٍٚ انصُذٔق فإٌ انزٕصًٛٚ غبس انصُذٔق أكجش يٍ انخطٚ إرا كبٌ انخط
10
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
أعئهخ اخزجبساد عبثقخ ػهٗ انفصم انضبنش
Prepared by: Gasim Mudawi – Mobile 0502180703
25.Which measures are affected the most by extremely small or extremely large
values?
a) Mode and midrange. c) Mean and mode.
b) Mean and midrange. d) Mode and median.
22. Referring to the sample data of patient's ages, any age value greater than ... is
consider to be an outlier.
a) 56 b) 57.25 c) 8 d) -1.25
25. Referring to the given data, the shape of the patient's age distribution is …
a) symmetrical b) left skewed c) unknown d) right skewed
26. If the value of the coefficient variation of the patient's height is 14.221%, then
which is more variable patient's age or height?
11
0502180703 قبعى يضٕ٘ – جٕال/أعزبر
a) Height is more variable than age c) Cannot tell.
b) They have the same variability d) Age is more variable than height.
27. The value of the standard deviation of the patient's age is …
a) 152.86 b) 53.74 c) 59.54 d) 12.36
28. The mean value of the patient's age is …
a) 118.75 b) 152 c) 76.00 d) 19.00
29. The IQR value for the patient's age is ……..
a) 16 b) 19.5 c) 11.5 d) 19.5
30. The range value of the patient's age is ……
a) 20 b) 36 c) 16.5 d) 13
* Reference: First term – first exam –form D- 1432-1433H.
the most appropriate measure of central tendency for the value -21, -24, -27, -23, -25,
-26, -22, -30, -29, -31, 35 is the ……..
a) mode b) median c) mean c) midrange
What is the most appropriate measure of central tendency for the following data set?
A, C, A, B, C, B, A
a) median b) mean c) midrange d) mode
When a study is conducted on all students from KAU, every measurement calculated
fro this study would be called …
a) statistic b) sample c) population d) parameter
The age of a department instructors are shown in the following stem and leaf plot:
3 1
4 2 2 2 6 6
5 0 7
6 1
15. The value of the median for the raw data for the above stem and leaf plot is…
a) 42 b) 46 c) 2 d) 6
16. the raw data set for the stem and leaf is called …
a) unimodal b) bimodal c) multimodal d) trimodal
17. The midrange of the raw data for the above stem and leaf plot is …
a) 6 b) 59 c) 9 d) 46
0 5 10 15 20 25 30 35 40
13
0502180703 قبعى يضٕ٘ – جٕال/أعزبر