Professional Documents
Culture Documents
Bab 2 dan 3
Pengantar Statistika
Walpole
1
STATISTIK DESKRIPTIF
Statistik yang mempelajari bagaimana data
disajikan dan diringkas.
MENYAJIKAN DATA
MERINGKAS DATA
2
MERINGKAS DATA
COLLEGE
Cumulative
Frequency Percent Valid Percent Percent
Valid agriculture 143 18.8 18.8 18.8
arts and sciences 102 13.4 13.4 32.2
business administration 147 19.3 19.3 51.4
education 160 21.0 21.0 72.4
engineering 210 27.6 27.6 100.0
Total 762 100.0 100.0
agriculture
18.8%
engineering
27.6%
education
21.0% business administrat
19.3%
4
agriculture engineering
11.1%
13.4%
business administrat
education 26.1%
arts and sciences
21.0% business administrat
35.6%
19.3%
2000 2005
5
BAR CHART
6
TABULASI SILANG UNTUK 2 VARIABEL KATAGORIK
Rokok * Hipertensi Crosstabulation
Hipertens i
ya tidak Total
Rokok bukan perokok Count 21 48 69
% within Rokok 30.4% 69.6% 100.0%
% within Hipertens i 23.9% 58.5% 40.6%
% of Total 12.4% 28.2% 40.6%
perokok s edang Count 36 16 52
% within Rokok 69.2% 30.8% 100.0%
% within Hipertens i 40.9% 19.5% 30.6%
% of Total 21.2% 9.4% 30.6%
perokok berat Count 31 18 49
% within Rokok 63.3% 36.7% 100.0%
% within Hipertens i 35.2% 22.0% 28.8%
% of Total 18.2% 10.6% 28.8%
Total Count 88 82 170
% within Rokok 51.8% 48.2% 100.0%
% within Hipertens i 100.0% 100.0% 100.0%
% of Total 51.8% 48.2% 100.0% 7
PENYAJIAN DATA KONTINU
HISTOGRAM
8
FREQUENCY HISTOGRAM
9
FREQUENCY HISTOGRAM
DENGAN 10 INTERVALS
FREQUENCY HISTOGRAM
10
DENGAN 5 INTERVALS
SAT score between verbal and math score 11
INCOME 12
STEM-AND-LEAF PLOT
13
DATA “CRIME”
crime Stem-and-Leaf Plot
2.00 1 . 89
6.00 2 . 126999
9.00 3 . 345577899
13.00 4 . 1234466688999
18.00 5 . 000122355666667778
16.00 6 . 0022233446788899
11.00 7 . 00112333557
12.00 8 . 000111456778
3.00 9 . 227
DATA (mg/kg)
57 55 56 56 55 56 55 51 56
58 60 48 32 46 58 56 51 50
15
16
17
MEASURES OF CENTRAL TENDENCY
MODE
Mode of a set of measurements is defined to be
the measurement that occurs most often (with
the highest frequency)
MODUS = 1005 18
MEDIAN
The median of a set of measurements is defined to be the
middle value when the measurements are arranged from
lowest to highest
SAT score :
95 86 78 90 62 73 89 92 84 76
19
ARITHMATIC MEAN OR MEAN
X is mean of a sample
Find mean for grouped data !!!!
20
UKURAN PENYEBARAN DATA
There is a joke that goes, "If a statistician had her hair on fire and her
feet in a block of ice, she would say that 'on the average' she felt good."
Of course, this is a silly example, but to what is this unfortunate
statistician referring? What is she ignoring?
22
The NATIONAL WEATHER SERVICE O'HARE AIRPORT (Chicago, IL)
reported the following temperature information for June 15, 1996.
Knowing this information, how would you dress for the day? 71 degrees is a fairly
comfortable mean temperature, but 57 can be a little on the chilly side and 84 is a
little warm. So, can you see how it is important to know more than just the mean of
a data set?
On this day, there was a range of 27 degrees Fahrenheit. The highest point was 84
and the lowest was 57. Thus, the range is found by subtracting 57 from 84.
23
MEASURES OF VARIABILITY
RANGE
Range of a set of measurements is defined to be the difference
between the largest and the smallest measurements of the set
For grouped data, because we do not know the individual
measurements, the range is taken to be the difference between the
upper limit of the last interval and the lower limit of the first interval.
17 18 21 25 31 45 50 Range = 33
17 17 17 17 17 17 50 Range = 33
17 50 50 50 50 50 50 Range = 33
25
COEFFICIENT OF VARIATION
Coefficient of variation measures the variability in the
values in a population relative to the magnitude of the
population mean.
CV = | |
26
INTERQUARTILE RANGE
Interquartile Range (diberi notasi dq) memperhitungkan
penyebaran bagian tengah dari data
dq q A qB
qA dan qB adalah kuartil atas dan kuartil bawah
Kuartil adalah nilai-nilai yang membagi kelompok data
yang sudah diurut menjadi empat bagian sama banyak
CATATAN:
Jika ukuran pusat adalah mean maka ukuran penyebaran yang
digunakan adalah S dan range
Jika ukuran pusat adalah median maka ukuran penyebaran yang
digunakan adalah range dan interquartile range 27
EXTREME MEASUREMENT ( OUTLIERS)
We can find an extreme measurement as follow :
•Count 1 step = 1.5 dq = 1.5 (q A qB )
• Outliers are measurements that greater than qA + 1 step or
measurements smaller than qB – 1 step
1 step 1 step
qB qA
Median
28
29
OPEN “BOX-PLOT” DATA
30
CONTOH SOAL
Data di bawah ini menyatakan hasil penelitian yang mencatat tentang : Warna
endapan yang terjadi dan berat endapan tersebut. Penelitian dilakukan sebanyak
10 kali.
No Warna Berat
1 Putih 3.45
2 Putih 3.11
Buatlah deskripsi dari data
3 - 2.11 warna endapan dan berat
4 - 3.21 endapan di atas dan berilah
penjelasan
5 Hitam 2.00
6 Hitam 2.01 Penelitian 3 dan 4 tidak
7 Hitam 1.52 dicantumkan data warna
endapan (missing)
8 Hitam 1.99
9 Putih 3.54
10 Hitam 1.28
31
DESKRIPSI DATA WARNA ENDAPAN
33