Statistika Week 3

Chapter 3 :
# MEASURES OF LOCATION
The Population Mean
Many studies involve all the values in a population. For example, there are 12 sales as-sociates
employed at the Reynolds Road Carpet Outlet. The mean amount of commission they earned last month
was $1,345. This is a population value because we considered the commission of all the sales associates.
Other examples of a population mean would be:
•The mean closing price for Johnson & Johnson stock for the last 5 days is $95.47.
• The mean number of hours of overtime worked last week by the six welders in the welding
department of Butts Welding Inc. is 6.45 hours.
• Caryn Tirsch began a website last month devoted to organic gardening. The mean number of hits on
her site for the 31 days in July was 84.36.
Population mean = Sum of all the values in the population / Number of values in the population
POPULATION MEAN μ = Σx / N
where:
μ represents the population mean. It is the Greek lowercase letter “mu.”
N is the number of values in the population.

x represents any particular value.
Σ is the Greek capital letter “sigma” and indicates the operation of adding.
Σx is the sum of the x values in the population.
Any measurable characteristic of a population is called a parameter. The mean of a population is an

example of a parameter.
PARAMETER A characteristic of a population.
The Sample Mean

For raw data—that is, ungrouped data—the mean is the sum of all the sampled values divided by the
total number of sampled values. To find the mean for a sample:
Sample mean = Sum of all the values in the sample / Number of values in the sample
The mean of a sample and the mean of a population are computed in the same way, but the shorthand
notation used is different. The formula for the mean of a sample is:
SAMPLE MEAN x = Σx / n
where:
x represents the sample mean. It is read “x bar.”
n is the number of values in the sample.
x represents any particular value.
Σ is the Greek capital letter “sigma” and indicates the operation of adding.
Σx is the sum of the x values in the sample.
The mean of a sample, or any other measure based on sample data, is called a statistic.
If the mean weight of a sample of 10 jars of Smucker’s orange marmalade is 11.5 ounces, this is an
example of a statistic.
STATISTIC : A characteristic of a sample.
Properties of the Arithmetic Mean
The arithmetic mean is a widely used measure of location. It has several important properties:
1. To compute a mean, the data must be measured at the interval or ratio level.
Recall from Chapter 1 that ratio-level data include such data as ages, incomes, and weights, with the
distance between numbers being constant.
2. All the values are included in computing the mean.
3. The mean is unique. That is, there is only one mean in a set of data. Later in the chapter, we will
discover a measure of location that might appear twice, or more than twice, in a set of data.
4. The sum of the deviations of each value from the mean is zero. Expressed symbolically:
Σ(x − x) = 0
As an example, the mean of 3, 8, and 4 is 5. Then:
Σ(x − x) = (3 − 5) + (8 − 5) + (4 − 5)
= −2 + 3 − 1
=0
The Median
The center for such data is better described by a measure of location called the median.
MEDIAN : The midpoint of the values after they have been ordered from the minimum to the maximum
values.
Gambar
Note that there is the same number of prices below the median of $70,000 as above it. The median is,
therefore, unaffected by extremely low or high prices. Had the highest price been $90,000, or $300,000,
or even $1 million, the median price would still be $70,000. Likewise, had the lowest price been $20,000
or $50,000, the median price would still be $70,000.
In the previous illustration, there are an odd number of observations (five). How is the median
determined for an even number of observations? As before, the observations are ordered. Then by
convention to obtain a unique value we calculate the mean of the two middle observations. So for an
even number of observations, the median may not be one of the given values.
The major properties of the median are:
1. It is not affected by extremely large or small values. Therefore, the median is a valuable measure of
location when such values do occur.
2. It can be computed for ordinal-level data or higher. Recall from Chapter 1 that ordinal-level data can
be ranked from low to high.
The Mode
MODE : The value of the observation that appears most frequently.
The Relative Positions of the Mean, Median, and Mode

Gambar
If a distribution is nonsymmetrical, or skewed, the relationship among the three measures changes. In a
positively skewed distribution, such as the distribution of weekly income in Chart 3–3, the arithmetic
mean is the largest of the three measures. Why? Because the mean is influenced more than the median
or mode by a few extremely high values. The median is generally the next largest measure in a positively
skewed frequency distribution. The mode is the smallest of the three measures.
If the distribution is highly skewed, the mean would not be a good measure to use. The median and
mode would be more representative.
Gambar
Conversely, if a distribution is negatively skewed, such as the distribution of tensile strength in Chart 3–
4, the mean is the lowest of the three measures. The mean is, of course, influenced by a few extremely
low observations. The median is greater than the arithmetic mean, and the modal value is the largest of
the three measures. Again, if the distribution is highly skewed, the mean should not be used to
represent the data.
Gambar
Software Solution
We can use a statistical software package to find many measures of location.
# THE WEIGHTED MEAN
Xw = w1x1 + w2x2 + w3x3 + … + wnxn / w1 + w2 + w3 + … + wn
This may be shortened to:
xw = Σ(wx) / Σw
Note that the denominator of a weighted mean is always the sum of the weights.
# THE GEOMETRIC MEAN
GM = √ n (x1)(x2) … (xn) liat di pdf
The geometric mean will always be less than or equal to (never more than) the arithmetic mean. Also, all
the data values must be positive.
As an example of the geometric mean, suppose you receive a 5% increase in salary this year and a 15%
increase next year. The average annual percent increase is 9.886%, not 10.0%. Why is this so? We begin
by calculating the geometric mean. Recall, for example, that a 5% increase in salary is 105%. We will
write it as 1.05.
GM = √(1.05)(1.15) = 1.09886
This can be verified by assuming that your monthly earning was $3,000 to start and you received two
increases of 5% and 15%.
Raise 1 = $3,000(.05) = $150.00

Raise 2 = $3,150(.15) = 472.50
Total. $622.50
Your total salary increase is $622.50. This is equivalent to:
$3,000.00(.09886) = $296.59
$3,296.58(.09886) = 325.91
$622.50
A second application of the geometric mean is to find an average percentage change over a period of
time. For example, if you earned $45,000 in 2004 and $100,000 in 2016, what is your annual rate of
increase over the period? It is 6.88%. The rate of increase is determined from the following formula.
RATE OF INCREASE OVER TIME
GM = √n Value at end of period / Value at start of period −1
n is the number of periods
# WHY STUDY DISPERSION?
A measure of location, such as the mean, median, or mode, only describes the center of the data. It is
valuable from that standpoint, but it does not tell us anything about the spread of the data.
Range
Range = Maximum value − Minimum value
Variance
The arithmetic mean of the squared deviations from the mean.
Variance = Σ(x − μ)² / N
Population Variance
POPULATION VARIANCE σ² = Σ(x − μ)² / N
where:
σ² is the population variance (σ is the lowercase Greek letter sigma). It is read as “sigma squared.”
x is the value of a particular observation in the population.
μ is the arithmetic mean of the population.
N is the number of observations in the population.
The process for computing the variance is implied by the formula.
1. Begin by finding the mean.
2. Find the difference between each observation and the mean, and square that difference.
3. Sum all the squared differences.
4. Divide the sum of the squared differences by the number of items in the population.
So the population variance is the mean of the squared difference between each value and the mean. For
populations whose values are near the mean, the variance will be small. For populations whose values
are dispersed from the mean, the population variance will be large.
The variance overcomes the weakness of the range by using all the values in the population, whereas
the range uses only the maximum and minimum values. We over-come the issue where Σ(x − μ) = 0 by
squaring the differences. Squaring the differences will always result in nonnegative values. The following
is another example that illustrates the calculation and interpretation of the variance.
Population Standard Deviation
σ = √ Σ(x − μ)² / N
Sample Variance and Standard Deviation
Sample variance : s² = Σ(x − x )² / n − 1
where:
s² is the sample variance.
x is the value of each observation in the sample.
x is the mean of the sample.
n is the number of observations in the sample.
SAMPLE STANDARD DEVIATION : s = √ Σ(x − x )² / n − 1
# INTERPRETATION AND USES OF THE STANDARD DEVIATION

CHEBYSHEV’S THEOREM : For any set of observations (sample or population), the proportion of the
values that lie within k standard deviations of the mean is at least 1 – 1/k2, where k is any value greater
than 1.
The EMPIRICAL RULE : For a symmetrical, bell-shaped frequency distribution, approximately 68% of the
observations will lie within plus and minus one standard deviation of the mean; about 95% of the
observations will lie within plus and minus two standard deviations of the mean; and practically all
(99.7%) will lie within plus and minus three standard deviations of the mean.
# THE MEAN AND STANDARD DEVIATION OF GROUPED DATA
ARITHMETIC MEAN OF GROUPED DATA
x = ΣfM / n
where:
x is the sample mean.
M is the midpoint of each class.
f is the frequency in each class.
fM is the frequency in each class times the midpoint of the class.
Σfm is the sum of these products.
n is the total number of frequencies.
STANDARD DEVIATION, GROUPED DATA s = √
Σf(M − x )² / n − 1 (3–12)
where:
s is the sample standard deviation.
M is the midpoint of the class.
f is the class frequency.

n is the number of observations in the sample.
x is the sample mean.
# UKURAN LOKASI
Rata-rata Populasi
Banyak penelitian yang melibatkan semua nilai dalam suatu populasi. Misalnya, ada 12 sales as-sociate
yang dipekerjakan di Outlet Karpet Jalan Reynolds. Jumlah rata-rata komisi yang mereka peroleh bulan
lalu adalah $ 1,345. Ini adalah nilai populasi karena kami mempertimbangkan komisi dari semua rekanan
penjualan. Contoh lain dari mean populasi adalah:
• Harga penutupan rata-rata untuk saham Johnson & Johnson selama 5 hari terakhir adalah $ 95,47.
• Jumlah rata-rata jam kerja lembur minggu lalu oleh enam tukang las di departemen pengelasan Butts
Welding Inc. adalah 6,45 jam.
• Caryn Tirsch memulai situs web bulan lalu yang dikhususkan untuk berkebun organik. Jumlah rata-rata
klik di situsnya selama 31 hari di bulan Juli adalah 84,36.
Populasi mean = Jumlah semua nilai dalam populasi / Jumlah nilai dalam populasi
BERARTI POPULASI μ = Σx / N
dimana:
μ mewakili rata-rata populasi. Ini adalah huruf kecil Yunani "mu."
N adalah banyaknya nilai dalam populasi.
x mewakili nilai tertentu.
Σ adalah huruf kapital Yunani "sigma" dan menunjukkan operasi penjumlahan.
Σx adalah jumlah dari nilai x dalam populasi.

Setiap karakteristik populasi yang dapat diukur disebut parameter. Rata-rata populasi adalah contoh
parameter.
PARAMETER: Karakteristik populasi.
Rata-Rata Sampel
Untuk data mentah — yaitu, data yang tidak dikelompokkan — mean adalah jumlah dari semua nilai
sampel dibagi dengan jumlah total nilai sampel. Untuk menemukan mean sampel:
Sample mean = Jumlah semua nilai dalam sampel / Jumlah nilai dalam sampel
Rata-rata sampel dan mean populasi dihitung dengan cara yang sama, tetapi notasi singkatan yang
digunakan berbeda. Rumus mean sampel adalah:
SARANA SAMPEL x = Σx / n
dimana:
x mewakili mean sampel. Bunyinya "x bar".
n adalah jumlah nilai dalam sampel.
x mewakili nilai tertentu.
Σ adalah huruf kapital Yunani "sigma" dan menunjukkan operasi penjumlahan.
Σx adalah jumlah dari nilai x dalam sampel.
Rata-rata sampel, atau ukuran lain apa pun berdasarkan data sampel, disebut statistik.
Jika berat rata-rata sampel 10 toples selai jeruk Smucker adalah 11,5 ons, ini adalah contoh statistik.
STATISTIK: Karakteristik sampel.
Sifat Rata-Rata Aritmatika

Rata-rata aritmatika adalah ukuran lokasi yang banyak digunakan. Ini memiliki beberapa sifat penting:
1. Untuk menghitung mean, data harus diukur pada interval atau level rasio.
Ingat kembali dari Bab 1 bahwa data tingkat rasio mencakup data seperti usia, pendapatan, dan bobot,
dengan jarak antar angka yang konstan.
2. Semua nilai dimasukkan dalam menghitung mean.
3. Maksudnya unik. Artinya, hanya ada satu mean dalam satu set data. Nanti di bab ini, kita akan
menemukan ukuran lokasi yang mungkin muncul dua kali, atau lebih dari dua kali, dalam satu set data.
4. Jumlah deviasi setiap nilai dari mean adalah nol. Dinyatakan secara simbolis:
Σ (x - x) = 0
Sebagai contoh mean dari 3, 8, dan 4 adalah 5. Maka:
Σ (x - x) = (3 - 5) + (8 - 5) + (4 - 5)
= −2 + 3-1
=0
Median
Pusat untuk data semacam itu lebih baik dijelaskan dengan ukuran lokasi yang disebut median.
MEDIAN: Titik tengah nilai setelah diurutkan dari nilai minimum hingga maksimum.
Gambar
Perhatikan bahwa ada jumlah harga yang sama di bawah median $ 70.000 seperti di atasnya. Oleh
karena itu, median tidak terpengaruh oleh harga yang sangat rendah atau tinggi. Seandainya harga
tertinggi adalah $ 90.000, atau $ 300.000, atau bahkan $ 1 juta, harga rata-rata tetap $ 70.000. Demikian
juga, jika harga terendah adalah $ 20.000 atau $ 50.000, harga rata-rata tetap $ 70.000.
Pada ilustrasi sebelumnya, jumlah pengamatan ganjil (lima). Bagaimana median ditentukan untuk
jumlah pengamatan genap? Seperti sebelumnya, observasi diatur. Kemudian berdasarkan kesepakatan
untuk mendapatkan nilai unik kami menghitung rata-rata dari dua pengamatan tengah. Jadi untuk
jumlah pengamatan genap, median mungkin bukan salah satu nilai yang diberikan.
Properti utama dari median adalah:

1. Itu tidak terpengaruh oleh nilai yang sangat besar atau kecil. Oleh karena itu, median adalah ukuran
lokasi yang berharga ketika nilai tersebut benar-benar terjadi.
2. Dapat dihitung untuk data tingkat ordinal atau lebih tinggi. Ingat kembali dari Bab 1 bahwa data
tingkat ordinal dapat diberi peringkat dari rendah ke tinggi.
Mode
MODE: Nilai observasi yang paling sering muncul.
Posisi Relatif dari Mean, Median, dan Mode
Gambar
Jika distribusi tidak simetris, atau miring, hubungan di antara ketiga ukuran tersebut berubah. Dalam
distribusi miring positif, seperti distribusi pendapatan mingguan pada Bagan 3–3, rata-rata aritmatika
adalah yang terbesar dari ketiga ukuran. Mengapa? Karena mean dipengaruhi lebih dari median atau
mode oleh beberapa nilai yang sangat tinggi. Median umumnya merupakan ukuran terbesar berikutnya
dalam distribusi frekuensi miring positif. Modusnya adalah yang terkecil dari tiga ukuran.
Jika distribusinya sangat miring, mean tidak akan menjadi ukuran yang baik untuk digunakan. Median
dan mode akan lebih representatif.
Gambar
Sebaliknya, jika distribusi miring negatif, seperti distribusi kekuatan tarik pada Bagan 3–4, mean adalah
yang terendah dari ketiga ukuran tersebut. Artinya, tentu saja, dipengaruhi oleh beberapa observasi
yang sangat rendah. Median lebih besar dari rata-rata aritmatika, dan nilai modal adalah yang terbesar
dari ketiga ukuran. Sekali lagi, jika distribusinya sangat miring, mean tidak boleh digunakan untuk
merepresentasikan data.
Gambar
Solusi Perangkat Lunak

Kita dapat menggunakan paket perangkat lunak statistik untuk menemukan banyak ukuran lokasi.
# SARANA BERBATAS
Xw = w1x1 + w2x2 + w3x3 +… + wnxn / w1 + w2 + w3 +… + wn
Ini dapat disingkat menjadi:
xw = Σ (wx) / Σw
Perhatikan bahwa penyebut dari rata-rata tertimbang selalu merupakan jumlah dari bobot.
# SARANA GEOMETRIK
GM = √ n (x1) (x2)… (xn) liat di pdf
Rata-rata geometrik akan selalu kurang dari atau sama dengan (tidak pernah lebih dari) rata-rata
aritmatika. Juga, semua nilai data harus positif.
Sebagai contoh rata-rata geometris, misalkan Anda menerima kenaikan gaji 5% tahun ini dan kenaikan
15% tahun depan. Peningkatan persen tahunan rata-rata adalah 9,886%, bukan 10,0%. Mengapa
demikian? Kami mulai dengan menghitung rata-rata geometris. Ingatlah, misalnya, kenaikan gaji 5%
adalah 105%. Kami akan menuliskannya sebagai 1.05.
GM = √ (1,05) (1,15) = 1,09886
Ini dapat diverifikasi dengan mengasumsikan bahwa penghasilan bulanan Anda adalah $ 3.000 untuk
memulai dan Anda menerima dua kenaikan sebesar 5% dan 15%.
Naikkan 1 = $ 3.000 (.05) = $ 150.00
Naikkan 2 = $ 3.150 (.15) = 472,50
Total. $ 622,50
Kenaikan gaji total Anda adalah $ 622.50. Ini sama dengan:
$ 3.000,00 (.09886) = $ 296,59
$ 3.296,58 (.09886) = 325,91
$ 622,50
Penerapan kedua dari rata-rata geometris adalah menemukan persentase perubahan rata-rata selama
periode waktu tertentu. Misalnya, jika Anda memperoleh $ 45.000 pada tahun 2004 dan $ 100.000 pada
tahun 2016, berapa tingkat kenaikan tahunan Anda selama periode tersebut? Itu 6,88%. Tingkat
kenaikan ditentukan dari rumus berikut.
TINGKAT PENINGKATAN SELAMA WAKTU
GM = √n Nilai di akhir periode / Nilai di awal periode −1
n adalah jumlah periode
# MENGAPA STUDI DISPERSI?
Ukuran lokasi, seperti mean, median, atau mode, hanya menjelaskan pusat data. Ini berharga dari sudut
pandang itu, tetapi tidak memberi tahu kami apa pun tentang penyebaran data.
Jarak
Range = Nilai maksimum - Nilai minimum
Perbedaan
Rata-rata aritmatika dari deviasi kuadrat dari rata-rata.

Varians = Σ (x - μ) ² / N
Varians Populasi
VARIANSI POPULASI σ² = Σ (x - μ) ² / N
dimana:
σ² adalah varian populasi (σ adalah huruf kecil Yunani sigma). Dibaca sebagai "sigma kuadrat".
x adalah nilai observasi tertentu dalam populasi.
μ adalah rata-rata aritmatika dari populasi.
N adalah jumlah observasi dalam populasi.
Proses untuk menghitung varians disiratkan oleh rumus.
1. Mulailah dengan mencari mean.
2. Temukan perbedaan antara setiap observasi dan mean, dan kuadratkan perbedaan itu.
3. Jumlahkan semua selisih kuadrat.
4. Bagilah jumlah selisih kuadrat dengan jumlah item dalam populasi.
Jadi varians populasi adalah mean dari selisih kuadrat antara setiap nilai dan mean. Untuk populasi yang
nilainya mendekati mean, variansnya akan kecil. Untuk populasi yang nilainya tersebar dari mean,
varians populasi akan besar.
Varians mengatasi kelemahan range dengan menggunakan semua nilai dalam populasi, sedangkan range
hanya menggunakan nilai maksimum dan minimum. Kami mengatasi masalah di mana Σ (x - μ) = 0
dengan mengkuadratkan perbedaan. Menguadratkan perbedaan akan selalu menghasilkan nilai non-
negatif. Berikut ini adalah contoh lain yang menggambarkan perhitungan dan interpretasi varians.
Deviasi Standar Populasi
σ = √ Σ (x - μ) ² / N
Varians Sampel dan Deviasi Standar
Varians sampel: s² = Σ (x - x) ² / n - 1
dimana:
s² adalah varian sampel.
x adalah nilai setiap observasi dalam sampel.
x adalah mean sampel.
n adalah jumlah observasi dalam sampel.
CONTOH DEVIASI STANDAR: s = √ Σ (x - x) ² / n - 1
# INTERPRETASI DAN PENGGUNAAN DEVIASI STANDAR
Teorema CHEBYSHEV: Untuk kumpulan pengamatan apa pun (sampel atau populasi), proporsi nilai yang
berada dalam k deviasi standar rata-rata minimal 1 - 1 / k2, di mana k adalah nilai yang lebih besar dari
1.
ATURAN EMPIRIS: Untuk distribusi frekuensi berbentuk lonceng yang simetris, sekitar 68% pengamatan
akan berada dalam plus dan minus satu standar deviasi dari mean; sekitar 95% pengamatan akan berada
dalam plus dan minus dua deviasi standar dari mean; dan secara praktis semua (99,7%) akan berada
dalam plus dan minus tiga deviasi standar dari mean.
# SARANA DAN DEVIASI STANDAR DATA YANG DIKELOMPOKKAN

ARITIMETIS ARTI DATA YANG DIKELOMPOKKAN
x = ΣfM / n
dimana:
M adalah titik tengah setiap kelas.
f adalah frekuensi di setiap kelas.
fM adalah frekuensi di setiap kelas dikalikan titik tengah kelas.
Σfm adalah jumlah dari produk ini.
n adalah jumlah frekuensi.
DEVIASI STANDAR, DATA YANG DIKELOMPOKKAN s = √
Σf (M - x) ² / n - 1 (3–12)
dimana:
s adalah deviasi standar sampel.
M adalah titik tengah kelas.
f adalah frekuensi kelas.
n adalah jumlah observasi dalam sampel.

Statistika Week 3

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Statistika Week 3

Uploaded by

Copyright:

Available Formats

Chapter 3 :

The Population Mean

N is the number of values in the population.

Any measurable characteristic of a population is called a parameter. The mean of a population is an

The Sample Mean

Sample mean = Sum of all the values in the sample / Number of values in the sample

STATISTIC : A characteristic of a sample.

Properties of the Arithmetic Mean

2. All the values are included in computing the mean.

The major properties of the median are:

MODE : The value of the observation that appears most frequently.

The Relative Positions of the Mean, Median, and Mode

We can use a statistical software package to find many measures of location.

# THE WEIGHTED MEAN

Xw = w1x1 + w2x2 + w3x3 + … + wnxn / w1 + w2 + w3 + … + wn

This may be shortened to:

# THE GEOMETRIC MEAN

GM = √ n (x1)(x2) … (xn) liat di pdf

Raise 1 = $3,000(.05) = $150.00

Your total salary increase is $622.50. This is equivalent to:

RATE OF INCREASE OVER TIME

# WHY STUDY DISPERSION?

Range = Maximum value − Minimum value

The arithmetic mean of the squared deviations from the mean.

Variance = Σ(x − μ)² / N

POPULATION VARIANCE σ² = Σ(x − μ)² / N

x is the value of a particular observation in the population.

μ is the arithmetic mean of the population.

N is the number of observations in the population.

The process for computing the variance is implied by the formula.

1. Begin by finding the mean.

Population Standard Deviation

Sample Variance and Standard Deviation

Sample variance : s² = Σ(x − x )² / n − 1

s² is the sample variance.

x is the value of each observation in the sample.

x is the mean of the sample.

n is the number of observations in the sample.

SAMPLE STANDARD DEVIATION : s = √ Σ(x − x )² / n − 1

# INTERPRETATION AND USES OF THE STANDARD DEVIATION

# THE MEAN AND STANDARD DEVIATION OF GROUPED DATA

ARITHMETIC MEAN OF GROUPED DATA

x is the sample mean.

M is the midpoint of each class.

f is the frequency in each class.

fM is the frequency in each class times the midpoint of the class.

Σfm is the sum of these products.

n is the total number of frequencies.

STANDARD DEVIATION, GROUPED DATA s = √

s is the sample standard deviation.

M is the midpoint of the class.

f is the class frequency.

x is the sample mean.

μ mewakili rata-rata populasi. Ini adalah huruf kecil Yunani "mu."

N adalah banyaknya nilai dalam populasi.

x mewakili nilai tertentu.

Σ adalah huruf kapital Yunani "sigma" dan menunjukkan operasi penjumlahan.

Σx adalah jumlah dari nilai x dalam populasi.

PARAMETER: Karakteristik populasi.

x mewakili mean sampel. Bunyinya "x bar".

n adalah jumlah nilai dalam sampel.

x mewakili nilai tertentu.

Σ adalah huruf kapital Yunani "sigma" dan menunjukkan operasi penjumlahan.

Σx adalah jumlah dari nilai x dalam sampel.