You are on page 1of 36

METODE PENELITIAN KUANTITATIF

Bagian VI

VALIDITAS DAN
RELIABILITAS
PENGUKURAN

Validitas dan Relialibilitas


METHODOLOGICAL GOODNESS / QUALITY CRITERIA

Trustworthiness
Constructiv
ism

Authenticity

• Historical
situatedness
Critical • Wholeness

Classical • Objectivity
• External Validity
• Internal validity • Measurement validity and
reliability

• Design and analysis


validity

Validitas dan Relialibilitas


Validitas dan Relialibilitas
MEASUREMENT RESULTS:
variances/scores differences

SOURCES OF VARIANCES/
SCORES DIFFERENCES
(Singleton, 1988; p.112; Selltiz, et.al., pp. 164-169)

TRUE RANDOM SYSTEMATIC


DIFFERENCES ERRORS ERRORS

differences in measurement biases inherent


the concept the errors due to in the method or
measure random or operational
intended to chance factors: definition
measure (Singleton, (Singleton,1988)
(Singleton, 1988) or ; is an error
1988); transient introduced into
differences in aspects of the the
the person, of the measurement by
characteristics situation of some factor that
we are measurement, systematically
attempting to or of the affects the
measure (Selltiz, measurement characteristics
1976). procedures that being measured
are likely to vary or the process of
from one act of measurement
measurement to (Selltiz, et el.,
the next, even 1976)
though the
characteristics
we are trying to
measure has not
changed.
(Selltiz, 1976)

Validitas dan Relialibilitas


Measurement errors

MEASUREMENT ERRORS

RANDOM ERRORS SYSTEMATIC ERRORS

• Kesalahan yang • Kesalahan yang


terjadi secara terjadi secara
random/acak, sistematis, antara
akibat kondisi, lain bersumber
proses, atau dari faktor-faktor
variasi prosedur yang inherent
pengukuran yang dalam alat ukur
dilakukan. atau definisi
operasional
konsep yang
diukur.

• Membias secara • Membias ke satu


acak ke berbagai arah kemungkinan
arah kemungkinan tertentu

RELIABILITY VALIDITY

Validitas dan Relialibilitas


Tingkatan sejauh Tingkatan sejauh
mana pengukuran mana pengukuran
yang dilakukan yang dilakukan benar-
memperoleh hasilyang benar mengukur
konsisten (antar konsep yang semula
waktu, antar akan diukur
pengamat, antar The degree to which
indikator, dsb.) a test measures what
The consistency of a it purports to
measure (Bailey, 1987) measure (Borg & Gall;
1971)

Validitas dan Relialibilitas


RELATIONSHIP BETWEEN RELIABILITY AND VALIDITY
• A MEASURE CAN BE RELIABLE BUT INVALID
• RELIABILITY IS A NECESSARY CONDITION FOR
VALIDITY

LOW RELIABILITY
LOW VALIDITY
• • •
HIGH RELIABILITY • •
LOW VALIDITY • •

• •

•• LOW RELIABILITY
•• • HIGH VALIDITY???
HIGH RELIABILITY
HIGH VALIDITY
Validitas dan Relialibilitas
RELIABILITY IS A NECESSARY BUT NOT SUFFICIENT CONDITION FOR VALIDITY
RELIABILITY AND VALIDITY
Reliability
high low

• •
high •• • • • • •
••⊗ • • • ⊗ •
• ••• • • • •
• • •

Validity • •
• • •• • • • •
• ••• • • • •
•• • • • • • •

⊗ ⊗
low
⊗ = actual parameter

Validitas dan Relialibilitas


• = measure

VALIDITY AND RELIABILITY

FACE - in the judgment of


others

CONTENT
Pre-data SAMPLING - capture the
Validity entire dimensions

CONCURRENT - preexisting
criterion
VALIDITY CRITERION
Data-based PREDICTIVE - predicted
-Validity criterion

CONVERGENT - similar
construct
MEASUREMENT CONSTRUCT
QUALITY Data-based DIVERGENT - opposing
Validity construct

Validitas dan Relialibilitas


STABILITY OVER TIME

RELIABILITY EQUIVALEN OVER ALTERNATE / RATER


CE
internal
consistency
HOMOGENE OVER unidimensionality
ITY INDICATORS
Sources: Allen and Yen (1979); Seltiz et al.(1986); Sekaran (1992)

Validitas dan Relialibilitas


VALIDITY ASSESSMENTS
CONTENT/ FACE VALIDITY
APPARENT . . . in the judgment of others -
VALIDITY Kesepakatan pakar tentang sejauh mana
definisi operasional/ indikator yang
dipergunakan suatu instrumen benar-benar
mengukur konsep yang ingin diukur

LOGICAL/SAMPLING VALIDITY
. . . capture the entire dimensions -
Kesepakatan pakar tentang sejauh mana
definisi operasional/ indikator-indikator suatu
instrumen mewakili keseluruhan dimensi
konsep yang diukur

CRITERION- CONCURRENT VALIDITY


RELATED correlatred with a preexisting criterion
VALIDITY - distinguish objects that differ in their
Pragmatic present status - tingkat sejauh mana hasil
validity
pengukuran berkorelasi dengan pengukuran
konsep lain atau kondisi tertentu yang
diasumsikan sebagai kriteria (e.g.:
Kemampuan Index of Democracy
menempatkan AS dan Korut dalam kategori
berbeda)

PREDICTIVE VALIDITY
correlated with future/predicted
condition - distinguish objects that will
differ in the future . Tingkat sejauh mana
hasil pengukuran suatu konsep mampu
memprediksi keadaan di masa datang
(Contoh: Test Potensi Akademik 2003 dan
Index Prestasi Kumulatif ( 2005)

CONSTRUC CONVERGENT VALIDITY


T positively correlated with similar
VALIDITY constructs
Theory- Tingkat sejauh mana hasil pengukuran suatu
related konsep ber-hubungan positif dengan hasil
validity
Validitas dan Relialibilitas
pengukuran konsep lain yang secara teoretis
harus sama (e.g.: Index Human Freedom
Index dan Civil Liberties Index)

DISCRIMINANT VALIDITY
uncorrelated or negatively correlated
with opposing construct - Tingkat sejauh
mana hasil pengukuran suatu konsep berbeda
(tidak berkorelasi / berkorelasi negatif)
dengan pengukuran konsep lain yang secara
teoretis harus berbeda (Contoh: Job
Satisfaction Index dan Labor Turn-over Index)

Validitas dan Relialibilitas


RELIABILITY
The consistency of a measure (Bailey, 1987)

STABILITY OVERTIME RELIABILITY:


Konsistensi hasil pengukuran dari waktu ke waktu

RELIABILITY EQUIVALENCE OVER-RATERS/ALTERNATE


RELIABILITY:
Konsistensi hasil suatu pengukuran dng.
pengukuran lain yang serupa, atau dng.
pengukuran yang dilakukan pengamat lain
(memakai instrumen yang sama)
internal
consistency
(summative rating
scales)
HOMOGENEITY OVER-INDICATORS:
Konsistensi hasil pengukuran antar
indikator dalam suatu instrumen
pengukuran
unidimensionality

Validitas dan Relialibilitas


(cumulative rating
scales)

Validitas dan Relialibilitas


STABILITY ESTIMATES:
OVERTIME RELIABILITY

Pearson’s r Correlation Coefficient

SUBJECTS TEST-1 TEST-2


(X) (Y) X2 Y2 XY

A 4.0 3.75 16.00 14.06


150.00
B 3.0 3.00 9.00 9.00
90.00
C 3.5 3.25 12.25 10.56
113.75
D .5 1.75 0.25 3.06
8.75
E 1.0 2.00 1.00 4.00
20.00
F 1.5 2.25 2.25 5.06
33.75
G 2.5 3.00 6.25 9.00
75.00

N=7 ∑X= 160 ∑Y= 19.00 ∑X2= 4700 ∑X2= 54.74 XY=
491.25

Pearson’s r Correlation Coefficient

N∑ xy - (∑x) (∑y)
rxy = = 0.98
[ N∑x2 - (∑x)2 ] [ N∑y2 - (∑y)2 ]

Statistical significance

Validitas dan Relialibilitas


∝ = 0.05 df = n - 2 = 5 critical value
rc0.05;5 = 0.74
r xy = 0.98 > rc0.05;5 significant
( reject H 0)

HOMEGENEITY ESTIMATES:
INTERNAL CONSISTENCY
(summative rating scales )

• Spearman-Brown coefficient
Split-half reliability coefficients

rX1X2 = 2 rY1Y2
_____________
1 + rY1Y2

rX1X2 = reliabilitas pengukuran seluruhnya


rY1Y2 = korelasi antara skor belahan 1 dan 2

• Cronbach's α

rX1X2 = 2 [ s2 - ( s2Y1 + s2Y2 ) ]


______________________
s2Y1Y2

rX1X2 = reliabilitas pengukuran seluruhnya


s2Y1 = varians skor belahan 1
s2Y2 = varians skor belahan 2
Validitas dan Relialibilitas
HOMEGENEITY ESTIMATES:
INTERNAL CONSISTENCY

Pearson’s r Correlation Spearman-Brown


Coefficient coefficient

N∑ x1x2 - (∑x1) (∑x2) 2 rx1,x2


rxy = rxy =
[ N∑x12 - (∑x1)2 ] [ N∑ 1 + rX1,X2
x22 - (∑x2)2 ]

Skor HFI Skor HFI


NEGARA belahan 1 belahan 2 X12 X22 X1X2
( X1 ) ( X2)

A 14 14
B 15 14
C 4 5
D 13 12
E 12 14
F 3 4
G 14 14
H 15 14
I 4 5
J 13 12
K 12 14
L 3 4
∑X12 ∑X22 ∑X1X2

Validitas dan Relialibilitas


HOMEGENEITY ESTIMATES
UNIDIMENSIONALITY
(cumulative rating scales )
COEFFICIENT OF REPRODUCIBILITY:
. . . the percentage of original responses that
could be reproduced by knowing the scale scores
to summarize them (Babbie, 1992; p.186).

Contoh penghitungan coefficient of reproducibility


skala Social Distance

INDICATORS:
YE NO
S
1.Bersedia menerima imigran Asia sebagai 1 0
warganegara
2.Bersedia menerima imigran Asia sebagai rekan 1 0
kerja
3.Bersedia menikah dengan imigran Asia 1 0
Observatio No. of Index Scale Total
n pattern responde score scores scale
nts s errors
1 2 3
Scale 1 0 0 10 1 1 0
types
1 1 0 7 2 2 0
1 1 1 2 3 3 0
0 0 0 8 0 0 0
Mixed 0 1 1 1 2 3 1
types
1 0 1 4 2 3 4
0 0 1 5 1 2 5
1 0 1 7 2 3 7
44 17

number of errors
CoR = 1-
number of cases x number of
item

Validitas dan Relialibilitas


16 Sep 98 SPSS for MS WINDOWS Release 6.0 Page 6

RANKING INDONESIA : CIVIL LIBERTIES - POLITICAL RIGHTS - POLYARCHY -


HUMAN FREEDOM
CNTRY RCIVLIB8 RPOLRIG8 RPOLY85 RHDI87 RHFI85
Mexico 43.0 41.0 18.0 29.0 36.0
SKorea 33.5 31.5 18.0 24.0 38.5
Thailand 33.5 41.0 8.5 38.0 38.5
India 33.5 31.5 8.5 61.0 38.5
Singapore 52.0 45.5 18.0 25.5 45.0
Egypt 43.0 50.5 18.0 55.0 45.0
Philippine 33.5 31.5 18.0 45.0 49.0
Malaysia 52.0 45.5 18.0 34.0 na
Chile 43.0 50.5 25.0 22.0 53.5
Bangladesh 52.0 45.5 25.0 67.0 58.0
Saudi 69.0 61.5 43.5 46.0 60.0
Indonesia 52.0 50.5 25.0 52.0 62.5
Pakistan 33.5 41.0 28.5 62.0 62.5
China 62.0 61.5 43.5 44.0 66.5
Libya 62.0 61.5 43.5 43.0 68.0

N = 71.0 71.0 45.0 71.0 71.0


Number of cases read: 71 Number of cases listed: 71

HUMAN FREEDOM INDEX


HFI disusun sebagai suatu Summative Rating Scale (Likert-type Scale)
berdasarkan 40 indikator, a.l., kebebasan untuk menyatakan pendapat,
melakukan oposisi, menentukan pasangan, kebebasan dari penganiayaan,
dsb. oleh Charles Humana (1986).

4 3 2 1
INDICATORS Respect for Some Substantial Continuous
rights or violations or violation or total
freedom infringement denial
s

Rights to:
03. To assembly

Freedom from:
07. Unlawful
detention

Freedom for:
19. Political

Validitas dan Relialibilitas


opposition

Personal Rights
40. Homosexuality

HUMAN FREEDOM INDEX (HFI): Dalam laporan tahunan UNDP, Human Development Report 1991,
mempergunakan data Human Rights Rating yang disusun oleh Charles Humana (World Human
Rughts Guide 1986).

Validitas dan Relialibilitas


POLYARCHY INDEX
Indeks Polyarchy dikembangkan sebagai
Cummulative Rating Scale (Guttman-type
Scale). Mengukur pluralisme politik
berdasarkan keberadaan perangkat tatanan
kelembagaan yang memungkinkan dan
menjamin adanya oposisi publik dan hak untuk
berpatisipasi dalam proses-proses politik
. . . the set of institutional arrangements that permits
public opposition and establishes the right to
participate in politics (Coppedge and Reinicke, 1993;
p.47).

CONCEPT DIMENSI INDICATORS


ONS

FREE & 1.No meaningful election are


FAIR held
ELECTION 2.Marred by fraud and coercion
3.Meaningful fair election

FREEDOM 1.All organizations are banned


OF or controlled
POLYARC ORGANIZA 2.Only nonpolitical
TION organizations are allowed
HY 3. Some independent political
organizations are banned
4.Full freedom for political
organization

FREEDOM 1.All public dissent is


OF suppressed
EXPRESSIO 2.Some public dissent is
N suppressed
3.Full freedom of expression

AVAILABILI 1.No public alternative to


TY OF official information
ALTERNATI 2.Alternative sources exist only
VE for nonpolitical issues
INFO 3.There is preferential
SOURCES presentation of official views
in the media
4.No preferential presentation
of official views in the media

Disusun berdasarkan: Coppedge and Reinicke (1993) “Measuring


Polyarchy”. In Inkeles, Inkeles, Alex (Ed.), On Measuring Democracy: Its
Cosequences and Concomitants. New Brunswick, London: Transaction
Publishers; pp. 47-68.

POLITICAL RIGHTS - CIVIL LIBERTIES

Dua Indeks yang sebagai kombinasi


dimaksudkan untuk menentukan tingkat
kehidupan demokrasi. Suatu Summative Rating
Scale (Likert-type Scale) yang dikembangkan
oleh Raymond Gastil, berdasarkan 11 indikator
Political Rights dan 14 indikator Civil Liberties
( lihat “The Comparative Survey of Freedom: Experiences and
Suggestions”. In Inkeles, Inkeles, Alex (Ed.), On Measuring Democracy:
Its Cosequences and Concomitants. New Brunswick, London: Transaction
Publishers; pp. 21-46.).

Sample Checklist for Political Rights


1. Chief authority recently elected by a meaningful
process
2. Legislature recently elected by a meaningful
process
Alternatives for 1 and 2:
a. No choice and possibility of rejection
b. No choice but some possibility of rejection
c. Government or single-party selected candidates
d. Choice possible only among government-
approved candidates
e. Relatively open choices possible only in local
elections
f. Open choice possible within a restricted range
g. Relatively open choices possible in all elections

Sample Checklist for Civil Liberties


17. Free from unjustified political terror or
imprisonment
18. Free trade unions, peasant organizations, or
equivalent
19. Free businesses or cooperatives
- - Correlation Coefficients - -
HFI85 HDI87 CIVLIB88 POLRIG88 POLY85 GNP6588
HFI85 -

HDI87 .6130 -
( 68)
P= .000

CIVLIB88 .8142 .7174 -


( 68) ( 71)
P= .000 P= .000

POLRIG88 .7612 .6706 .9284 -


( 68) ( 71) ( 71)
P= .000 P= .000 P= .000

POLY85 .7102 .4652 .8295 .7830 -


( 42) ( 45) ( 45) ( 45)
P= .000 P= .001 P= .000 P= .000

GNP6588 .1118 .3144 .1674 .2306 .2514 -


( 68) ( 71) ( 71) ( 71) ( 45)
P= .364 P= .008 P= .163 P= .053 P= .096
R E L I A B I L I T Y A N A L Y S I S - S C A L E (S P L I T)
N of Cases = 86.0 N of Items = 40
Guttman Split-half = .9791 Unequal-length Spearman-Brown = .9820
20 Items in part 1 20 Items in part 2

Validitas dn Reliabilitas 26
FACTORIAL VALIDITY
CONFIRMATORY FACTOR ANALYSIS
LIBERALISM – CONSERVATISM
A DUALISM CONCEPT
LIBERALISM FACTOR I FACTOR II
CONSERVATISM
DUALITY CON1 .03513 .9476
6
LIBERALISM CON2 .09007 .9408
CONSERVATISM
2
DUALISM CON3 .05609 .8541
0
LIB1 .8109 .50260
1
LIB2 .9434 .05173
FACTOR 8
. . . a construct, a hypothetical entity, a LIB3 .9458 .10260
latent variable that is assumed to underlie 4

Validitas dn Reliabilitas 27
tests, scales, items, and indeed, measures of almost any kind (Kerlinger, 1986;
p. 569).

PRINCIPLES TO INCREASE THE RELIABILITY


OF MEASURES
(Neuman, 1997; pp. 147-148)

• Clearly conceptualize all constructs


• Increase the level of measurements
• Use multiple indicators of a variable
• Use pretest, pilot studies, and replication

Validitas dn Reliabilitas 28
Validitas dn Reliabilitas 29
ITEM-ANALYSIS UNTUK PENINGKATAN RELIABILITAS
JOB SATISFACTION SCALE
(Adopted from Schuessler’s Job Satisfaction Scale. In Miller, 1991; p.453)

Statements SD DA N A SA
A 4 3 2 1
5
1. There is too little variety in my job
2. I tend to get bored on the job
3. There must be better places to work
4. I would like more freedom on the job
5. I have to small a share in deciding matters that affect
my work
6. My job means more to me than just money*
7. I am not satisfied with the the work I do
8. My job gives me a chance to do what I do best*
9. People feel like they belong where I work*
*Positive statements
12 Oct 98 SPSS for MS WINDOWS Release 6.0 Page 1

N of
Statistics for Mean Variance Std Dev Variables
SCALE 31.4733 4.2743 2.0674 9

Validitas dn Reliabilitas 30
N of Cases = 129.0

JOB SATISFACTION SCALE


R E L I A B I L I T Y A N A L Y S I S - S C A L E (A L P H A)
Reliability Coefficients

N of Cases = 129.0 N of Items = 9

Alpha = .6198

Item-total Statistics
Scale Scale Corrected
Mean Variance Item- Alpha
if Item if Item Total if Item
Deleted Deleted Correlation Deleted

Q1 28.0275 3.5993 .1503 .6419


Q2 27.9570 3.3573 .5014 .5462
Q3 27.9612 3.2213 .5466 .5301
Q4 27.9756 2.4755 .7012 .4419
Q5 27.9054 3.2929 .3719 .5719
Q6 27.9399 3.2890 .5468 .5347
Q7 28.1163 3.2524 .5300 .5350

Validitas dn Reliabilitas 31
Q8 27.9008 4.5462 -.3015 .6782
Q9 28.0023 4.8057 -.4374 .7132
12 Oct 98 SPSS for MS WINDOWS Release 6.0 Page 5
13 Oct 98 SPSS for MS WINDOWS Release 6.0 Page 2

JOB SATISFACTION SCALE


R E L I A B I L I T Y A N A L Y S I S - S C A L E (A L P H A)

Item-total Statistics

Scale Scale Corrected


Mean Variance Item- Alpha
if Item if Item Total if Item
Deleted Deleted Correlation Deleted

Q2 17.4627 3.6783 .3779 .8343


Q3 17.4658 3.5845 .3960 .8326
Q4 17.4873 2.4132 .8262 .7372
Q5 17.4181 2.9834 .6503 .7829
Q6 17.4485 3.1811 .7433 .7700
Q7 17.6254 3.2917 .6072 .7938

Reliability Coefficients

Validitas dn Reliabilitas 32
N of Cases = 130.0 N of Items = 6

Alpha = .8241
13 Oct 98 SPSS for MS WINDOWS Release 6.0 Page 3

WORDING EFFECTS IN REFERENDUM

% Differences b/w
cons and pros
Common Market
Should the United Kingdom come out of
the Common Market? + 10.8

The Government recommends the - 11.2


acceptance of the renegotiated terms of
British membership of the Common
Market. Should the United Kingdom stay
in the Common Market?

Her Mayesty’s government believes that


the nation’s best interests would be - 16.2

Validitas dn Reliabilitas 33
served by accepting the favourably
negotiated terms of our continued
membership of the Common Market.
Should the United Kingdom stay in the
Common Market
Source: Butler and Kitzinger (1976; p.60)

Validitas dn Reliabilitas 34
BEBERAPA PRINSIP PENYUSUNAN INSTRUMEN
PERTANYAAN
(Sumber: Neuman, 1997; pp.223-237)

• Avoid jargon, slang, and abbreviations


“Apakah suami anda memperlihatkan gejala Oedipus
complex?”

• Avoid ambiguity, confusion, and vagueness


“Apakah anda berolahraga secara teratur?”

• Avoid emotional language and prestige bias


“Kalangan berpendidikan umumnya setuju agar film-film
televisi bertema kekerasandan sex dilarang. Apakah anda
setuju dengan pelarangan film-film televisi seperti itu?”

• Avoid double-barreled questions


“Apakah anda setuju dengan rencana untuk
menyelenggarakan Sidang MPR bulan Oktober 1998 dan
Pemilihan Umum bulan Mei 1999?”

• Avoid leading questions


“Haruskah pemerintah mengucurkan dana lebih banyak
lagi untuk mengamankan jalanan ibukota yang semakin
rawan kejahatan?”

• Avoid asking questions that are beyond


respondents’ capabilities
“Berapa menitkah rata-rata anda menonton televisi
selama tiga bulan terakhir ini?”

• Avoid false premises


“Pemda telah terlalu banyak memberi layanan kepada
masyarakat. Apakah anda setuju layanan tersebut
dikurangi demi menghemat biaya?”

Validitas dn Reliabilitas 35
• Avoid overlapping or unbalanced
“Bagaimanakah penilaian anda terhadap kinerja pimpinan
anda? Luar biasa, istimewa, sangat bagus, bagus, atau
memuaskan?

Validitas dn Reliabilitas 36

You might also like