Professional Documents
Culture Documents
Concept of Reliability
Target analogy
http://ccnmtl.columbia.edu/projects/qmss/measurement/validity_and_reliability.html
2
The smaller the error variance, se ,the greater
proportion that is due to true score variance and
the higher the reliability
If proportion is 1.0, then no error variance – perfect
reliability
If proportion is 0.0, then all error variance – no
reliability and all noise
True
Reliability =
True + Error
st2
Rxx = 2 2
st + se
st2
= 2
so
Note: your text uses Rxx as the symbol for reliability but most texts use ρxx (rho) or rxx
Newsom, Fall 2017, Psy 521/621 Univariate Quantitative Methods
7
Attenuation
rxy =
∑ ( X − X )(Y − Y ) cxy
∑ ( X − X ) ∑ (Y − Y ) sx s y
2 2
f
Variability around the average
Average
weight
Newsom, Fall 2017, Psy 521/621 Univariate Quantitative Methods
13
Means
Comparing means
X1 − X 2
t=
s12 s22
+
n1 n2
Comparing means
X o1 − X o 2
d xo =
so21 + so22
2
Also seen in the estimate of the effect size, which gives the
magnitude of the group difference (where o1 and o2 subscripts
indicate observed values for group one and two)
Test-retest reliability
Repeat the test two or more times to see how similar the
measurements are
Calculate the correlation between the measurement occasions
Problem is that in the interval between the measurement
occasions the attribute may have changed
Small time interval needed in between measurements without
contamination from recall
Parallel tests
Two tests are parallel if their true scores are the same and they
have the same standard deviation
Theoretical notion, because it is not possible to know with
absolute certainty that two tests are exactly parallel
Split-half reliability
Can develop a larger test and correlate two
halves
Problem is how best to split up the test
e.g., what if the first half and second half differ?
Internal reliability
The domain sampling idea allows us to use the correlations
among items to gauge the reliability of a measure
This is the basis of internal reliability, such as the type of
reliability assessed by Cronbach’s alpha
Preliminary steps
• Generate descriptive statistics, including means, standard
deviations (and/or variances, skewness and kurtosis)
• Obtain frequency tables and histograms
• Check for errors in entry, coding, etc.
• Variables do not need to be normally distributed, but when they
are highly skewed or kurtotic or they respondents have not used
the full range of values, you may want to consider the wording of
that item.
• Check correlations to confirm scoring direction is correct and
potentially eliminate items that are supposed to correlate that do
not
Newsom, Fall 2017, Psy 521/621 Univariate Quantitative Methods
23
Cronbach’s Alpha
α= 1 − 2
k −1 sX
k = number of items, si2 is the variance for each item, and sx2 is the variance for
the composite scale score (as a sum of the items)
Newsom, Fall 2017, Psy 521/621 Univariate Quantitative Methods
25
Cronbach’s Alpha
rii ′ is the average correlation among all pairs of items, and k is the
number of items
Nunnally, J. C. (1978). Psychometric theory (2nd ed.). New York: McGraw-Hill. Quote also given by: Lance, C. E., Butts, M. M., &
Michels, L. C. (2006). The sources of four commonly reported cutoff criteria what did they really say?. Organizational research
methods, 9(2), 202-220.
Newsom, Fall 2017, Psy 521/621 Univariate Quantitative Methods
30
Kuder-Richardson 20 (KR20)
nRxx − original
Rxx − revised =
1 + ( n − 1) Rxx − original
nRxx − original
Rxx − revised =
1 + ( n − 1) Rxx − original
2 (.6 ) 1.2
= = = .75
1 + ( 2 − 1) .6 1.6