Professional Documents
Culture Documents
Outcome 2
process, research methods and statistics
used in test development and standardization
Your Team
QUESTION 1
Scale of Measurement
Scale of Measurement
CATEGORICAL CONTINUOUS
Ex. SES (“low income”,”middle income”,”high income”) Kelvin scale : Income, height, weight, annual sales, market share, product
education level (“high school”,”BS”,”MS”,”PhD”) defect rates, time to repurchase, unemployment rate, and crime rate
income level (“less than 50K”, “50K-100K”, “over 100K”)
QUESTION 1
Phi Coefficient
Correlates two dichotomous data; at least one true dichotomy
Ex. Gender and passing or failing a test
Tetrachoric Correlation
Correlates two dichotomous data; both are artificial dichotomy
Ex. Passing or failing a test and being highly aggressive or not.
QUESTION 1
Which of the
A greater percentage of cases following
B distributed about the mean
describes a
normal curve?
The values that trail off sharply
C on one side than the other
C Below average
Poor
D
A T-score of 45
STANDARD SCORES
T – Score
Mean = 50; SD = 10
Created by McCall in honor of
his professor Thorndike
Stanine
Mean = 5; SD = 2
Used by US Airforce
Z-SCORES - Mean of 0 ; SD of 1
- zero plus or minus one scale Assessment
scale
- When determined, can be used to translate one
to another.
Takes whole numbers 1 – 9; no
Example:
decimals
Score- 65
Mean- 50
Sd= 15
STANDARD SCORES
Sten
Standard ten
Mean = 5.5; SD = 2
A 61
51
Find range
B distribution of 72, 25,
81, 63, 30, 20, 53.
C 52
20
D
MEASURES OF DISPERSION
MEASURES OF DISPERSION
MEASURES OF DISPERSION
QUESTION 5
A Pearson – r
You are calculating the
reliability of your newly
Spearman-Brown develop Interest
B Questionnaire, answerable
by YES/NO.
D KR 20
TEST STATISTICS FOR CORRELATION
Spearman Rho
Also called as rank-ordered correlation or
Spearman Correlation
Correlates 2 variables in ordinal scale
CRONBACH
➔COEFFICIENT ALPHA
➔Non-dichotomous items
➔Preferred statistic for obtaining an estimate
of internal consistency reliability
Provide an indication of the likelihood that a test taker will score within some interval of
scores on a criterion measure – an interval that can be categorized as “passing”,
➔COEFFICIENT ALPHA
➔Non-dichotomous items
➔Preferred statistic for obtaining an estimate
of internal consistency reliability
Provide an indication of the likelihood that a test taker will score within some interval of
scores on a criterion measure – an interval that can be categorized as “passing”,
A Factor analysis
Sarah would like to predict
college achievement from a
variable of his High School
B Meta-analysis
Grade Point Average,
Scholastic Achievement Test
(SAT), SAT reading score,
SAT Math score, and SAT
C Multiple Regression
writing score. What statistical
treatment is most applicable?
D Multi-variate analysis
REGRESSION ANALYSIS
MULTIPLE REGRESSION ANALYSIS
Independent Dependent
Variable Variable
attitude
Diet
beliefs adherence
Social
norms
Factor Analysis Meta-analysis
D No caffeine at all.
CONTROL GROUP IN AN EXPERIMENT
A Criterion-referenced
D Cultural norm
NORMS
A Construct underrepresentation
C Criterion contamination
D Poor divergence
Construct underrepresentation
Failure to capture important components of a construct (e.g. An English
test which only contains vocabulary items but no grammar items will
have a poor content validity.)
Construct-irrelevant variance
Happens when scores are influenced by factors irrelevant to the
construct (e.g. test anxiety, reading speed, reading comprehension,
illness)
A Predictive validity
Pre-board examinations
B Content validity should be concerned
primarily with _____.
C Construct validity
D Convergent validity
QUESTION 14
A Local validation
Cultural Check
D
EVIDENCE OF CONSTRUCT VALIDITY
Two psychologists
B Split-half reliability evaluated the "Autism"
symptoms of their
patients. If both of their
judgment yields
C Inter-rater reliability identical rating, then it
has _______.
Test-retest reliability
D
QUESTION 17
A Inter-rater reliability
D Standardization
RELIABILITY
RELIABILITY ESTIMATES
RELIABILITY
• Dependability or • Inter-scorer reliability – the degree
consistency in of agreement or consistency
measurement. between two or more scorers (or
judges or raters) with regard to a
particular measure.
RELIABILITY ESTIMATES
RELIABILITY
• Dependability or • Test-retest reliability – an estimate of
reliability obtained by correlating pairs of
consistency in
scores from the same people on two different
measurement. administrations of the same test.
RELIABILITY ESTIMATES
RELIABILITY
• Dependability or • Split-half reliability – obtained by
consistency in correlating two pairs of scores
measurement. obtained from equivalent halves of a
single test administered once.
RELIABILITY ESTIMATES
RELIABILITY
• Dependability or • Inter-item consistency – the degree
consistency in of relatedness of items on a test.
measurement. Able to gauge the homogeneity of a
test.
A Good item
C Not discriminating
D Strongly discriminating
ITEM – DISCRIMINATORY INDEX
A Time differences
Test forms
D
Thank You