Professional Documents
Culture Documents
2. Detailed information regarding how a particular test was developed is typically found
in
a. the current test catalogue distributed by the test’s publisher
b. the Standards for Educational and Psychological Tests
c. a review of the test published in a journal
d. the test manual
12-14. A researcher was interested in whether or not jazz vocals and opera influence
men’s and women’s emotional states. She hypothesized that these types of music
influence men and women differently. In a study investigating this hypothesis, 40 men
and 40 women heard a jazz piece, and 4 men and 4 women heard an operatic piece.
The jazz piece was sung by a man, and the operatic piece was sung by a woman.
Afterward, participants rated themselves on an inventory measuring emotional state.
Higher scores on the inventory indicate positive mood. Results of this study are
presented in the graph below:
12. Which of the following describes the pattern of findings displayed in the graph?
a. Women scored higher than women on the mood inventory regardless of the type of
music they heard.
b. Men who heard the jazz piece and women who heard the operatic piece scored
higher on the mood inventory than those in the other two groups
c. women who heard the jazz piece and the men who heard the operatic piece scored
higher on the mood inventory than those in the other two groups.
d. men scored higher than women on the mood inventory regardless of the type of
music they heard.
13. The researcher concludes from her study that jazz music positively changes men’s
mood and operatic music positively changes women’s moods. Which of the following
invalidates the conclusion?
a. men’s and women’s moods were not measured before exposure to the two types of
music.
b. previous studies have shown that men are less emotional than women
c. men and women were randomly assigned to the groups
d. only one scale was used to measure mood
14. Which of the following is the most serious problem with the methodology of this
research?
a. The sample size was too small to draw a valid conclusion
b. men and women did not listen to both types of music
c. only one type of music should have been used
d. the singers were not the same gender
15.A recent article in an educational journal described a university at which the average
age is 26. This article also mentioned that 38 percent of the students are over 25 years
of age. What can be concluded from this information?
a. the distribution must be skewed
b. the standard deviation must be relatively small
c. the median age must be greater than the mean age.
d. the distribution must be bimodal
16. Conducting a study by analyzing Philippine census data from previous years is an
example of using which of the following research approaches?
a. descriptive
b. surveys
c. case history
d. archival analysis
17-18. Depression is more common among people with insomnia than among those
with satisfactory sleep. To determine the reasons for this relationship, investigators
identified 40 people suffering from both depression and insomnia. For each of these 40,
they paired two other people of the same gender and age who were neither depressed
nor suffering from any other sleep other. One of these was designated the “normal-
sleep-control”, and the other was designated the “yoked-control”. All participants slept in
a laboratory for one week. The normal-sleep control person slept without restrictions.
During that same time, yoked control was permitted to sleep when the depressed-
insomniac person slept but was required to awaken whenever the depressed-insomniac
was awakened.
A valid questionnaire for measuring depression was administered at the end of the one-
week study. Assume that higher scores on the questionnaire reflect greater depressive
symptomatology.
17. What pattern of results on the depression questionnaire would justify the conclusion
that sleeplessness leads to depression?
a. normal sleep control = yoked control < depressed
b. normal sleep control < yoked control = depressed
c. yoked control < normal sleep control = depressed
d. yoked control < normal sleep control < depressed
18. Supposed that the results were consistent with the hypothesis that sleeplessness
does NOT lead to depression. Of the following which would be the most serious
criticism of the study and its conclusion?
a. One week of sleep deprivation may have been adequate to produce depression
b. the study failed to examine other factors that might also contribute to depression
c. the yoked-control group was unnecessary
d. the normal sleep control group was unnecessary
19. A researcher conducted a study to determine the effects of gender and status on the
perceived credibility of an eyewitness testifying in a trial. Participants watched one of
four video recordings depicting the eyewitness and rated the credibility of the
eyewitness.
In order to determine whether gender, as a specific variable, had an effect on perceived
credibility of the eyewitness, which of the following must be significant?
a. the interaction between gender and status
b. the main effect of status
c. the main effect of gender
d. post-hoc analysis of gender
20. Melody exclaims, “I got a C- on the statistics exam, and I was miserable until I
thought how terrible it must be for those who got F’s.” Melody’s attitude is an example of
which of the following?
a. social comparison
b. social anxiety
c. social validation
d. social learning
22. Which of the following tests measures ability, intellect, and knowledge?
a. Wechsler Adult Intelligence Scale -Fourth Edition (WAIS-IV)
b. Minnesota Multiphasic Personality Inventory-2-RF (MMPI-2-RF)
c. Myers-Briggs Type Indicator (MBTI)
d. Strong Interest Inventory
23. Research by Solomon Asch supports which of the following?
a. conformity increases as group size increases from two people to four or five people
b. higher levels of conformity are found in individualistic societies than in collectivistic
societies
c. individual will follow orders to shock innocent strangers
d. the presence of one dissenter in a group is not strong enough to reduce conformity.
24. A group of researchers was interested in learning whether a newly developed exam
would be useful in determining whether a student will be successful in college. The
researchers designed a study in which a students took the new exam prior to entering
college, the student took another exam, which was designed to measure how much
information they had learning during their first year. The score on this exam was then
correlated with the student’s score on the newly developed exam. What type of validity
was being evaluated in the study?
A. predictive
b. discriminant
c. divergent
d. concurrent
26. Which of the following effects is the most serious limitation of this study?
a. ceiling
b. carryover
c. cohort
d. selection
28. Dr. Chen is interested in feminist is interested in feminist attitudes of young adult
women in the United States. Consequently, she administered a feminist attitude
questionnaire to a total of 100 young adult women from three universities. The 100
women tested and the number of young adult women in the United States are which of
the, respectively?
A. effect size and population
b. random assignment and random selection
c. independent and dependent variables
d. sample and population
29. In the language of psychological testing and assessment, scoring refers to assigning
evaluative numbers, codes, or statements to performance on:
a. interviews
b. tests
c. all of these
d. tasks
30. If the results of an examination are negatively skewed, the exam questions were
likely:
a. easy
b. biased
c. difficult
d. quite novel in many respects
34. The higher the item-difficulty index, the _____ the item
a. easier
b. less robust
c. more robust
d. harder
38. As part of the test developmental process; a test revision may entail:
a. Development of a new edition of a test
b. rewording, deletion, or development of new items
c. rewording, deletion, or development of new items; and development of a new edition
of a test
d. the reprinting of a test
39. which is NOT a typical question that is raised and answered during the test
conceptualization stage of test development?
a. what is the objective if the test?
b. how valid are the items on the test?
c. what types of responses will be required to the test-taker?
d. is there a need for the test?
40. A test developer designs a test for the sole purpose of identifying the most highly
skilled individuals among those tested. During the test revision stage of test
development, the test developer will be particularly interested in:
a. item reliability
b. item discrimination
c. item validity
d. item bias
41. You are interested in developing a test for social adjustment in a college fraternity or
sorority. You begin by interviewing persons who had graduated from college after
having been a member of a fraternity or sorority for at least 2 years. Which stage of test
development best describes the one that you are in?
a. the test-tryout stage
b. the test revision stage
c. the pilot work stage
d. the test construction stage
43. The statistical tool that is ideally suited for making selection decisions within the
framework of a compensatory model is:
a. expectancy data
b. multiple regression
c. the Brodgen-Cronbach-Glesser formula
d. utility analysis
44. When a cut score is set based on norm-related considerations rather than on the
relationship of test score to a criterion, it is known as:
a. a referential cut score
b. a relative cut score
c. an absolute cut score
d. a fixed cut score
45. If an instructor assigns a grade of “A” to all students who earn 900 or more points
out of a total 1000 points during the semester, 900 represents:
a. the base rate of A-level students
b. the selection ratio
c. the cut score for an A
d. the success rate
47. The term used to describe the proportion of people in a population who posses a
given characteristic is:
a. sensitivity
b. selection ratio
c. success rate
d. base rate
50. Which of the following represents a problem unique to self-report personality tests?
a. all of these
b. respondents may be too “low” on the construct being measured to register on the test
c. the reading ability of respondents may prevent them from responding accurately to
items
d. respondents might be unwilling to reveal something negative about themselves
51. A key difference between concurrent and predictive validity has to do with
a. the magnitude of the reliability coefficient that will be considered significant at the .05
level
b. the magnitude of the validity coefficient that will be considered significant at the .05
level
c. the time frame during which data on the criterion measure is collected
d. none of these
52. To ensure that a test developed for national use is indeed suitable for national use,
test developers:
a. all of these
b. post sample items on the Web to gauge response of different groups
c. have a culturally representative panel of experts review test items
d. employ a culturally representative group of examiners
53. If a time limit is long enough to allow test-takers to attempt all items, and if some
items are so difficult that no test-takers is able to obtain a perfect score, then the test is
referred to as a __________ test
a. reliable
b. valid
c. speed
d. power
56. You wish to determine if the student you are evaluating scored higher on a
mathematics test than on reading test. What is statistic(s) would you calculate?
a. the mean of each distribution and index of test difficulty for each test
b. the raw score on each test as well as the mean of each distribution
c. the standard error of the difference between two scores
d. the standard error of measurement for each test score
57. A significant, positive relationship exists between scored on a new test of
intelligence and scores on the fourth edition of the Stanford-Binet intelligence scale.
These data may be viewed as supportive of which type of validity evidence for the new
test?
a. discriminant evidence of construct validity
b. criterion-related validity
c. content validity
d. convergent evidence of construct validity