Professional Documents
Culture Documents
CHAPTER 1: PSYCHOLOGICAL TESTING AND PSYCHOLOGICAL ASSESSMENT regard for the how or mechanics of such
Testing and Assessment content”
1905, France - Alfred Binet and a colleague published a test designed to help place Paris Role of Evaluator
schoolchildren in appropriate classes. The tester is not key to the process; The assessor is key to the process of
practically speaking, one tester may be selecting tests and/or other tools of
World War II, the military would depend even more on psychological tests to screen recruits substituted for another tester without evaluation as well as in drawing
for service. appreciably affecting the evaluation. conclusions from the entire evaluation.
Skill of Evaluator
Psychological Testing and Assessment Defined Testing typically requires technician-like Assessment typically requires an educated
Testing was the term used to refer to everything from the administration of a test (as in skills in terms of administering and selection of tools of evaluation, skill in
“Testing in progress”) to the interpretation of a test score (“The testing indicated that ...”) scoring a test as well as in interpreting a evaluation, and thoughtful organization
test result. and integration of data.
The OSS model—using an innovative variety of evaluative tools along with data from the Outcome
evaluations of highly trained assessors—would later inspire what is now referred to as the Typically, testing yields a test score or Typically, assessment entails a logical
assessment center approach to personnel evaluation series of test scores. problem-solving approach that brings to
bear many sources of data designed to
Assessment acknowledges that tests are only one type of tool used by professional assessors shed light on a referral question.
(along with other tools, such as the interview), and that the value of a test, or of any other
tool of assessment, is intimately linked to the knowledge, skill, and experience of the
assessor. Varieties of assessment
1. educational assessment refers to, broadly speaking, the use of tests and other tools
Psychological assessment as the gathering and integration of psychology-related data for the to evaluate abilities and skills relevant to success or failure in a school or pre-school
purpose of making a psychological evaluation that is accomplished through the use of tools context.
such as tests, interviews, case studies, behavioral observation, and specially designed 2. retrospective assessment may be defined as the use of evaluative tools to draw
apparatuses and measurement procedures. conclusions about psychological aspects of a person as they existed at some point in
time prior to the assessment. There are unique challenges and hurdles to be
Psychological testing as the process of measuring psychology-related variables by means of overcome when conducting retrospective assessments regardless if the subject of
devices or procedures designed to obtain a sample of behavior. the evaluation is deceased
3. Remote assessment refers to the use of tools of psychological evaluation to gather
TESTING ASSESSMENT data and draw conclusions about a subject who is not in physical proximity to the
Objective person or people conducting the evaluation.
Typically, to obtain some gauge, usually Typically, to answer a referral question, 4. ecological momentary assessment (EMA). EMA refers to the “in the moment”
numerical in nature, with regard to an solve a problem, or arrive at a decision evaluation of specific problems and related cognitive and behavioral variables at the
ability or attribute. through the use of tools of evaluation. very time and place that they occur.
protocol refers to diplomatic etiquette. A less common use of the word is a synonym for the Continuous Innovation
first copy or rough draft of a treaty or other official document before its ratification. With - Psychological testing continually evolves, with over 20,000 new tests developed annually.
reference to testing and assessment Innovations include the use of video, virtual reality, and a variety of tools adapted from
protocol typically refers to the form or sheet or booklet on which a testtaker’s responses are medical health for psychological assessment.
entered.
Reasons for Assessment of People with Disabilities:
Test Site Preparation Employment, professional credentialing, psychopathology screening, etc.
1. Materials
PSYCHOLOGICAL ASSESSMENT – COMPETENCY APPRAISAL
Laws mandate "alternate assessment" programs for those unable to participate due - Perhaps one of the most readily accessible sources of information is a catalogue distributed
to disabilities. by the publisher of the test. Because most test publishers make available catalogues of their
offerings, this source of test information can be tapped by a simple telephone call, e-mail, or
note.
Alternate Assessment and Accommodations:
Alternate assessment involves accommodating individuals with exceptional needs. Test manuals
Accommodation is the adaptation of a test, procedure, or situation to suit the needs - Detailed information concerning the development of a particular test and technical
of the assessee. information relating to it should be found in the test manual, which usually can be purchased
Examples include large-print versions, special lighting, sign language, extended from the test publisher.
evaluation time, etc.
Alternate assessment is an evaluative or diagnostic procedure or process that varies from the Professional books
usual, customary, or standardized way a measurement is derived, either by virtue of some - Many books written for an audience of assessment professionals are available to supplement,
special accommodation made to the assessee or by means of alternative methods designed to re-organize, or enhance the information typically found in the manual of a very widely used
measure the same variable(s). psychological test.
SOME ISSUES REGARDING CULTURE AND ASSESSMENT LEGAL AND ETHICAL CONSIDERATIONS
--- Verbal Communication - Language, the means by which information is communicated, is --- Laws - rules that individuals must obey for the good of the society as a whole—or rules
a thought to be for the good of society as a whole.
key yet sometimes overlooked variable in the assessment process. Most obviously, the --- Ethics - body of principles of right, proper, or good conduct.
examiner and the examinee must speak the same language. --- Standard of Care - level at which the average, reasonable, and prudent professional would
provide diagnostic or therapeutic services under the same or similar conditions.
--- Nonverbal Communication and Behavior - Humans communicate not only through --- code of professional ethics is recognized and accepted by members of a profession, it
verbal means but also through nonverbal means. Facial expressions, finger and hand signs, and defines the standard of care expected of members of that profession. In this context, we may
shifts in one’s position in space may all convey messages. define standard of care as the level at which the average, reasonable, and prudent
Psychoanalysis - ymbolic significance is assigned to many nonverbal acts. professional would provide diagnostic or therapeutic services under the same or similar
conditions.
--- Standards of Evaluation
-- Individualist Culture - (typically associated with the dominant culture in countries such as THE CONCERNS OF THE PUBLIC
the United States and Great Britain) is characterized by value being placed on traits such as --- Truth-in-Testing Legislation - primary objective of these laws was to give testtakers a
self-reliance, autonomy, independence, uniqueness, and competitiveness. way to learn the criteria by which they are being judged. To meet that objective, some laws
-- Collectivist Culture - (typically associated with the dominant culture in many countries mandate the disclosure of answers to postsecondary and professional school admissions tests
throughout Asia, Latin America and Africa), value is placed on traits such as conformity, within 30 days of the publication of test scores.
cooperation, interdependence, and striving toward group goals. --- Quota System - selection procedure whereby a fixed number or percentage of applicants
from certain backgrounds were selected
--- Test and Group Membership --- Discrimination - practice of making distinctions in hiring, promotion, or other selection
- Psychological assessments in various settings reveal individual and group differences. decisions that tend to systematically favor members of a majority group regardless of actual
- Systematic group score differences on tests can lead to conflicts and debates about fairness. qualifications for positions.
- Questions about fairness arise in situations like admissions to schools or job placements. -- Reverse Discrimination - practice of making distinctions in hiring, promotion, or other
- Legal and ethical considerations are crucial in vocational assessment for hiring decisions. selection decisions that systematically tend to favor members of a minority group regardless of
- Conflicts can emerge from disagreements over criteria for performing specific jobs. actual qualifications for positions.
- Height requirements or other criteria may unintentionally exclude certain cultural groups, -- Disparate Treatment - consequence of an employer’s hiring or promotion practice that was
leading to discrimination claims. intentionally devised to yield some discriminatory result or outcome. Possible motivations for
- Group differences extend to psychological attributes, such as measured intelligence, raising disparate treatment include racial prejudice and a desire to maintain the status quo.
concerns about bias and discrimination. -- Disparate Impact - consequence of an employer’s hiring or promotion practice that
- Some argue that professionally developed tests accurately reflect real-world performance unintentionally resulted in a discriminatory result or outcome. Because disparate impact is
differences among groups. presumed to occur unintentionally, it is not viewed as the product of motivation or planning.
PSYCHOLOGICAL ASSESSMENT – COMPETENCY APPRAISAL
--- Litigation - court-mediated resolution of legal matters of a civil, criminal or - The American Psychological Association (APA) periodically releases guidelines for
administrative nature professionals working with specific populations, providing assistance in offering informed and
developmentally appropriate services.
THE CONCERNS OF THE PROFESSION - A distinction exists between APA guidelines and standards, with standards being mandatory
--- Test User Qualifications for all psychologists, while guidelines are aspirational.
1. Level A: Tests or aids that can adequately be administered, scored, and interpreted with the - An example is the 2015 Guidelines for Psychological Practice with Transgender and Gender
aid of the manual and a general orientation to the kind of institution or organization in which Nonconforming (TGNC) People, consisting of 16 guidelines addressing various aspects of
one is working (for instance, achievement or proficiency tests). working with TGNC individuals.
2. Level B: Tests or aids that require some technical knowledge of test construction and use - Several groups and professional organizations publish "good practices," "best practices," or
and of supporting psychological and educational fields such as statistics, individual "practices" for the assessment and treatment of specific populations.
differences, psychology of adjustment, personnel psychology, and guidance (e.g., aptitude - Resources such as special journal issues and publications focusing on topics from an
tests and adjustment inventories applicable to normal populations). assessment perspective are valuable for professionals interested in specific populations.
3. Level C: Tests and aids that require substantial understanding of testing and supporting - Various publications provide insights into professional practices, including issues related to
psychological fields together with supervised experience in the use of these devices (for assessment.
instance, projective tests, individual mental tests).
--- Treating People with Disabilities THE RIGHTS OF TESTTAKERS
Challenges analogous to those concerning testtakers from linguistic and cultural minorities are --- The Right of Informed Consent – Test takers have a right to know why they are being
present when testing people with disabling conditions. Specifically, these challenges may evaluated, how the test data will be used, and what (if any) information will be released to
include (1) transforming the test into a form that can be taken by the testtaker, (2) transformingwhom
the responses of the testtaker so that they are scorable, and (3) meaningfully interpreting the --- The Right to be Informed of Test Findings - Testtakers have a right to be informed, in
test data. language they can understand, of the nature of the findings with respect to a test they have
taken. They are also entitled to know what recommendations are being made as a consequence
--- Computerized Test Administration, Scoring and Interpretation of the test data. If the test results, findings, or recommendations made on the basis of test data
- Access to test administration, scoring, and interpretation software. Despite purchase are voided for any reason (such as irregularities in the test administration), testtakers have a
restrictions on software and technological safeguards to guard against unauthorized right to know that as well.
copying, software may still be copied. Unlike test kits, which may contain --- The Right to Privacy and Confidentiality - the privacy right “recognizes the freedom of
manipulatable objects, manuals, and other tangible items, a computer-administered the individual to pick and choose for himself the time, circumstances, and particularly the
test may be easily copied and duplicated. extent to which he wishes to share or withhold from others his attitudes, beliefs, behavior, and
- Comparability of pencil-and-paper and computerized versions of tests. Many tests opinions”
once available only in a paper-and-pencil format are now available in computerized -- Confidentiality - may be distinguished from privilege in that, whereas “confidentiality
form as well. In many instances the comparability of the traditional and the concerns matter of communication outside the courtroom, privilege protects clients from
computerized forms of the test has not been researched or has only insufficiently been disclosure in judicial proceedings”
researched. --- The Right to the Least Stigmatizing Label - The Standards advise that the least
- The value of computerized test interpretations. Many tests available for stigmatizing
computerized administration also come with computerized scoring and interpretation labels should always be assigned when reporting test results
procedures. Thousands of words are spewed out every day in the form of test
interpretation results, but the value of these words in many cases is questionable. CHAPTER 3: STATISTIC REFRESHER
- Unprofessional, unregulated “psychological testing” online. A growing number of measurement as the act of assigning numbers or symbols to characteristics of things (people,
Internet sites purport to provide, usually for a fee, online psychological tests. Yet the events, whatever) according to rules.
vast majority of the tests offered would not meet a psychologist’s standards.
Assessment professionals wonder about the long-term effect of these largely scale is a set of numbers (or other symbols) whose properties model empirical properties of
unprofessional and unregulated “psychological testing” sites. the objects to which the numbers are assigned
Normal Curve – Karl A nonlinear transformation may be required when the data under consideration are not
Pearson normally distributed yet comparisons with normal distributions need to be made. In a
Gaussian Curve - Karl nonlinear transformation, the resulting standard score does not necessarily have a direct
Friedrich Gauss numerical relationship to the original, raw score. As the result of a nonlinear
transformation, the original distribution is said to have been normalized.
A normal curve has
two tails. The area on Normalized standard scores - Many test developers hope that the test they are working
the normal curve on
between 2 and 3 standard will yield a normal distribution of scores. Yet even after very large samples have been tested
deviations above the mean is referred to as a tail. The area between −2 and −3 standard with the instrument under development, skewed distributions result.
deviations below the mean is also referred to as a tail. Let’s digress here momentarily for a
“real-life” tale of the tails to consider along with our rather abstract discussion of statistical
One alternative available to the test developer is to normalize the distribution. Conceptually,
concepts. normalizing a distribution involves “stretching” the skewed curve into the shape of a normal
curve and creating a corresponding scale of standard scores, a scale that is technically referred
a standard score is a raw score that has been converted from one scale to another scale, where to as a normalized standard score scale.
the latter scale has some arbitrarily set mean and standard deviation.
Raw scores may be converted to standard scores because standard scores are more easily 1. Definition of Correlation:
interpretable than raw scores. Correlation is an expression of the degree and direction of correspondence
between two variables, often traits, abilities, or interests.
z score results from the conversion of a raw score into a number indicating how many standard A coefficient of correlation (r) is
deviation units the raw score is below or above the mean of the distribution. a numerical index indicating the
strength of the relationship
T Scores between two continuous
If the scale used in the computation of z scores is called a zero plus or minus one scale, then variables.
the scale used in the computation of T scores can be called a fifty plus or minus ten scale;
that is, a scale with a mean set at 50 and a standard deviation set at 10. Devised by W. A. 2. Interpretation of Correlation Coefficient:
McCall (1922, 1939) and named a T score in honor of his professor E. L. Thorndike, this
standard score system is composed of a scale that ranges from 5 standard deviations below the
mean to 5 standard deviations above the mean.
PSYCHOLOGICAL ASSESSMENT – COMPETENCY APPRAISAL
The sign (plus or minus) of the correlation coefficient indicates the direction from it what is called a coefficient of determination, or r2. It is an indication of how much
of the relationship. Plus for a positive correlation, minus for a negative variance is shared by the X- and the Y-variables.
correlation, and none for no correlation.
The magnitude of the correlation coefficient, ranging from -1 to +1, The Spearman Rho
expresses the strength of the relationship. The Pearson r enjoys such widespread use and acceptance as an index of correlation that if
3. Perfect Correlation: for some reason it is not used to compute a correlation coefficient, mention is made of the
A correlation coefficient of +1 or -1 indicates a perfect correlation between statistic that was used. There are many alternative ways to derive a coefficient of correlation.
two variables—meaning a flawless relationship without statistical error. One commonly used alternative statistic is variously called a rank-order correlation
Perfect correlations are rare in psychological work. coefficient, a rank-difference correlation coefficient, or simply Spearman’s rho.
4. Positive and Negative Correlation: Developed by Charles Spearman, a British psychologist (Figure 3–12), this coefficient of
Positively correlated variables increase or decrease simultaneously. For correlation is frequently used when the sample size is small (fewer than 30 pairs of
example, the height and weight of children tend to be positively correlated. measurements) and especially when both sets of measurements are in ordinal (or rank-order)
Negatively correlated variables show an inverse relationship—one increases form. Special tables are used to determine whether an obtained rho coefficient is or is not
while the other decreases. For instance, there's a negative correlation significant.
between mileage on a car's odometer and trade-in allowance.
5. Example: Cell Phone Use and Test Grades: Graphic Representations of Correlation
An example is given regarding the estimated correlation between daily, in- One type of graphic representation of correlation is referred to by many names, including a
class cell phone use by students and their test grades. bivariate distribution, a scatter diagram, a scattergram, or—our favorite—a scatterplot.
A scatterplot is a simple graphing of the coordinate points for values of the X-variable (placed
The correlation suggests a potential relationship between distraction caused
along the graph’s horizontal axis) and the Y-variable (placed along the graph’s vertical axis).
by cell phone use and academic performance.
Scatterplots are useful because they provide a quick indication of the direction and magnitude
of the relationship, if any, between the two variables. Figures 3–13 and 3–14 offer a quick
The most widely used of all is the Pearson r, also known as the Pearson correlation
course in eyeballing the nature and degree of correlation by means of scatterplots.
coefficient and the Pearson product moment coefficient of correlation. Devised by Karl
Pearson (Figure 3–11), r can be the statistical tool of choice when the relationship between the
To distinguish positive from negative correlations, note the direction of the curve. And to
variables is linear and when the two variables being correlated are continuous (or, they can
estimate the strength of magnitude of the correlation, note the degree to which the points
theoretically take any value).
form a straight line. Scatterplots are useful in revealing the presence of curvilinearity in a
relationship. As you may have guessed, curvilinearity in this context refers to an “eyeball
gauge” of how curved a graph is. Remember that a Pearson r should be used only if the
relationship between the variables is linear. If the graph does not appear to take the form of a
straight line, the chances are good that the relationship is not linear (Figure 3–15). When the
relationship is nonlinear, other statistical tools and techniques may be employed.
Meta-Analysis
Generally, the best estimate of the correlation between two variables is most likely to come not
from a single study alone but from analysis of the data from several studies