Professional Documents
Culture Documents
1991.44
This article discusses the major inferential issues arising in the devel-
opment of behavioral classification systems. Subsequently, we discuss
the implications of these inferential issues for evaluating the construct
validity of systems designed to assess the requirements of human task
performance. Fleishman’s (1975b, 1982)ability requirement taxonomy
and its associated job analysis system, the Manualfor theAbility Require-
ment Scales (UARS), were then evaluated with respect to these crite-
ria. In particular, a variety of criteria relevant to internal and external
validity was reviewed. It was found that the ability requirement tax-
onomy and the associated measurement system provides a meaningful
description of job activities with respect to these criteria. It was argued
that application of these construct validity principles might contribute
much to our understanding of human performance.
We would like to thank Lee Friedman, Sigrid Gustafson, Al Hartman, and Bernard
Nickels for their comments concerning earlier drafts of this manuscript. Correspondence
and requests for reprints should be addressed to Dr. Edwin A. Fleishman, Department of
Psychology, George Mason University, Fairfax, Virginia 22030-4444.
523
524 PERSONNEL PSYCHOLOGY
ClassiJcationIssues
Evaluation Principles
1957), among others, have argued that meaning arises from specification
of functional relationships permitting inferences concerning the likely
status of an object given certain known conditions. In essence, then,
meaning arises from a set of propositional relational statements that can
be shown to hold in reality. Classification systems, of course, involve a
set of propositional relationships concerned with the relative similarity
of category members and their characteristic properties. The proposi-
tional relationships implied by category status can, of course, be articu-
lated, elaborated, and formally specified. If these relationships are sub-
sequently substantiated in empirical tests, evidence for the meaningful-
ness of a classification will have been obtained.
Evaluation issues. This hypothesis testing framework indicates that
a well-articulated nomothetic net contributes to the evaluation of clas-
sifications. This nomothetic net, reflecting relationships implied by our
understanding of the phenomenon, provides a necessary basis for hy-
pothesis generation and the subsequent accumulation of confirmatory
evidence. Viable hypotheses, however, do not arise in a vacuum. Rather,
they require some understanding of the phenomenon at hand. Psycho-
logical theory, therefore, provides a basis for establishing the construct
validity of classification systems. As a result, the acquisition of validation
evidence is likely to progress more rapidly in areas where viable theories
providing well-specified relational networks are available.
Hypothesis generation, however, represents a minimum condition
for establishing the meaningfulnessof a classification. Multiple hypothe-
ses must be generated, and these hypotheses must be tested to provide
the requisite confirmatory evidence. Under these conditions, greater
confidence can be placed in the meaningfulness of a classification as the
number and diversity of the confirmed hypotheses increases (James, Mu-
liak, & Brett, 1984). In searching for evidence of a system’s meaningful-
ness, the sheer number of confirmed relationships is not the only issue
to consider. Messick (1989) points out that stronger validation evidence
is provided by studies that provide confirmatory evidence for central,
theoretically important relationships. Greater confidence can also be
placed in the meaningfulness of a classification when this confirmatory
evidence has been accrued using a number of alternative methods (Cron-
bach, 1971). Similarly, Cook and Campbell’s (1979) notions of conver-
gent and divergent validity suggest that relational tests which serve to
rule out competing theoretical explanations will tend to permit stronger
inferences to be drawn concerning the classification’sconstruct validity.
In evaluating evidence provided by these hypothesis tests, two other
considerations should be attended to. First, only rarely will it prove pos-
sible to test all relationships implied by the nature of the classification
FLEISHMAN AND MUMFORD 529
Overview
IdentifiingAbility Categories
Clearly, the ability categories used to summarize job tasks play a cen-
tral role in applying this classification. To define these categories, Fleish-
man (1972a, 1975a, 1982) attempted to identify the fewest, most useful
independent ability categories for describing performance. The method-
ology used to identify these ability categories is illustrated in Fleishman’s
(1964, 1972b) research on psychomotor and physical abilities. This re-
search program involved a series of interlocking experimental and fac-
tor analytical studies. Here, tasks were explicitly designed or selected
for inclusion in task batteries to test certain hypotheses about the or-
ganization of abilities over a wide range of tasks. These task batteries
were administered to several hundred subjects. The resulting correla-
tions among the performances of these subjects on these tasks were then
factor-analyzed to define ability categories. Experimental studies were
subsequently designed to introduce variations in the conditions of task
performance aimed at sharpening, limiting, or broadening initial factor
definitions.
Without reviewing the list of resulting abilities, we can say that a lim-
ited number of categories (9 or 10 in the psychomotor area and 9 in the
physical performance area) seemed to account for most of the variance
in several hundred tasks investigated over many years. It became appar-
ent in these studies, furthermore, that these abilities convey a great deal
of information about task performance. For example, multilimb coordi-
nation summarized tasks involving two hands and hands and feet seen in
operating equipment. This summarization, however, did not extend to
tasks involving coordination when the whole body was in motion. It was
also possible to say that there was an ability common to simple reaction
tasks, both auditory and visual, although complicating the response or
stimulus brought another ability into play, termed response orientation.
These investigations, along with a variety of other factor analysis
studies, provided well-documented evidence indicating the kind of abil-
ity categories that might be used to generate summary descriptions of
human task performance. Consequently, specification of the relevant
summarization categories began with a comprehensive literature review
intended to identify abilities that had been empirically established in
earlier research concerned with human task performance (Theologus &
Fleishman, 1973; Theologus, Romashko, & Fleishman, 1973). The pri-
mary sources of evidence for the utility of potential cognitive constructs
was the work of Guilford (1967) and Guilford and Hoepfner (1966) on
the structure of intellect, along with the work of French, Ekstrom, and
Price (1963). From these sources, initial ability categories were selected
according to the criterion that each ability had been identified in at least
534 PERSONNEL PSYCHOLOGY
ul
ul
w
536 PERSONNEL PSYCHOLOGY
this article. For the present, we will confine our discussion to the evalua-
tion of the abilities taxonomy described in Xible 1, which are the abilities
included in the Manual for the Ability Requirement Scales (MARS) eval-
uated in this article.
WRI'ITEN COMPREHENSION
This is the ability to understand written sentences and paragraphs.
How Written Comprehension is Different from Other Abilities:
THIS ABILITY OTHER ABILITIES
Understand written English words, sen- Oral Comprehension(1): Listen and un-
tences, and paragraphs. derstand spoken Englishwords and sen-
tences.
VS. Oral Expression (3): and Written Ex-
pression (4): Speak or write English
words and sentences so otherswill un-
derstand.
F
ing unusual words and phrases and in- - Understand an instruction book
volving fine distinctions in meaning a- on repairing a missileguidance sys-
mong words. 6 tem.
I J I
t5
t: - Understand an apartment lease.
Operational Evaluations
STATIC STRENGTH
This is the ability to use muscle force to lift, push, pull, or carxy objects. It is
the maximum muscle force that one can exert for a brief period. This ability can
involve arms, back, shoulders, or legs.
, t:
rying a 50 lb. back pack.
and supervisors are used as subject matter experts (SMEs). These stud-
ies with Army officers (Wallis, Korotkin, Yarkin-Levin, & Schemmer,
1985), heavy-equipment operators (Fine, 1988; Olson, Fine, Myers, &
Jennings, 1979),Federal Bureau of Investigation special agents (Cooper,
Schemmer, Jennings, & Korotkin, 1983), and automotive mechanics (Is-
raelski, 1988) indicate that this definition permits subject matter experts
to reach agreement in more than 90% of the cases as to whether a task
is, indeed, part of their job. Evidence of this sort is not unambiguous.
Nonetheless, it suggests that this domain definition reflects experts’ im-
plicit understanding of the nature of performance on their jobs.
Descriptive categories. The second step in constructing classification
systems is specification of the categories used to summarize the objects
specified by the domain definition. Thus, there is a need for evidence
indicating that a meaningful set of categories has been identified (Horn
& Knapp, 1973). We have already considered some evidence that points
to the meaningfulness of the ability categories included in the ability re-
quirement (Fleishman, 1972a, 1975b, 1982) taxonomy. First, the ability
constructs included in this taxonomy were expresslychosen because they
had been identified in multiple factor analytic investigationsfocusing on
certain subdomains of task performance. Second, these abilities were
reviewed for comprehensiveness and significance by panels of psycholo-
gists and psychometricians, who found that the proposed categories pro-
vided a reasonably comprehensive listing of the more significant ability
constructs found in the literature. Third, each of these ability constructs
could be linked to an extensive literature indicative of the construct’s
ability to account for performance differences.
Other ongoing research has also provided evidence for the meaning-
fulness of these categories. For instance, early factor analytic studies of
psychomotor tasks indicated that a common ability was related to per-
formance on both pursuit and compensatory tracking tasks (Fleishman,
1954, 1958; Fleishman & Hempel, 1954, 1955, 1956). To further estab-
lish the meaning of the construct underlying performance, other tasks
were developed that required the timing of control manipulations in re-
sponse to stimuli moving at different rates (Fleishman, 1966, 1967). In
accordance with initial interpretations of rate control, it was found that
this factor was related to performance on this new set of tasks. Later
studies involved the use of tasks intended to reveal whether this ability
stressed judgments of stimulus rate as opposed to response rate con-
trol. This was accomplished using motion picture tasks and other tasks
that required only a button press in response to the location of a moving
stimulus. These tasks were found to be unrelated to rate control. Thus,
definition of the rate control category was extended beyond tracking and
FLEISHMAN AND MUMFORD 541
pursuit tasks, but restricted to tasks requiring the actual timing of adjus-
tive motor movements to a continuously changing stimulus.
Fleishman (1966, 1967, 1972a) describes a number of other stud-
ies intended to elucidate the character and meaning of the various psy-
chomotor constructs included in the taxonomy. Thus, reaction time abil-
ity was found to extend to simple reaction time responses to either visual
and auditory stimuli, but when two or more stimuli must be discrimi-
nated, or a choice must be made about which control to use, reaction
time is no longer measured. With such tasks, a different ability, called
response orientation, is measured. The physical capacities included in
the ability requirement taxonomy have been appraised in a similar re-
search program (Fleishman, 1964), while Fleishman and Quaintance
(1984) and Guilford and Hoepfner (1971) describe a series of studies
intended to establish the meaningfulness of the interpretations applied
to the various cognitive abilities. Because the results obtained in the ma-
jority of these investigations tend to support the substantive interpreta-
tions applied to the ability constructs, these studies provide evidence for
the meaningfulness of these categories and, therefore, the classification
system as a whole.
Similarity assessment and category assignment. The third and fourth
steps involved in classification efforts entail (a) assessments of object
similarity, and (b) specification of decision rules for assigning objects
to categories based on the observed degree of similarity. As Cronbach
and Gleser (1953) and Gregson (1975) point out, similarity may be as-
sessed in a number of ways, including correlation coefficients as well as
distance metrics. Further, a variety of decision rules might be applied
in determining when an object displays sufficient similarity to other cat-
egory members to permit assignment and common interpretation (An-
derberg, 1973). Fleishman and Quaintance (1984) and Mumford et al.
(1990) note that variations in the methodological procedures employed
in these operations may lead to marked differences in the content and
character of the resulting summary descriptions. This point has been un-
derscored by Hamer and Cunningham (1981), who have shown that the
use of different similarity metrics (e.g., correlation coefficientsvs. gener-
alized distance metrics) can lead to different classifications. This leads to
another set of questions pertinent to the system's meaningfulness: Were
appropriate indices of similarity and viable decision rules applied in con-
structing this classification scheme?
In the case of the ability requirement taxonomy, a judgmental pro-
cedure is used in similarity assessment and category assignment. Judges
are asked to apply the category definition and associated rating scale to
reach a decision as to the level of that ability required for performance
on a task. 'Qpically, abilities receiving relatively high ratings (4 or above)
542 PERSONNEL PSYCHOLOGY
TABLE 2
InterraterAgreement Between Incumbents, Supervisors, and Job Analysts
on the Relative Rank Orders of the 19 Abilitiesfor Each Job Category
1 2 3 4 5 6
Organizing Analysis Reading
and and. technical Customer
planning Installation repair material Sales relations
Ability M SD M SD M SD M SD M SD M SD
1. Oral comprehension 4.1 1.5 4.3 1.4 4.6 1.2 4.8 1.5 3.5 1.5 3.9 1.6
2. Written comprehension 3.7 1.4 4.1 1.3 4.1 1.3 5.0 1.4 3.0 1.5 3.0 1.5
3. Oral expression 3.8 1.5 3.8 1.5 4.3 1.3 4.2 1.6 4.1 1.7 4.3 1.7
4. Written expression 3.2 1.6 3.4 1.5 3.7 1.5 3.7 1.7 2.8 1.5 3.1 1.5
5. Memorization 4.2 1.3 4.4 1.5 4.5 1.3 4.5 1.6 3.3 1.4 3.2 1.5
6. Problem sensitivity 3.9 1.6 4.3 1.5 4.8 1.5 3.7 1.5 3.1 1.4 3.5 1.6
7. Math reasoning 2.8 1.5 3.1 1.5 3.3 1.5 3.2 1.5 2.7 1.5 2.2 1.2
8. Number facility 2.6 1.3 2.8 1.3 2.8 1.4 2.8 1.5 2.6 1.4 2.2 1.3
9. Deductive reasoning 3.9 1.5 4.0 1.3 4.5 1.4 4.1 1.6 2.9 1.4 3.1 1.4
10. Inductive reasoning 3.4 1.3 3.6 1.2 4.1 1.3 3.7 1.4 2.6 1.4 2.1 1.4
11. Information ordering 3.9 1.4 4.0 1.3 4.1 1.3 3.8 1.4 2.8 1.4 2.8 1.4
12. Category flexibility 3.3 1.4 3.4 1.2 3.6 1.4 3.4 1.5 2.5 1.3 2.6 1.3
13. Electrical knowledge 3.1 5.1 3.7 1.3 4.0 1.3 3.9 1.6 2.0 1.2 2.0 1.2
14. Mechanical knowledge 3.3 1.8 3.6 1.6 3.6 1.8 3.1 1.8 2.1 1.4 2.0 1.3
15. Knowledge of tools and their use 4.3 1.6 4.5 1.3 4.3 1.5 3.0 1.6 1.9 1.3 2.0 1.4
16. Map reading 3.4 1.7 2.9 1.7 3.1 1.8 3.0 1.8 2.0 1.4 2.1 1.4
17. Drafting 2.5 1.4 2.5 1.4 2.5 1.4 2.6 1.6 1.7 1.0 1.7 1.o
18. Reading plans 3.1 1.5 3.2 1.5 3.3 1.5 3.7 1.6 1.8 1.3 1.8 1.2
19. Selective attention 3.8 1.5 4.2 1.4 4.3 1.5 4.3 1.6 2.9 1.6 3.1 1.6
20. Time sharing 3.5 1.3 3.7 1.2 3.8 1.3 3.7 1.5 2.6 1.4 2.7 1.4
21. Spatial orientation 3.3 1.5 3.6 1.2 3.6 1.4 3.3 1.6 2.3 1.4 2.4 1.4
22. Visualization 3.9 1.6 4.3 1.4 3.8 1.4 3.3 1.6 2.6 1.5 2.5 1.5
23. Persuasion 3.0 1.6 3.3 1.6 3.2 1.6 2.6 1.5 4.4 1.6 3.9 1.7
24. Social sensitivity 3.0 1.7 3.2 1.7 3.2 1.7 2.4 1.6 4.1 1.6 4.7 1.7
25. Fact-Findingability 3.7 1.7 3.9 1.6 4.4 1.4 3.3 1.8 3.9 1.7 4.1 1.6
26. Flexibility of closure 3.7 1.5 3.9 1.4 4.2 1.5 4.0 1.6 2.7 1.5 3.0 1.5
(From Reilly & Zink, 1980)
546 PERSOMVEL PSYCHOLOGY
Internal klidity
The methods used to define categories and assign tasks to these cat-
egories provide a basis for more complex relational inferences bearing
on the classification’sconstruct validity. In this section, we will consider
evidence derived from the relationships embedded in the classification.
Here we are concerned with the classification’s internal validity. Two
general kinds of relationships might be used to marshal evidence for a
classification’sinternal validity: (a) the relationships observed between
categories and the objects to be classified, and (b) the relationships ob-
served between categories. We will consider the construct validity of the
ability requirement taxonomy with respect to these two kinds of relation-
ships.
Behavior-Cafegoty relationships. The general goals and functions of
taxonomic efforts lead to certain paramount concerns, when one con-
siders behavior-category relationships. For instance, one might ask the
question as to whether the proposed categories are sufficient to account
for the behaviors under consideration, or alternatively,whether most be-
haviors can be accounted for by assignment to one or more categories.
In addressing this question, however, the issue of the system’sparsimony
should be borne in mind (Horn & Knapp, 1973). More specifically, re-
dundant categories should not be proposed, and the proposed categories
should be just sufficient to result in the assignment of most behaviors to
one or more categories.
With regard to the ability requirements approach, the procedures
used in category definition might be said to provide some evidence in-
dicative of the taxonomy’scomprehensiveness (Fleishman & Quaintance,
1984). More direct evidence bearing on this question, however, has been
provided by certain empirical investigations. In a series of panel meet-
ings, Hogan, Ogden, and Fleishman (1979) found that 80% of the tasks
FLEISHMAN AND MUMFORD 547
TABLE 4
Interclass Reliability Coefficientsfor the Physical Ability Scales-
Scale Reliabilities by Study
Physical ability scales s1- Sa" SQ" S4" saa ssa S7"
TABLE 5
Assignments of Tasks to Selected Ability Categories
In one investigation along these lines, Inn et al. (1982) obtained de-
scriptions of 830 tasks likely to be performed on 17physically demanding
jobs in the telecommunications industry. Ratings of each task's physical
ability requirements were then obtained. Correlations among the ability
dimensions were then generated in a series of analyses, where tasks were
treated as the unit of analysis.
With regard to convergent validity, sizable positive correlations were
obtained among the physical ability categories. This pattern of relation-
ships was anticipated because, within this limited range of physical jobs,
tasks that demand one physical ability (dynamic strength) often demand
a number of other physical abilities (e.g., static strength and extent flex-
ibility). Highest correlations were found among the physical ability re-
quirement scales. These positive, task-level correlations argue for the
552 PERSONNEL PSYCHOLOGY
and parking meter patrol officers were assigned to another cluster. The
coherent clusters of jobs derived from these categories and their interre-
lationships provide an additional piece of evidence pointing to the mean-
ingfulness of the ability requirement taxonomy.
The ability requirement taxonomy was, of course, expressly designed
to maximize meaningfulness of the proposed categories (Fleishman &
Quaintance, 1984). It should, however, be recognized that a number
of other investigators have proposed taxonomies containing abilities in-
tended to summarize task performance. These alternative ability clas-
sifications are illustrated in the work of Drauden (1988), Lopez (1988),
and Primoff and Eyde (1988). The significant point, with regard to the
current discussion, is that these taxonomies also stress the importance
of abilities such as strength, stamina, visual acuity, hearing, memory,
oral expression, written expression, numerical ability, and perceptual
speed. Because these alternative worker-oriented classification schemes
were constructed under different assumptions and used different proce-
dures in category definition, this convergence in category content pro-
vides some additional evidence for the meaningfulness of the ability cat-
egories proposed by Fleishman and his colleagues (Fleishman, 1972a,
1975b, 1982; Fleishman & Mumford, 1988; Fleishman & Quaintance,
1984). With regard to the Position Analysis Questionnaire (PAQ) (Mc-
Cormick, Jeanneret, & Mecham, 1972), McCormick (1976) explicitly ac-
knowledged that the ability concepts in the PAQ methodology drew on
Fleishman’s (1975b) taxonomies.
External Validity
the results obtained in these inferential tests provide a basis for theory
development and refinement (Cronbach, 1971; Landy, 1986), thereby
contributing description, prediction, and understanding, as well as vali-
dation.
Performance requirements. Given the nature of this taxonomy, abil-
ity categories should map onto empirically derived dimensions of task
performance. In an initial evaluative effort, Theologus and Fleishman
(1973) attempted to confirm this hypothesis. In this investigation, they
had a panel of 79 judges rate descriptions of 38 tasks using 37 cognitive,
psychomotor, and physical abilities. Some 200 subjects were asked to
perform the same 38 tasks. To identify empirical dimensions summariz-
ing task performance, the correlations among the performance scores on
the 38 tasks were factor analyzed. Of the eight dimensions identified in
this factor analysis, the tasks yielding high loadings on seven dimensions
also produced high ratings on the level of the ability required on these
same dimensions. Hence, ability ratings were found to yield descriptions
of task performance similar to those obtained in quantitative analyses of
observed performance differences.
If, as Theologus and Fleishman’s (1973) data suggest, ability ratings
are related to empirically derived dimensions capable of summarizing
task performance, then another question might be posed. This observa-
tion suggests that ability requirement ratings might predict absolute dif-
ferences in task performance requirements. Fleishman, Gebhardt, and
Hogan (1986) initiated a series of investigations intended to address this
issue.
In several studies, Hogan and Fleishman (1979) and Hogan, Ogden,
Gebhardt, and Fleishman (1980) obtained ability requirement ratings
for a number of job and recreational tasks whose metabolic requirements
were known. High positive correlations ( 3 5 ) were observed between the
tasks’ known metabolic requirements and independent ratings of their
physical ability requirements.
In another study (Hogan, Ogden, Gebhardt, & Fleishman, 1979), in-
dividuals were asked to perform various material-handling tasks where
boxes of identical size, but different weights, were to be moved differ-
ent distances to establish foot-pounds of work. These individuals were
also asked to rate each task on physical abilities. When the actual foot-
pounds of work required by each task was correlated with task ability
ratings, a coefficient of .88 was obtained. This confirms that ability rat-
ings are indicative of objective performance demands.
The relationship between abilities and performance requirements
suggests that various derivative features of job activities, such as knowl-
edge requirements, should also be related to ability requirements. This
issue has been addressed in a recent study by Landy (1988), who obtained
558 PERSONNELPSYCHOLOGY
646 job knowledge items from a battery of seven tests used to assess
candidates for potential promotion to fire captain. A group of indus-
trial psychologists was then asked to rate each item in terms of the abil-
ities in the Manual for the Ability Requirement Scales (Fleishman, 1975b,
1991) found to be relevant to task performance in an earlier job analysis.
Landy’s findings indicated that roughly half of the job knowledge items
reflected abilities held to summarize task performance.
Peformance prediction. As Landy (1988) points out, one common
application of ability requirement data is to allow us to draw inferences
concerning the kinds of tests likely to predict job performance. One im-
plication of our foregoing observations is that ability requirement eval-
uations can be used to identify tasks that tap certain abilities. Individual
differences observed on these tasks might then be used to draw infer-
ences concerning the individual’s performance on other tasks calling for
this ability. But can actual performance be predicted from ability rat-
ings? In an early study, Theologus and Fleishman (1973) used six ability
requirement scales to obtain judges’ ratings of 27 laboratory tasks to pre-
dict actual performance or levels of its 400 subjects on these tasks. The
common performance metric for all these tasks was “number of units
produced per unit time.” A multiple correlation of .64 was obtained be-
tween ability ratings and this performance metric, indicating that ability
scale ratings were, indeed, correlates of task performance.
Later, Myers, Gebhardt, Price, and Fleishman (1981) identified the
physical abilities required to perform various Army tasks. Subsequently,
job sample tasks, such as grenade throwing (upper-body explosive
strength) and loading of 55-pound cartons onto a truck (upper-body
static strength) were developed to measure these constructs. When per-
formance on the job sample tasks was correlated with performance on
the physical ability marker tests of these abilities, coefficients on the or-
der of S O were obtained, confirming the expected relationship between
abilities and performance. Furthermore, the performance tests identi-
fied using this approach evidenced some convergent and discriminant
validity across job sample tasks. Another study, by Hogan et al. (1978),
confirmed these results using a job sample developed to simulate the or-
der selection and loading operations in a large warehouse. Generic abil-
ity tests, selected on the basis of the ability requirement scales, yielded a
multiple R of .45 in predicting performance on the job sample. Finally,
in a recent study, Gebhardt and Schemmer (1985) found validities in the
30s for generic tests of abilities in the taxonomy against job samples of
tasks performed by dockworkers.
Other investigations of performance prediction, based on the ability
taxonomy and rating methodology, have involved a broader set of crite-
ria and occupations. In these studies, the ability requirement taxonomy
FLEISHMAN AND MUMFORD 559
Conclusions
Some time ago, Paul Fitts (1962) stated the requirements for a tax-
onomy of human performance:
1986). This cumulative theoretical progress may itself dictate future re-
visions in the ability requirement taxonomy to further enhance the mean-
ingfulness and comprehensiveness of the resulting descriptive informa-
tion. Current efforts are being made to extend this taxonomy to cap-
ture dimensions of interpersonal skills, personality, and generic knowl-
edges that might contribute to the development of skilled performance
(Fleishman & Friedman, 1990; Mumford, Weeks, Harding, & Fleish-
man, 1988; Mumford & Nickels, 1990). The current Manual for the Abil-
ity Requirement Scales (Fleishman, 1975a, 1991) is being supplemented
with such additional scales to measure these requirements. Examples
of task-anchored scales for skillfinowledge requirements developed are
mechanical knowledge, elecfricallelectronicknowledge, knowledge of tools
and uses, map reading, drafting, reading plans, driving, typing, keypunch-
ing, telefypewriting, shorthand, spelling, and grammar. Examples of per-
sonality requirements and interpersonal skill requirements are social
sensitivity,persuasion,persistence, behavior flexibility, dependability,emo-
tional stability, and self-confidence. This progressive refinement should
be viewed as a desirable characteristic of systematic construct validation
efforts.
This article has focused solely on inferences derived from the ability
requirement taxonomy. The evidence obtained for the meaningfulness
of this approach in no way speaks to its merits relative to alternative
systems for generating summary descriptions of job behavior. There is,
however, a need for this kind of comparative evaluation research. Hope-
fully, the evaluation principles described will facilitate such comparative
research by demarking certain crucial validation issues. Construct val-
idation focusing on the meaningfulness of the descriptive information
provided by a particular classification system represents a necessary first
step for conducting more complex comparative studies.
The need for further efforts along these lines becomes especially im-
portant when it is recognized that these schemes for generating sum-
mary descriptions of job behavior provide a foundation for many prac-
tical and theoretical efforts in the field of industriallorganizational psy-
chology (see Fleishman & Quaintance, 1984; Primoff & Fine, 1988). If
the classification schemes in use do not permit valid inferences about
job behavior, then much of this work will be based on an insecure foun-
dation. When we lack adequate construct validation evidence, the fail-
ure of an intervention might be plausibly attributed to poor description,
rather than to some inherent deficiency in the intervention technique
being evaluated. It would seem that substantially more effort should be
devoted to establishing the construct validity of other job analysis sys-
tems. We hope that the present effort will serve as an impetus to further
568 PERSONNEL PSYCHOLOGY
research along these lines, because it is only through the systematic ap-
plication of scientific principles that we can construct truly meaningful
descriptions of job behavior.
REFERENCES
Fleishman EA. (1988). Some new frontiers in personnel selection research. PERSONNEL
PSYCHOLOGY,41 679-701.
Fleishman EA. (1991). Manual for the Ability Requirement Scales (MARS, revised). Palo
Alto, CA: Consulting PsychologistsPress.
Fleishman EA, Cobb AT, Spendolini MJ. (1976). Development of ability requirement scales
for the analysis ofyellowpage salesjobs in the Bell System (Final Report). Bethesda,
MD: Management Research Institute.
Fleishman EA, Elkin EH, Baker WJ. (1983). Effects of drugs on human peqormance: The
effects of scopolomine on representative tests of human peformance. Bethesda, MD:
Advanced Research Resources Organization.
Fleishman EA, Ellison GD. (1969). Prediction of transfer and other learning phenomena
from ability and personality measures. Journal ofEducationa1Psychology, 60,300-
314.
Fleishman EA,Friedman L. (1990). Cognitive competencies related to management per-
formance requirements in R&D organizations (Tech. Rep.). Fairfax, VA Center for
Behavioral and Cognitive Studies, George Mason University.
Fleishman EA, Fruchter B. (1960). Factor structure and predictability of successive stages
of learning Morse code. Journal ofApplied Psychology, 44,96101.
Fleishman EA, Fruchter B. (1965). Component and total task relations at different stages
of learning a complex tracking task. Perceptual and Motor Skills, 20,1305-1311.
Fleishman EA, Gebhardt DL, Hogan JC. (1986). The perception of physical effort in job
tasks (pp. 225-242). In Borg G (Ed.), Perception of aertion in physical exercise.
London: MacMillan Press.
Fleishman EA, Hempel WE Jr. (1954). Changes in factor structure of a complex psy-
chomotor test as a function of practice. Psychometrika, 18,239-252.
Fleishman EA, Hempel WE Jr. (1955). The relation between abilities and improvement
with practice in a visual discrimination reaction task. Journal of Experimental Psy-
chology, 49,301-312.
Fleishman EA, Hempel WE Jr. (1956). Factorial analysis of complex psychomotor per-
formance and related skills. Journal of Applied Psychology, 40,96104.
Fleishman EA, Hogan JC. (1978). Taronomicmethod for assessing the physical requirements
of jobs: Thephysical abilities analysis approach (Tech. Rep. 3012lR78-6). Bethesda,
MD: Advanced Research Resources Organization.
Fleishman EA, Mumford MD. (1988). Ability requirement scales. In Gael S (Ed.), Thejob
analysis handbook for business, government, and industry (pp. 917-935). New York:
Wiley.
Fleishman EA, Mumford MD. (1989a). Individual attributes and training performance. In
Goldstein IL (Ed.), Training and development in organizations (pp. 183-255). San
Francisco: Jossey-Bass.
Fleishman EA, Mumford MD. (1989b). Abilities as causes of individual differences at
different stages of skill acquisition. Human Peformance, 2,201-222.
Fleishman EA, Quaintance MK. (1984). Taronomies of human performance: The descrip-
tion ofhuman tasks. Orlando, F L Academic Press.
Fleishman EA, Reilly ME. (1991). Human abilities: Their definition, measurement, and job
task requirements. Palo Alto, C A Consulting Psychologists Press.
Fleishman EA, Rich S. (1963). Role of kinestheticand spatial-visualabilities in perceptual-
motor learning. Journal of Experimental Psychology, 66,611.
Fleishman EA, Stephenson RW. (1972). Development of a taxonomy of human perfor-
mance: A review of the third year’s progress. JSAS Catalog of Selected Documents
in Psychology,2,39-40 (Ms. No. 112).
Fogli L. (1988). Supermarket cashier. In Gael S (Ed.), The job analysis handbook for
business, government, and industty (pp. 1215-1228). New York: Wiley.
FLEBHMAN AND MUMFORD 571
French JW, Ekstrom RB, Price LA. (1963). f i t of reference testsfor cognitivefactors. Prince-
ton, NJ: Educational Testing Service.
G a p e RM. (1962). Human functions in systems. In Gagne RM (Ed.), Psychologicalprin-
ciples in system development (pp. 35-74). New York Holt, Rinehardt, & Winston.
Gebhardt DL, Cooper M, Jennings MC, Crump C, Sample RA. (1983). Development
and validation of selection tests for a natuml gas company (Final Report 30789).
Bethesda, MD: Advanced Research Resources Organization.
Gebhardt DL, Crump CE. (1983). Development of physical performance selection tests
for paramedics in the city of Los Angeles (Tech. Rep.). Bethesda, M D Advanced
Research Resources Organization.
Gebhardt DL, Jennings MC, Fleishman FA. (1981). Factors affecting the reliability of
physical ability and effort ratings of Navy tasks (Tech. rep. 3034). Bethesda, MD:
Advanced Research Resources Organization.
Gebhardt DL, Schemmer FM. (1985). Development and validation of selection testsfor long-
shoreman and marine clerks (Tech. Rep. 3113). Bethesda, MD: Advanced Research
Resources Organization.
Gebhardt DL, Weldon LJ.(1982). Development and validation ofphysicalperformance tests
for correctional officers (Final Report 3080). Bethesda, MD: Advanced Research
Resources Organization.
Goldstein IL. (1986). Training in organizations: Nee& assessment, development, and evalu-
ation. Monterey, C A Brooks/Cole.
Gregson RA. (1975). Psychometrics of similariy. New York: Academic Press.
Guilford JP. (1967). Nature of human intelligence. New York McGraw-Hill.
Guilford JP, Hoepfner R. (1966). St~~cture of intellect factors and their tests. Los Angeles:
Psychological Laboratory, University of Southern California.
Guilford JP, Hoepfner R. (1971). The anatysis of intelligence. New York McGraw-Hill.
Guion RM. (1978). “Content validity” in moderation. PERSONNEL PSYCHOLOGY,32,205-
213.
Guion RM. (1980). On trinitarian doctrines of validity. Professional Psychology, 12, 385-
398.
Hackman JR. (1968). Tasks and task performance in research on stress. In McGrath JE,
Social and psychologicalfactors in s!ress. New York Holt, Rinehart, and Winston.
Hamer RM, Cunningham JW. (1981). Cluster analyzing profile data confounded with in-
terrater differences: A comparison of profile association measures. Applied Psy-
chological Measurement, 5,63-73.
Harvey RJ, Lozada-Larson SR. (1988). Influence of amount of job descriptive information
on job analysis rating accuracy. Journal ofApplied Psychology, 73,457-461.
Henderson RI. (1988). Job evaluation, classification, and pay. In Gael S (Ed.), The job
analysis handbook for business, government, and industry (pp. 90-118). New York
Wiley.
Hogan JC, Fleishman EA. (1979). An index of the physical effort required in human task
performance. Journal of Applied Psychology, 64,197-204.
Hogan JC, Jennings MC, Ogden GD, Fleishman EA. (1980). Determining the physical
ability requirements of Emon apprenticejobs (Final Report 3044). Bethesda, MD:
Advanced Research Resources Organization.
Hogan JC, Ogden GD, Fleishman EA. (1978). Assessing the physical requirements in se-
lected benchmark jobs (Final Report 3012). Bethesda, MD: Advanced Research
Resources Organization.
Hogan JC, Ogden GD, Fleishman EA. (1979). The development and validation of tests
for the order selector job at Certijied Grocers of California, Ltd. (Tech. Rep. 3029).
Bethesda, MD: Advanced Research Resources Organization.
572 PERSONNEL PSYCHOLOGY
Hogan JC, Ogden GD, Gebhardt DL, Fleishman EA. (1979). An index of physical effort
required in human task performance. Journal ofApplied Psychology, 65,672-679.
Hogan JC, Ogden GD, Gebhardt DL, Fleishman EA. (1980). Reliability and validity of
methods for evaluating perceived physical effort. Journal ofApplied Psychology: 65,
672-679.
Horn GL, Knapp JR. (1973). On the subjective character of the empirical base for Guil-
ford’s structure of intellect model. PsychologicalBulletin, 80,3043.
Horn JL. (1976). Human abilities: A review of research and theory in the early 1970s.
In Rosennueig MR, Porter LW (Eds.), Annual review ofpsychofogy(Vol. 27). Palo
Alto, C A Annual Review.
Imhoff DL, Levine JM. (1980). Development of apercept%al-motorancognitiveperformance
task battery for pilot selection (Tech. Rep.). Bethesda, MD: Advanced Research
Resources Organization.
Inn A, Schulman DR, Ogden GD, Sample RA. (1982). Physical ability requirements of
Bell System jobs (Final Report 3057/R82-1). Bethesda, MD: Advanced Research
Resources Organization.
Israelski EW. (1988). Automobile mechanic. In Gael S (Ed.), The job analysis handbook
for business, government, and industry (pp. 1311-1328). New York Wiley.
James LR, Muliak SA, Brett JM. (1984). Causal analysis: Assumptions, models, and data.
Beverly Hills, C A Sage.
Jaques E. (1978). Levels of abstracfionin logic and human action. Exeter, NH: Heinemann
Educational Books.
Joreskog CK, Sorbom DS. (1979). Structural equations analysis. Reading, M A : ABT
Books.
Konz SA. (1988). Designing the work environment. In Gael S (Ed.), The job analysis
handbook for business, government, and industry (pp. 731-748). New York Wiley.
Kulik Cr,Oldham GR. (1988). Job diagnostic survey. In Gael S (Ed.), The job analysis
handbook for business, government, and industiy (pp. 936-959). New York Wiley.
Landy FJ. (1986). Stamp collection versus science: Validation as hypothesis testing. Amer-
ican Psychologist, 41,1183-1192.
Landy FJ. (1988). Selection procedure development and usage. In Gael S (Ed.), The job
analysis handbook for business, government, and industry (pp. 271-287). New York:
Wiley.
Latham GP, Wexley KN. (1980). Increasing productivity through performance appraisal.
Reading, MA: Addison-Wesley.
Levine EL, Ash RA, Bennett N. (1980). Exploratorycomparativestudy of four job analysis
methods. Journal of Applipd Psychology, 65,524-535.
Levine JM, Kramer GG, Levine EN. (1975). Effects of alcohol on human performance:
An integration of research findings based on an abilities classification. Journal of
Applied Psychology,60,285-293.
Levine JM, Romashko T, Fleishman EA. (1973). Evaluations of an abilities classification
system for integrating and generalizing findings about human performance: The
vigilance area. Journal of Applied Psychology,58, 147-149.
Lopez FM. (1988). Threshold traits analysis system. In Gael S (Ed.), The job analysis
handbook for business, government, and industry (pp. 880-901). New York Wiley.
Madden JM. (1962). What makes work difficult? Personnel Journal, 41,341-344.
Malamud SM, Levine JM, Fleishman EA. (1980). Identifying ability requirements by
decision flow diagrams. Human Factors, 22,5748.
Mayr E. (1969). Principles of systematic biology. New York: McGraw-Hill.
McCormick ET. (1976). Job and task analysis. In Dunnette MD (Ed.), Handbook of
industrial and organizationalpsychology(pp. 651-696). Chicago: Rand-McNally.
McCormick El. (1979). Job analysis: Methods and applications. New York: Amacom.
FLEISHMAN AND MUMFORD 573
McCormick El, Jeanneret PR, Mecham RC. (1972). A study of job characteristics and
job dimensions as based on the Position Analysis Questionnaire (PAQ). Journal of
Applied Psychology, 56,347-368.
Messick S. (1975). The standard problem: Meaning and values in measurement and
evaluation. American Psychologist, 30,955-966.
Messick S. (1980). R s t validity and the ethics of assessment. American Psychologist, 35,
1012-1027.
Messick S. (1989). Validity. In Linn RL (Ed.), Educational measurement (pp. 13-103).
New York Macmillan.
Miller EE. (1969). A tuwnomy of response processes (Tech. Rep. 69-16). Fort b o x , Ky:
Human Resources Research Organization.
Miller RB. (1962). Task description and analysis. In Gagnt RM (Ed.), Psychologicalprin-
cipks in system development (pp. 187-230). New York Holt, Reinhart, & Winston.
Miller RB. (1967). Task taxonomy: Science or technology? In Singleton W,Easterly RS,
Whitfield DC (Eds.), The human operator in complex systems. London: Taylor &
Francis.
Miller RB. (1973). Development of a taxonomy of human performance: Design of a
systemstaskvocabulary.JSAS Catalogof Selected Documents in Psychology,3,29-30
(Ms. No. 327).
Muliak SA.(1986). Toward a synthesis of deterministic and probabilistic formulation of
causal relations by the functional relation concept. Philosophy of Science, 53,313-
337.
Mumford MD, Nickels BJ.(1990). Making sense of people’s lives: Applying principles of
content and construct validity to background data. Forensic Reports, 3,143-168.
Mumford MD, Stokes GS, Owens WA. (1990). Patterns of life history: The ecology of human
individuality. Hillsdale, NJ: Lawrence Erlbaum.
Mumford MD, Weeks JL, Harding FD, Fleishman EA. (1987). Measuring occupational
difficulty: A construct validation against training criteria. Journal of Applied Psy-
chology, 72,578-587.
Mumford MD, Weeks JL, Harding FD, Fleishman EA. (1988). Relations between student
characteristics, course content, and training outcomes: An integrative modeling
effort. Journal ofApplied Psychology, 73,443-456.
Mumford MD, Yarkin-Levin K,Korotkin AC, Wallis MR, Marshall-MiesJ. (1985). Char-
acteristics relevant to performance as an Army leader: Knowledges, skilk, abilities,
other characteristics, and generic skillr (Tech. Rep.). Alexandria, VA: U.S. Army
Research Institute for the Behavioral and Social Sciences.
Murphy KR. (1989). Is the relationship between cognitive ability and job performance
stable over time? Human Performance, 2,183-200.
Myers DC, Gebhardt DL, Crump CE, Fleishman EA. (1984). Factor analysis of strength,
cardio-vascular endurance, flexibility, and body composition measures (Tech. Rep.
R83-9). Bethesda, MD: Advanced Research Resources Organization.
Myers DC, Gebhardt DL, Price SJ, Fleishman EA. (1979). Development ofphysicalperfor-
mance standardsforAmy jobs: Thejob ana&sismethodology (Report 3045/R79-10).
Bethesda, MD: Advanced Research Resources Organization.
Myers DC, Gebhardt DL, Price SJ, Fleishman EA. (1981). Development ofphysicalperfor-
mance standards for Army jobs: Validation of the physical abilities analysis methodol-
ogy (Final Report 3045). Bethesda, MD: Advanced Research Resources Organiza-
tion.
Myers DC, Jennings MC, Fleishman EA. (1981). Development of job-related medical stan-
dards andphysical testsfor court securify officerjobs (Final Report 3062). Bethesda,
MD: Advanced Research Resources Organization.
574 PERSONNEL PSYCHOLOGY
Olson HC, Fine SA,Myers DC, Jennings MC. (1979). The use of functional job analysis
in establishing performance standards for heavy equipment operators. PERSONNEL
PSYCHOLOGY, 34,351-364.
Owens WA, Schoenfeldt LE (1979). Toward a classificationof persons. Journal ofApplied
Psychology,68,570-607.
Parker JR Jr, Fleishman EA. (1960). Ability factors and component performance mea-
sures as predictors of complex tracking behavior. Psychological Monographs, 74
(No. 503).
Pelz DC. (1952). Influence: A key to effective leadership in first-line managers. Personnel,
29,205-217.
Peterson NG, Bownas DA. (1982). Skill, task structure, and performance acquisition. In
Dunnette MD, Fleishman EA (Eds.), Humanpeflomance andproductivity: Human
capability assessment. Hillsdale, N J Lawrence Erlbaum.
Primoff ES, Eyde LD. (1988). Job element analysis. In Gael S (Ed.), The job analysis
handbook for business, indushy, and government (pp. 807-824). New York: Wiley.
Primoff ES, Fine SA. (1988). A history of job analysis. In Gael S (Ed.), The job anafysis
handbook for business, indushy, and government (pp. 14-29). New York: Wiley.
Reilly RR, Zedeck S, Tenopyr ML. (1979). Validity and fairness of physical ability tests for
predicting performance in craft jobs. Journal of Applied Psychology, 64,262-274.
Reilly RR, Zink DL. (1980).Analysis of three outside crafijobs (AT&T Research Report).
New York American Telephone and 'Megraph Co.
Reilly RR, Zink DL. (1981). Analysis of four inside crafl jobs (AT&T Research Report).
New York American Telephone and Telegraph Co.
Romashko T, Brumbach GB, Fleishman EA, Hahn CF! (1974). Development of aprocedure
to validate physical tests (Tech. Rep.). Washington, D C American Institutes for
Research.
Romashko T, Hahn CP, Brumback GB. (1976). The prototype development of job-related
physical testingfor Philadelphia policeman selection (Tech. Rep.). Washington, DC:
American Institutes for Research.
Rose A, Fingerman P,Wheaton G, Eisner E, Kramer G. (1974). Methodsforpredictingjob
ability requirements. II: Ability requirements as a function of changes in the charac-
teristics of an electronicfault-finding task. Washington,D C American Institutes for
Research.
Rupe JC. (1956). Research into basic methods and techniques of Air Force job analysis
Tv (ALPTRC-TN-56-51). Chanute AFB, I L Air Force Personnel and ltaining
Research Center, Air Research and Development Command.
Schemmer FM. (1982).Development of rating scales for selected visual, auditory, and speech
abilities (Final Report 3064). Bethesda, MD: Advanced Research Resources Orga-
nization.
Schemmer FM, Cooper MA. (1986). Test transportability study for technician jobs in the
telephone industry. Bethesda, MD: Advanced Research Resources Organization.
Simon HA. (1953). Causal ordering and identifiability. In Hood WC, Koopmans TC
(Eds.), Studies in econometric methods (pp. 49-74). New York Wiley.
Simon HA. (1957). Making management decisions. The Academy of Management Execu-
tive, 1(2), 57-64.
Simpson GG. (1961). Principles of animal taxonomy. New York Columbia University
Press.
Snow RE, Lohman DF. (1984). Toward a theory of cognitive aptitude for learning from
instruction. Journal of Educational Psychology, 76,347-376.
Snow RE, Lohman DE (1989). Implications of cognitive psychology for educational mea-
surement. In Linn RL (Ed.), Educational measurement (pp. 263-331). New York:
Macmillan.
FLEISHMAN AND MUMFORD 575
Sokal RR. (1974). Classification: Purposes, principles, progress, prospects. Science, 185,
1115-1 123.
Sokal RR, Sneith PHA. (1963). Princ@lesof numerical taronomy. San Francisco: Freeman.
Sternberg RJ. (1985). Implicit theories of intelligence, creativity, and wisdom. Journal of
Pemomli@and Social Psychology, 44,607-627.
Sternberg RJ.(1986). Synopsis of a triarchic theory of human intelligence. In Irvine SH,
Newstead SE (Eds.), Intelligence and cognition. Boston: Nijhoff.
Theologus GC, Fleishman EA. (1973). Development of a taxonomy of human perfor-
mance: Validation study of ability scales for classifying human tasks. JSAS Catahg
of Selected Documents in Psychology,3,29 (Ms. No. 326).
Theologus GC, Romashko T, Fleishman EA. (1973). Development of a taxonomy of
human performance: A feasibility study of ability dimensions for classifying human
tasks. JSAS Catalog of Selected Documents in Psychology,3,25-26 (Ms. No. 321).
'Qler TA. (1984). Police officerjob description. Flomnoor, I L Merit Employment Assess-
ment Services.
Wagner M. (1985, August). On the use of maintenance reports in job analysis. Paper pre-
sented at the 93rd annual convention of the American Psychological Association,
Los Angeles, CA.
Wallis MR, Korotkin AL, Yarkin-Levin K, Schemmer FM. (1985). Leadership job dimen-
sions and competencyrequirementsfor commissioned and non-commissioned oficers:
Remediation of inadequacies in existing databases (Report 3084, Vol. 2). Bethesda,
MD: Advanced Research Resources Organization.
Weldon L. (1983). Recommendationsfor physical ability testing and medical guidelines for
the city of Pittsburgh (Final Report 3075). Bethesda, MD: Advanced Research
Resources Organization.
Wexley KN. (1989). Contributions to the practice of training. In Goldstein IL (Ed.),
Training and a'evebpment in organizations (pp. 487-500). San Francisco: Jossey-
Bass.
Wheaton GR. (1973). Development of a taxonomy of human performance: A review
of classificatorysystems relating tasks and performance. JSAS Catalog of Selected
Documents in Psychology,3,22-23 (Ms. No. 317).
Wheaton GR, Eisner E, Mirabella G, Fleishman EA. (1976). Ability requirements as a
function of changes in the characteristics of an auditory signal identification task.
Journal ofAppied Pqchology,61,663-676.
Wunder SR. (1981). Predictive validity of a physical abilities testing program for process
apprentices. Baton Rouge, LA: Exxon C o p , Employee Relations Department.
Zedeck S. (1975). klidation of physical abilities tests for AT&T craffpositions: Program
report with special emphasis on detailed job analyses (Tech. Rep. 5 ) . New York
American Telephone and Telegraph Co.