Professional Documents
Culture Documents
Levels of Processing
Levels of Processing
Assuming a specialised language faculty, one can speculate about its internal structure. As
in any scientific discipline, it is useful to divide the overall problem into a number of
subproblems (Garnham 1985: 4), in the case of the language faculty into a number of
subsystems and processing levels. Although this unit focuses on high-level or cognitive
processes, we will include low-level processes such as auditory and visual analysis. The
treatment of these stages of language processing is necessary to get an idea of the early
steps of language processing. Fig 1 provides an overview of the sub-processes involved in
language processing.
Figure 1. Sub-processes of language processing
Model
Lexical Processing
Construction
1
Pragmatic interpretation
Parsing
Comprehension vs Production
A central question in cognitive science concerns the relationship between language
comprehension and language production. Psycholinguists have for a long time
concentrated on studies in language comprehension, primarily for experimental reasons:
language production is much more difficult to control in the laboratory than
comprehension. However, three main observational techniques, the study of disfluencies,
the study of speech errors, and the study of aphasia, have shed some light on the process
of language production.
Meanwhile, enough results have been obtained to outline the differences and
similarities between the two modes, one of the most important tasks being to establish to
what extent language production and language comprehension share processing
mechanisms and rules. From an efficiency point of view, it seems plausible to postulate
each processing mechanism and each rule system only once. For example, it would be a
wasteful duplication of information if the language processor made use of two mental
lexicons, one for production and one for comprehension. However, there is strong
experimental evidence that this is not the case (Garnham 1985: 221). Likewise, we can
assume one general knowledge base, which is made use of in production as well as in
comprehension. Despite these similarities, there are differences between production and
comprehension. One difference concerns the sequence of the activation of the various
mechanisms. Assuming a sequential model of language processing, language production
starts with a thought or an idea, to eventually produce a spoken or written output; language
comprehension, by contrast, first performs an acoustic analysis of the incoming signal,
before a message is generated.
1. Low-level Processes
Not all processes involved in language processing are cognitive (Garnham 1985: 4). In
speech production the articulators have to be set into motion and in the production of
written language the motion of the hand has to be precisely controlled. Both processes are
physiological rather than linguistic. They involve subconscious movements of bodily
organs controlled by the central nervous system. The comprehension of language also
involves low level processes. Prior to cognition an analysis of the sensory signal has to take
place and make the results available to the understanding system. There are two central
input channels to the human language processing system, one for speech and one for
written input. Both input signals are enormously complex and require highly specialized
analysis mechanisms.
In summary, low-level processes extract various properties from an input signal, which
conveys an enormously large amount of information in an extremely short time interval
and is normally intermingled with a good deal of background information. To cope with
these aspects, the processor applies complicated strategies to work out those properties of
the sensory signal that are necessary for higher levels of language processing.
2
2. Lexical Processing
The goal of lexical processing is to retrieve the stored knowledge associated with a
word in order to generate a meaningful interpretation of an utterance. Two fundamental
concepts of lexical processing deserve our specific attention: lexical access and word
recognition. According to Tyler/Frauenfelder (1987, p.6) lexical access refers to the point
at which the various facets of the lexical entry contacted become available, whereas word
recognition defines the end-point of the selection phase, that is the point at which a listener
is able to decide what lexical entry he identified.
While the process of 'word recognition' is relatively unproblematic, the term 'lexical
access' is confusing in many ways. First, it is used fairly inconsistently throughout the
relevant literature. In contrast to Tyler/Frauenfelder, Garnham (1985: 43) defines 'lexical
access' as the retrieval of a word form from the lexicon on the basis of perceptual and
contextual information, and 'word recognition' as the identification of one remaining word
candidate. Aitchison (1992: 53, 95) views 'word recognition' as a two-stage process: at the
first stage, the stage of 'lexical access', the input is matched against possible words, and at
the second, the multiple possibilities are narrowed down to one candidate. This view is
paralleled by Zwitserlood's (1989) approach, where stage one of the word recognition
process is defined as 'lexical access' and stage two as 'lexical selection'. Thus, we are
confronted with various different interpretations of the term 'lexical access'. On the one
hand, it is viewed as the initial stage of the more general process of word recognition
(Aitchison, Garnham, Zwitserlood), on the other hand, it is defined as the retrieval of
lexical information (Tyler/Frauenfelder). In both cases, the term 'access' seems to be
interpreted too narrowly. To 'access something' means to 'reach' or 'make use of something'.
Thus, the process of 'lexical access' should rather be interpreted as 'making use of the
lexicon' or 'preparing the lexicon for use' just as we open a dictionary before we actually
start reading in it. Such a generalized interpretation is much more plausible if we consider
the computational interpretation of 'access' in the sense of 'file access', which is normally
read as 'making use of a file' or 'preparing a file for read/write operations'. Thus, it seems
reasonable to extend the term 'lexical access' and consider it synonymous with lexical
processing henceforth. The process of making available lexical information, i.e. 'lexical
access' in Tyler/Frauenfelder's sense, will be referred to as lexical retrieval instead.
The relationship between word recognition and lexical retrieval is assumed to be
sequential. Most theories of lexical processing claim that word recognition precedes lexical
retrieval. This in turn raises the question whether either of the two stages can be by-passed.
Put differently, is it possible to retrieve information from the mental lexicon without
recognizing what was heard, or, vice versa, is it possible to perceive a sensory input, i.e.
hear or read a word, without understanding it? These are debatable issues among
psycholinguists.
3
3 Parsing
The parsing process can be subdivided into two sub processes: the identification of the
internal structure of words (morphological analysis) and the analysis of sentence structure
(syntactic analysis).
(a)morphological analysis
Morphological analysis is concerned with the internal make-up of words. But what can be
considered to be a word? Morphologically, a word is the actual realisation of a lexeme, the
fundamental unit of the lexicon of a language (Matthews 1972: 22).
For example, dies, died, dying, and die are forms or 'words' of the lexeme DIE.
According to Quirk et al. (1985: 67ff), words can be subdivided into two general classes:
- open-class words
- closed-class words
While open-class items (nouns, full verbs, adjectives, adverbs) allow the creation of new
members and thus constitute classes which are unlimited in number, closed-class items
(prepositions, pronouns, determiners, conjunctions, auxiliary verbs, interjections) are
closed in the sense that they are highly resistant to the addition of new members and can
only exceptionally be extended by processes of alternation. In other words, closed-class
words are limited in number (Huddleston 1984: 120ff). Consequently, the number of words
in a language depends on the productivity of the processes capable of extending the open-
class items.
The task of morphological analysis, then, is to find out the basic building blocks of which
open-class words are constructed. These building blocks, or morphemes, constitute the
smallest units of grammatical analysis (Matthews 1974: 13). By convention, morphemes
are presented in curly brackets. They can be free, for example table, or bound, e.g. -s. Free
morphemes serve as the basis for further morphological processes. They are generally
referred to as roots.
The terminology in this area is very fluid. Some linguists draw a distinction between root,
the fundamental morphological unit, and stem, basis for inflectional processes. According
to this distinction, the following root and stem relationships can be postulated:
farm -»· root, also a stem
farms —» root/stem + inflectional affix
farmer —root/stem + derivational affix = new stem
farmers stem + inflectional affix
4
In English this distinction seems quite trivial. In languages, however, where reduced forms
can be generated, it makes sense to postulate such a specification. For example, in German
derivatives such as Röschen /'roesçan/ ("little rose"), the diminutive -chen does not attach
to the nominative singular form Rose /'ro:z3/ but to a lesser form, in this case Ros-. This
lesser form would then be the root and the resulting derivative, Röschen, which serves as
the basis for further inflectional processes, the stem.
For reasons of generalisation across languages, we have good reason to make use of this
differentiation henceforth. The actual realisation of a morpheme is referred to as morph.
The examples in (5) illustrate the different phonological realisations of the plural
morpheme in English.
(5) a. {cat} + w • /'kaets/
b. {dog} + w • /'dDgz/
c. {rose} + {s} /'rauziz/
Bound morphemes can also be referred to as affixes. Affixes in turn can be classified into
prefixes (affixes that precede the root), infixes (affixes that are inserted into the root), an
suffixes (affixes that follow the root). Somewhat natural antitheses to infixes are
circumfixes, which attach discontinuously around a stem (Sproat 1992: 50). The process
of combining an affix with a root is called affixation.
(b) syntactic analysis
The syntactic structure of a sentence determines the relationship between the words in a
sentence. It indicates how words can be grouped together to form phrases, to what extent
words modify other words, and what words are syntactically most important. The parsing
process extracts the structural properties of a sentence and eventually produces a
representation which contains the general syntactic aspects of the sentence, such as:
A) The man gave the book to john
A possible functional syntactic representation of sentence (A) could look like this:
[(Sentence-Features:
(S-Type: Declarative)
(Tense: Past)
(Voice: Active)
(Aspect: Perfective, Simple)
(Mood: Indicative))
(Syntactic Functions:
(Subject:
(Direct Object:
(Indirect Object:
5
(MAN, Definite)
(BOOK, Indefinite)
(JOHN, Proper))]
4. Semantic Interpretation
Generating the functional structure of a sentence is just one step towards building an
understanding of a sentence. Over and above the structural properties of a sentence we need
to determine its meaning. The process of generating and representing a sentence's meaning
is called semantic interpretation. Recently, a distinction has been drawn between aspects
6
that primarily have intra-linguistic relevance and aspects that relate to external domains
(Schwarz 1992: 49). Such a two-level model suggests that the process of semantic
interpretation can be subdivided into two stages: the generation of a linguistic-semantic
form which has no contact with external domains, and the generation of a structure which
has access to the outside world and incorporates general knowledge. In computational
linguistics these two levels are referred to as logical form and conceptual representation
(Allen 1987: 193ff).
7
sense. This second level of semantic interpretation, then, builds a conceptual structure
which allows the drawing of inferences and conclusions.
That human experience is often a key factor in the semantic interpretation of a sentence
can be illustrated using the following examples:
(9) a. He read a book about music in the last two hours,
b. He read a book about music in the last century.
The central problem of these two sentences concerns the relationship between the act of
reading and the adverbial of time "in the last” While in sentence (9a) the temporal adverbial
is external to "a book about music" and describes the length of the reading process, (9b)
exhibits a relationship of the opposite kind. Here, the adverbial "in the last century" relates
to "a book about music". How do we arrive at such a conclusion? Clearly, our knowledge
about the relationship between a human action such as reading, human lifetime, and time
in general, guides our interpretation process. Whatever the internal stages of semantic
interpretation, the main task of semantic analysis is to give a precise account of the meaning
of a sentence.
In summary, the process of semantic interpretation has to fulfil the following tasks: it has
to specify the meaning of the words in a sentence, it has to define the meaning relations
between the words and phrases in a sentence, and it has to couple linguistic interpretation
techniques with general knowledge in order to generate a conceptual structure.
2. Higher Levels
Beyond the syntactic analysis of sentences and their semantic interpretation, two further
levels are involved in natural language processing. One such level is concerned with the
8
building of a model of the discourse and the situation which the actual sentence describes.
The second of these higher levels determines what to do with that model; expressed
differently, it defines what message is conveyed.
2.1 model construction
Speakers and listeners keep track of what is being talked about. They introduce and
reintroduce referents (persons, objects) and make assumptions about them, they change
topics; in short, they build mental models. Levelt (1989: 114ff) distinguishes four types of
knowledge structure on the basis of a two-person interaction. The first kind of knowledge
is the knowledge which the speaker believes he shares with his addressee. It is called
common ground. The second kind of knowledge is a collection of knowledge structures
which the speaker believes he has successfully transmitted to his interlocutor. These own
contributions are mixed with the interlocutor's contributions, the third knowledge structure.
The remaining knowledge structure is the information yet to be conveyed or the
communicative goal. All four knowledge structures constitute the speaker's discourse
model, which is defined by the speaker's plus the interlocutor's contributions. In addition
to the discourse model, humans build general performance models of the interlocutor which
contain information about the interlocutor's preferred interaction modes, a rough
characterisation of his linguistic competence, an assessment of the interlocutor's memory
ability, and an indication of what his goals seem to be for the remaining dialogue. In other
words, we build a model of what the interlocutor knows, how he thinks, what he
memorises, and how he learns. The process of inferring a person's cognitive state from his
performance can be called cognitive diagnosis (Ohlsson 1987: 204). Research into the
modelling of these aspects has primarily been carried out in the area of ICAI (see section
1.1.3., above), where computer systems used for teaching are equipped with a user-
modelling component that represents the student's understanding of the material to be
taught (Bumbaca 1988: 228).
9
create an ironical effect. A performative verb such as promise, then, can be used as a
warning, provided that the general circumstances permit such an interpretation. The
pragmatic theory of speech acts, which goes back to Austin (1962), approaches phenomena
of this kind, but constructing an adequate theory in this area is very difficult since many
contextual influences play a role in the interpretation of natural language.
Another aspect of pragmatic interpretation deals with the fact that speakers adhere to a
general principle of co operativeness. This principle, which was first formulated by Grice
(1975), defines a general framework of conversation where speakers mutually assume that
their contributions are purposeful, well-conducted, or, more generally, co-operative. The
co-operative principle is supplemented by four maxims that Grice considers to follow from
it:21
- the principle of quality
- the principle of quantity
- the principle of relation
- the principle of manner
These maxims are not scientific laws that determine the operation of natural language
processing; rather they serve as defaults or norms that can be violated, or, to use Grice's
terminology, 'flouted'.
(12) a. Speaker A: "Can you pass me the salt?"
Speaker B: "It's a nice day."
In (12) we are confronted with two examples of a violation of the maxim of relation, which
essentially says: be relevant. In both cases, speaker B's contribution is superficially
irrelevant as an answer to the question asked by speaker A. However, such a violation does
not necessarily lead to a failure of the conversation. Assuming that any contribution is
purposeful or cooperative, the language processor tries to establish a relationship between
question and answer, to work out what was meant. Such a relationship is called
conversational implicature. In example (12b), this relationship is obvious: "on broken
watches one can't read the time; speaker Β has such a watch and can thus not answer
speaker A's question". Speaker B's answer in (12a), by contrast, is really irrelevant, unless
we construct something like "under normal circumstances speaker Β never passes the salt
to A; however, since it is a nice day he will make an exception". Both levels, model
construction and pragmatic interpretation, are processes which are well beyond the scope
10
of the central linguistic levels: phonetics, grammar and semantics. They are enormously
difficult to define, and, despite more or less precise theoretical underpinnings, these
extremely complex high-level processes are very hard to capture.
Knowledge Base
• Situation knowledge
• Model construction
• Pragmatic
interpretation
Linguistic Processing
Lexicon
Comprehension System
• Lemma lexicon
• Semantic interp
• Form Lexicon
• Parsing
11