You are on page 1of 11

Annals of Library Science and Documentation 1988,35(2),58-68

LANGUAGE AND INDEXING LANGUAGE


NALIMOV AND GARDIN REVISED
F J DEV ADASON
Documentation Research Training ~ntre
Indian Statistical Institute
&ngalore-560059
MRKUMBHAR
Librarian
Manipur Univel'lity Library
Canchipur
Imphal-795003
A study of the salient features of ordinary difficult to imagine any satisfactory definition
language is necessary for the design of infor- of the term 'language' that did not incorporate
mation storage and retrieval systems in general some reference to the notion of cornmunica-
and indexing systems in particular. This paper, tion" [5]. Language is a specific social means
based on the works of Nalimov, and Gardin, of information storage and transfer as well as
attempts to discuss some of these salient [ea- of controlling human behaviour. It is a purely
tures. Only the relevant characteristics of ordi- human and non-instinctive method of com-
nary language are mentioned, which are re- municating ideas, emotions and desires by
fleeted in indexing languages. The important means of a system of voluntarily produced
features of indexing language and its meta- symbols. Only the communicative aspect of
language character are also mentioned. Com- language plays an important part in in form a-
morl analogical features of both subject ana- tion systems, "the essence of which is storage
lysis and linguistic analysis are presented along of information for use and manipulation by an
with a guage. Indicaters that the Postulate based individual" [6].
Permuted Subject Indexing (POPSI) language, One of the features of modem investigation
together with its vocabulary control tool Clas- of language is the recognition that 'ordinary
saurus, has also the important components of language' - be it written or spoken - is only
an indexing language. a member of a class of coherent symbolic com-
munication systems. All conceivable sym bol
INTRODUCTION systems have been considered as languages. For
instance : the language of music, the language
Language is a means of communication and of dance, the languages used for computer
thinking - for dissemination of information programming etc. But fulfilment of the com-
and to at' extent the creation of it. In indexing, munication function alone cannot be con-
language plays a very special role in information sidered as a necessary and sufficient require-
organisation and documentation. However, it ment for elevating a symbol system to the rank
is not language which is the object of study of 'ordinary language'. Exchange of informa-
here, but rather the use of language to formu- tion may be effectuated not only with 'words'
late and communicate ideas and opinions [1]. but also with other symbols. Also exchange of
A study of the salient features of language is information may take place between a human
helpful in the design of information storage being and a computer and also between inani-
and retrieval systems ir; general and indexing mate mechanism - between two computers.
systems in particular [2]. This paper based on Many phenomena of the physical world can be
the works of Nalimov [3] and Bardin [4] pre- regarded in terms of receipt and transfer of
sents some of the salient features of langugage information. Even photoelectric effect can be
and indexing language. regarded as information transfer. But these do
not become 'ordinary languages'. The essential
LANGUAGE functional characteristic of ordinary language
(referred to as just 'language' hereafter) is its
Language is system of symbols serving the means role in information reduction or synthesis,
of human communication, thinking and ex- abstraction, generalisation; in elaboration or
pression. The primary function of language is expansion; and particularisation or concreti-
communication between individuals. "It is sation - in short, thinking.

58 Ann Lib Sci Docu


LANGUAGE AND INDEXING LANGUAGE

STRUCTURE OF LANGUAGE 1. subelementary linguistic symbols: the letters


or alphabets, morphemes (a morpheme is a
Signs/Symbols meaningful part of word - the root and the
affix, prefixes, suffixes etc.):
First, a language is composed of a plurality of
signs - articulate sounds as signs/symbols of 2. the elementary symbols: the words (a word
ideas. is a symbol for one of the smallest com-
Second, in a language each sign/symbol has pletely satisfying bits of isolated 'meaning' -
a signification common to a number of inter- a fragment of a text between spaces);
preters.
Third, signs constituting a language must be 3. phrases (a phrase is a fragment of a text
'common signs', that is, producible by the between two punctuation marks);
members of the interpreter-family and have the
same signification to the producers which they 4. clauses and sentences (a sentence is a frag-
have to other interpreters. ment of a text between full stops); para-
Fourth, the signs which constitute a langu- graphs and SCi on.
age are plurisituational signs, that is, signs with
a relative constancy of signification in every The complex symbols constructed from simpler
situation in which a sign of a sign-family in ones fcrm a hierarchical symbol system of
question appears. A sign in a language is a sign- several levels. Phenomenologically, thinking is
family and not merely an un i-situational vehicle a proc.ess of constructing complicated symbols
[7). A sign and its meaning do not completely from simpler ones which is outwardly reflected
cover each other. Their boundaries do not in the hierarchical structure of language (3). It
coincide in all points. One and the same sign is reflected even in the sequence of sentences,
has several functior:s. One and the same meaning paragraphs and even larger portions of text. A
is expressed by several signs. "If signs were text does not consist of a sequence of unrelated
fixed and _each of them fulfilled only one sentences. If it did, it would not convey the
function, language would become a mere collec- intended message properly. In fact, every pre-
tion of labels" [3). It is not enough for the ceding sentence thrives to fix the context for
perfection of language that sounds can be made the next sentence, helping the precise under-
signs of ideas, unless those signs can be so made standing of the following sentences, infusing a
use of as to comprehend several particular kind of hierarchy among them. "It should be
things: for, the multiplication of signs would acknowledged that logical hierarchy of state-
have perplexed their use, had every particular ments exists in language, but it is so concealed
thing needed a distinct name to be signified that in practice, it cannot be directly ob-
by. To remedy this inconvenience, language had served" [3, p.31). The narration of events in
yet a further improvement in the use of general chronological sequence in text is an example
terrns, whereby one word was made to make a of this kind of 'hierarchy'.
multitude of particular existences.
Fifth, the differen t signs in a language Grammar
should fit into a semantic hierarchic structure. In order tc make operative the rules for con-
"I see two things: a chair and a furniture" is structing complex logical structures using the
not an acceptable sentence. Semantic hierarchy simpler symbols, the symbols are viewed as
among the different signs may be viewed as belonging to grammatical categories or parts-of-
one of the conditions necessary for regarding a speech such as noun, verb, adverb etc. The
system of signs/symbols as a language. The signs rules of syntax (grammar) are defmed with
in a language must constitute a system of inter- respect to these categories. But Chinese makes
connected signs combinable ir. some ways and no use of the formal categories devised for the
not in others in order to form a variety of Indo-European languages. So also the poly-
complex symbols of different levels [7] . synthetic languages of American Indians. Sym-
bols and grammar are, of course structural ele-
Symbol system ments of language; they are clearly seen in the
majority of the symbol systems perceived as
The structure of language starts with languages [3, p.28).

Vol 35 No 2 June 1988 59


DEVADASON

Characteristics of Language so rich that every thing stated in terms of the


object language could be said in the metal-
In terpretability language; particularly, it should have the means
for constructing names of the object-language.
A sign system has a right to be called a language "The metalanguage of everyday speech uses
if it can be interpreted into another language. the same guage [3, p.39] .
Speaking to a foreigner we interpret our mother
tongue in the system of another language, and LANGUAGE OF SCIENCE
actually it is not a translation but a mere inter-
pretation. This is because of the metaphors, Components of the Language of Science
special phrases, idoms etc., linked to culture
and historical experiences present in language. Science is a linguistic or symbolic representa-
tion of experience. Scientific development is
Polymorphism reflected in the development of the language
of science. In the language of science the sym-
Consider the following three phrases : bols and grammar of ordinary language are used.
New terms (or new combination of old words)
1 This a good mango, and new meanings ascribed to old words bor-
2 This a good lemon, rowed from everyday vocabulary are introduced.
3 This is a good carving knife. This gives an esoteric character to specific langu-
ages of sciences: they prove comprehensible
It is readily seen that the word 'good' has quite only to the initiated. Terms in science are
different meanings in these phrases. A good closely connected with the respective theoreti-
lemon Should be sour, a good mango should cal concepts, though on the surface, many
not be sour; and a good carving knife should terms seem to be more than names of some
be sharp which has nothing to do either with object or phenomena. For example the term
the quality of lemon or with that of mango. 'Raman effect' would seem to be a name of
Both in every-day language and in many other some physically observed phenomenon. In
languages every special symbol is connected in fact, the meaning of this term becomes clear
a probabilistic manner with a variety of mean- only through the understanding of the theory
ings. One of the characteristics of polymorphism of this phenomenon. No doubt, in the language
is that of synonymy. This diversity of everyday of science there are some terms which may be
language is considered its most essential charac- clearly defined. But the meaning changes with
teristic and is no longer regarded as an index time, together with the development of scienti-
of its deficiency. It is due to its polymorphism fic concepts. The meaning ascribed to the word
that natural language is richer than any arti- 'atom' now differs considerably both from that
ficially created one. ascribed by the ancient Greeks and from that
used at the beginning of this century. The
METALANGUAGE meaning ascribed to terms change in different
theories though they may be closely related.
Languages with highly developed logic emerge Both 'Classical mechanics' and 'Relativity
up to constitute metalanguages whereby one theory' make use of such terms as 'mass' and
may judge the correctness of statements made 'length' but they are interpreted differently.
in the object-languages. Our everyday language
is a metalanguage in relation to the. 'language' Polymorphism of the LangUageof Science
of things surrounding us. In terms of every-day
language we operate not with things but with Scientific terms have a more polymorphic
their names. Making judgements about the character than the words of ordinary language,
things of the outer world, we try to arrange The term 'model' has been studied by Cho
them in some consistent structures which is Yuan-Ren [8]. He has given a list of several
equivalent to searching for logical foundations synonyms which is characteristic of 'model'
of the world of things. Generally the metal- and non-synonyms which are notions contrasted
anguage is formalised to a greater extent than to 'model'. Scientists express new notions with
the object language. A metalanguage should be the help of rather unusual combinations of old,

60 Ann Lib Sei Docu


LANGUAGE AND INDEXING LANGUAGE

well known and familiar expressions. Scientists 2. Information analysis (analytic and synthetic
have permitted rather differen t senses for old processing of information/document surro-
terms with the emergence of new theoretical gates/ documen ts themselves), categorisation
concepts. In science, theories are continuously and systematisation of the incoming flow
changing, but the change does not cuase a of information; and
water fall of new words. The new phenomena
are interpreted, through the old familiar ones, 3. Storage and retrieval of information/docu-
through old words whose meaning is slightly, ments and the development and use of
continuously changed. For instance, every body information retrieval procedures with the
knows the common meaning of the word use of modern means for achieving the de-
'isolate'. In Colon Classification its meaning sired results.
is completely changed (that which in isolation
is not fit to form the name of subject and has of the above mentioned three stages, the second
to have a Basic Subject component along with one namely, information analysis or document
it to represent a subject). In the language of analysis or subject analysis or content analysis
science, new concepts emerge, and old notions decides the efficiency of the system. It deals
are often assigned new meaning. Because of its with systematising or organizing the informa-
continuously changing nature, specific languages tion/documents accompanied by classification
of science are accessible only to those working and indexing of the information content of the
in the field and thus constantly interacting with documents; the creation of new secondary
the informational flow in science. It can be documents or document surrogates on the basis
said that, the emergence of a new independent of certain rules and procedures depending on
discipline must be accompanied by the emer- specific tasks of information practice to form
gence of a new specific language or a dialect of the index me or enquiry me, and the retrieval
the discipline. "In the realm of professional procedures constituting the interface in the
activities too there evolves a restricted and information retrieval process. A model Biblio-
accommodated language, a so called professional graphic Information System is given in figure-I.
language. A professional language is developed Documents are selected, received and their
and used by people for some professional pur- information content analysed to determine the
pose, that is, performing professional tasks in subject categories into which they are to be
a specific field and communicating about them. classified as well as the index terms associated
A professional language consists of vocabulary with the documents, according to a specified
(terminology and concepts) and types of com- language having grammar, called information/
municative acts (including typical intentions). documentation/indexing language. After the
A professional language is not only used for completion of this analysis, there is a branching
talking about an activity field (a universe of dis- which allows for more than one method of
course); it is also a part constituting it" 19]. 'organising' the 'files'. The documents them-
of course Library and Information Processing selves are stored physically in a pre-defined
profession has its own professional language and sequence constituting the, 'storage file'. Since
dialects too. only one physical arrangement of documents
is possible without expensive duplication of the
BIBLIOGRAPHIC INFORMATION SYSTEM documents, alternative paths of access to them
are provided through the 'index me' which con-
The functions of a bibliographic Information tains the index terms determined at the analysis
System are: collecting, organising, storing, step .
citing and disseminating of documents or· the
information/ideas recorded in the documents. The user requests information. She/He inter-
Such information processing activity can be acts with the index and/or storage file. To do so,
generally divided into the following stages: she/he must convert his/her request for informa-
tion into a well defined search question to which
1. Collection of information/documents em- the system can respond. The formulation of
bodying information with the purpose of the search question again has to be according
su pplying most fully, the necessary infor- to or in the information/indexing language.
mation/ documen ts in accordance with the When the user is satisfied with the documents
request of the users; which have been located in response to this

Vol 35 No 2 June 1988 61


DEVADASON

r-----------~STORAGq_------~

I
I
I
I
I
I
SELECTION ANALYSIS _- - - - - -- - -1- - - - --
I
I
I
I I
I I i
I I
I I
I I
I
I I
I INDEX 1
I I
I
1 I
I I
: I
I I
I I I
I -
1. ....1.I -II

Fig. 1

request, she/he reads them to obtain new facts formalised structure they are regarded as meta-
and ideas. Presumably, she/he will now become languages for information organisation and re-
~ gene~ator of new documents which will in turn trieval. They are used in
find their way into the system.
The nucleus of the Bibliographic Informa- 1 analysing the information content of docu-
tion System is the Information/Indexing langu- ments;
age. According to it the documents are ana-
lysed. The creation and organisation of the 2 formulating names for the information con-
'index file' and the setting up of the retrieval tent or names for the subject of the docu-
procedures are all based on the analysis of the ments;
documents according to the prescriptions of
the information/indexing language. 3 verifying the correctness and completeness
Natural language is a tool for thinking and of the statements of the names of subjects
a means of communication. In such a language of documents; and
there are synonymous correspondences bet-
ween words and meanings. The meaning of 4 constructing other information handling
words, sentences and even speech may change tools.
with time. Hence, the use of a natura1language
as such in information retrieval systems for the INFORMATION/INDEXING LANGUAGE AS
description of semantic contents of documents METALANGUAGE
is associated with difficult problems. in order
to overcome these problems, special artificial In information/indexing languages, the meaning
languages called information languages/indexing extracted from the text of a document (denot-
languages/documentation languages are used. ing the information content), is designated by
These artificial languages are formalised auxili- concepts and their inter-relations (which are not
ary languages, created by information pro- necessarily found in the text) by 'adhoc' sym-
fessionals for use in bibliographic information bols (descriptors, role indicators, facet indica-
storage and retrieval systems. Because of their tors, links etc.), in a formalised way. Because

62 Ann Lib Sci Docu


LANGUAGE AND INDEXING LANGUAGE

these set of symbols are external to the object- necessary step by which to initiate the descrip-
language in which the documents under analysis tion or generation of natural language text and
are written and intended to facilitate the mani- to the understanding of language. The standard
pulation of these documents in various ways unit of analysis is the sentence, as it is more or
and for various purposes, they can be called less intuitively defined in traditional grammar;
'metalanguages' [41. They are not only used in any larger unit is felt to exceed the scope of
analysing 'what the information content of syn tactical analysis. The basic language units
documents are' but also in constructing state- for syntactical analysis are grammatical cate-
ments of the names of subjects of documents gories (nouns, verbs, adjectives etc.) and gram-
and also for verifying the correctness and com- matical functions (subject, object etc.) of
pleteness of these statements. In the formalised former times. But they are being further refined,
statements of these metalanguages "some of In accordance with the priority given to syn-
the 'logical semantic' relations, specifically tactical analysis, semantic analysis is confined
those of implication are specified, but not in to a subsidiary position as illustrated by Katz
the surface-structure of the object-language, and Fodor [12], which still seems to form the
that is, natural language" [10]. Moreover, these basis of the idea that the properties of 'surface
information/indexing languages, though de- structures' (in general terms) play a primary
veloped under independent conditions for the role in the determination of meaning [13] .
analysis of documents of many different kinds - But linguists themselves have felt the need
not only scientific texts, but also sociological to broaden the basis of language analysis. Grimes
records, scripture, folklore etc., - have striking [14] has pointed out that it is unwise to go on
similarities [4]. An account of these meta- ignoring the findings of other disciplines also
languages has been given by Gardin [11] . concerned with the analysis of text such as,
rhetoric, criticism etc., and information science
FEATURES OF INFORMATION/SUBJECT itself, especially since some of the better de-
AND LINGUISTIC ANALYSIS fined procedures recently setforth in these
areas occasionally unveil facts that linguists
Information/document analysis in the context would be ill-advised to neglect. A number of
of a particular science, obviously implies some linguists have proposed the broader concept of
knowledge 'about' the universe of discourse. 'paraphrase' as a basis upon which to decide the
There is a need to have a knowledge to write proper level where sentences should be assigned
summarising statements about the content of the same 'deep structures' [15]. Also a notion
the document/information being analysed. It is of syntactic analysis as a less discriminating form
left to be decided whether language analysis in of semantic analysis [16], and the notion that
a more general, less specialised field of speech "syntactic structures are derived from semantic
or writing, does not also imply some knowledge graphs" [17] point to the fact that syntax is
about that field. no more considered to be the first step of langu-
Information/indexing languages do not feel age analysis. The derivation of syntactic struc-
it an obligation to consider the natural language tures on 'semantic graphs' is exactly the strategy
sentence as the proper unit of analysis, as a adopted in information/indexing languages in
large number of linguists would have it. For one connection with subject/document analysis.
thing, the definitions of what a sentence is, in An investigation of 'prelexical' structures,
any given natural language are not so stable as equivalent to representations in information/
to provide with a firm analytical framework for indexing languages, prior to the development of
the kind of semantic analysis required in infor- grammatical transformations has also been
mation/indexing languages. Conversely, infra- advocated by linguists [18]. Again the adop-
sentential units often provide a more con- tion of ...•.
'seman tic categories (that) can guide
venient basis on which to conduct the deriva- the interpretation of sentences, independently
tion of information/indexing language units, of and in parallel with perceptual processing of
which are later chained to one another through the syntactic structure" [19] has also been
procedures that again overstep the boundaries advocated. Though some linguists have favoured
of natural language sentence. units smaller than sentence [16, p. 101-102],
In language analysis the priority is given to others on the contrary, have demonstrated that
syntactical analysis, which is considered as the larger units are necessary to provide a proper

Vol 35 No 2 June 1988


DEVADASON

understanding of 'coordination', 'pronominali- suggested. According to a well known linguist


sation', 'subordination' etc. [20-22]. More Langendoen : "eventually it will be necessary
generally there have been many signs of a revival for linguistics theory to come up with a uni-
of interest for the analysis of 'discourse' over versal inventory of roles" [20, P: 79). Vermier
and above the analysis of sentences, as done in [26] too has come up with such conclusions.
document/subject analysis. Such universal roles, then would have to be
It has also been observed by linguists [20, of a rather abstract nature (such as Agent,
P: 32; 199p 297; 14, p.141] that the 'proposi- Instrument, Object, Process, Property etc).
tion' defined in more or less logical terms pro- They would thus tend to converge with logical
vides a more convenient analytical framework operators of the kind used in 'syntactical' ana-
than the grammatically defined sentence. In the lysis, even though the initial purpose was to
presentation of the basic structures that are account for 'semantic' structures. Specific field-
supposed to underlie the formation of speech, bound roles have been used in document/
linguists have tended to use logical concepts subject analysis for a long time. For example
often very close to those which underlie docu- the various faceted classification schemes for
ment/subject analysis. From the modest use of different special fields. The process of abstrac-
'markers' [12] came the view of semantic re- tion leading to the more general 'role opera-
presentations as 'trees' from which has come the tors' or 'categories' too is a well known path
broader concept of relation of combination used in theory of document analysis to relate
information 'about the world' that have to be semantics and syntax. For example the cate-
taken into account for a proper understanding gories of Kaiser's systematic indexing, the
of language. The function of such semantic net- Fundamental Categories Personality, Matter,
works is to provide for the enunciation and Energy, Space and Time of Ranganathan's
application of rules of 'well-forme dness , that Colon Classification and his search for 'absolute
were presented as a necessary extension of syntax' of facets [27, 28]. Moreover the sug-
syntactical theory. The 'presuppositions' have gestion that "deep structures should be stated
been given more and more importance later in terms of role relationships rather than syn-
[23]. Though the notion of 'presupposition' thetic relationships" [19, p. 279; 32, P.62] sug-
needs "some sort of conceptual straightening gested that broad conceptual framework infer-
up" [24] it can still be observed that it is being red from the practice of document/subject
used in the same way, for theoretical purposes analysis should bear some (at least analogical)
just as the notion of semantic or 'paradigmatic' relation to linguistic theory and language ana-
organisation has been used for applied pur- lysis. Linguists also have come up with similar
poses in information/subject analysis. proposals. For instance, Grimes' categories:
Another parallelism with the procedures of States, Processes, Actions etc., [14, p. 167-8]
document/subject analysis could be seen if the and Bach's new category 'Contentive' to cover
kind of 'categories' used in the fomulation of nouns, verbs and adjectives [29] .
presuppositions is considered. In brief, the pre- Further, meaning representation systems
suppositions associated with a given word con-
such as Sager's automatic conversion of texts to
sists in a statement of some relational/combi-
a structured information base [30], Schank's
national properties of the given word with res-
computerised paraphrase and inference system
pect to other words or classes of other words.
[31, 32] , wilk's intelligent analyser and under-
The basic idea is that 'role structures' can be
stander of English [33], question answering
attached to particular words [22, p. 66] in
systems of Grishman and Hirschman [34 J ,
order to express the range of their possible re-
and of Lehnert [35] and so on, all depend on
lations with other words, from a syntactico-
some sort of underlying semantic categorisation
semantic viewpoint. The 'case frames' of Fill-
and deep syntactic relations.
more [25], the 'basic relation structures' of
Bever [13, p.279-352], the 'mental templates' Formally stated, the common thread bet-
etc.) naturally differ according to the parti- ween the theories of document/subject analysis
cular word under consideration. However, and that of language analysis is, the role assigned
identical role structures have been assigned to to n-place predicates (the term 'predicate' in this
classes of words. By carrying the abstraction context designates any relation holding between
process further, a set of 'universal roles' can be two, or more entities or any property of any

64 Ann Lib Sci Docu


LANGUAGE AND INDEXING LANGUAGE

entity) [10, p. 210], to express semantic as 3 To take into account the immensely diverse
well as syn tactic represen tations with: relational data observed in documents, a
set of rules of syntax, constituting the
1 the need to categorise the symbols of the syntagmatic structure of the metalanguage,
vocabulary (words, descriptors) in such a which is contrasted to the paradigmatic
way that formation rules equivalent to the structure, not in essence but in use.
phrase-structure rules of grammar can be
stated adequately with no regard to the POSTULATE BASED PERMUTED SUBJECT
grammatical status customarily assigned to INDEXING LANGUAGE
the words concerned; and
The Postulate-based Permuted Subject Indexing
2 the need to account for the derivation of language of Bhattacharyya [36-42] has all the
propositions from one another, as a necess- essential features mentioned above. It is based
ary component, in the understanding of on
language behaviour.
1 a set of postulated Elementary Categories
of the elements fit to form components of
COMPONENTS OF INFORMATION/INDEX- names of subjects;
ING LANGUAGE
2 a set of syntax rules with reference to the
In documentation/information/indexing langu- Elementary Categories for formulating ad-
ages, there is a focus on the concept of 'sum- missible names of subjects; and
mary' or 'aboutness' which may not be consi-
dered important in 'other' languages. Infor- 3 a faceted hierarchic scheme of terms for
mation/indexing languages are used to formu- vocabulary control called Classaurus.
late statements as answers to questions of
the type 'what the information content of Each of the terms occurring in POPSI language
particular documen ts are about'. These state- statements have their appropriate broader terms
ments stand as names of the topics or subjects of prefixed to them. Hence the POPSI language
the documents. In short they are languages for has been shown as a source language for pro-
naming subjects for the special purpose of ducing and organising associated indexes [43].
storage and retrievaL The minimum essential It has also been shown as a metalanguage for
components of any such metalanguage are [4, computer aided generation of information re-
p.147] . trieval thesaurus [44], as well as Classaurus [45,
46]. POPSI language is amenable for com-
1 The 'lexicon', a list of content terms, either puterisation of index generation, especially in
extracted as they occur in a given natural producing different types of indexes including
language (keywords) or redefined for the chain index and PRECIS-format index entries
purpose of the analysis (descriptors). If the [47. 48] . It is also possible to use POPSI langu-
list of content terms carry no relational age in computer based online information
information of any kind, then the metalan- retrieval. In such an online information retrieval
guage is said to be 'unorganised' (uniterm system the searcher need not know any query
lists) . language and the system would have built-in
vocabulary control mechanism [48,49] .
2 A set of relational data provided 'a periori'
CONCLUSIONS
with the lexicon, irrespective of the way in
which they are expressed (cross references,
A study of the salient features of language and
hierarchy, tree, factoring etc.), and whether
language analysis is necessary for the design of
they be regarded as semantic data as in
information retrieval systems in general and
taxonomies or as syntactic templates as in
indexing system in particular [2] . It would help
some faceted classification schemes. This
in incorporating the required features though
forms the paradigmatic structure of the
not exactly but atleast analogically in indexing
metalanguage .
systems. POPSI has incorporated all the essential

Vol 35 No 2 June 1988 65


DEVADASON

features necessary for making it an universal 11. Gardin, J C. Semantic analysis procedures in the
indexing system. Its resilience and amenability science of man. Soc Sc Inf. 1969,9, 1342.
to computerisation has paved the way for re-
search in this direction, the ultimate aim of 12. Katz, J and Fodor, J .A. The structure of seman-
which is to develop an online information re- tic theory. Language 1963,39,170-210.
trieval system which would be user friendly.
13. Chomsky, N. Deep structure, surface structure
and semantic interpretation. In Steinberg, K and
BIBLIOG RAPHlCAL RE FERENCES Jakobovits, L., Ed. Semantics: An interdiscipli-
nary reader in philosophy, linguistics and psycho-
1. Beling, G. The use of EDP in terminological logy. Cambridge University Press, Cambridge.
work. In Overcoming the language barrier: Pro- 1971. p 214.
ceedings of the language barrier: Proceedings of
the Third European Congress on Information 14. Grimes, J E. The thread of discourse. Cornell
Systems and Networks. Luxembourg, 3-6 May, University, Dept. of Modern Languages and Lin-
1977. K.G. Saur, London. 1979. v.1, P 102-103. guistics, Ithaca, New York. (Technical Report
No.1). (NSF Grant. GS-3180). 1972. P 13-34.
2. Mitchell, Gillian. The natural language founda-
tions of indexing language relations. CanJI of Inf
15. Partee, B H. On the requirement that transfonna-
Sc 1979,4,99-104.
tions preserve meaning. In Fillmore, C.J. and
Langendden, D.T., Ed. Studies in linguistic
3. Nalimov, V.V. In the labyrinths of language: A
semantics. Holt, Rinehart and Winston, New
mathematician's journey. ISI Press, Philadelphia.
York. 1971, p 3.
1981. p 3.

16. Wilks, Y A. Grammar, meaning and the machine


4. Gardin, J .C. Document analysis and linguistic
analysis of language. Routledge and Kegan Paul,
theory.Jl of Doc 1973,29(2),144.
London. 1972, p 94.

5. Lyons, J. Semantics. Cambridge University Press.


17. Hutchins, W J. The generation of syntactic struc-
Cambridge. 1977. p.32.
tures from a semantic base. North Holland,
Amsterdam. 1971.p 2.
6. Sloman, A. The primacy of non-communicative
language. In The analysis of meaning: Proceed-
18. Postal P M. On the surface verbs 'remind'. In
ings of a conference held by the Aslib Informa-
above cited ref. 14; p 249-50.
tics Group and the BCS Information Retrieval
Specialist Group. (Informatics 5). Oxford, 26-28
19. Bever, T G. The cognitive basis for linguistic
March 1979. Aslib, London. 1979. p 1.
structures. In Hayes, J.R., Ed. Cognition and the
development of language. John Wiley, New York.
7. Morris, C. Signs, language and behaviour. Pren-
1970. p. 297.
tice Hall, New York. 1946.

20. Langendden, D T. Essentials of English Gram-


8. Chao Yuan-Ren, Models in linguistics and models
mar. Holt, Rinehart and Winston, New York.
in general. In Logic, methodology and philoso-
1970. p. 141.
phy of science: Proceedings of the International
Congress. Stanford University, Palo Alto, Califor-
21. Lakoff, R. If's, and's and But's about conjunc-
nia. 1962. p 558-66.
tion. In above cited ref. 14; p. 55-70.

9. Israel, J. The language of dialectics and the dia-


lectics of language. Munksgaard, Copenhagen. 22. Thompson, S A. The deep structure of relative
1979. clauses. In above cited ref. 14; p 79-96.

10. Montgomery, C.A. Linguistics and information 23. Fillmore, C J. Verbs of judging: An exercise in
science. Jl of Amer Soc for Inf Sc 1972, 23(3), semantic description. In above cited ref. 14;
214-215. p 273-89.

66 Ann Lib Sei Docu


LANGUAGE AND INDEXING LANGUAGE

24. Fillmore, C J and Langendden, D.T., Ed: Studies 36. Bhattacharyya, G and Neelameghan, A: Postu-
in linguistic semantics. Holt, Rinehart and Wins- late-based subject headings for dictionary catalo-
ton, New York. 1971. gue system. (DRTC Annual Seminar. 7; 1969;
paper CA).
25. Fillmore, C J. The case for case. In Bach, E and
Harms, R. Ed. Universals in linguistic theory. 37. Bhattacharyya, G: A general theory of SIL,
Holt, Rinehart and Winston, New York. 1968, POPSI and Classaurus: Results of current classifi-
p 1090. cation research in India. (Paper presented at the
International Classification Research Forum,
26. Vermier, D. Semantic hierarchies and abstrac- organised by SIG/CR of ASIS, Minneapolis, Oct
tions in conceptual schemata. Inf Systems 1983, 1979).
8(2),117-24.
38. Bhattacharyya, G: POPSI: Its fundamentals and
27. Ranganathan, S R. Hidden roots of classification. procedure based on a general theory of subject
Inf Stor and Retr 1967,3,407-9. indexing languages. Lib Sc with a slant to Doc
1979,16(1),142.
28. Neelameghan, A: Absolute syntax and structure
39. Bhattacharyya, G: Some significant results of
of an indexing and switching language. In Order-
ing systems for global information networks: current classification research in India. Intl

Proceedings of the Third International Study Forum on Inf and Doc 1981,6(1),11-21,
Conference on Classification Research. Bombay,
6-11 Jan, 1975. (FID/CR Pub No. 553). DRTC, 40. Bhattacharyya, G: Subject indexing language: Its
Bangalore, 1979, p 170. theory and practice. (DRTC Refresher Seminar,
13; 1981;paper BA).
29. Bach, E: Nouns and noun phrases. In above cited
ref. 24; p 91. 41. Bhattacharyya, G: Elements of POPS!. In Rajan,
T.N.' Ed. Indexing systems: Concepts, models
30. Sager, N: Natural language information format- and techniques. IASLIC, Calcutta. 1981. p 82.
ting: The automatic conversion of texts to a
structured data base. Advances in Computers 42. Bhattacharyya, G. Classaurus: Its fundamentals,
1979,17,89-162. design and use. In Dahlberg, I., Ed. Proceedings
of the 4th International Study Conference on
31. Schank, R C. Inference and paraphrase by com- Classification Research. Augusburg, 28 June-
puter. Jl of Asso for Comput Machinery 1975, 2 July, 1982. Indeks Verlaag, Frankfurt. v 1.
22,309-28. 1982. p 13948.

32. Schank, R C and Abelson, R P: Scripts, plans and 43. Bhattacharyya, G: POPSI: A source language for
knowledge. In Proceedings of the Fourth Inter- organising and associative classifications. Lib Sc
national Joint Conference on Artificial Intelli- with a slant to Doc 1982, 19, 243-52.
gence. Tbilisi. 1975, p 151-157.

44. Devadason, F J: Postulate based Permuted Sub-


33. Wilks, Y A: An intelligent analyser and under- ject Indexing language as a metalanguage for
stander of English. Comm of the Asso forComput computer aided generation of information retrie-
Machinery 1975, 18,264-74. val thesaurus. IntI Forum on Inf and Doc 1983,
8(1),22-9.

34. Grishman, R. and Hirschman, L: Question ans-


wering for natural language medical data bases. 45. Devadason, F.J. and Kothanda Ramanujam, M:
Computer aided construction of alphabetic class-
Artif Intell 1978,11,2543.
aurus. In (above cited ref. 41 ; p 173 -82.

35. Lehnert, W A: The process of question answer- 46. Devadason, F J: Online construction of alphabe-
ing. Dept of Computer Science, Yale University. tic classaurus: A vocabulary control and indexing
Research Report. 88. 1977. tool. Inf Process and Mgt 1985,21(1),14.

Vol 35 No 2 June 1988 67


DEVADASON

47. Devadason, F J: Computerisation of deep struc- (Guide. M.R. Kumbhar]. Kamatak University.
ture based indexes, IntI Classif 1985; -12(2). P 87. Dharwad. 1985.

48_ Devadason, F J: Computer-based system for


generating different types of subject indexes and 49.- Devadason, F.J. Computerised Deep Structure
alphabetical classaurus based on 'deep structure' Indexing System. FID/CR Report. 21. Indeks
of subject indexing languages. Ph.D Thesis. Verlaag, Frankfurt. 1986.42 p.

68 Ann Lib Sci Docu

You might also like