Professional Documents
Culture Documents
and
Information Science
"This page is Intentionally Left Blank"
Advancement
•
In
Library &Information Science
Punit Ralhan
© Reserved
Typeset by:
Shivangi Computers
267, IO-B-Scheme, Opp. Narayan Niwas,
Gopalpura By Pass Road, Jaipur-302018
All Rights are Reserved. No part of this publication may be reproduced, stored in a
retrieval system, or transmitted, in any form or by any means, electronic,
mechanical, photocopying, recording, scanning or otherwise, without the prior
written permission of the copyright owner. Responsibility for the facts stated,
opinions expressed, conclusions reached and plagiarism, if any. in this volume is
entirely that of the Author. according to whom the matter encompassed in this
book has been originally created/edited and resemblance with any such
publication may be incidental. The Publisher bears no responsibility for them,
whatsoever.
Preface
Punit Ralhan
"This page is Intentionally Left Blank"
Contents
Preface v
1. Automation of Library Services 1
2. Digitisation of Library Services 35
3. ICT Applications in Libraries 69
4. Features of Digital Library 101
5. Digital Information Resources 131
6. Multimedia Systems in Libraries 157
7. Digital Information Preservation 175
8. Classification in Digital Libraries 209
9. Trends in Information Archiving 253
10. Information Retrieval in Modem Libraries 275
11. Information Access in Digital Libraries 291
Bibliography 315
Index 317
"This page is Intentionally Left Blank"
1
Automation of Library Services
Vocabulary Management
This variable is closely related to specificity. Although there
is no necessary connection between type of analysis and
indexing on the one hand and vocabulary control or
management on the other, nevertheless, the provision of
references linking synonymous or equivalent terms,
pointing to related terms, and distinguishing among
ambiguous homographs tends to accompany human
analysis and indexing more commonly than automatic
analysis and indexing.
However, this type of vocabulary management is
increasingly common in automatic experimental systems
and more advanced publicly available systems. Closely
related to several of these key variables is the amount,
nature, and style of information provided to the user about
documentary units. For browsable displayed indexes, this
will be connected to the amount and style of information
provided in index headings, but also to subsequent
documentary unit records that are linked to index headings.
For machine matching systems, this variable relates to
the size and style of the documentary unit records provided
to the user for evaluation, ranging from very brief to very
lengthy. Newer methods of using visual displays to
characterise retrieved or relevant sets of messages has been
more closely tied to automatic analysis and indexing
techniques, but there is no inherent reason why they could
not also be used with human indexing in the context of
electronic IR database displays.
Because variables such as these have typically not been
separately analysed, it has been difficult, if not impossible,
to determine whether the results of particular IR systems
are due to automatic versus human analysis and indexing,
or to different documentary units, different levels of
index able matter and exhaustivity, different types of
interface options provided, different levels of vocabulary
Automation of Library Services 33
Beagrie, N. and Greenstein, D., A strategic policy framework for creating and
preserving digital collections. JISC/NPO Studies on the Preservation
of Electronic Materials. eLib Supporting Study P3. London: South
Bank University, Library Information Technology Centre, 1998.
Bearman, D., "Optical media: their implications for archives and
museums", Archives and Museum Informatics Technical Report, 1 (1).
Pittsburgh, Pa.: Archives and Museum Informatics, 1987.
Conway, P., Digitizing preservation. Library Journal, 1 February 1994,42-
45.
2
Digitisation of Library Services
Beagrie, N. and Greenstein, D., A strategic policy framework for creating and
preserving digitnl collections. JISC/NPO Studies on the Preservation
of Electronic Materials. eLib Supporting Study P3. London: South
Bank University, Library Information Technology Centre, 1998.
Bearman, D., "Optical media: their implications for archives and
museums", Archives and Museum Informatics Technical Report, 1 (1).
Pittsburgh, Pa.: Archives and Museum Informatics, 1987.
Conway, P., Digitizing preservation. Library Tournai, 1 February 1994,42-
45.
Feeney, M. (ed.), Digitnl culture: maximising the nation's investment: a
synthesis of TISCINPO studies on the preservation of electronic
materials. London: National Preservation Office, 1999.
3
leT Applications in Libraries
Users of leT
ICT Competence
Complementary Strategies
There are two main strategies taken by libraries: one, a skill
based or vertical strategy, the .other a use based or
horizontal strategy. The skill based vertical strategy was
aimed at moving people up the technical competence
hierarchy through a deliberate programme of instruction
and development. Success was measured by progression
from novice to basic to intermediate to advanced levels, and
often reflected in numbers completing certificated courses.
Developing leT capability was very much its own end, or
94 Advancement in Library and Information Science
Civic Literacy
Digital Citizenship
The different ways in which libraries are enabling citizens
to access networked information resources and to exploit
the communication potential of the new technologies.
These r~late to:
Helping users discover and retrieve electronic
information relevant to their needs and interests.
Organising, presenting and creating new online
content, especially local content.
Facilitating interactive learning environments and
creating virtualleaming communities.
leT Applications in Libraries 97
1& g Repositories
F1l n
Users
the market will have settled down sufficiently for the legal
rules to be clarified. Until then, economic and legal
uncertainties are annoying, though they have not proved
to be serious barriers to progress. Overall, there appear to
be no barriers to digital libraries and electronic publishing.
Technical, economic, social, and legal challenges abound,
but they are being overcome steadily.
FUTURE OF DIGITAL LIBRARIES
Electronic Publishing
Most libraries are also pursuing a proactive strategy to
increase user access to quality information resources on the
Web by digitising materials and collaborating with others
Features of Digital Library 121
Online CQtalogues
It is clear that the role of the online catalog has changed.
That catalog has become one of many databases available
to users; libraries are linking to and from the catalog to
integrate all of these resources. In recognition of this
broader role for the library catalog, some libraries are
considering modifying cataloguing to favour timely access
to a wider variety of formats. One proposal called for
reallocating funds that are devoted to describing books and
journals to materials that are proportionately underrep-
resented in today's catalogs, such as films, music,
photographs, and digital objects. Libraries are also
reconsidering their current efforts to collect and catalog free
Web resources, concluding that these labor-intensive
activities can be avoided by perfecting nascent machine
harvesting and cataloging techniques.
/
A small number of libraries and library organisations
are participating in experiments f{mded by The Mellon
Foundation to test the application of harvesting and search
engine technologies. Using the recently developed OAI
metadata harvesting protocol, these libraries are delivering
information from the "hidden Web" not normally found by
Internet search engines and from databases with retrieval
formats that present special processing or presentation
problems.
Several libraries are worl~jng with vendors to adapt
existing portal software into multifunctional products with
features and services desired by users in research
communities. Other examples of library involvement in
124 Advancement in Library and Information Science
Dienst Server
This component models the behavior of a Dienst server that
receives a user query, searches its own repository of
technical reports, and simultaneously sends the query to all
other Dienst and MIS servers in its region. If some server
does not respond within the timeout limit, the Dienst server
then sends the query to its region's backup server, if any.
When all contacted servers respond with query
results, the Dienst server returns the merged results to the
user. This component belongs to the VSE class DienstServer
and exhibits the following behavior (methods).
replicationStarted: sends the reference (identity) of the
Dienst server to the region it is contained in at
simulation start-up. The region collects all the Dienst
server references within it and sends the list to each
server.
146 Advancement in Library and Information Science
Lite Server
This component models all of the Dienst server's behavior
except that it does not perform a search on its own since
it does not possess a repository of technical reports. It
belongs to the VSE class LiteSite and exhibits the following
behavior (methods).
replicationStarted: sends the reference (identity) of the
Lite server to the region it is contained in. The region
collects all the Lite server references within itself and
sends them to each server.
serversWithinRegionAre: returns a list of references of all
Dienst, MIS, Backup (if any), and CIS (if any) servers
within a particular region.
dynObjArrived: indicates the arrival of a query
submitted by the user. The server checks if it is
operational by generating a uniform random number
between 0 and 1 and comparing it with the probability
of its going down, and if so, it then sends a "searchFor"
message to all Dienst servers, the MIS server within the
region and the system CIS server. It also sets the
number of replies expected to the query. It schedules
a "timeOut" event for that query, whereupon it checks
if all expected responses for that query have been
received.
timeOut: checks if all the expected responses to a
particular query have been received. If not, it accesses
the backup server of the region (if any). It schedules
another "timeOut", whereupon a check is made to see
if the backup server has responded within the time
limit.
backupServerTimeOut: checks if the backup server of the
region has responded to the query. If not, the Lite
server sends all the ret'ponses it has been able to collect •
so far to the user and informs the user of the servers
that failed to respond within the time limit.
148 Advancement in library and Information Science
Abid, A., Memory of the World: pres('wing our documentary heritage. Paris:
UNESCO, Information and Informatics Division, July 1997.
208 Advancement in Library and Information Science
computers;
astronomy, nature, animals, environment;
first love, star signs, being young today;
horses;
excitement, humor;
fantasy, science fiction; and
books that are easy to read.
From a semantic or disciplinary point of view, the
separation of subjects like animals and horses would
appear to be "incorrect" or "illogical." For the children,
however, this classification worked very well. Category 2
(astronomy, nature, animals, environment) was intended
for a broad group of interests, ,including fact literature,
whereas category 4 (horses) was intended, in particular, for
girls interested in novels about horses. There is, in
Denmark, a special research tradition within children's
librarianship, based on Wanting's research on how children
ask questions in libTaries, that advocates mediating
literature according to the different user interests of
children. Pejtersen has also studied children's use of
libraries in Denmark and their communication with
librarians.
In her development of the Book House system in the
1980s, Pejtersen used a collaborative prototyping approach,
engaging librarians, information scientists, and users in
Danish public and school libraries, and subsequently
designed a special interface of subject icons for browsing
of the Book House system by children. Database 2001 took
advantage of both of these research approaches to
children's information searching. The Book House is a
retrieval system for fiction and is based on a general
conceptual model that seeks to surround users with an
adequate resource space within which to situate their own
search spaces.
Classification in Digital Libraries 221
- ordered
- unordered
The first category has been explored thoroughly by
numerical taxonomists. An early statement of the
distinction between monothetic and polythetic is given by
Beckner: 'A class is ordinarily defined by reference to a set
of properties which are both necessary and sufficient (by
stipulation) for membership in the class. It is possible,
however, to define a group K in terms of a set G of
properties f1' f2/" •• , fn in a different manner. Suppose we
have an aggregate of individuals such that
each one possesses a large (but unspecified) number of
the properties in G;
each f in G is possessed by large number of these
individuals; and
no f in G is possessed by every individual in the
aggregate.
The first sentence of Beckner's statement refers to the
classical Aristotelian definition of a class, which is now
termed monothetic. The second part defines polythetic. To
illustrate the basic distinction consider the following
example of 8 individuals (1-8) and 8 properties (A-H). The
possession of a property is indicated by a plus sign. The
individuals 1-4 constitute a polythetic group each
individual possessing three out of four of the properties
A,B,C,D. The other 4 individuals can be split into two
monothetic classes {5,6} and {7,81. The distinction between
monothetic and polythetic is a particularly easy one to
make providing the properties are of a simple kind, e.g.
binary-state attributes. When the properties are more
complex the definitions are rather more difficult to apply,
and in any case are rather arbitrary.
The distinction between overlapping and exclusive is
important both from a theoretical and practical point of
view. Many classification methods can be viewed as
226 Advancement in Library and Infonnation Science
WWLib
Oassifier
Previous experimentation with automatic classification was
232 Advancement in Library and Information Science
Space
- location
Time
- time period
There are also facets that are common to all the classes.
These are called common isolates. Examples include form
and language. The same facet can be used more than once.
Notations, such as numbers and letters, are used to
represent the facets, while punctuation marks are used to
indicate the nature and type of the following facets. The
classifier's job, therefore, is to combine the available terms
that are appropriate in describing the information package
in hand.
FIVE LAWS OF LIBRARY SCIENCE
I
9
Trends in Information Archiving
may exist specifying the file formats and the general subject
matter the site will accept.
Within the Submission Agreement, one or more Data
Submission Sessions are specified. There may be significant
time gaps between the Data Submission Sessions. A Data
Submission Session will contain one or more SIPs ~nd may
be a delivered set of media or a single telecommunications
session. The Data Submission Session content is based on
a data model negotiated between the OAIS and the
Producer in the Submission Agreement. This data model
identifies the logical components of the SIP (e.g., the
Content Information, PDI, Packaging Information, and
Descriptive Information) that are to be provided and how
(and whether) they are represented in each Data
Submission Session.
All data deliveries within a Submission Agreement are
recognised as belonging to that Submission Agreement and
will generally have a consistent data model, which is
specified in the Submission Agreement. A Data Submission
Session may consist of a set of Content Information
corresponding to a set of observations, which are carried by
a set of files on a CD-ROM. The Preservation Description
Information is split between two other files. '
All of these files need Representation Information
which must be provided in some way. The CD-ROM and
its directory/file structure are the Packaging Information,
which provides encapsulation and identification of the
Content Information and PDI in the Data Submission
Session. The Submission Agreement indicates how the
Representation Information for each file is to be provided,
how the CD-ROM is to be recognised, how the Packaging
Information will be used to identify and encapsulate the SIP
Content Information and PDI, and how frequently Data
Submission Sessions will occur. It also gives other needed
information such as access restrictions to the data.
262 Advancement in Ubrary and Information Science
Beagrie, N. and Greenstein, D., A strategic policy framework for creating and
preserving digital collections. JISC/NPO Studies on the Preservation
of Electronic Materials. eLib Supporting Study P3. London: South
Bank University, Library Information Technology Centre, 1998.
Bearman, D., "Optical media: their implications for archives and
museums", Archives and Museum Informatics Techllical Report, 1 (1).
Pittsburgh, Pa.: Archives and Museum Informatics, 1987.
Conway, P., Digitizing preservation. Library Journal, 1 February 1994,42-
45.
McKemmish, S. and Duff, W., Metadata and ISO 9000 compliance.
Information Management Journal, 34 (1), January 2000.
Roberts, D., The disposal and appraisal of machine-readable records
from the literature. Archives and Manuscripts, 13, 1985, 30-38.
Waters, D.J., Electronic technologies and preservation: [... J based on a
presentation to the Annual Meeting of the Research Libraries Group,
lune 25, 1992.
10
Information Retrieval in Modern
Libraries
<DC:creator> Shakespeare</DC:creator>
<DC:type> play</DC:type>
•
282 Advancement in Library and Information Science
<RDF:RDF>
<RDF:description RDF:about = "http:/ /
hamlet.org/ ">
<OC:creator> Shakespeare</DC:creator>
<OC:type> play</OC:type>
</RDF:descri ption>
</RDF:RDF>
would look in the inverted file and find that the word.
appears in documents 2, 19, and 29. A simple reference to
an inverted file is typically a fast operation for a
computer.
Most inverted lists contain the location of the word
within the document. This is important for displaying the
result -of searches, particularly with long documents. The
section of the document can be displayed prominently
with the search terms highlighted. Since inverted files
contain every word in a set of documents, except stop
words, they are large.
For typical digital library materials, the inverted file
may approach half the total size of all the documents,
even after compression. Thus, at the cost of storage space,
an inverted file provides a fast way to find every
occurrence of a single word in a collection of documents.
Most methods of information retrieval use inverted files.
VIRTUAL LIBRARY AND INFORMATION RETRIEVAL
can not only communicate with other users but also with
digital agents.
REFERENCES
Abid, A., Memory of the W9rld: preserving our documentary heritage. Paris:
UNESCO, Information and Informatics Division, July 1997.
Beagrie, N. and Greenstein, D., A strategic policy framework for creating and
preserving digital collections. JISC/NPO Studies on the Preservation
of Electronic Materials. eLib Supporting Study P3. London: South
Bank University, Library Information Technology Centre, 1998.
Bearman, D., "Optical media: their implications for archives and
museums", Archives,and Museum Informatics Technical Report, 1 (1).
Pittsburgh, Pa.: Archives and Museum Informatics, 1987.
Blake, R, Electronic Records from Office Systems (EROS). In: Electronic
access: archives in the new millennium: proceedings, 3-4 June 1999.
London: Public Record Office, 1999, pp. 52-58.
Conway, P., Digitizing preservation. Library Journal, 1 February 1994, 42-
45.
Cook, T., The impact of David Bearman on modem archival thinking:
an essay of personal reflection and critique. Archives and
Museum Informatics, 11, 1997.
Cox, RJ., Electronic information technology and the archivist: bright.
lights, lingering concerns. A~can Archivist, 55, 1992.
Day, M.W., Preservation problems of electronic text and data. East
Midlands Branch of the Library Association, Occasional papers,
~. 3. Loughborough: EMBLA Publications, 1990.
Feeney, M. (ed.), Digital culture: maximising the nation's investment: a
synthesis of JISC/NPO studies on the preservation of electronic
materials. London: National Preservation Office, 1999.
Fishbein, M.H., Appraising information in machine language form.
American Archivist, 35, 1972, 35-43.
Hills,S., Electronically published material and the archival library.
Electronic Publishing Review, 5 (1), 1985, 63-72
316 Advancement in Library and Information Sdence