Professional Documents
Culture Documents
COLLEGE OF EDUCATION
BACHELOR OF LIBRARY AND INFORMATION SCIENCE PROGRAM
ABSTRACT 3. SUMMARY
- Brief by accurate and clear representations of the - A brief statement expressing the general
contents of a document substance or overall idea of a work (or portion of
- Pieces of text created by the abstractor rather it), recapitulating its main points, findings and
than a direct quotation from the author conclusions, usually given at the end
- Originally – ready by humans to keep up with - A brief reinstatement within a document (usually
advance in the literature of a field at the end) or its salient findings and conclusion
- Now – provide a representation of the document intended to complete the orientation of a reader
that can be searched by computer who has studied the preceding text.
- It can be:
a. Informative 4. ABRIDGEMENT
b. Indicative - A shortened version or edition of a written work
c. Critical that preserves the overall meaning and manner
d. Slanted (written from a particular point of of presentation of the original but omits the less
view) important passages of the text and usually any
illustrations, notes and appendices
USE OF ABSTRACT - Often prepared by a person other than the
A well prepared abstract enables the reader to: original author or editor, an abridged edition is
a. Quickly identify the basic content of the generally intended for readers unlikely to
document purchase the unabridged version because of its
b. Determine its relevance to their interests, and length, complexity or price
c. Decide whether it is worth their time to read the
entire document 5. SYNOPSIS
- A concise written description of the plot of a long
VARIOUS KINDS OF SURROGATES narrative work (novel, play, opera, epic poem,
etc.) giving a quick orderly, overview of the
DOCUMENT SURROGATES whole, usually prepared by a person other than
- Substitute of the original the author.
- Abstracts is known as surrogates of the original
documents 6. TERSE LITERATURE
- It is a condensation of the original
1. ANNOTATION - This is done by using statements which are
- Brief explanation of a document or its contents, highly abbreviated to encapsulate the major
usually added as a note to clarify title points.
- A brief note, usually no longer than two or three
sentences, added after a citation in a INDEXING
bibliography to describe the content or message - It is the process that produces entries in an index
of the work cited or to comment on it. - The procedure includes a number of basic
activities:
2. EXTRACT 1. Document Analysis
- One or more length quotations from a book or 2. Subject Analysis
other work set within the main text of another 3. Evaluation and representation of subjects
work, usually indented and sometimes printed in 4. Description
distinguishing type 5. Review
- When printed in the same type size as the text - It is similar to classifying and subject cataloging
and without indention, an extract is enclosed in - It describes activity involving the assignment of
quotation marks and preceded and followed by a subject identifiers for purpose of information
blank line. retrieval
- In a more general sense, any piece taken from - Refers to the activity of assigning terms from an
one work and used in another, sometimes to indexing language to documents or parts of
a. Title b. Sentences
b. Table of contents - Derived indexing may use whole sentences, but
c. Introduction or preface rarely done – used in some web documents and
d. Author’s purpose or foreword for derived abstracts
e. Abstract of Summary or Conclusion - Abstracts, summaries or annotations*
f. Index
g. Illustrations, diagrams c. Taxonomic categories
h. Containers - Such as the type used in the Yahoo directory
CHAPTER III
1. TERM SELECTION
BUILDING THESAURUS - Select sources for the collection of terms
LS22: Indexing and Abstracting II - Assign codes to each source
- Selection of terms
THESAURUS - Use worksheets for the entries
The word "thesaurus" is derived from 16th century - Enter terms into database with all information
New Latin, literally "treasure store", generally
meaning a collection of things which are of big SOURCE IN COLLECTING TERMS
importance or value (synonymous with Your documents and databases
treasurer). Departmental terminology
It is a controlled vocabulary arranged in a known Text books and their indexes (indices)
order and structured so that equivalence, Book tables of contents and indexes
homographic, hierarchical, and associative Journal quarterly indexes
relationships among terms are displayed clearly Encyclopedias
and identified by standardized relationship Dictionaries, glossaries on the topic
indicators that are employed reciprocally (NISO). Web resources
Users and experts
PRIMARY PURPOSES OF THESAURUS Search logs
1. To facilitate retrieval of documents
2. To achieve consistency in the indexing of written SELECTION OF SOURCES
or recorded documents and other items PREARRANGED SOURCES require less
3. To establish concept relationship effort in gathering the material, and may
already indicate some relationships between
WHY DEVELOP A THESAURUS? terms and concepts and relationships
- To provide vocabulary or terminological control among terms
device OPEN-ENDED SOURCES can reflect
When there are several possible terms current terminology and may provide more
designating a single concept, the thesaurus complete coverage
should lead the indexer or searcher to the Choose a set of sources that are current, as
appropriate concept. complete as possible, and considered
- To provide a conceptual structure of information authoritative
To adequately describe the topical content Each selected source is assigned an ID for
of information resources at an appropriate tracking its use in the development of the
level of generality or specificity thesaurus
To enhanced search capabilities and to Useful when making decisions about
improve the effectiveness of searching (i.e., which terms to prefer
to retrieve most of the relevant material) Useful for backtracking when questions
arise (where did this come from?)
PRELIMINARY CONSIDERATIONS
- Survey existing thesaurus resources for your SELECTION OF TERMS
domain - Terms can be transferred directly from
Continue using an existing thesaurus? prearranged sources to the recording
Modification or update of existing medium (cards, worksheets or database)
thesaurus? Have to decide which terms and
Develop a new well-structured thesaurus? references to include, or to take the
- What is the scope and complexity of the subject whole source
field? - In open-ended sources you read through the
source and pick out terms (i.e. words and
STAGES IN THE DEVELOPMENT OF A THESAURUS phrases) that might be useful in retrieval or
1. Term selection as references to other terms
2. Merging and development of concept classes - Use keyword and phrase extraction
3. Definition of broad subject fields and subfields software to create lists of terms and select
4. Development of classificatory structure from those
5. Review, testing, application, revision - Transfer selected terms to the recording
medium (cards or database)
UNIVERSITY OF SANTO TOMAS
College of Education
Bachelor Of Library & Information Science
8|B U I L D N G T H E S A U R U S
LS22: INDEXING AND ABSTRACTING II
ORGANIZE TERMS
- Sort terms into several major categories
- Logical groups of similar concepts as Preferred
Terms
- Identify core areas and peripheral topics
- Consider moving proper names to authority files
LABELLING A CONCEPT
1. GENERIC (GENUS – SPECIES)
- Most-used labels are ranges from general to
RELATIONSHIP
specific
SEMANTIC RELATIONSHIPS
2. WHOLE-PART RELATIONSHIP
- Also known as meronymy or
partonomy
- Four types allowed in thesaurus
standards
a. Body systems and organs
• Ear Middle ear
A. HIERARCHAL RELATIONSHIPS
- Broader Term represents the main category b. Geographical locations
- Narrower Term represents the specific • Philippines Manila
- Three types of Hierarchical relationships:
1. Generic relationship (BTG/NTG) c. Fields of study
2. Whole-part relationship (BTP/NTP) • Geology Physical geology
3. Instance relationship (BTI/NTI)
d. Hierarchical
organizational/corporate/social/pol
itical structures
FACET CLASSIFICATION
POLYHIERARCHAL RELATIONSHIP
Term can logically fit under more
than one Broader Term – can
have Multiple Broader Terms
(MBT)
6. FINAL STAGES
- REVIEW
Discuss classified index with users and
experts THESAURUS EXAMPLES
o Select descriptors and checklist The ERIC Thesaurus of Descriptors
descriptors The Medical Subject Headings (MESH) of the
Assign notational symbols National Library of Medicine
Produce main thesaurus and indexes
The Art and Architecture Thesaurus
Check cross references and insert where
Thesaurus on Philippine environment
needed
Test by indexing Thesaurus on Philippine literature
Trade Publications
UNIVERSITY OF SANTO TOMAS
College of Education
Bachelor Of Library & Information Science
13 | B U I L D N G T H E S A U R U S
LS22: INDEXING AND ABSTRACTING II
then state it in their own words, or in a. Author of the article in an inverted form
words borrowed from the author b. Title of the article
c. Periodical title and underscored
d. Volume number and/or issue number
e. Pages where the article appears
4. CONVERSION TO THE INDEXING LANGUAGE f. Date of the publication preferably – month, day
- Converting the concept list into a list of and year
acceptable terms. If a controlled vocabulary is
used, this implies using an authority list to do the Procedure 3
conversion. - Mark the article indexed with any colored pencil
- The terms in the concept list are matched against - Indexed periodicals are marked and indexed on
the thesaurus or other authority lists with the goal the upper right hand corner
of choosing final descriptors within the accepted - Slips are set aside for typing
framework of the prescribed indexing language - Issues of periodicals indexed are recorded
5. REVIEW
- Reviewing the indexing accomplishments with
the following questions in mind:
o Do the major terms represent the chief
emphasis of the document?
o Are the major terms reflected in the
abstract so that there is no question
about their relevance?
o Do the descriptors reflect all of the
important concepts in the document?
PRINCIPLES OF INDEXING
a. Exhaustivity
- Concerned with the number of index terms to use
o Selective indexing – the use of fewer
words to cover only the central subject
matter of a document
o Exhaustive indexing – the use of
enough terms to cover the subject
matter of a document more completely
b. Specificity
- Concerned only with the assignment of the most
specific term that entirely covers a topic
c. Consistency
- Refers to the extent to which agreement exists
on the terms to be used to index some document
Procedure 1
1. Use a 3 x 5 slip or card for easy handling and
location
2. Read through the articles before assigning
subject heading(s)
3. Provide cross references for subject heading(s)
that cannot be used
Procedure 2
On the slip, the following data are written with the proper
indentions:
CHAPTER V
CITATION INDEXING
LS22: Indexing and Abstracting II CITATION INDEX
HISTORY
D. WEB OF KNOWLEDGE
- ISI Web of Knowledge®, a dynamic, integrated,
Web-based environment
- ISI Web of Science® provides access to
Science Citation Index (over 3,200 journals )
Social Sciences Citation Index (1400
journals)
- Arts & Humanities Citation Index
- Updated weekly
- Journals from 1986 is available
- Previous years of each index are available in
PRINT at the Libraries.
E. WEB OF SCIENCE
- Search current and retrospective multidisciplinary
information from nearly 8,500 research journals
in the world.
- Users can navigate forward, backward, and
through the literature, searching all disciplines
and time spans to uncover lot of information
relevant to their research.
CHAPTER VI
UNIVERSITY OF SANTO TOMAS
College of Education
Bachelor Of Library & Information Science
18 | B U I L D N G T H E S A U R U S
LS22: INDEXING AND ABSTRACTING II
CHAPTER VII
Indexing
- It refers to the activity of assigning index terms derived from the indexing languages or extracted from the documents
for the purpose of information retrieval.
Retrieval
- It is identifying content of interest or getting the actual document.
MULTIMEDIA INDEXING
- It is an activity to analyse, annotate, and query vast amounts of multimedia data
1. IMAGES
- Shapes
- Textures
- Color of pixels
- Line segments and edges
- Images described by text
o Figure captions
o Keywords
o Associated paragraphs
2. SOUND
- Rhythm
- Music
- Pitch
- Pattern
3. VIDEO
- Motion
- Segments
- Sequence
- Scene
SOUND INDEXING
- It is the process of assigning and retrieving audio information from the available resources.
- Text based searching for audio information is most common.
- It is a process of analysing audio to extract its features
a. Loudness
b. Pitch
c. Bandwidth
d. Harmony
b. Music – Centered
- Instrumental, vocal
2. Scripted stories
a. Books on tape
b. Poetry reading
c. Theatre
3. Spontaneous Storytelling
a. Oral history
UNIVERSITY OF SANTO TOMAS
College of Education
Bachelor Of Library & Information Science
23 | B U I L D N G T H E S A U R U S
LS22: INDEXING AND ABSTRACTING II
b. Folklore
4. Incidental recording
a. Speeches
b. Oral arguments
c. Meetings
d. Phone calls
DESCRIPTION STRATEGIES
1. Indexing
- Manually assign descriptors to points in a recording
2. Transcription
- Manual transcription (with optional post-editing)
3. Associated Materials
- Interviewer’s notes, speech scripts, producer’s logs
4. Automatic
- Create access points with automatic speech processing
VIDEO INDEXING
- Segments, scenes and basic frames
- Transitions
- Motion
o Motion of objects
o Camera
- Compression Standards
o MP4
o WMA
o AVI
o MOV
MUSIC INDEXING
1. Music indexing is chiefly bibliographic
2. How to index music?
o Pitch, temporal, harmonic, timbral
o Textual and bibliographic
o Visual (musical scores, manuscripts)
o Aural (digital music)
SNAPSHOTS!
- The most important aspects of sound are those qualities which convey emotions
o The Anstending Institute. (1988). The Importance of sound in our lives
CHAPTER VIII
COMPUTER-ASSISTED INDEXING
If you're using a computer, you're indexing with computer assistance.
Computer-assisted indexing includes such indexing techniques as Dedicated, Embedded, and Web indexing
Index generation software also falls into this category.
COMPUTER-GENERATED INDEXING
Indexing that do not involve the human decision making process.
The software creates the index based on certain algorithms designed by programmers.
The software inspects the text, make certain determinations regarding the importance of text elements (based on
placement within sentences, format, the number of times the term is repeated), and writes the index from scratch.
INDEXING SOFTWARE
It refers to write or create an index using a computer program.
Text editor could be considered indexing software, since indexes can be typed from scratch.
Two specific types:
1. Indexing-dedicated software
2. Indexing-enabled software.
INDEXING-DEDICATED SOFTWARE
Software with a purpose of writing an index or creating an index database.
It is designed with a data-entry interface for typing keywords and locators.
The software manipulate the input data and generates a final product: a text index (which can be converted into other
formats) with a desired layout.
These programs are stand-alone tools that assist indexers with the indexing process.
Different brands of indexing-dedicated software packages have different features or options, and can run on different
platforms.
DEDICATED INDEXING
It is a process by reading the material and retyping the entries, subheads, and locators into a specially designed
indexing module or software.
With a stand-alone index, you create the index as a separate text file using Indexing dedicated software.
Indexing-dedicated software
o used in the book publishing industry
o packages provide many functions that facilitate the indexing process.
Notes:
If you are modifying 30% or more of a document,
it’s more economical to create a fresh index than to revise an existing one.
Revising and restructuring an old index can be difficult and time consuming.
When you anticipate making substantial revisions or additions to a document on the next release, creating a stand-
alone index for each new release will minimize costs both in the short– and long– term.
INDEXING-ENABLED SOFTWARE
Indexing-enabled software is software that allows the user to generate indexes.
This generation process is called embedded indexing.
Indexing-enabled software is often better known for its more general purpose:
o page layout software
o book creation software
o word processing software
o editing software.
EMBEDDED INDEXING
It is the process of creating index entries electronically in a document’s files
DESKTOP PUBLISHING OR WORD PROCESSING PACKAGES can be used to create effective embedded indexes
This form of indexing uses software to insert indexing information (known as "tags" or "markers") into the documents
that are being indexed.
Indexing a web document is also embedded indexing. This is because index information is still embedded in the
document.
Web indexing is a subset of embedded indexing.
EMBEDDED INDEX
Each location has a collection of headings to which the location is relevant.
These collections of headings are not usually visible in the document.
One specific example of paperless indexing is embedded indexing.
sites, rather than the multiple pages of a single o Servers establish a huge inverted
site. indexing and vector indexing database
o Also known as Automatic indexing.
AUTOMATIC INDEXING
Selection of key words from a document by
computer for use as index entries. INVERTED INDEX
The procedure for determining the orientation Consists of an ordered list of indexing terms,
and position of a work piece manipulates by a each indexing term is associated with some
robot. document identification numbers.
It is faster and cheaper. Retrieval is done by first searching in the ordered
list to find the indexing term, then using the
METHODS FOR AUTOMATIC INDEXING document identification numbers to locate
Indexing by semantic components documents.
o Marks up segments of text in a
document with labels
o Index terms are extracted on the basis
of the identified components
4. PROXIMITY SEARCHING
It looks for documents where two or more
separately matching term occurrences are
within a specified distance, where distance
2. PHRASE SEARCHING is the number of intermediate words or
It narrows your search down by searching characters.
for an exact phrase or sentence. For example
It is particularly useful when searching for a o Term A NEAR Term B
title or a quotation. o Term A ADJ Term B
Usually quotation marks are used to connect
the words together. 5. FOCUSING / LIMITING A SEARCH
For example: “Bachelor of Library and There are many ways to focus your search
Information Science” and all search tools offer different ways of
doing this. Some of the ways of limiting your
3. TRUNCATION / WILDCARD search are as follows:
These search techniques retrieve o Date
information on similar words by replacing o Language
part of the word with a symbol usually a * o Place
or ?. However, different databases use o Publication type
different symbols, so check what is used. o Age groups
o In truncation the end of the word is o Type of material e.g. you could
replaced. – For example biblio* will just need to find case studies
CHAPTER X
INTRODUCTION
Researchers used to go to the library to manually look for journal articles of their interest are long gone.
Looking through dozens of physical journals or consulting literature overviews in book format became obsolete with the
dawn of the electronic era and the availability of searchable online listings of all that journal information.
Nowadays more than 400 online Abstracting and Indexing (A&I) services provide these search capabilities for the
various disciplines and research areas.
ARRANGEMENT
Printed abstracting and indexing service information accessible by an alphabetical arrangement of subject headings.
Appear the bibliographic references to the documents relating to that subject.
Author and other indexes provide additional access points.
Abstracting services extend the bibliographic information by providing a synopsis or abstract of the cited document.
Printed abstracting services differ in their arrangement that the documents are usually grouped in classified order on
which a sequential numbering system for identification via a subject index.
PURPOSE
The main purpose of abstracting and indexing services is to help researchers overcome the difficulties of tracing
potentially useful articles scattered over periodical and other literature.
Abstracts are of especial benefit as they provide an overview of the article and thus aid researchers in their selection of
what they consider worth reading.
It is a service that is particularly appropriate for systems specialists. An enhanced version became available through
DIALOG and Silver Platter in 2001.
HOW DOES A JOURNAL BENEFIT FROM BEING ABSTRACTED AND INDEXED BY THE A&I SERVICES?
To be represented in the relevant online
A&I services is an essential factor for the success of a journal.
Today all searching is done online, so it is vital that a journal is represented in the relevant online search systems.
Authors rely on finding articles through A&I services and boost their usage when reading them.
When they come across high-quality articles of a certain journal in their search, they consider the journal a good outlet
to publish in themselves.
ACADEMIC ONEFILE
It is the premier source for peer-reviewed, full-text articles from the world's leading journals and reference sources.
With extensive coverage of the physical sciences, technology, medicine, social sciences, the arts, theology, literature
and other subjects.
It is both authoritative and comprehensive.
With millions of articles available in both PDF and HTML full text with no restrictions, researchers are able to find
accurate information quickly.
o Humanities
o Science
o Medicine
o Law
o Social Sciences
COMPENDEX
It is a comprehensive bibliographic database of engineering research, containing over 10 million records taken from
over 5,000 engineering journals, conferences, an technical reports.
o Science
INSPEC
It is the leading English language bibliographic information service providing access to the World's scientific & technical
literature in physics, electrical engineering, electronics, communications, control engineering, computing, information
technology, manufacturing, production and mechanical engineering.
o Science
EconLit
Economics: General, Teaching, History of Thought, Methodology, Mathematical and Quantitative Methods,
Microeconomics, Macroeconomics, Monetary economics International economics, Financial economics, Public
economics, Health, Education, Welfare, Labor, Demographic economics, Law and economics, Industrial Organization,
Business Administration, Business economics, Marketing, Accounting, Economic History, Economic Development,
Technical Change, Growth, Economic Systems, Agricultural economics, Natural Resource economics, Urban
economics, Regional economics, Rural economics, Cultural economics, Economic Anthropology, Economic Sociology
o Social Sciences
EBSCOHOST
SocINDEX
with Full Text is the world’s most comprehensive and highest quality sociology research database. The database
features more than 2.1 million records with subject headings from a 20,000+ term sociological thesaurus designed by
subject experts and expert lexicographers. SocINDEX with Full Text contains full text for more than 860 journals dating
back to 1908. This database also includes full text for more than 830 books and monographs, and full text for over
16,800 conference papers
ENVIRONMENT COMPLETE
offers deep coverage in applicable areas of agriculture, ecosystem ecology, energy, renewable energy sources, natural
resources, marine & freshwater science, geography, pollution & waste management, environmental technology,
environmental law, public policy, social impacts, urban planning, and more. Environment Complete contains more than
1,957,000 records from more than 1,700 domestic and international titles going back to the 1940s (including 1,125
active core titles). The database also contains full text for more than 680 journals and 120 monographs
Full text
ABI/INFORM
Academic Source Premier
Business Source Premier
General Onefile
SocINDEX with Full Text
- Philippines Journals OnLine (PhilJOL) is a service to provide access to Philippine published research, and
increase worldwide knowledge of indigenous scholarship