You are on page 1of 6

ISSN 2229-5984

The Concept of Semantic Web in Library Services


Saikat Goswami*, Payel Biswas**

*Assistant Librarian ABSTRACT


Eastern Institute for Integrated This article explores the applicability primary library functions (collection
Learning in Management,
Kolkata development, cataloging, reference, and circulation) to the Semantic Web.
The article defines the Semantic Web, identifies similarities between the
library institution and the Semantic Web and presents research questions
guiding the inquiry. The article addresses each library function and
** Semi Professional Assistant, demonstrates the applicability of each function's polices to Semantic Web
CIET, NCERT,
development. Results indicate that library functions are applicable to
New Delhi
Semantic Web with "collection development" translating to "Semantic
Web selection;" "cataloging" translating to "Semantic Web 'semantic'
representation;" "reference" translating to "Semantic Web service," and
circulation translating to "Semantic Web resource use." The last part of this
article includes a discussion about the lack of embrace between the library
and the Semantic Web community's recommendations for reducing this
gap and conclusions.

Correspondence Author: KEY WORDS: Semantic Web, Library Functions, Collection Development,
Saikat Goswami Cataloging, Reference, Library Outreach, Circulation, Planning and
goswami0408saikat@gmail.com Policies.

Received : 20.06.2011; Revised : 27.07.2011; Accepted : 18.08.2011

INTRODUCTION readable information and automated services that extend far


The World Wide Web contains huge amount of information beyond current capabilities. [12]
which can be easily accessed by specifying URI (Uniform SEMANTIC WEB CONCEPTS
Resource Identifier) addresses, search engines and following The word semantic' stands for the meaning of 'or, word web
links to find other related resources. This simplicity of usage defines it as of or relating to meaning or the study of meaning.
has made the web so popular but such simplicity comes with a In the term semantic web also indicates that the meaning of
price. The enormous amount of data has made it increasingly data on the web can be discovered not just by people also by
difficult to find, access, present and maintain the information computers. According to Tim Berners-Lee (1998) [2] the word
required by a wide variety of users. It is very easy to get lost or semantic means 'machine-possible'. Tim Berners-Lee et al.
discover irrelevant and unrelated information because (2001) describe the semantic web as: “an extension of the
information content is presented in natural language. Search current web in which information is given well-defined
engines cannot promise precision since the indexing is based meaning, better enabling computers and people to work in
on quantity with no clue to the context in which the terms cooperation.” The key enabler of the semantic web is the need
occur. [13] For this problem, a support is essential for bringing of many communities to put machine - understandable data on
the web to its full potential. Tim Berners-Lee, the inventor of the web which can be shared and processed by automated tools
the World Wide Web, put forward the concept of meaningful as well as by people. Tim Berners-Lee [2] has clearly stated the
Web or semantic web. He referred to the future of the current main goal of the semantic web in his statement, if html and the
web as the semantic web an extended web of machine- web made all the online documents look like one huge book,

165 International Journal of Information Dissemination and Technology | July - September 2011 | Vol. - 1 | Issue - 3
RDF; schema and inference language will make all the data in his/her own tags hidden labels such as or that explain web
the world look like one huge database. Semantic web aims to pages of sections of text on a page, but it says nothing about
develop such technologies that make the information more what the structures mean. XML is particularly suitable for
meaningful for the machines to process which in turn makes sending documents across the web. [8], [1]
search and retrieval of information more effective. In the RDF (Resource Description Framework)
semantic web data itself becomes a part of the web and is able
RDF is a framework for describing resources on the web.
to be processed independently of application, platform, or
It describes a resource the resources properties and the values
domain. The semantic web on the other hand is about having
of those properties, often referred to as triples that consist of a
data as well as documents on the web so that machines can
subject, predicate, and object. It is particularly intended for
process, transform, assemble and even act on the data in more
representing metadata about the resources, such as title,
useful and meaningful ways.[3],[14]
author and modification date of a web page, copyright and
IMPORTANT FEATURES OF SEMANTIC licensing information about a web document (or information
WEB resource published on the web), or the availability schedule
•The semantic web comprises the standards and tools of for some shared resources. RDF presents the syntactic
XML (Extensible Markup Language), XML Schema, structure whereby we can express information about the
RDF (Resource Description Framework), RDF Schema resource in form of XML statements. For example we want to
and OWL (Web Ontology Language). say that the title of the webpage http://www.caluniv.ac.in is
•Two important technologies for developing the semantic University of Calcutta Homepage. This will be expressed in
web are already in place: Extensible Markup Language RDF as: http://www.caluniv.ac.in has title whose value is
(XML) and the Resource Description Framework (RDF). University of Calcutta Homepage.

•XML lets everyone create their own tags hidden labels The RDF statement comprises of three parts:
such as <zip code> or <alma mater> that annotates web •Subject
pages or sections of text on a page. •Predicate
•Meaning is expressed by RDF which encodes it in sets of •Object
triples each triple being rather like the subject verb and
Subject
object of an elementary sentence. These triples can be
written using XML tags. By subject we mean the web resource that has described.
In the above example the subject of the RDF statement is http:
•In RDF, a document makes assertions that particular
//www.caluniv.ac.in. In RDF the subject has to be always
things (people, web pages or whatever) have properties
represented in the form of URI (Uniform Resource Identifier)
(such as "is a sister of," "is the author of") with certain
which is the identifier by which the web resource can be
values (another person, another web page). This structure
identified in the web. URIs is of two types: URL (Uniform
turns out to be a natural way to describe the vast majority
Resource Locator) and URN (Uniform Resource Number).
of the data processed by machines.
URLs can locate and retrieve the resource on the web, whereas
•Subject and object are each identified by a Universal URN works to identify the web resources which may not be
Resource Identifier (URI), just as used in a link on a web retrievable. For example: the ISBN of the book is 0-223-
page. (URLs, Uniform Resource Locators are the most 59871-7. This will be represented in URN as urn: ISBN: 0-
common type of URI.) The verbs are also identified by 223-59871-7. Hence the subject of the RDF statement is
URIs which enables anyone to define a new concept, a always presented as an URI. [22], [25]
new verb, just by defining a URI for it somewhere on the
Predicate
web. [24], [1]
This denotes the specific property of the resource. For
XML (Extensible Markup Language)
example, title is the property of the webpage. Predicate can be
The basic building block of semantic technology is the any other property of the resource such as creator, language,
use of Extensible Markup Language (XML) which permits the date of creation, etc.
use of tags to describe objects [24]. It is a markup language much
Object
like html and was designed to carry data not to display data.
XML tags are not predefined so one can define or create This is the actual value of the property. For example, in the

International Journal of Information Dissemination and Technology | July - September 2011 | Vol. - 1 | Issue - 3 166
above case University of Calcutta Homepage is the value of computer scientists by facilitating the sharing and reuse of
the predicate (property) of the resource. [12], [1] digital information. [4], [23]
Ontology and Taxonomies in the Semantic Web: How LIS
and Computer Science need to join forces to make this
Subject Predicate Object happen
Figure 1 : An Example of an RDF Directed Graph LIS and cataloguing professionals are not only familiar
with these concepts, as they often form the core of their work
ONTOLOGY WEB LANGUAGES (OWL) and part of the educational curricula. The traditional skills of
librarianship - thesaurus construction, metadata design and
The term ontology has its origin in philosophy, and has
been applied in many different ways. Ontology is a formal information organization - are deeply important in the creation
representation of knowledge as a set of concepts within a of this next stage of web development. Ontology Web
domain and the relationships between those concepts. It is Languages (OWL) facilitates greater machine interoperability
used to reason about the entities within that domain and may be of web content than that supported by XML, RDF and RDFS
used to describe the domain ontology are the structural by providing additional vocabulary along with a formal
frameworks for organizing information and are used in semantics. It can be used to explicitly represent the meaning of
artificial intelligence, the semantic web, systems engineering, terms in vocabularies and the relationships between those
software engineering, library science, enterprise book terms. OWL adds more vocabulary for describing properties
marking and information architecture as a form of knowledge and classes: among others, relations between classes (e.g.,
representation about the world or some part of it. The creation disjointness), cardinality (e.g., exactly one), equality, richer
of domain anthologies is also basic to the definition and use of typing of properties and characteristics of properties (e.g.
an enterprise architecture framework. [17], [1] symmetry), and enumerated classes. Thus semantic web
ontology consists of taxonomy and a set of inference rules
Libraries and Information Science (LIS) and Computer
from which machines can make logical conclusions. [16], [23]
Science (CS) Join Forces How Librarians and Computer
Scientists Complement and Contrast: SIMILARITIES BETWEEN THE
LIBRARY AND THE SEMANTIC WEB
Some of the librarians work with knowledge that computer
scientists are looking for when trying to envision the semantic The “Internet [Web] has been described as a library with all the
web the two disciplines does have unique differences. books tossed on the floor” [21] or “the Web is like a virtual
library” the latter statement marshals little support when
•While computer science is concerned with how software
considering the full scope and anarchy of the web. The
and associated machines interact with ontology, LIS is
semantic web part of the larger web is however quite similar to
more concerned with how their users retrieve information
the library for the following reasons:
and as a way to facilitate certain types of information-
seeking behavior with the aid of taxonomies. •The library and the semantic web have mission statements
grounded in service, information access, and knowledge
•As computer science professionals perceive hierarchies
discovery.
as logical structures that help machines make decisions,
while LIS professionals view these information •The library and the semantic web have advanced as result
structures in terms of mapping out a topic for the benefit of international and national standards.
of patrons. Missions grounded in service, information access and
•Computer Science is concerned with how software and knowledge discovery
associated machines interact with ontology; in contrast, The library's definitive goal is to support knowledge discovery
librarians are concerned with how patrons retrieve for advancement of citizens and society. The semantic web's
information with the aid of taxonomies. Despite these homepage provides a succinct definition of the semantic web
apparent differences, they are essentially different sides that is characteristic of a mission statement. The semantic web
of the same coin. [17], [4] provides a common framework that allows data to be shared
Nonetheless, there is collaboration to be made between LIS and reused across application, enterprise, and community
and Computer science, particularly when mapping concepts, boundaries. [19] The statement highlights such components as a
skills and jargon between computer scientists and librarians common framework, shared data (information) and
encourages collaboration. Computer Science and LIS are collaboration and it parallels the library's standardization and
working to solve problems of information retrieval and the sharing of bibliographic data, resource circulation and
exchange of knowledge between user groups. Whether they collaborative activities. The semantic web's overriding goal to
are doing it together or doing it separately, ontology and imbue computer and human agents with intelligence which is
taxonomies will be extremely important to a number of very similar to the library's goal of advancing knowledge.

167 International Journal of Information Dissemination and Technology | July - September 2011 | Vol. - 1 | Issue - 3
Advancement via International and National Standards Based on knowledge about the library community's
The library community's response to the increased amount of experience developing library collection development
information has also led to development of cataloging codes policies, it is likely that semantic web selection policy
formalized classificatory and verbal systems; and encoding development will require time and patience, particularly given
communication standards (International Bibliographic the absence of examples specific to the semantic web. The
Description (ISBD) and Machine Readable Cataloging wide availability of library collection development guidelines
(MARC). The web and digital library growth has also and resources, such as Guidelines for Writing Collection
motivated rethinking and revision of cataloging standards, Development Policies provide a useful framework for
models, and codes, as evidenced by the development of the developing Semantic Web selection polices. [8]
many metadata schemes, Functional Requirements for Library cataloging
Bibliographic Records (1998) [11], and Resource Description The purpose of cataloging is to make library collection
and Access (RDA). The Semantic web has followed a similar materials findable and discoverable so they can be used.
path as evidenced by a collection of information standards: Charles A. Cutter's (1904) [7] objectives for a library catalog,
extensible Markup Language (XML), RDF, OWL, Friend of A printed in the 4th edition of his Rules for a Dictionary Catalog,
Friend (FOAF) and Simple Knowledge Organizations System are among the most influential statements impacting
(SKOS). [18] cataloging. Cutter's objectives state that a library should:
APPLICABILITY OF LIBRARY FUNCTIONS 1. Enable a person to find a book when the author, title, or
TO THE SEMANTIC WEB subject is known;
This section discusses the goals and objectives of the four 2. Show what the library has by author, subject, and
primary functions underlying the modern library. The literature genre; and
discussion also explores the applicability of each function to
Written a century before the development of the web, Cutter's
the semantic web based on the above analysis of library and
objectives are still applicable to library operations today, and
semantic web similarities.
thus influence current cataloging activities. [15]
Collection development in the library
Jumping a century beyond Cutter to today, digital resource
The collection development is it to build and maintain a cataloging (metadata creation) is being guided by principles
various collection that services a designated constant patron and objectives documented in a variety of metadata schemes
population. The activities of collection development policy [12]
. Under development is the Rules for Description and Access
that viewed as a contract between the library and users. (RDA), which includes a draft statement of objectives (RDA,
Collection development policies document the library's intent 2005). For example, descriptive data (metadata) created using
to grow the collection, identify collection strengths and RDA should enable a user to “identify the resource described”
limitations, and guide library staff, particularly and select appropriate resources “with respect to content,
bibliographers, in their collection development work. format, etc.” Additional objectives address access points,
Guidelines also include selection criteria about preferred representation of entities identified in Functional
subjects and formats. Collection development policies are not Requirements for Bibliographic Records (FRBR) (1998) [9],
permanent, rather they need to be reviewed and revised, as and cataloging quality criteria (e.g. data flexibility,
user populations' change and present new demands. Finally, sufficiency, and accuracy). [20], [11]
collection development can help libraries with administrative
Semantic Web “semantic” representation
activities by including procedures for acquisitions, gifts,
weeding, replacing lost items and collection evaluation. [12], [13] Similarities between library cataloging and producing
metadata for the semantic web, both are deal with
Semantic Web selection
representation. In fact, the boundary between the employ of
The semantic web initiative, as a whole, does not identify representation standards in these two environments (libraries
a specific type of user, although semantic web selection and the semantic web) is artificial. Rather the representation
policies will require review and revision for the following key activity takes place along a continuum, with simple
reasons: bibliographic representation for search and retrieval on one
•the development of new and related projects—some of end, and the implementation of formal ontology and machine
which may be competitors; supported deductive reasoning on the other. [17] Similar to the
•the identification of new user agents (computer and library's community extensive MARC documentation the
human); and semantic web provides comprehensive documentation for
working with enabling technologies, such as XML, RDF, and
•the development of new technologies and machine
OWL. However, the semantic web community falls short,
capabilities.
currently, in providing documentation to guide the use of

International Journal of Information Dissemination and Technology | July - September 2011 | Vol. - 1 | Issue - 3 168
metadata standards and ontology. Plans, guidelines and semantic web and work with semantic web enabling
policies are needed stating principles and objectives for technologies and standards in comparison to the way in which
semantic web representation to ensure good quality computer scientists, engineers, and oncologists. Now a day's
“semantics” (e.g., coherent, consistent, accurate semantic real time and instant processes like blogs, email, websites and
representation). A semantic representation policy would help instant massaging are placed to disseminating of information
secure a robust framework for effective semantic web via web in library. [12], [13]
operations. [5], [10] CONCLUSION
Reference and outreach The library and the semantic web are cultures devoted to
The goal of reference is to provide the library community increasing information access and knowledge discovery, it
with effective information services. Reference services makes sense to explore the foundations of the library and
include personal interaction and dissemination of information. consider what primary functions may help advance the
The library has a compulsion “to provide information service semantic web initiative. The library has been society's chief
to support the educational, recreational, personal and information custodian for the last several hundred years; and if
economic endeavors of the members of their respective the semantic web is to evolve into a chief and trusted
communities” (RUSA Access to Information Committee, information network, affording services and performing tasks
2000). [20] for both humans and machines and it need to examine the
An extension of reference service is outreach. Libraries plan applicability of the library's primary functions to the semantic
services that are of value to their users. Whereas conducting web.
outreach to highlight collection resources that help with This paper explored the applicability primary library functions
finding a first professional job. Outreach extends to to the semantic web. The inquiry was a discussion based on
community outreach, generally in public through the offering rudimentary deduction and was supported by an analysis of
of classes and other services (e.g., English as a second various library guidelines and policies. An exploration of
language classes, story time for youngsters, reader advisory similarities between the library institution and the semantic
services, even cooking and art classes). These items extend web served as a base. All four of the primary library functions
beyond reference but deal with overall access and use of the proved applicable to the semantic web. Each library function
library facility and often promote collection use. translates to a semantic web function.
Semantic Web service REFERENCES
The attachment is with “reference service”— the central 1. Berners-Lee, T. (1999). Weaving the Web: the original
pillar of semantic web. The semantic web depends on design and ultimate destiny of the World Wide Web by its
standardized structured metadata and semantic web inventor. Harper: San Francisco.
algorithms capable of reading and manipulating such data but 2. Berners-Lee, T., Hendler, J. & Lassila, O. (2001). The
the overriding goal is to provide service to free humans from semantic web. Scientific American, 284, 34-
routine tasks that computers can perform—and can perform 43.(http://www.sciam.com/ article.cfm?article
effectively. Current semantic web services facilitate ID=00048144-10D2-1C70-84A9809EC588EF21).
knowledge and service discovery and more sophisticated (Accessed March 28, 2011).
forecasted activities include automatic purchasing of an airline
3. Brooks, T. A. (2002). The Semantic Web, Universalist
ticket—even an airline ticket from your preferred. [17]
Ambition and Some Lessons from Librarianship.
A GAP BETWEEN SEMANTIC WEB AND Information Research, 7,78-85.
LIBRARY 4. Burke, M. (2009). The semantic web and the digital
The semantic web and library communities are far from being library. Aslib Proceeding New Information Perspectives,
healthfully integrated. On one side of this gap, the members of 61, 316-322.
the semantic web community are not fully aware of the skills,
5. Cutter, C. A. (1904). Rules for a Dictionary Catalog (4th
talent, and knowledge that librarians (primarily catalogers)
Ed.) Washington, D.C.: Government Printing Office.
have, and which can help advance the semantic web. This is
evident by the absence of a metadata representation working 6. Dutta, B. (2006). Semantic Web technology:Towards
group within the World Wide Web Consortium (W3C), and the meaningful retrieval. SRELS Journal of Information
severely limited participation of professional librarians on Management, 52, 149-154.
various W3C working groups. Granted, the W3C's semantic 7. München, K. G. S. (1998). Functional Requirements for
web activity has focused more on the development of enabling Bibliographic Records: Final Report.
technologies, rather than processes or activities. On the library (http://www.ifla.org/VII/s13/frbr/frbr.htm). (Accessed
side of the gap, librarians have been slow to embrace the on March 28, 2011).

169 International Journal of Information Dissemination and Technology | July - September 2011 | Vol. - 1 | Issue - 3
8. Graves, M., Constabaris, A. & Brickley, D. (2007). 15. RUSA Access to Information Committee. Information
FOAF: Connecting people on the Semantic Web. Services for Information Consumers: Guidelines for
Cataloging & Classification Quarterly, 43, 191–202. Providers. Prepared by the Standards and Guidelines
9. München, K. G. S. (1998) Functional Requirements for Committee, Reference and User Services
Bibliographic Records: Final Report. (http://www. Association200. Approved by the RUSA Board of
ifla.org/VII/s13/frbr/frbr.htm). (Accessed on March 28, Directors. (http://www.ala.org/ala/rusa/rusaprotools
2011). /referenceguide/guidelinesinformation.htm). (Accessed
March 28, 2011).
10. Malmsten, M. (2008, September). Making a library
catalogue part of the Semantic Web. In the Proceedings of 16. Sadeh, T. & Walker, F. (2003). Library portals: toward the
the International Conference on Dublin Core and semantic web. New Library World, 104, 11-19.
Metadata Applications: Metadata for semantic and social 17. Warren, P. & Alsmeyer, D. (2005). Applying semantic
applications, Berlin, pp.45-74. technology to a digital library: a case study, Library
11. McGuinness, D. L. (2003). Ontologies Come of Age. In Management, 26, 196–205.
Dieter Fensel, Jim Hendler, Henry Lieberman, and 18. http://en.wikipedia.org/wiki/Ontology
Wolfgang Wahlster, (Ed.). Spinning the Semantic Web: (Information_Science) (Accessed on March 28, 2011).
Bringing the World Wide Web to its Full Potential. 19. http://www.suite101.com/content/ontologies-and-
Cambridge: MIT Press, pp. 171-194. taxonomies-in-the-semantic-web-a122095 (Accessed
12. McGuinness, D. L. & Harmelen, F.V. (2004) OWL Web March 28, 2011).
Ontology Language Overview, W3C Recommendation. 20. http://resources.metapress.com/pdf-preview.axd?
(http://www.w3.org/TR/2004/REC-owlfeatures- code=h4u1531242678175&size=large (Accessed March
20040210/, retrieved on 20/03/2008). 28, 2011).
13. Prasad, A.R.D. (2007). Semantic Web technologies for 21. The FOAF project. (http://www.foaf-project.org/)
meaningful information mapping and retrieval, IASLIC (Accessed March 28, 2011).
Bulletin, 52, 25-34.
14. Radovanonic, D. (2003). Semantic Web and electronics
information resources. Infotheca Journal, 4, 157 – 163.

International Journal of Information Dissemination and Technology | July - September 2011 | Vol. - 1 | Issue - 3 170

You might also like