You are on page 1of 3

Integrated Intelligent Research (IIR) International Journal of Web Technology

Volume: 04 Issue: 02 December 2015 Page No.74-76


ISSN: 2278-2389

Semantic Information Retrieval Based on Domain


Ontology
A. Sudha Ramkumar1, B. Poorna2
1
Research Scholar, Bharathiar University , Coimbatore, India,
2
Principal, SSS Jain College,Chennai, India.
Email: sudharam99@gmail.compoornasundar@yahoo.com

Abstract - Most of the current search engines are based on the information retrieval systems, ontology languages and DL
keyword search and it returns a large amount of irrelevant queries. Section 3 describes our framework of semantic
information in response to the user query. This is because of information retrieval system based on ontology. Sections 4
the keyword matching does not consider the semantics of the provides an overview of search algorithm and section 5
keyword term. Ontology has been increasingly used to describes the future work and conclusion.
efficiently capture the semantics of the information which
overcomes the problems associated with traditional keyword
II.BACKGROUND
search. The importance of ontology is identified in the area of
building semantic information retrieval systems for the specific The semantic web aims to extend the current web standards
domain and gains greater significance over the tradition and technology so that the semantics of the web is machine
keyword search.Knowledge representation formalism such as processible[12] . The use of domain ontology in the semantic
description logic is used to obtain the knowledge of the domain information retrieval system enablesusers torepresent a
of interest and is used for the development of intelligent meaningful knowledge of a particular domain of interest which
information retrieval system.Protégé is the widely used tool for makes the machine more intelligent to understand the
developing and editing ontology with GUI which enables the semantics of the information.Domain Ontology describes a list
ontology developers to describe the conceptual terms and of terms to represent important concepts, such as classes of
relationship between them by creating taxonomies. This paper objects and the relationship between them in order to represent
describes the steps for creating operating system ontology and the knowledge of domain[13]. Gruber defines ontology as an
the various aspects like subclassof, superclassof relationships explicit specification of conceptualization. Ontologies play an
and an overview of search algorithm have been described. important role in representing the terms unambiguously which
serve as a background for machine processible.Ontology
Keywords - Semantic Information retrieval; Search algorithm;
languages are formal languages and are used to construct
Ontology.
ontologies. They allow the encoding of knowledge about the
specific domain and often include reasoning rules that support
I. INTRODUCTION the processing of the available knowledge. Ontology languages
are commonly based on either first order logic or on
Traditional keyword search relies on the occurrence of the
description logic. Among the several ontology languages such
keywords in the document and this search returns a large
as Resource Description Framework(RDF), RDF-Schema,Web
amount of irrelevant information.Knowledge management is
ontology language OWL, OWL is the most acceptable and
the process of capturing, structuring and reusing the
recommended ontology language by W3C to construct a
information to develop an understanding of how a particular
domain ontology[5].Sir Jorge Cardoso[2] surveyed on most
system works, and subsequently to share this information
widely used ontology editors and found that protégé tool had a
meaningfully to other information system. The adequate
market share of 68.2% followed by swoop, ontoedit,
management of knowledge is becoming more important for
texteditor, altova semantic works and hence ontologies were
academicians in educational institutions. Ontology is used to
mostly developed in the educational field 31%.Semantic
represent the knowledge of the domain of interest which
information retrieval using ontology in university domain[10]
significantly improves the search results. According to Gruber
developed and implemented a search engine confined to the
ontology is an explicit specification of conceptualizations.
university domain where the query is analysed semantically to
Ontologies are developed with the help of ontology languages
obtain the accurate result. Semantic search using ontology and
such as web ontology language OWL, Resource Description
RDBMS for cricket[9] uses reference to ontology data directly
Framework RDF, and RDF-schema. Ontologies serve to
from SQL using semantic match operators and focuses on
provide conceptualization in the domain of interest and thus
semantic search based on ontology. Developing an university
guarantee to provide semantics of the information. Thus
ontology in education domain using protégé for Semantic
ontologies are an integral part in every knowledge management
web[6]by Sanjay Kumar Maliket al., demonstrated the
initiative. The protégé tool developed by the Stanford
indraprastha university ontology with aspects like super class,
university is widely used tool for constructing domain ontology
subclass, query retrieval process and TGVIZ graph
and to develop semantic information retrieval system.This
visualization.Developing university ontology using protégé owl
paper provides an overview of development of ontology and
tool: Process and Reasoning[7] by Naveen Malviya et al., is
the use of DL query. Finally we discuss how a DL queries are
similar to Sanjay Kumar malik et al., .The Jena based ontology
used for the semantic search. This paper is organized as
model inference and retrieval application[14] by Luo Zhong et
follows. Section 2 provides the overview of semantic
al., proposed computer graphics ontology model and stored the
74
Integrated Intelligent Research (IIR) International Journal of Web Technology
Volume: 04 Issue: 02 December 2015 Page No.74-76
ISSN: 2278-2389
owl documents in MySQL, sets the inference rule and achieve concepts and their relationship are built into domain ontology
search queries on the ontology database. Semantic Search : and stored in the knowledge repository with sufficient
Document ranking and clustering using computer science annotation properties and instances. Finally, this ontology is
ontology and N-grams[1] by Thanyaporn Boonyoung called through the user interface by matching the keyword term
presented a new document ranking process that proposes a new with the concepts present in the class hierarchy. The search
weight of query term in the document based on the computer algorithm is to be used to search the concepts of this domain
science ontology. ontology and to obtain tha knowledge of that concept.

III. FRAMEWORK OF PROPOSED SYSTEM


The operating system ontology has been developed using
protégé 4.2 with the related classes as conc epts. All the
concepts related to the operating system ontology has been
defined in the hierarchy with data properties, object properties
and annotation properties. The object property defines the
relationship between the concepts of operating system
ontology and the data properties used to define the relationship
between the concept individual and the data literal. The
annotation properties such as comment, seealso, isdefinedby
etc are used to annotate the concepts of the domain ontology of
interest. Definition of each class, subclass concept and list of
Fig 3: Visualization of ontology
references to each class and subclass are attached to the node
of class hierarchy. The ontoGraf tab is used to visualize the
ontology concepts like graph. The DL query tab of protégé is a
powerful and easy to use feature for searching a classified
ontology. This query language is based on the OWL syntax
that is fundamentally based on collecting all the information
about a class, property or instances into a single frame.

Fig 4: DL query

IV. OVERVIEW OF SEARCH ALGORITHM


Hildebrand[4] examined 35 search systems and according to
him, the search system consists of 3 main steps. They areQuery
construction, Core search process and presentation and
Fig 1: Framework of Proposed system organization of results.The core search process contains the
search algorithm. All keyword based search involves some
form of syntactic matching of the query against textual content
or metadata. The semantic matching extends the syntactic
matching by exploiting the linked data in the semantic graph.
To index the textual content of the ontology, a search engine
such as Lucene can be used along with a triple store. When
writing search algorithm, the easiest way to optimize code is to
create inverted index which is a structure which maps each
searchable keyword to a list of document that occurs in. Along
with each document, store a corresponding score for that
keyword in that document. To calculate score, there are many
search algorithms[4] available like weighted graph search
algorithm may constrain the possible path structures and path
length, e-culture algorithm assigns weights manually to RDF
relations, SemRank automatically computes weights based on
Fig 2: Class Hierarchy of OS ontology statistics derived from graph structures, and spreading
activation algorithm[11] incorporates weights as well as the
Our proposed system first identifies the concepts and their number of incoming links. The vector space model uses the
relationship for the operating system ontology. The identified spreading activation algorithm by exploiting the relationship
75
Integrated Intelligent Research (IIR) International Journal of Web Technology
Volume: 04 Issue: 02 December 2015 Page No.74-76
ISSN: 2278-2389
between the concepts will give the required appropriate [11] Schumacher, Kinga, Michael Sintek, and Leo Sauermann. Combining fact
and document retrieval with spreading activation for semantic desktop
information to the user in response to their query.The spreading search. Springer Berlin Heidelberg, 2008.
activation model[3] is widely used in the information retrieval [12] Shadbolt, Nigel, Wendy Hall, and Tim Berners-Lee. "The semantic web
which consists of a network structure of concepts of a domain revisited." Intelligent Systems, IEEE 21.3 (2006): 96-101.
and processing techniques applied on this structure to find the [13] Swartout, Bill, et al. "Toward distributed use of large-scale
ontologies." Proc. of the Tenth Workshop on Knowledge Acquisition for
concepts related to the query term. The relations between Knowledge-Based Systems. 1996.
concepts of a domain are usually labelled and weighted. The [14] Zhong, Luo, et al. "The Jena-Based Ontology Model Inference
spreading activation algorithm will create a set of concepts andRetrieval Application." Intelligent Information Management 4.04
after receiving the contents of the query. With this set of (2012):157
concept, it will find the nearest concepts in the network
structure. This process is an iterative one. This spreading
activationalgorithm is like waves activates from one concept to
all other conceptsconncected to the network. When this
spreading activation[8] is used with the query expansion
technique will yield very good search results.

V. CONCLUSION AND FUTURE WORK


The ontology becomes an integral part of every semantic
applications. The use of ontology in the search process will
make the machine understandable about the information and
helps to produce intelligent search results. The use of ontology
in the spreading activation algorithm increases the coverage of
relationship between the concepts. This paper developed a
operating system ontology in protégé along with relationships,
check for consistency of classes, visualizates the ontology and
finally describes the usage of DL query on ontology concepts.
The scope of this developed operating system is very small. Our
future work is to call the ontology concepts as network structure
consists of the nodes and arcs, here the concepts are represented
as nodes and properties are represented as arcs. This network
structure is to be applied to the spreading activation algortithm
to find relevant information based on the result of traditional
keyword search based on the document index.

REFERENCES
[1] Boonyoung, Thanyaporn, and Anirach Mingkhwan. "Semantic Search
Using Computer Science Ontology Based on Edge Counting and N-
Grams." Recent Advances in Information and Communication
Technology. Springer International Publishing, 2014. 283-291.
[2] Cardoso, Jorge. "The semantic web vision: Where are we?." Intelligent
Systems, IEEE 22.5 (2007): 84-88.
[3] Crestani, Fabio. "Application of spreading activation techniques in
information retrieval." Artificial Intelligence Review 11.6 (1997): 453-
482.
[4] Hildebrand, Michiel, Jacco Van Ossenbruggen, and Lynda Hardman. "An
analysis of search-based user interaction on the semantic web." (2007).
[5] Horridge, Matthew, et al. "A Practical Guide To Building OWL
Ontologies Using The Protégé-OWL Plugin and CO-ODE Tools Edition
1.0." University of Manchester (2004).
[6] Malik, Sanjay Kumar, Nupur Prakash, and S. A. M. Rizvi. "Developing an
university ontology in education domain using protégé for semantic
web."International Journal of Science and Technology 2.9 (2010): 4673-
4681.
[7] Malviya, Naveen, Nishchol Mishra, and Santosh Sahu. "Developing
university ontology using protégé owl tool: Process and
reasoning." International Journal of Scientific & Engineering Research 2.9
(2011): 1-8.
[8] Ngo, Vuong M. "Discovering Latent Informaion By Spreading Activation
Algorithm For Document Retrieval." International Journal of Artificial
Intelligence & Applications 5.1 (2014).
[9] Patil, S. M., and D. M. Jadhav. "Semantic Search using Ontology and
RDBMS for Cricket." International Journal of Computer Applications 46
(2012).
[10] Rajasurya, Swathi, et al. "Semantic information retrieval using ontology in
university domain." arXiv preprint arXiv:1207.5745 (2012).

76

You might also like