Professional Documents
Culture Documents
Semantic Web: Reference: Peter Mike, Social Networks and The Semantic Web
Semantic Web: Reference: Peter Mike, Social Networks and The Semantic Web
Introduction
Reference: Peter Mike, Social
Networks and the Semantic Web
Introduction
• Most of the web’s content is designed for humans to
read and not for computer programs to process
meaningfully.
• Computer programs can
- parse the web pages
- perform routine processing
• In general, they have no reliable method to understand
and process the semantics.
• The Semantic Web brings structure to the meaningful
content of the web pages, creating an environment
where software agents roaming from page to page
carry out sophisticated tasks for users.
Introduction (cont’d)
• The Semantic Web is a major research initiative
of the World Wide Web Consortium (W3C) to
create a metadata-rich Web of resources that
can describe themselves not only by how they
should be displayed (HTML) or syntactically (XML),
but also by the meaning of the metadata.
• “The Semantic Web is an extension of the current
web in which information is given well-defined
meaning, better enabling computers and people
to work in cooperation.”
– Tim Berners-Lee, James Hendler, Ora Lassila,
Introduction (cont’d)
• In the traditional web difficulties to find,
present, access, or maintain available
electronic information on the web
• Need for a data representation to enable
software products (agents) to provide
intelligent access to heterogeneous and
distributed information.
The Semantic Web: why?
• Difficulty in searching on the Web
– due to the way in which information is stored on the
Web
• Problem 1: Web documents do not distinguish
between information content and presentation
(“solved” by XML)
• Problem 2: Different web documents may
represent in different ways semantically related
pieces of information
• This leads to hard problems for “intelligent”
information search on the Web
Separating content and presentation
• Problem 1: web documents do not distinguish
between information content and
presentation
– problem due to the HTML language
– problem “solved” by technologies like
• stylesheets (HTML, XML)
• XML
– Stylesheets allow for separating formatting attributes from
the information presented
XML
• XML: eXtensible Mark-up Language
• XML documents are written through a user defined set
of tags
• XML lets everyone to create their own tags.
• These tags can be used by the scripts in sophisticated
ways to perform various tasks, but the script writer has
to know what the page writer uses each tag for.
– XML allows to add arbitrary structure to the documents
but says nothing about what the structures mean.
• It has no built mechanism to convey the meaning of
the user’s new tags to other users.
XML: example
• HTML:
<H1>Seminar on Data Analytics </H1>
<UL>
<LI>Teacher: Max Plank
<LI>Room: 7
<LI>Prerequisites: none
</UL>
• XML:
<course>
<title> Seminar on Data Analytics </title>
<teacher> Max Plank </teacher>
<room> 7 </room>
<prereq> none</prereq>
</course>
Limitations of XML
• XML does not solve all the problems:
– different XML documents may express
information with the same meaning using
different tags
The need for a “Semantic” Web
• Problem 2: Different web documents may
represent in different ways semantically related
pieces of information
– different XML documents do not share the semantics
of information
• Idea: annotate (mark-up) pieces of information to
express the “meaning” of such a piece of
information
- the meaning of such tags is shared
shared semantics
The Semantic Web initiative
• The Semantic Web provides a common
framework that allows data to be shared and
reused across application, enterprise and
community boundaries.
• Published using languages specifically
designed for data:
– Resource Description Framework (RDF)
– Web Ontology Language (OWL)
– Extensible Markup Language (XML)
Example
• An example of a tag that would be used in a
non-semantic web page:
<item> blog </item>
• Encoding similar information in a semantic web
page might look like this:
• OWL Lite is a subset of OWL DL, which in turn is a subset of OWL Full.
The proof/rule layer
Beyond OWL:
• Rule: informal notion
• Rules are used to perform inference over
ontologies
• Rules -> a tool for capturing further
knowledge
(not expressible in OWL ontologies )
The Trust layer
• SW top layer:
– where does the information come from?
– how is this information obtained?
– can I trust this information?
Ontologies: example
Evolution of Semantic Web
The Semantic Web Tower