Professional Documents
Culture Documents
An Introduction to
XML
Lecturer : Dr. Pavle Mogin
COMP 442
Issues in Databases and
Information Systems
A General Plan for the XML Topic
• First:
– Why XML at all
• Next:
– What is XML and related meta languages - DTD and Schema
– XPath and XQuery XML query languages
– Because, without precise knowledge of XML data model, you
cannot consider storage, retrieval, and update techniques of
XML databases
• Finally:
– How to store, constrain, query, and update XML data bases
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 1
Plan for Intro into XML Databases
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 2
Motives for Databases on the Internet
• Internet and corporate intranets offer services like:
– Purchasing books online,
– Online auctions,
– Online submission of bids,
– Distant learning
• These pose new challenges on DBMS:
1. Large number of concurrent users (scalability),
2. Storing and handling unstructured and semistructured
documents
3. Ranked keyword search
• The first generation of Internet sites were collections
of HTML files and these proved to be inadequate
• Modern electronic commerce sites rely on database
systems
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 3
New Requirements
• On-Line Enterprise Reporting (OLER) systems, also
known as Enterprise Information Portals (EIP) provide
a single point entry to integrated corporate data both
to employees, clients and partners
• These users pose a number of new requirements
against database systems like:
– Categorization,
– Personalization,
– Publishing,
– Collaboration, and
– Notification
• For all these new requirements XML with its related
technologies offer a right answer
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 4
A Classification of e-Businesses
• Business to Customer:
– online shops,
– online banking,…
• Business to Business:
– online bidding,
– online ordering,…
• Business to Administration:
– online tax payment,
– online assurance,…
• Customer to Administration:
– online libraries,
– online car registration,…
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 5
Business to Business
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 6
So, Why is XML Better
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 7
Open and Free Standard
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 8
Easy to Understand
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 9
An HTML Expression
a) A rubber gasket?
b) A rock artist?
c) A marine mammal?
d) An official stamp?
e) An elite group of the US Navy command?
• Style languages
– Cascading Style Sheets (CSS),
– Extensible Style Language (XSL)
• Supplemental Technologies:
– XLinks,
– XPointers,
– Namespaces, and
– Resource Description Framework
• We shall briefly discuss each of them
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 12
Style Languages
• XML markup only specifies what is in a document
• Unlike HTML it does not say anything about the
presentation
• Information about XML’s document appearance when
printed, or viewed in a Web browser is stored in a
style sheet document
• Different style sheet documents may accompany the
same XML document
• So, you can change the appearance of an XML
document by choosing another style sheet
• Two style sheet languages in broadest use are:
– Cascading Style Sheets (CSS), and
– Extensible Style Language (XSL)
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 13
Cascading Style Sheet (CSS)
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 14
eXtensible Style Language (XSL)
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 15
Supplemental Technologies
• XML based languages that layer on top of basic XML
represent supplemental technologies
• These are:
– XLinks, which provides multi-directional hypertext links that
are more powerful than the simple HTML <A> tag
– XPointers, which introduce a new syntax you can attach to
the end of a URL to link a particular part of a particular
document
– Namespaces, which use prefixes and URLs to disambiguate
conflicting XML markup
– Resource Description Framework (RDF) is an XML
application used to embed meta-data in XML and HTML
documents
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 16
Summary
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 17
Summary
COMP 442 Issues in Databases and Information Systems 2008 Intro to XML 18