Professional Documents
Culture Documents
A Functional Approach
Sandra Payette
Digital Library Research Group
Cornell University
payette@cs.cornell.edu
Metadata
CREATOR: Plato
TITLE: The Republic
• Descriptive
• Structural
• Terms and conditions
• Administrative
• Content ratings
• Provenance
• Relationship
Basic Functions We Must Support
• Resource Discovery
• Access and Use
• Preservation and Administration
Resource Discovery:
• Catalogs
– OPAC / MARC Records
• Indexes
– Structured descriptive records (e.g., Dublin Core)
– Abstracts
– Full-text surrogates (e.g, via OCR)
Challenges
• Easy to create
• Applicable across domains
• Applicable for different genre of objects
• Allows interoperability among robots,
indexers, and search clients
Dublin Core Element Set
Source: http://www.purl.org/Metadata/dublin_core/
Dublin Core : 15 Elements
• Title name given to the work by the author • Resource Type category of the
resource
• Author or Creator person(s)
responsible for the intellectual content
• Format Data representation of the
resource
• Subject and Keywords the topic • Resource Identifier Unique
of the work, keywords, or formal Identification string (e.g. URL, URN,
classification schemes ISBN...)
• Description textual description of the • Source object from which this object is
content (abstract, prose describing an image, derived (if applicable)
etc.)
• Publisher • Language language of the intellectual
the organization making the
content of the object
work available in its present form
• Other Contributor person(s) other
• Relation relationship of the object to
than the author who have made significant other objects or collections
contributions to the intellectual content • Coverage spatial locations and temporal
• Date the date the work was made available duration characteristics
• Rights Management a pointer to a
copyright notice, a rights management
statement, or a rights server.
Dublin Core in HTML META Tags
<html>
<head>
<title>Cornell Digital Library Research Group</title>
<META name="DC.subject" content=”digital library research">
<META name="DC.subject" content="networked object description">
<META name="DC.publisher" content=”Cornell University">
<META name="DC.creator" content=”Lagoze, Carl, lagoze@cs.cornell.edu.">
<META name="DC.creator" content=”Payette, Sandra, payette@cs.cornell.edu.">
<META name="DC.title" content=”Cornell Digital Library Research Group">
<META name="DC.date” content="1998-05-15">
<META name="DC.form" scheme="IMT" content="text/html">
<META name="DC.language" scheme="ISO639" content="en">
<META name="DC.identifier" scheme="URL"
content="http://www2.cs.cornell.edu/NCSTRL/CDLRG/cdlrg.htm">
</head>
<IMG SRC="/mydir/mysubdir/mypicture.gif" WIDTH=208 HEIGHT=216>
</html>
Source: http://www.w3.org/TR/REC-html40/
Warwick Framework
Package
Dublin Core
Package
Other Descriptive
Package
Reference to MARC
Package
URI MARC Record
WWW Infrastructure
Evolving in this Direction
DC:Creator
www2.cs.cornell.edu/CDLRG/doc1
DC:Publisher
QCSchema:Rating www.xxx.org/rate
MyRating YourRating
A B
RDF Expressed in XML
<?xml:namespace name=
“http://www.purl.org/Metadata/dublin_core/” as=“DC”>
<?xml:namespace name=
“http://www.w3.org/Schemas/RDF/” as=“RDF”>
<RDF:Serialization>
<RDF:Assertions
href=“http://www2.cs.cornell.edu/CDLRG/doc1”>
<DC:Creator>Sandy Payette</DC:Creator>
<DC:Publisher>Cornell DLRG </DC:Publisher>
</RDF:Assertions>
</RDF:Serialization>
Dublin
Core
Element
Set
RDF: Why is it important?
0:1 Table
Front 1:N
Contents
0:1
1:N
Chapter Page
1:N 1:N
0:1
Index 1:N
Source: http://sunsite.berkeley.edu/Ebind/
Finding Aids in SGML
Source: http://www.loc.gov/rr/ead/eadhome.html
Preservation and Administration
Unique cnri.dlib/april97-payette
Identifier:
Naming Item
Authority Name
URL: http://www.somewebserver.org/somedirectory/somefile
Identifiers: Current Initiatives