P. 1


|Views: 6|Likes:
Published by Aishwarya Vasudeven

More info:

Categories:Types, Research
Published by: Aishwarya Vasudeven on Jul 26, 2013
Copyright:Attribution Non-commercial


Read on Scribd mobile: iPhone, iPad and Android.
download as DOC, PDF, TXT or read online from Scribd
See more
See less





Metadata 1. 2. 3. 4. 5. 6. 7. 8. 9. Metadata – DATA ABOUT DATA Metadata is stored in Repository or a Data Dictionary.

used to identify the contents and location of data in any system When structured hierarchically, Meta data is called ontology or schema. Speed up the serching process Helps to bridge the semantic gaps. Shares resources across users and various tools. Types->classified by the following factors-> contents mutability, logical functions Types of BI&DW metadata-> DW metadata, OLAP metadata, Reporting metadata, Data mining metadata. 10. DW metadata types-> backroom, front room, source system, Data staging, DBMS 11. backroom metadata-> helps bringing the OLTP metadata into DW, Used in ETL 12. Front Room metadata-> Used to label screens and to create reports, defines anything that acts on the data present in the DW. Ex: Business names and descriptions for the DW element, Report definitions and queries. 13. Source system metadata-> metadata acquired during the analysis of source system to help in extraction 14. Source system-> Repositories, source system data model, Data dictionary. 15. source system data model-> logical data model and physical data model 16. Data staging metadata-> Data acquisition metadata, Dimension table management, transformation and aggregation. 17. RDBMS metadata-> a) Referred to as CATALOG->contains system tables which are again tables in the master tables. b) Provides information about the structure and contents of the DB c) Relation present in the DB d) Relation or referential attribute. e) Attribute representation and storage-> partitioning f) Other in formations->indexes. 18. BI metadata describes how data is queried filtered and analyzed and how data is displayed in BI tools. 19. BI metadata types a) OLAP metadata b) Reporting metadata c) Data mining metadata 20. OLAP metadata a) Describes various OLAP elements like facts measures and dimensions. b) Structures of various OLAP elements like cubes, levels, hierarchy and drill paths. c) It depends on RDBMS metadata and DW metadata. 21. Reporting metadata a) Description of various reporting elements like charts, reports and datasets. b) Report attributes like queries, variables, expressions, metadata extensions.

31. c) Allows metadata created by one community to be used by another community. Crosswalks: a) Are very important for virtual collection b) Expected to work as a single search engine c) Are labor intensive to develop and maintain d) Less granularity to more granularity mapping is very complex. b) Compatibility of rules used to fill elements of each schema. Crosswalks: a) Facilitate interoperability and exchange of data. Success factor of cross walks depends on a) Degree of similarities between the two metadata schemes. Creating metadata is a most efficient way to employ information professional to create and mainteian metadata. WebService. Translates one metadata format to another. Standards defined for metadata a) ISO or IEC (International Electro technical Commission) 11179 specification and standardization of data elements. . Metadata is created by technical staff or busi executives or automated 24. DTD. Creation Tools: 27. Extracted metadata is reviewed and edited d) Conversion tools i. particularly with in a specific field of interest. EDI(Electronic Data Interchange) b) That needs consistent definition across time like DW. Semantics and syntax from one metadata scheme to another. 28. 30.Document Type Definition c) Extraction tool i. c) Repositories document multiple schemas or element sets. XML ii. (Ex: US environment data registry) 32. Creation tools: a) Templates-> user enters data into the fields which are set previously b) Markup tools i. 25. Applies to organizations: a) That transmit data using structure like XML. algorithms and queries 23. Data mining metadata a) Describes about datasets. c) Granularity of elements in the target schema when compared with source schema 29. Repositories: a) Central area where the metadata definitions are stored and maintained in a controlled manner. SGML-Standard Generalized Markup Lang iii. b) Crosswalk is mapping of Elements. To extract meta data from textual tools ii.22. 26.

metamodel for management of sharable data. Stored as binary ii. 37. External Storage. 35.b) ANSI. 36. 33. Not optimized for storage capacity b) Non-human readable formats i. b) Allows transferring data and metadata together. X3. c) Metadata should adapt if the base resource changes. Stores in files like XML ii. e) Metadata history should be maintained for changes even if the base resource is deleted. d) Metadata should merge if two resource merges. d) Not suitable if the dat is not centralized. MD is a) Too expensive and time consuming b) Too complicated c) Subjective d) There is no end to MD 39. c) Creates high redundancy. Storage formats a) Human readable formats i. f) Change logs or histories are useful in rights management and access restrictions. f) Not efficient for searching. Speeds up storage and saves space 38. Metadata lifecycle management a) Managing metadata from the starting phases of the project through design and planning. Internal storage a) MD stored along with data. MD quality control . e) MD canbe manipulated easily. b) Different groups working in a project should follow the agreed standards and compatible methods of collecting data.285. Some vendors: a) Data Foundation Metadata Registry b) Oracle Enterprise Metadata Manager c) SAS Metadata Repository d) Info Librarian metadata integration framework e) Masai technology 34. a) Metadata stored separately b) Creates less redundancy c) Care should be taken to maintain the link between data n mD d) Change in dat is hard to track and reflect in MD e) More efficient searching.

Appropriate to the base resource ii.a) MD creation and defining the quality should be done by the same person coz terminologies will be inconsistent. Records should support archiving and persistence. . c) Interoperability should be maintained at least for the MD definied with in same domain d) Good metadata should be i. mandatory elements may be missed or used incorrectly b) MD creators should be trained. Supports interoperability iii. Uses standard terminology iv.

You're Reading a Free Preview

/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->