Professional Documents
Culture Documents
decision making, business modeling, and operations research activities. OLAP systems are
designed to use both operational and Data Warehouse data.
Geographic information systems (GISs) is a system that creates, manages, analyzes, and maps
all types of data. GIS connects data to a map, integrating location data (where things are) with
all types of descriptive information (what things are like there). This provides a foundation for
mapping and analysis that is used in science and almost every industry. GIS helps users
understand patterns, relationships, and geographic context. The benefits include improved
communication and efficiency as well as better management and decision making.
Spatial OLAP refers to such OLAP operations in a spatial data warehouse, such as drilling into
the detailed spatial locations (e.g., finding detailed fact summaries in a city district), or slicing
of a data cube with a set of constraints (e.g., finding average midnight temperature
distribution in May in the city of Chicago).
World Wide Web Consortium (W3C) is an international organization committed to improving
the web. It is made up of several hundred member organizations from a variety of related IT
industries. W3C sets standards for the World Wide Web (WWW) to facilitate interoperability
and cooperation among all web stakeholders. It was established in 1994 by the creator of the
WWW, Tim Berners-Lee.
Resource Description Framework (RDF) is a standard model for data interchange on the Web.
RDF has features that facilitate data merging even if the underlying schemas differ, and it
specifically supports the evolution of schemas over time without requiring all the data
consumers to be changed.
Web Ontology Language (OWL) is a Semantic Web language designed to represent rich and
complex knowledge about things, groups of things, and relations between things. OWL is a
computational logic-based language such that knowledge expressed in OWL can be exploited by
computer programs, e.g., to verify the consistency of that knowledge or to make implicit
knowledge explicit. OWL documents, known as ontologies, can be published in the World Wide
Web and may refer to or be referred from other OWL ontologies. OWL is part of the W3C’s
Semantic Web technology stack, which includes
ETL (or Extract, Transform, Load) is a process of data integration that encompasses three steps
— extraction, transformation, and loading. In a nutshell, ETL systems take large volumes of raw
data from multiple sources, converts it for analysis, and loads that data into your warehouse
Database Management System (DBMS) is a software package designed to define, manipulate,
retrieve and manage data in a database. A DBMS generally manipulates the data itself, the data
format, field names, record structure and file structure. It also defines rules to validate and
manipulate this data. Database management systems are set up on specific data handling
concepts, as the practice of administrating a database evolves. The earliest databases only
handled individual single pieces of specially formatted data. Today’s more evolved systems can
handle different kinds of less formatted data and tie them together in more elaborate ways.
Microsoft SQL Server is a relational database management system (RDBMS) that supports a
wide variety of transaction processing, business intelligence and analytics applications in
corporate IT environments. Microsoft SQL Server is one of the three market-leading database
technologies, along with Oracle Database and IBM's DB2.
Rectangle: Represents Entity sets.
Ellipses: Attributes
Diamonds: Relationship Set
Lines: They link attributes to Entity Sets and Entity sets to Relationship Set
Double Ellipses: Multivalued Attributes
Dashed Ellipses: Derived Attributes
Double Rectangles: Weak Entity Sets
Double Lines: Total participation of an entity in a relationship set
Entity–Relationship Model (Er Model) describes the structure of a database with the help of a
diagram, which is known as Entity Relationship Diagram (ER Diagram).
An ER model is a design or blueprint of a database that can later be implemented as a database.
The main components of E-R model are: entity set and relationship set.
ER diagram shows the relationship among entity sets. An entity set is a group of similar entities
and these entities can have attributes. In terms of DBMS, an entity is a table or attribute of a
table in database, so by showing relationship among tables and their attributes, ER diagram
shows the complete logical structure of a database.
ER diagram has three main components:
1. Entity
2. Attribute
3. Relationship
ENTITY
An entity is an object or component of data. An entity is represented as rectangle in an
ER diagram.
Weak Entity:
An entity that cannot be uniquely identified by its own attributes and relies on the relationship
with other entity is called weak entity. The weak entity is represented by a double rectangle.
For example – a bank account cannot be uniquely identified without knowing the bank to which
the account belongs, so bank account is a weak entity.
ATTRIBUTE
An attribute describes the property of an entity. An attribute is represented as Oval in
an ER diagram. There are four types of attributes:
1. Key attribute
2. Composite attribute
3. Multivalued attribute
4. Derived attribute
KEY ATTRIBUTE A key attribute can uniquely identify an entity from an entity set. For example,
student roll number can uniquely identify a student from a set of students. Key attribute is
represented by oval same as other attributes however the text of key attribute is underlined.
COMPOSITE ATTRIBUTE An attribute that is a combination of other attributes is known as
composite attribute. For example, In student entity, the student address is a composite
attribute as an address is composed of other attributes such as pin code, state, country.
MULTIVALUED ATTRIBUTE An attribute that can hold multiple values is known as multivalued
attribute. It is represented with double ovals in an ER Diagram. For example – A person can
have more than one phone numbers so the phone number attribute is multivalued.
DERIVED ATTRIBUTE A derived attribute is one whose value is dynamic and derived from
another attribute. It is represented by dashed oval in an ER Diagram. For example – Person age
is a derived attribute as it changes over time and can be derived from another attribute (Date of
birth).
RELATIONSHIP
A relationship is represented by diamond shape in ER diagram, it shows the relationship
among entities. There are four types of relationships:
1. One to One
2. One to Many
3. Many to One
4. Many to Many
1. POPULATION INCLUSION meaning that every instance of the subtype is also an instance
of the supertype. In our example, this means that every temporary employee is also an
employee of the Northwind company.
2. INHERITANCE meaning that all characteristics of the supertype (e.g., attriibutes and
roles) are inherited by the subtype. Thus, in our example, temporary employees also
have, for instance, a name and a title.
Physical database design is to specify how database records are stored, accessed, and related in
order to ensure adequate performance of a database application. Physical database design is
related to query processing, physical data organization, indexing, transaction processing, and
concurrency management, among other characteristics.
Transaction throughput is the number of transactions that can be processed in a given time
interval. In some systems, such as electronic payment systems, a high transaction throughput is
critical.
Response time is the elapsed time for the completion of a single transaction. Minimizing
response time is essential from the user’s point of view.
Disk storage is the amount of disk space required to store the database files. However, a
compromise usually has to be made among these factors. From a general perspective, this
compromise implies the following factors:
File organization is the physical arrangement of data in a file into records and blocks on
secondary storage. There are three main types of file organization. In a heap (or unordered) file
organization, records are placed in the file in the order in which they are inserted. This makes
insertion very efficient.
Sequential (or ordered) files have their records sorted on the values of one or more fields,
called ordering fields. Ordered files allow fast retrieving of records, provided that the search
condition is based on the sorting attribute. However, inserting and deleting records in a
sequential file are problematic, since the order must be maintained.
Online transaction processing (OLTP) systems OR operational databases This paradigm is
focused on queries, in particular, analytical queries. OLAP oriented databases should support a
heavy query load.
Alternate key: all candidate keys not chosen as the primary keycandidate key: a simple or composite key
that is unique (no two rows in a table may have the same value) and minimal (every column is
necessary)
Characteristic entities: entities that provide more information about another table
Dependent entities: these entities depend on other tables for their meaning
Derived attributes: attributes that contain values calculated from other attributes
Entity: a thing or object in the real world with an independent existence that can be differentiated from
other objects
Entity relationship (ER) data model: also called an ER schema, are represented by ER diagrams. These
are well suited to data modelling for use with databases.
Foreign key (FK): an attribute in a table that references the primary key in another table OR it can be
null
Independent entity: as the building blocks of a database, these entities are what other tables are based
on
Key: an attribute or group of attributes whose values can be used to uniquely identify an individual
entity in an entity set
null: a special symbol, independent of data type, which means either unknown or inapplicable; it does
not mean zero or blank
Relationship strength: based on how the primary key of a related entity is defined
Ternary relationship: a relationship type that involves many to many relationships between three
tables.
Unary relationship: one in which a relationship exists between occurrences of the same entity set.