Professional Documents
Culture Documents
Chapter 6
Q. The Data Hierarchy:
A computer system organizes data in a hierarchy that starts with bits and bytes and
progresses to fields, records, files and databases.
Bit: Represent the smallest unit of data that a computer can handle. Example: 0 or
1
Byte: Groups of bits, called a byte. Example: a single character, which can be a letter,
a number or a symbol.
Field: Group of characters as word(s) or number is called a field. Example:
Employee_ID, Employee_name.
Record: Group of related fields, comprises a record. Example:
Product_ID, Product_Name, Product_description
File: Group of records of same type is called a file. Example: Employee File, Product
File
Database: Group of related files makes up a database. Example: Crouse File,
personal File
For example, Student_ID, Cource, Date, and Grade are attribute of the entity course.
The specific values that these attributes can have been found in the fields of the record
describing the entity course.
Q. Problems with the traditional file environment
Data redundancy and inconsistency: Data redundancy refers to the duplication of
data within a system. This can occur when the same data is stored in multiple
locations, or when multiple copies of the same data are created and stored separately.
Data redundancy wastes storage resources and also leads to data inconsistency, where the
same attribute may have different values.
Lack of Flexibility: A traditional file system can deliver routine scheduled reports after
extensive programming efforts, but it cannot deliver as required reports or respond to
unanticipated information requirements in a timely fashion.
Poor security: Because there is little control or management of data, access to and
dissemination of information may be out of control.
Lack of data sharing and availability: Because pieces of information in different files
and different parts of the organization cannot be related to one another, it is virtually
impossible for information to be shared or accessed in timely manner.
The DBMS acts as an interface between application programs and the physical data files.
The DBMS relieves the programmer or end user from the task of understanding where and
how the data are actually stored by separating the logical and physical views of the data.
Q. Relational DBMS
Represent data as two-dimensional tables called relations or files. Each table contains data
on entity and attributes.
The database has a separate table for the entity SUPPLIER and a table for the entity
PART. Each table consists of a grid of columns and rows of data. Each individual element
of data for each entity is stored as a separate field, and each field represents an attribute for
that entity. Fields in a relational database are also called columns. For the entity
SUPPLIER, the supplier identification number, name, street, cities, state, and zip code are
stored as separate fields within the SUPPLIER table and each field represents an attribute
for the entity SUPPLIER.
The field for Supplier_Number in the SUPPLIER table uniquely identifies each record so
that the record can be retrieved, updated, or sorted. It is called a key field. Each table in a
relational database has one field that is designated as its primary key. This key field is the
unique identifier for all the information in any row of the table and this primary key cannot
be duplicated.
Supplier Number is the primary key for the SUPPLIER table and Part Number is the
primary key for the PART table. Note that Supplier_Number appears in both the
SUPPLIER and PART tables. In the SUPPLIER table, Supplier_Number is the primary
key. When the field Supplier_Number appears in the PART table, it is called a foreign key
and is essentially a lookup field to look up data about the supplier of a specific part.
JOIN: Combines relational tables to provide user with more information than available in
individual tables
PROJECT: Creates subset of columns in table, creating tables with only the information
specified.
Q. Using Databases to Improve Business Performance and Decision
Making
Data warehouse: A data warehouse is a database that stores current and historical data of
potential interest to decision makers throughout the company. The data originate in many
cores operational transaction systems. Data warehouse system will provide query,
analysis, and reporting tools.
Data marts: A data mart subset of data warehouse. Summarized or highly focused portion
of firm’s data for use by specific population of users. Typically focuses on single subject
or line of business.
Q. Data Mining
Data mining is more discovery driven than OLAP. Finds hidden patterns, relationships in
large databases and infers rules to predict future behavior. E.g., Finding patterns in
customer data for one-to-one marketing campaigns or to identify profitable customers.
Types of information obtainable from data mining: