Professional Documents
Culture Documents
Typical Environment
• Corporate Enterprise
• Data With Large Homogenous Parts (e.g., Formatted Data)
• Data Relevant Over a Long Time
• Data Used by Many Simultaneous Users (Batch and On-
line Users) for Retrieval & Update
What is a DBMS?
Database Management System is a general purpose software
system that facilitates the process of
• defining,
• constructing and
• manipulating
database for various applications
Main functions of DBMS
• Defining a Database:
Specifying Data Types, Structures, and Constraints
• Constructing a Database:
the Process of Storing the Data Itself on Some Storage Medium
• Manipulating a Database:
Function for Querying Specific Data in the Database and
Updating the Database
Additional functions of DBMS
• Interaction with File Manager
So that Details Related to Data Storage and Access are Removed From
Application Programs
• Integrity Enforcement
Guarantee Correctness, Validity, Consistency
• Security Enforcement
Prevent Data From Illegal Uses
• Concurrency Control
Control the Interference Between Concurrent Programs
• Recovery from Failure
• Query Processing and Optimization
Motivating database management
• Manual Database Management
• Data are Not Stored
• Programmer Defines Both Logical Data Structure and Physical
Structure (Storage)
Motivating database management
• File Processing
• Data are Stored in Files with Interface Between Programs and Files.
• Various Access Methods Exist (E.G., Sequential, Indexed, Random)
• One File Corresponds to One or Several Programs.
Problems with file systems
• Data are Still Highly Redundant
• Sharing Limited and at the File Level
• Data is Unstructured
• “Flat” Files
• High Maintenance Costs
• Data Dependence
• Ensuring Data Consistency and Controlling Access to Data (Concurrent
Access Problematic)
• Difficult to Understand by New Developers
• Difficulties in Developing New Applications
• Almost Impossible to Evolve with New Capabilities
• Risk of Inefficient Applications
Database vs. File System
Coordinates Both Physical Coordinates Only the
and Logical Access to the Physical Access to the
Data Data
Data are Shared by All Data Written by One
Programs Authorized to Program May Not Be
Have Access to It Readable by Another
Program
Flexible Access to Data Pre-determined Access to
(i.e., Queries) Data (I.E., Compiled
Programs)
Multiple Users Accessing No Two Programs Can
the Same Data at Same Concurrently Access the
Time Same File
‘University’ database example
• Has 5 files (‘tables’):
Student
Course
Section
Grade Report
Prerequisite
• See how do they relate to each other!
• How do we define, construct & manipulate?
Characteristic of Database
Approach
• Self-describing nature
• Insulation between program and data
• Support of multiple views of data
• Sharing of data & multiuser transaction
processing
Self-describing nature
• As oppose to unstructured data
• Contains not only database itself, but
also a complete definition of database
structure (metadata)
• In traditional file processing, data
definition is part of the application
program
Insulation between programs &
data
Data Mining
Management
The above gives rise to new research and development in
incorporating new data types, complex data structures, new
operations and storage and indexing schemes in database
systems.
When not to use a DBMS
Main inhibitors (costs) of using a DBMS:
High initial investment and possible need for
additional hardware.
Overhead for providing generality, security,