DB Concepts and Models

DATA BASE MANAGEMENT SYSTEM (P15IS51)
Chapter 2 Database System

Concepts and Architecture
Chapter – 2 Database System Concepts and Architecture

2.1 Data Models, Schemas, and Instances
PUTTASWAMY B S Assistant Professor, Department of IS&E Page 1

2.1.1 Categories of Data Models

2.1.2 Schemas, Instances, and Database State
2.2 Three-Schema Architecture and Data Independence
2.2.1 The Three-Schema Architecture
2.2.2 Data Independence
2.3 Database Languages and Interfaces
2.3.1 DBMS Languages
2.3.2 DBMS Interfaces
2.4 The Database System Environment
2.4.1 DBMS Component Modules
2.4.2 Database System Utilities
2.4.3 Tools, Application Environments, and Communications Facilities
Data Models, Schemas, and Instances:

One fundamental characteristic of the
database approach is that it provides some level of data abstraction.
Data abstraction generally refers to the suppression of details of data organization and storage,
and the highlighting of the essential features for an improved understanding of data.

One of the main characteristics of the database approach is to support data abstraction so that
different users can perceive data at their preferred level of detail.
Data model:-
A data model is a collection of concepts that can be used to describe the structure of
a database and provides the necessary means and details to achieve this abstraction.
A collection of concepts that can be used to describe the structure of a database provides the
necessary means to achieve this abstraction. By structure of a database we mean the data types,
relationships, and constraints that apply to the data. Most data models also include a set of basic
operations for specifying retrievals and updates on the database.
A database model shows the logical structure of a database, including the relationships and
constraints that determine how data can be stored and accessed. Individual database models are
designed based on the rules and concepts of whichever broader data model the designers adopt.
Most data models can be represented by an accompanying database diagram.
Categories of Data Models:
There are many kinds of data models. Some of the most common ones include:
 Hierarchical database model
 Relational model
 Network model
 Object-oriented database model
 Entity-relationship model
 Document model
 Entity-attribute-value model
 Star schema
 The object-relational model, which combines the two that make up its name
Relational model:
The most common model, the relational model sorts data into tables, also
known as relations, each of which consists of columns and rows. Each column lists an attribute
of the entity in question, such as price, zip code, or birth date. Together, the attributes in a

relation are called a domain. A particular attribute or combination of attributes is chosen as a

primary key that can be referred to in other tables, when it’s called a foreign key.
Each row, also called a tuple, includes data about a specific instance of the entity in question,
such as a particular employee.
The model also accounts for the types of relationships between those tables, including one-to-
one, one-to-many, and many-to-many relationships. Here’s an example:
Within the database, tables can be normalized, or brought to comply with normalization rules
that make the database flexible, adaptable, and scalable. When normalized, each piece of data is
atomic, or broken into the smallest useful pieces.
Relational databases are typically written in Structured Query Language (SQL). The model was
introduced by E.F. Codd in 1970.
Hierarchical model:
The hierarchical model organizes data into a tree-like structure, where
each record has a single parent or root. Sibling records are sorted in a particular order. That order

is used as the physical order for storing the database. This model is good for describing many
real-world relationships.
This model was primarily used by IBM’s Information Management Systems in the 60s and 70s,
but they are rarely seen today due to certain operational inefficiencies.
Network model:
The network model builds on the hierarchical model by allowing many-to-
many relationships between linked records, implying multiple parent records. Based on
mathematical set theory, the model is constructed with sets of related records. Each set consists
of one owner or parent record and one or more member or child records. A record can be a
member or child in multiple sets, allowing this model to convey complex relationships.
It was most popular in the 70s after it was formally defined by the Conference on Data Systems
Languages (CODASYL).

Object-oriented database model:

This model defines a database as a collection of
objects, or reusable software elements, with associated features and methods. There are several
kinds of object-oriented databases:
A multimedia database incorporates media, such as images, that could not be stored in a
relational database.
A hypertext database allows any object to link to any other object. It’s useful for organizing
lots of disparate data, but it’s not ideal for numerical analysis.
The object-oriented database model is the best known post-relational database model, since it
incorporates tables, but isn’t limited to tables. Such models are also known as hybrid database
models.

Diagramming is quick and easy with Lucidchart. Start a free trial today to start creating and
collaborating.
Object-relational model:
This hybrid database model combines the simplicity of the
relational model with some of the advanced functionality of the object-oriented database model.
In essence, it allows designers to incorporate objects into the familiar table structure.
Languages and call interfaces include SQL3, vendor languages, ODBC, JDBC, and proprietary
call interfaces that are extensions of the languages and interfaces used by the relational model.
Entity-relationship model:
This model captures the relationships between real-world
entities much like the network model, but it isn’t as directly tied to the physical structure of the
database. Instead, it’s often used for designing a database conceptually.
Here, the people, places, and things about which data points are stored are referred to as entities,
each of which has certain attributes that together make up their domain. The cardinality, or
relationships between entities, are mapped as well.
A common form of the ER diagram is the star schema, in which a central fact table connects to
multiple dimensional tables.
The term "database schema" can refer to a visual representation of a database, a set of rules that
govern a database, or to the entire set of objects belonging to a particular user. Read on to find
out more about database schemas and how they are used.
1.11 Data Model
A model is an abstraction process that hides superfluous details. Data modeling is used for
representing entities of interest and their relationship in the database.
Data model different types of Data Model
Data model is a collection of concepts that can be used to describe the structure of a database
which provides the necessary means to achieve the abstraction. The structure of a database
means that holds the data.
Data types,
 Relationships
 Constraints

Types of Data Models

High Level- Conceptual data model.
1. Low Level – Physical data model.
2. Relational or Representational
3. Object-oriented Data Models:
4. Object-Relational Models:
High Level-conceptual data model: User level data model is the high level or conceptual
model. This provides concepts that are close to the way that many users perceive data.
Low level-Physical data model: provides concepts that describe the details of how data is stored
in the computer model. Low level data model is only for Computer. This is for Specialists not for
end-user.
Representation data model: It is between High level & Low level data model which provides
concepts that may be understood by end-user but that are not too far removed from the way data
is organized by within the computer.
The most common data models are
1. Relational Model
The Relational Model uses a collection of tables both data and the relationship among those data.
Each table have multiple column and each column has a unique name. Relational database
comprising of two tables:
Customer –Table:
Security Account-
Customer-Name Number Address City Number
Bangalor
Preethi 111-222-3456 Yelhanka e A-101
Bangalor
Sharan 111-222-3457 Hebbal e A-125
Bangalor
Preethi 112-123-9878 Jaynagar e A-456
Bangalor
Arun 123-987-9909 MG road e A-987
Bangalor
Preethi 111-222-3456 Yelhanka e A-111
Sanjay Bangalor
Rocky 222-232-0987 Nagar e A-111

Account –Table
Account-
Number Balance
A-101 1000.00
A-125 1200.00
A-456 5000.00
A-987 1234.00
A-111 3000.00
Customer Preethi and Rocky share the same account number A-111
Advantages
The main advantage of this model is its ability to represent data in a simplified format.
1. The process of manipulating record is simplified with the use of certain key attributes used to
retrieve data.
2. Representation of different types of relationship is possible with this model.
2. Network Model
The data in the network model are represented by collection of records and relationships among
data are represented by links, which can be viewed as pointers.
The records in the database are organized as collection of arbitrary groups.

Advantages:
Representation of relationship between entities is implemented using pointers which allows the
representation of arbitrary relationship
1. Unlike the hierarchical model it is easy.

2. data manipulation can be done easily with this model.

3. Hierarchical Model
A hierarchical data model is a data model which the data is organized into a tree like structure.
The structure allows repeating information using parent/child relationships: each parent can have
many children but each child only has one parent. All attributes of a specific record are listed
under an entity type.
Advantages:
The representation of records is done using an ordered tree, which is natural method of
implementation of one–to-many relationships.
1. Proper ordering of the tree results in easier and faster retrieval of records.
2. Allows the use of virtual records. This result in a stable database especially when
modification of the data base is made.
4.0 Object-oriented Data Models
Several models have been proposed for implementing in a database system.
 One set comprises models of persistent O-O Programming Languages such as C++ (e.g., in
OBJECTSTORE or VERSANT), and Smalltalk (e.g., in GEMSTONE).
 Additionally, systems like O2, ORION (at MCC – then ITASCA), IRIS (at H.P.-used in
Open OODB).
5.0 Object-Relational Models
Most Recent Trend. Started with Informix
 Universal Server.
 Relational systems incorporate concepts from object databases leading to object-relational.
 Object Database Standard: ODMG-93, ODMG-version 2.0,ODMG-version 3.0.
 Exemplified in the latest versions of Oracle-10i,DB2, and SQL Server and other DBMSs.

 Standards included in SQL-99 and expected to be enhanced in future SQL standards.
Database Schema:
A database schema is the skeleton structure or visual representation that
represents the logical view of the entire database. It defines how the data is organized and how
the relations among them are associated. It formulates all the constraints that are to be applied
on the data.
A database schema does not contain any data or information. It formulates all the
constraints that are to be applied on the data.
A database schema defines its entities and the relationship among them. It contains a descriptive
detail of the database, which can be depicted by means of schema diagrams. It’s the database
designers who design the schema to help programmers understand the database and make it
useful.
A database schema can be divided broadly into two categories :

1. Physical Database Schema
2. Logical Database Schema
Physical Database Schema: This schema pertains to the actual storage of data and its
form of storage like files, indices, etc. It defines how the data will be stored in a
secondary storage.
Logical Database Schema: This schema defines all the logical constraints that need
to be applied on the data stored. It defines tables, views, and integrity constraints.

Database Instance:
The data stored in database at a particular moment of time is called instance of database.
The data in the database at a particular moment in time is called a database state or
snapshot. It is also called the current set of occurrences or instances in the database.
In a given database state, each schema construct has its own current set of instances.
The DBMS stores the descriptions of the schema constructs and constraints also called
metadata in the DMBS catalog so that DBMS software can refer to the schema
whenever it needs to.

It is important that we distinguish these two terms individually. Database schema is the skeleton
of database. It is designed when the database doesn't exist at all. Once the database is
operational, it is very difficult to make any changes to it. A database schema does not contain
any data or information.
A database instance is a state of operational database with data at any given time. It contains a
snapshot of the database. Database instances tend to change with time. A DBMS ensures that its
every instance (state) is in a valid state, by diligently following all the validations, constraints,
and conditions that the database designers have imposed.
A database state at a specific time defined through the currently existing content and
relationship and their attributes is called a database instance
Three-Schema Architecture and Data Independence:

The main characteristics
of the database approach versus the file-processing approach are the following:
1. Self-describing nature of a database system
2. Insulation between programs and data, and data abstraction
3. Support of multiple views of the data
4. Sharing of data and multiuser transaction processing
In this section we specify architecture for database systems, called the three-
schema architecture that was proposed to help achieve and visualize characteristics (1), (2) and
(3).
The Three-Schema Architecture:
The goal of the three-schema architecture, illustrated

in Figure below, is to separate the user applications from the physical database.
Three-schema architecture is an idea in relational database design that breaks a database down
into three different categories according to its use and structure, and to the roles played by
system administrators, designers and end users.
Internal schema:
The internal level has an internal schema, which describes the physical
storage structure of the database. The internal schema uses a physical data model and describes
the complete details of data storage and access paths for the database.
[External Level: An external level specifies a view of the data in terms of conceptual level
tables. Each external level view is used to cater the needs of a particular category of users. For
Example, FACULTY of a university is interested in looking course details of students,
STUDENTS are interested in looking all details related to academics, accounts, courses and
hostel details as well. So, different views can be generated for different users.]
Conceptual schema:
The conceptual level has a conceptual schema, which describes the
structure of the whole database for a community of users. The conceptual schema hides the
details of physical storage structures and concentrates on describing entities, data types,
relationships, user operations, and constraints. Usually, a representational data model is used to
describe the conceptual schema when a database system is implemented. This implementation
conceptual schema is often based on a conceptual schema design in a high-level data model.

[Conceptual Level: At conceptual level, data is represented in the form of various database
tables. For Example, STUDENT database may contain STUDENT and COURSE tables which
will be visible to users but users are unaware about their storage.]
External schema:
The external or view level includes a number of external
schemas or user views. Each external schema describes the part of the database that a particular
user group is interested in and hides the rest of the database from that user group. As in the
previous level, each external schema is typically implemented using a representational data
model, possibly based on an external schema design in a high-level data model.
[At physical level, the information about location of database objects in data store is kept.
Various users are DBMS are unaware about the locations of these objects.]
Three schema architecture divided into three levels. They are,
1. Physical Level
2. Conceptual Level
3. External Level

In the above diagram,

 It shows the architecture of DBMS.
 Mapping is the process of transforming request response between various database levels of
architecture.
 Mapping is not good for small database, because it takes more time.
 In External / Conceptual mapping, DBMS transforms a request on an external schema against
the conceptual schema.
 In Conceptual / Internal mapping, it is necessary to transform the request from the conceptual
to internal levels.
1. Physical Level:
 Physical level describes the physical storage structure of data in database.
 This level is very close to physical storage of data.
 At lowest level, it is stored in the form of bits with the physical addresses on the
secondary storage device.
 At highest level, it can be viewed in the form of files.
 The internal schema defines the various stored data types.
 It uses a physical data model.

2. Conceptual Level
 Conceptual level describes the structure of the whole database for a group of users.
 It is also called as the data model.
 Conceptual schema is a representation of the entire content of the database.
 This schema contains all the information to build relevant external records.
 It hides the internal details of physical storage.
3. External Level
 External level is related to the data which is viewed by individual end users.
 This level includes a no. of user views or external schemas.
 This level is closest to the user.
 External view describes the segment of the database that is required for a particular user
group and hides the rest of the database from that user group.
Data Independence:
Data independence is ability to modify a schema definition in one
level without affecting a schema definition in the next higher level.
[data independence The facility to modify a database schema (logical or storage schema) with
no consequent requirement to modify user views or programs interacting with the database nor
any need to reload data. To provide data independence has been a main motivation for the
development of database management software.]
There are two levels of data independence:
1. Physical Data Independence
2. Logical Data Independence

Logical Data Independence:

Logical data is data about database, that is, it stores information
about how data is managed inside. For example, a table (relation) stored in the database and all
its constraints, applied on that relation.
Logical data independence is a kind of mechanism, which liberalizes itself from actual data
stored on the disk. If we do some changes on table format, it should not change the data residing
on the disk.
[Logical data independence points out that the conceptual pattern can be altered by undamaging
the current external patterns or schemas. The external level and conceptual level has mapping in
between them and it takes all the made alterations. It also protects and isolates application
programs from actions like combination of dual records into a single record or separating a
single record into two or more records.
Logical data independence is a complex parameter to succeed when compared to the physical
data independence because it needs pliancy in the scheme of database. For example, two users A
and B select the same fields ‘student name’ and ‘student roll number’ then user A adds a new
column to the field i.e. ‘student age’ then this does not affect the external view of user B but the
internal patterns of both the users will be changed.]
 Logical data independence is ability to modify the conceptual schema without requiring any
change in application programs.
 Modification at the logical levels is necessary whenever the logical structures of the database is
altered.
 Logical data independence separates external level from the conceptual view.
 Comparatively it is difficult to achieve logical data independence.
 Application programs are heavily dependent on logical structures of the data they access.so any
change in logical structure also requires programs to change.
Physical Data Independence:
Physical data independence indicates that physical storage
structures or devices could be changed without affecting conceptual schema. The change would
be absorbed by mapping between internal and conceptual levels. Physical data independence is
achieved by presence of internal level of database and then, transformation from conceptual level
of database to internal level.

[All the schemas are logical, and the actual data is stored in bit format on the disk. Physical data
independence is the power to change the physical data without impacting the schema or logical
data.For example: in case we want to change or upgrade the storage system itself − suppose we
want to replace hard-disks with SSD − it should not have any impact on the logical data or
schemas. ]
Difference between Physical data independence and logical data
independence:
Logical data independence is the one which has power (Ability) to change logical schema
without changing external schema or application. Logical data independence in any database
applications relates to views where we can only view the data rather than modifying data.
Example:
Adding or removing Entities, relationships, attributes, to conceptual schema is possible without
doing or performing any alteration in the external schemas or records.
Renaming attributes or tables does not affect any change in external schema.
Advantages of Data independence

The advantages of data independency in DBMS are as follows:
 Ability of improving performance
 Alterations in data structure does not requires alterations in application programs
 Implementation details can be hidden from the users
 Reduction of incongruity
 Tractability in improvement of system
 Affordable prices of maintaining system
 Providing the best services to the users
 Permit users to focus on general structure
 Enforcement of standards
 Improvement of security
 The state of being undamaged or undivided can be improved

Database Languages and Interfaces:

A DBMS must provide appropriate
languages and interfaces for each category of users to express database queries and updates.
Database Languages:
Database Languages are used to create and maintain database on computer. (Database languages
are used for read, update and store data in a database.).
There are several such languages that can be used for this purpose such as Oracle, MySQL, MS
Access, dBase, FoxPro, Sybase etc.
Database languages can be categorized into four types. They are,
1. Database Definition Language (DDL)
2. Data Manipulation Language (DML)
3. Data Control Language (DCL)
4. Storage Definition Language (SDL)
5. View Definition Language(VDL)
Data Definition Language (DDL):

It is a language that allows the users to define data
and their relationship to other types of data (DDL is used for specifying the database schema).
It is mainly used to create files, databases, data dictionary and tables within databases. It is also
used to specify the structure of each table, set of associated values with each attribute, integrity
constraints, security and authorization information for each table and physical storage structure
of each table on disk.

[The language used to define the data in the database is called as Data - Definition Language
(DDL). In simple words, Data Definition Languages is used to create the database, alter the
database and delete the database.]
The lists of tasks that come under DDL:
 CREATE - used to create objects in the database
 ALTER - used to alters the structure of the database
 DROP - used to delete objects from the database
 TRUNCATE - used to remove all records from a table, including all spaces allocated for the
records are removed
 COMMENT - used to add comments to the data dictionary
 RENAME - used to rename an object
Data Manipulation Language (DML):

It is a language that provides a set of
operations to support the basic data manipulation operations on the data held in the databases. It
allows users to insert, update, delete and retrieve data from the database. The part of DML that
involves data retrieval is called a query language.
[The language used to manipulate the data in database is called Data - Manipulation Language
(DML). In simple words, Data - manipulation Language is used to retrieve the data from the
database, insertion of new data into the database & deletion or modification of existing data.
The operation of retrieving the data from the database using DML is called as a query.]
The lists of tasks that come under DML:

1. SELECT - It retrieves data from a database

2. INSERT - It inserts data into a table
3. UPDATE - It updates existing data within a table
4. DELETE - It deletes all records from a table, the space for the records remain
5. MERGE - UPSERT operation (insert or update)
6. CALL - It calls a PL/SQL or Java subprogram
7. EXPLAIN PLAN - It explains access path to data
8. LOCK TABLE - It controls concurrency
Data Control Language (DCL):
It is used to control privilege in Database. To perform any
operation in the database, such as for creating tables, sequences or views we need privileges.
DCL statements control access to data and the database using statements such as GRANT and
REVOKE. A privilege can either be granted to a User with the help of GRANT statement. The
privileges assigned can be SELECT, ALTER, DELETE, EXECUTE, INSERT, INDEX etc. In
addition to granting of privileges, you can also revoke (taken back) it by using REVOKE
command.
The lists of tasks that come under DCL:
 To grant access to user – GRANT

 To revoke access from user – REVOKE
Transaction Control Language (TCL):
Transaction Control statements are used to
run the changes made by DML statements. It allows statements to be grouped together into
logical transactions.
 COMMIT - It saves the work done
 SAVEPOINT - It identifies a point in a transaction to which you can later roll back
 ROLLBACK - It restores database to original since the last COMMIT
 SET TRANSACTION - It changes the transaction options like isolation level and what
rollback segment to use.

Storage Definition Language (SDL):

It is a language used to specify the internal schema.
The mappings between the two schemas may be specified in either one of these languages. In most
relational DBMSs today, there is no specific language that performs the role of SDL. Instead, the
internal schema is specified by a combination of functions, parameters, and specifications related to
storage. These permit the DBA staff to control indexing choices and mapping of data to storage.
View Definition Language (SDL):

It is used to specify user views and their mappings to the
conceptual schema, but in most DBMSs the DDL is used to define both conceptual and external
schemas.
DBMS Interfaces:
A database management system (DBMS) interface is a user
interface which allows for the ability to input queries to a database without using the query
language itself. A DBMS interface could be a web client, a local client that runs on a desktop
computer, or even a mobile app.
Various types of DBMS interfaces are,
1. Menu-Based Interfaces for Web Clients or Browsing
2. Forms-Based Interfaces
3. Graphical User Interfaces
4. Natural Language Interfaces
5. Interfaces for Parametric Users
6. Interfaces for the DBA
Menu-Based Interfaces for Web Clients or Browsing:
These interfaces allow
users to present lists of options, called menus, which show the way the user through the
formulation (send) of a request.
Menus do away with the need to memorize the specific commands and syntax of a query
language; rather, the query is composed step by step by picking options from a menu that is
displayed by the system.

Pull-down menus are a very popular technique in Web-based user interfaces. They are also often
used in browsing interfaces, which allow a user to look through the contents of a database in an
exploratory and unstructured manner.
Forms-Based Interfaces:
A forms-based interface displays a form to each user. Users can fill
out all of the form entries to insert new data, or they fill out only certain entries, in which case
the DBMS will retrieve matching data for the remaining entries.
Forms are usually designed and programmed for naive users as interfaces to canned transactions.
Many DBMSs have forms specification languages, which are special languages that help
programmers, specify such forms.
Graphical User Interfaces:
A graphical user interface (GUI) typically displays a schema
to the user in diagrammatic form. The user can then specify a query by manipulating the
diagram. In many cases, GUIs utilize both menus and forms. Most GUIs use a pointing device,
such as a mouse, to pick certain parts of the displayed schema diagram.
Natural Language Interfaces:
These interfaces accept requests written in English or some
other language and attempt to "understand" them. A natural language interface usually has its
own "schema," which is similar to the database conceptual schema, as well as a dictionary of
important words.
The natural language interface refers to the words in its schema, as well as to the set of standard
words in its dictionary, to interpret the request. If the interpretation is successful, the interface
generates a high-level query corresponding to the natural language request and submits it to the
DBMS for processing; otherwise, a dialogue is started with the user to clarify the request.
Interfaces for Parametric Users:
Parametric users, such as bank tellers, often have a
small set of operations that they must perform repeatedly. Systems analysts and programmers
design and implement a special interface for each known class of naive users. Usually, a small
set of abbreviated commands is included, with the goal of minimizing the number of keystrokes
required for each request.

Interfaces for the DBA:

Most database systems contain privileged commands that can be
used only by the DBA's staff. These include commands for creating accounts, setting system
parameters, granting account authorization, changing a schema, and reorganizing the storage
structures of a database.
The Database System Environment:

A DBMS is a complex software
system. In this section we discuss the types of software components that constitute a DBMS and
the types of computer system software with which the DBMS interacts.
DBMS Component Modules:

The figure is divided into two halves. The top half of the figure refers to the various users of the
database environment and their interfaces. The lower half shows the internals of the DBMS
responsible for storage of data and processing of transaction.
The database and the DBMS catalog are usually stored on disk. Access to the disk is primarily
controlled by operating system(OS).which inclues disk input/Output. A higher level stored data
manager module of DBMS controls access to DBMS information that is stored on the disk.
If we consider the top half of the figure, It shows interfaces to DBA staff, casual users,
application programmers and parametric users.
The DDL compiler processes schema definitions, specified in the DDL, and stores the
description of the schema in the DBMS Catalog..The catalog includes information such as names
and sizes of the sizes of the files, data types of data of data items. Storage details of each file,
mapping information among schemas and constraints.
Casual users and persons with occasional need of information from database interact using some
of interface which is interactive query interface. The queries are parsed, analyzed for correctness
of the operations for the model, the names of the data elements and so on by a query compiler
that compiles them into internal form. The internal query is subjected to query optimization..The
query optimizer is concerned with rearrangement and possible recording of operations,
eliminations of redundancies.

Application programmer writes programs in host languages. The precompiler extracts DML
commands from an application program.
Various Components of Database Management System (DBMS)
Query Processor
Machine only understand low level language, so it is the task of query
processor to convert user’s queries in the series of low level instruction. Then after, it sends these
instructions to database manager for execution. There are various component of of query
processor.
DDL Complier:
It records the DDL statements into set of tables containing data dictionary. It
coverts DDL statement into object form from source form.
DML Complier:
It converts DML statements into low level instructions that are more easy to
understand by query evaluation engine.
Query Evaluation Engine:
Queries generated by DML compiler are executed in Query
evaluation Engine.
Database Manager:
Database manager components issue the interface between low level
data, application program and queries. It is responsible for the backup and recovery operations of
database. Also it controls the consistency and integrity of data. Security is also performed by the
database manager.
Authority and integrity manager:
Authority manager is responsible for checking the
authority of users to access and use information in database. Integrity manager check the
integrity constrains like various keys (Primary, unique etc.)
Data Structure
There are following data structures are required by DBMS.

Data Files: These are the files that stored data base.
Data Dictionary: Data dictionary carries the data about data means it stores the meta data of
data structure.
Access Acids: Acid properties are used to improve the performance of database management
system. They make the search operations very fast.
Statistical Data: It stores some information that are used by query processor to choose the best
way to execute queries.
So all these were the Various Components of Database Management System (DBMS). If you
have any problem regarding this topic then please comment below.
 Transaction Manager Component: It handle atomicity and concurrency of transactions
and consistency and durability of the databases
Query Processor Component –

Query Processor- Query processor is used to interpret the online user query and converts it into
an efficient series of operation in a form capable of being send to the data manager for execution.
The query processor use the data dictionary to find the structure of the relevant portion of the
data base and use this information in modifying the query and prepare an optimal plan to access
the database. It is a program module that provides the interface between the database and the
application programs/queries. The Query Processor Components include –
1. Data Definition Language(DDL) Compler- DDL compiler takes the data definition
statement that is the source form & convert them into the object form (or) interprets DDL
commands and records them in the data dictionary.
2. Data Modeling Language (DML) compiler – translates DML commands into query
evaluation plans
3. Query evaluation engine – executes queries according to the plans
Storage Manager Component –

A Storage Manager is a component or program module that provides the interface between the
low-level data stored in the database and the application programs/queries submitted to
the system. The Storage Manager Components include –

1. File Manager- File manager manages the file space and it takes care of the structure of the
file. It manages the allocation space on disk storage and the data structures used to
represent info stored on other media.
2. Buffer Manager – It transfers blocks between disk (or other devices) and Main Memory.
A DMA (Direct Memory Access) is a form of Input/Output that controls the exchange of
blocks process. When a processor receives a request for a transfer of a block, it sends it to
the DMA Controller which transfers the block uninterrupted.
3. Disk Manager- The block requested by the file manager is transferred by the Disk
Manager.
The Structures maintained by Storage manager are-

1. Data Files- Data files contains the data portion of the data base.
2. Data Dictionary- DBMS must a data dictionary function. The dictionary contains the data
about the data. Rather than just raw data. The information about attributes, entity,
mapping & cross reference information is contained in the data dictionary.
3. Indices or Indexing and Access Aids – An index is a small table having two columns in
which the first column contains a copy of the primary or candidate key of a table and the
second column contains a set of pointers holding the address of the disk block where that
particular key value can be found. The advantage of using indices is that index makes
search operation perform very fast.
In a data base system, a set of access aids in the form of indexes are usually provided to
improve the performance of a database system.
Database System Utilities:

DBMSs have database utilities that help the DBA manage the database system. Common
utilities have the following types of functions:
Loading:
A loading utility is used to load existing data files such as text files or sequential files
into the database. Usually, the current (source) format of the data file and the desired (target)
database file structure are specified to the utility, which then automatically reformats the data
and stores it in the database.
Backup.

A backup utility creates a backup copy of the database, usually by dumping the entire
database onto tape or other mass storage medium. The backup copy can be used to restore the
database in case of catastrophic disk failure. Incremental backups are also often used, where only
changes since the previous backup are recorded. Incremental backup is more complex, but saves
storage space.
Database storage reorganization:
This utility can be used to reorganize a set of database
files into different file organizations, and create new access paths to improve performance.
Performance monitoring:
Such a utility monitors database usage and provides statistics to
the DBA. The DBA uses the statistics in making decisions such as whether or not to reorganize
files or whether to add or drop indexes to improve performance.
Other utilities may be available for sorting files, handling data compression, monitoring access
by users, interfacing with the network, and performing other functions.
Tools, Application Environments, and Communications Facilities:
 CASE Tools – used in the design phase to help speed up the development process.
 Data dictionary system – stores catalog information about schemas and constraints, as
well as design decisions, usage standards, application program descriptions, user
information. Also called an information repository. Can be accesses directly by DBA or
users when needed.
 Application development environments – (i.e. JBuilder) provide environment for
developing database applications, and include facilities to help in database design, GUI
development, querying and updating and application development.
 Communication software – allow users at remote locations to access the database
through computer terminals, workstations or personal computers. Connected to the
database through data communications hardware such as phone lines, local area networks
etc.


DB Concepts and Models

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

DB Concepts and Models

Uploaded by

Copyright:

Available Formats

DATA BASE MANAGEMENT SYSTEM (P15IS51)

Chapter 2 Database System

Chapter – 2 Database System Concepts and Architecture

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 1

2.1.1 Categories of Data Models

Data Models, Schemas, and Instances:

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 2

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 3

relation are called a domain. A particular attribute or combination of attributes is chosen as a

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 4

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 5

Object-oriented database model:

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 6

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 7

Types of Data Models

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 8

The records in the database are organized as collection of arbitrary groups.

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 9

2. data manipulation can be done easily with this model.

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 10

 Standards included in SQL-99 and expected to be enhanced in future SQL standards.

A database schema can be divided broadly into two categories :

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 11

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 12

Three-Schema Architecture and Data Independence:

The goal of the three-schema architecture, illustrated

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 14

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 15

In the above diagram,

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 16

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 17

Logical Data Independence:

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 18

Advantages of Data independence

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 19

Database Languages and Interfaces:

Data Definition Language (DDL):

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 20

Data Manipulation Language (DML):

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 21

1. SELECT - It retrieves data from a database

 To grant access to user – GRANT

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 22

Storage Definition Language (SDL):

View Definition Language (SDL):

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 23

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 24

Interfaces for the DBA:

The Database System Environment:

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 25

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 26

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 27

Query Processor Component –

Storage Manager Component –

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 28

The Structures maintained by Storage manager are-

Database System Utilities:

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 29

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 30

PUTTASWAMY B S Assistant Professor, Department of IS&E Page 31

You might also like