You are on page 1of 19

Team:DBMS

Distributed Database
By Navdeep Kaur
Assistant Professor
Computer Science Department
Distributed Database
Distributed Database

 A distributed database is basically a database that is


not limited to one system, it is spread over
different sites, i.e, on multiple computers or over
a network of computers
A distributed database system is located on various
sited that don’t share physical components
Need of Distributed Database

  This maybe required when a particular database


needs to be accessed by various users globally
Distributed Database
Distributed Database
Distributed Data Storage
There are 2 ways in which data can be stored on
different sites 
  Replication
 Fragmentation
Distributed Database
Distributed Data Storage
  Replication
If the entire database is available at all sites, it is a fully redundant database
Hence, in replication, systems maintain copies of data

Advantageous
It increases the availability of data at different sites
Also, now query requests can be processed in parallel
Disadvantages
Data needs to be constantly updated
Concurrency control becomes way more complex as concurrent access now
needs to be checked over a number of sites
Distributed Database
Fragmentation
•In this approach, the relations are fragmented (i.e., they’re divided into smaller
parts) and each of the fragments is stored in different sites where they’re required
Advantage
Fragmentation is advantageous as it doesn’t create copies of data, consistency is
not a problem
Fragmentation of relations can be done in two ways:
Horizontal fragmentation – Splitting by rows – The relation is fragmented into
groups of tuples so that each tuple is assigned to at least one fragment
Vertical fragmentation – Splitting by columns – The schema of the relation is
divided into smaller schemas
Each fragment must contain a common candidate key so as to ensure lossless join
Types Distributed Database
Types Distributed Database
Homogeneous Distributed Databases
In a homogeneous distributed database, all the sites use identical DBMS
and operating systems
There are two types of homogeneous distributed database −
Autonomous − Each database is independent that functions on its own
They are integrated by a controlling application and use message
passing to share data updates.
Non-autonomous − Data is distributed across the homogeneous nodes
and a central or master DBMS co-ordinates data updates across the
sites.
Architecture of Distributed Database
Architecture of Distributed Database
Client - Server Architecture for
DDBMS
• This is a two-level architecture
where the functionality is
divided into servers and clients
• The two different client - server
architecture are −
• Single Server Multiple Client
• Multiple Server Multiple Client
Architecture of Distributed Database

Peer- to-Peer Architecture


for DDBMS
In these systems, each peer
acts both as a client and a
server for imparting
database services
Architecture of Distributed Database

Multi - DBMS Architectures


• This is an integrated database system formed by a collection
of two or more autonomous database systems
• There are two design alternatives for multi-DBMS −
• Model with multi-database conceptual level
• Model without multi-database conceptual level
Types Distributed Database
Heterogeneous Distributed Databases
In a heterogeneous distributed database, different sites have
different operating systems, DBMS products and data models
Types of Heterogeneous Distributed Databases
• Federated − The heterogeneous database systems are
independent in nature and integrated together so that they
function as a single database system.
• Un-federated − The database systems employ a central
coordinating module through which the databases are accessed.
Distributed Database
Features
It ensures that the data modified at any site is
universally updated
It is used in application areas where large
volumes of data are processed and accessed
by numerous users simultaneously
It maintains confidentiality and data integrity
of the databases
Advantages Distributed Database
Increased availability –
Availability is defined as the probability that the system is continuously
available during a time interval
When the data and DBMS software are distributed over several sites one site
may fail while other sites continue to operate
Easier Expansion –
In a distributed environment expansion of the system in terms of adding
more data, increasing database sizes, or adding more data, increasing
database sizes or adding more processor is much easier
Improved Performance –
By executing multiple queries at different sites by breaking up a query
into a number of subqueries that basically executes in parallel which
basically leads to improvement in performance
Advantages Distributed Database
More Reliable 
− Reliability is basically defined as the probability that a system is running at
a certain time
In case of database failures, the total system of centralized databases comes to
a halt
However, in distributed systems, when a component fails, the functioning of
the system continues may be at a reduced performance. Hence DDBMS is
more reliable
Better Response − If data is distributed in an efficient manner, then user
requests can be met from local data itself, thus providing faster response.
Lower Communication Cost −
In distributed database systems, if data is located locally where it is mostly
used, then the communication costs for data manipulation can be
minimized
Be Passionate and Be creative

You might also like