Professional Documents
Culture Documents
Part-2-
Promises of DDBSs
• Transparent management of distributed,
fragmented, and replicated data
• Improved reliability and availability through
distributed transactions
• Improved performance
• Easier and more economical system
expansion
Transparency
• Transparency refers to separation of the high-level semantics
of a system from low-level implementation details.
• The goal of transparency is to provide a system where users
do not have to worry about representation, fragmentation,
location, and replication of data.
• Fundamental issue is to provide data independence in the
distributed environment
• Various forms of transparency can be distinguished for
DDBSs:
– Network transparency (also called distribution transparency)
• Location transparency
• Naming transparency
– Replication transparency
– Fragmentation transparency
• Horizontal fragmentation: selection
• Vertical fragmentation: projection
• hybrid
Transparency (cont.)
• Network (or distribution) transparency allows a user to perceive a
DDBS as a single, logical entity
• The user is protected from the operational details of the network (or
even does not know about the existence of the network)
• Two types of network transparency can be identified: location
transparency and naming transparency.
• The user does not need to know the location of data items and a
command used to perform a task is independent from the location of
the data and the site the task is performed (location transparency)
• A unique name is provided for each object in the database (naming
transparency)
• In absence of this, users are required to embed the location name
as part of an identifier
Transparency (cont.)
• Replication transparency ensures that the user is not involved in
the managment of copies of some data
The user should even not be aware about the existence of replicas,
rather should work as if there exists a single copy of the data
Replication of data is needed for various reasons
e.g., increased efficiency for read-only data access
DDistributed Database
Distributed DBMS – Reality
DBMS
User Query
Software
User Application
DBMS
Software
Communication
DBMS Subsystem
soSoftware
User Query
User Query
ASG
– intra-query parallelism
• Intra-query parallelism is a form of parallelism in the evaluation of
database queries, in which a single query is decomposed into
smaller tasks that execute concurrently on multiple processors.
Physical storage
Physical storage London
New York
Rule 5: Fragmentation Independence
(Transparency)
Users should be able to behave (at least from a logical
standpoint) as if the data were in fact not fragmented at
all.
Replication is
desirable for at
least two reasons Replica of New York Segment
§ Performance Replica of London Segment
§ Availability
Rule 6: Replication Independence
(Transparency) (cont.)
User should be able to behave as if the data were in fact
not replicated at all.
• User should be able to behave as if the data were in fact not
replicated at all.
• Replication, like fragmentation, should be ``transparent to the
user'‘.
• Update propagation problem
• Replication independence (like location and fragmentation
independence) is desirable because it simplifies user
programs and terminal activities.
Rule 7: Distributed Query Processing
It is crucially important for distributed database systems
to choose a good strategy for distributed query processing.
Directory Management
Concurrency Control
Deadlock Management
Question & Answer