Professional Documents
Culture Documents
Chapter 10
Distributed Database
Management Systems
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 2
10
Evolution of DDBMS
• Decentralized database management systems (DDBMS)
– Interconnected computer systems
– Data/processing functions reside on multiple sites
• 1970’s: Centralized DBMS
• 1980’s: Social and Technical Changes
– Ad hoc capability required
– Decentralized management structure common
• 1990’s: New forces
– Internet and the World Wide Web used for data access and
distribution
– Data analysis through data mining and data warehousing
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 3
10
DDBMS Advantages
• Data located near site with greatest demand
• Faster data access
• Faster data processing
• Growth facilitation
• Improved communications
• Reduced operating costs
• User-friendly interface
• Less danger of single-point failure
• Processor independence
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 4
10
DDBMS Disadvantages
• Complexity of management and control
• Security
• Lack of standards
• Increased storage requirements
• Greater difficulty in managing data environment
• Increased training costs
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 5
10
Distributed Processing
Shares database’s logical processing among
physically, networked independent sites
Figure 10.1
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 6
10
Distributed Database
Stores logically related database over physically
independent sites
Figure 10.2
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 7
10
Distributed Database
vs. Distributed Processing
• Distributed processing
– Does not require distributed database
– May be based on a single database on single
computer
– Copies or parts of database processing functions
must be distributed to all data storage sites
• Distributed database
– Requires distributed processing
• Both
– Require a network to connect components
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 8
10
Functions of DDBMS
• Application/end user interface
• Validation to analyze data requests
• Transformation to determine request components
• Query optimization to find the best access strategy
• Mapping to determine the data location
• I/O interface to read or write data
• Formatting to prepare the data for presentation
• Security to provide data privacy
• Backup and recovery
• DB Administration
• Concurrency Control
• Transaction Management
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 9
10
Centralized Database
Figure 10.3
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 10
10
Fully Distributed Database
Management System
Figure 10.4 11
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel
10
DDBMS Components
• Computer workstations
• Network hardware and software components
• Communications media
• Transaction processor (TP)
– Also called application manager (AP) or
transaction manager (TM)
• Data processor (DP)
– Also called data manager (DM)
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 12
10
Figure 10.5
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 13
10
DDBMS Protocols
• Interface with network to transport data and
commands between DPs and TPs
• Synchronize data received from DPs and route to
appropriate TPs
• Ensure common database functions
– Security
– Concurrency control
– Backup and recovery
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 14
10
Table 10.1
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 15
10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 16
10
Figure 10.6
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 17
10
Multiple-Site Processing, Single-Site
Data (MPSD)
• Requires network file server
• Applications accessed through LAN
• Variation known as client/server architecture
Figure 10.7
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 18
10
Multiple-Site Processing,
Multiple-Site Data (MPMD)
• Fully distributed DDBMS with support for multiple
DPs and TPs at multiple sites
– Homogeneous I
• Integrate one type of centralized DBMS over the
network
– Heterogeneous
• Integrate different types of centralized DBMSs over a
network
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 19
10
Heterogeneous Distributed Database
Scenario
Figure 10.8
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 20
10
Distributed DB Transparency
• Allows end users to feel like only database user
• Hides complexities of distributed database
• Transparency features
– Distribution
– Transaction
– Failure
– Performance
– Heterogeneity
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 21
10
Distribution Transparency
• Allows management of a physically dispersed
database as though it were centralized
• Three Levels
– Fragmentation transparency
– Location transparency
– Local mapping transparency
Table 10.2
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 22
10
Transaction Transparency
• Ensures transactions maintain integrity and
consistency
• Completed only if all involved database sites
complete their part of the transaction
• Management mechanisms
– Remote request
– Remote transaction
– Distributed transaction
– Distributed request
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 23
10
Remote Request
Figure 10.10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 24
10
Remote Transaction
Figure 10.11
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 25
10
Distributed Transaction
Figure 10.12
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 26
10
Distributed Requests
Figure 10.13
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 27
10
Figure 10.14
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 28
10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 29
10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 30
10
Performance Transparency
and Query Optimization
• Objective: Minimize total cost associated with execution
of request
• Main costs
– Access time
– Communication
– CPU time
• Basis for query optimization algorithms
– Optimum execution order
– Sites accessed to minimize communication costs
• Dynamic or static optimization
• Statistically based vs. rule-based query optimization
algorithms
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 31
10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 32
10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 33
10
Client/Server Disadvantages
• Creates more complex environment with different
platforms
• Increased number of users and sites creates
security problems
• Training issues become more complex and
expensive
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 34
10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 35
10
Database Systems: Design, Implementation, & Management, 5 th Edition, Rob & Coronel 36