Professional Documents
Culture Documents
FDMS - Chapter Seven
FDMS - Chapter Seven
3
Data Recovery
Introduction
Recovery algorithms
Recovery concepts
◦ Write-ahead logging
◦ In-place versus shadow updates
◦ Rollback
◦ Deferred update
◦ Immediate update
Certain recovery techniques best used with
specific concurrency control methods
5
Database Recovery
Purpose of Database Recovery
◦ To bring the database into the last consistent state, which
existed prior to the failure.
◦ To restored the database to the most recent consistent
state just before the failure.
◦ To preserve transaction properties (Atomicity,
Consistency, Isolation and Durability).
Example:
◦ If the system crashes before a fund transfer transaction
completes its execution, then either one or both
accounts may have incorrect value. Thus, the database
must be restored to the state before the transaction
modified any of the accounts.
Slide 19- 6
Types of Failure
The database may become unavailable for use due to
A hardware or software error occurs during transaction execution. System may fail because
of addressing error, application error, operating system fault, RAM failure, etc.
Transaction failure:
Transactions may fail because of Some operation error such as integer overflow or division
by zero.
synchronization.
8
Recovery Concepts (cont’d.)
Deferred update techniques
◦ Do not physically update the database until after
transaction commits
◦ Undo is not needed; redo may be needed
Immediate update techniques
◦ Database may be updated by some operations of
a transaction before it reaches commit point
◦ Operations also recorded in log
◦ Recovery still possible
9
Recovery Concepts (cont’d.)
Undo and redo operations required to be
idempotent
◦ Executing operations multiple times equivalent to
executing just once
◦ Entire recovery process should be idempotent
Caching (buffering) of disk blocks
◦ DBMS cache: a collection of in-memory buffers
◦ Cache directory keeps track of which database
items are in the buffers
10
Recovery Concepts (cont’d.)
Cache buffers replaced (flushed) to make
space for new items
Dirty bit associated with each buffer in the
cache
◦ Indicates whether the buffer has been modified
Contents written back to disk before flush if
dirty bit equals one
Pin-unpin bit
◦ Page is pinned if it cannot be written back to disk
yet
11
Recovery Concepts (cont’d.)
Main strategies
◦ In-place updating
Writes the buffer to the same original disk location
Overwrites old values of any changed data items
◦ Shadowing
Writes an updated buffer at a different disk location, to
maintain multiple versions of data items
Not typically used in practice
Before-image: old value of data item
After-image: new value of data item
12
Recovery Concepts (cont’d.)
Write-ahead logging
◦ Ensure the before-image (BFIM) is recorded
◦ Appropriate log entry flushed to disk
◦ Necessary for UNDO operation if needed
UNDO-type log entries
REDO-type log entries
13
Recovery Concepts (cont’d.)
Steal/no-steal and force/no-force
◦ Specify rules that govern when a page from the
database cache can be written to disk
No-steal approach
◦ Cache buffer page updated by a transaction
cannot be written to disk before the transaction
commits
Steal approach
◦ Recovery protocol allows writing an updated
buffer before the transaction commits
14
Recovery Concepts (cont’d.)
Force approach
◦ All pages updated by a transaction are
immediately written to disk before the
transaction commits
◦ Otherwise, no-force
Typical database systems employ a steal/no-
force strategy
◦ Avoids need for very large buffer space
◦ Reduces disk I/O operations for heavily updated
pages
15
Recovery Concepts (cont’d.)
Write-ahead logging protocol for recovery
algorithm requiring both UNDO and REDO
◦ BFIM of an item cannot be overwritten by its after
image until all UNDO-type log entries have been
force-written to disk
◦ Commit operation of a transaction cannot be
completed until all REDO-type and UNDO-type
log records for that transaction have been force-
written to disk
16
Checkpoints in the System Log and Fuzzy Checkpointing
Taking a checkpoint
◦ Suspend execution of all transactions temporarily
◦ Force-write all main memory buffers that have been
modified to disk
◦ Write a checkpoint record to the log, and force-write
the log to the disk
◦ Resume executing transactions
DBMS recovery manager decides on checkpoint interval
Fuzzy checkpointing
◦ System can resume transaction processing after a
begin_checkpoint record is written to the log
◦ Previous checkpoint record maintained until
end_checkpoint record is written
17
Transaction Rollback
Transaction failure after update but before
commit
◦ Necessary to roll back the transaction
◦ Old data values restored using undo-type log
entries
Cascading rollback
◦ If transaction T is rolled back, any transaction S
that has read value of item written by T must also
be rolled back
◦ Almost all recovery mechanisms designed to
avoid this
18
Figure 22.1 Illustrating cascading rollback (a process that never occurs
in strict or cascadeless
schedules) (a) The read and write operations of three transactions (b) System log
at point of crash (c) Operations before the crash 19
Transactions that Do Not Affect the Database
20
NO-UNDO/REDO Recovery Based on Deferred Update
Deferred update concept
◦ Postpone updates to the database on disk until the transaction
completes successfully and reaches its commit point
◦ Redo-type log entries are needed
◦ Undo-type log entries not necessary
◦ Can only be used for short transactions and transactions that
change few items
Buffer space an issue with longer transactions
Deferred update protocol
◦ Transaction cannot change the database on disk until it reaches
its commit point
All buffers changed by the transaction must be pinned until the
transaction commits (no-steal policy)
◦ Transaction does not reach its commit point until all its REDO-
type log entries are recorded in log and log buffer is force-
written to disk
21
NO-UNDO/REDO Recovery Based
on Deferred Update (cont’d.)
23
Figure 22.3 An example of recovery using deferred update with concurrent
transactions (a) The READ and WRITE operations of four transactions (b)
System log at the point of crash 24
Shadow Paging
No log required in a single-user environment
◦ Log may be needed in a multiuser environment
for the concurrency control method
Shadow paging considers disk to be made of
n fixed-size disk pages
◦ Directory with n entries is constructed
◦ When transaction begins executing, directory
copied into shadow directory to save while
current directory is being used
◦ Shadow directory is never modified
25
Shadow Paging (cont’d.)
New copy of the modified page created and
stored elsewhere
◦ Current directory modified to point to new disk
block
◦ Shadow directory still points to old disk block
Failure recovery
◦ Discard current directory
◦ Free modified database pages
◦ NO-UNDO/NO-REDO technique
26
Shadow Paging (cont’d.)
27
Database Backup and Recovery from Catastrophic Failures
Database backup
◦ Entire database and log periodically copied onto
inexpensive storage medium
◦ Latest backup copy can be reloaded from disk in case
of catastrophic failure
Backups often moved to physically separate
locations
◦ Subterranean storage vaults
Backup system log at more frequent intervals
and copy to magnetic tape
◦ System log smaller than database
Can be backed up more frequently
◦ Benefit: users do not lose all transactions since last
database backup
28
Summary
Main goal of recovery
◦ Ensure atomicity property of a transaction
Caching
In-place updating versus shadowing
Before and after images of data items
UNDO and REDO operations
Deferred versus immediate update
Shadow paging
Catastrophic failure recovery
29
Concurrency
Introduction
Concurrency control protocols
◦ Set of rules to guarantee serializability
Two-phase locking protocols
◦ Lock data items to prevent concurrent access
Timestamp
◦ Unique identifier for each transaction
31
Two-Phase Locking Techniques for Concurrency Control
Lock
◦ Variable associated with a data item describing
status for operations that can be applied
◦ One lock for each item in the database
Binary locks
◦ Two states (values)
Locked (1)
Item cannot be accessed
Unlocked (0)
Item can be accessed when requested
32
Two-Phase Locking Techniques for Concurrency Control
(cont’d.)
Transaction requests access by issuing a
lock_item(X) operation
34
Figure 21.2 Locking and unlocking operations for two-mode (read/write, or
shared/exclusive) locks 35
Two-Phase Locking Techniques for Concurrency Control
(cont’d.)
Lock conversion
◦ Transaction that already holds a lock allowed to
convert the lock from one state to another
Upgrading
◦ Issue a read_lock operation then a write_lock
operation
Downgrading
◦ Issue a read_lock operation after a write_lock
operation
36
Guaranteeing Serializability by Two-Phase Locking
37
Figure 21.3 Transactions that do not obey two-phase locking (a) Two transactions
T1 and T2 (b) Results of possible serial schedules of T1 and T2 (c) A
nonserializable schedule S that uses locks
38
Guaranteeing Serializability by Two-Phase Locking
39
Variations of Two-Phase Locking
Basic 2PL
◦ Technique described on previous slides
Conservative (static) 2PL
◦ Requires a transaction to lock all the items it
accesses before the transaction begins
Predeclare read-set and write-set
◦ Deadlock-free protocol
Strict 2PL
◦ Transaction does not release exclusive locks until
after it commits or aborts
40
Variations of Two-Phase Locking (cont’d.)
Rigorous 2PL
◦ Transaction does not release any locks until after
it commits or aborts
Concurrency control subsystem responsible
for generating read_lock and write_lock
requests
Locking generally considered to have high
overhead
41
Dealing with Deadlock and Starvation
Deadlock
◦ Occurs when each transaction T in a set is waiting
for some item locked by some other transaction
T’
◦ Both transactions stuck in a waiting queue
Figure 21.5 Illustrating the deadlock problem (a) A partial schedule of T1′ and T2′ that
is in a state of deadlock (b) A wait-for graph for the partial schedule in (a)
42
Dealing with Deadlock and Starvation (cont’d.)
Deadlock prevention protocols
◦ Every transaction locks all items it needs in advance
◦ Ordering all items in the database
Transaction that needs several items will lock them in that order
◦ Both approaches impractical
Protocols based on a timestamp
◦ Wait-die
◦ Wound-wait
No waiting algorithm
◦ If transaction unable to obtain a lock, immediately aborted
and restarted later
Cautious waiting algorithm
◦ Deadlock-free
43
Dealing with Deadlock and Starvation (cont’d.)
Deadlock detection
◦ System checks to see if a state of deadlock exists
◦ Wait-for graph
Victim selection
◦ Deciding which transaction to abort in case of
deadlock
Timeouts
◦ If system waits longer than a predefined time, it
aborts the transaction
Starvation
◦ Occurs if a transaction cannot proceed for an
indefinite period of time while other transactions
continue normally
◦ Solution: first-come-first-served queue
44
Concurrency Control Based on Timestamp Ordering
Timestamp
◦ Unique identifier assigned by the DBMS to identify a transaction
◦ Assigned in the order submitted
◦ Transaction start time
Concurrency control techniques based on timestamps do
not use locks
◦ Deadlocks cannot occur
Generating timestamps
◦ Counter incremented each time its value is assigned to a
transaction
◦ Current date/time value of the system clock
Ensure no two timestamps are generated during the same tick of the
clock
General approach
◦ Enforce equivalent serial order on the transactions based on
their timestamps
45
Concurrency Control Based on Timestamp Ordering
Timestamp ordering (TO)
◦ Allows interleaving of transaction operations
◦ Must ensure timestamp order is followed for each pair of
conflicting operations
Each database item assigned two timestamp values
◦ read_TS(X)
◦ write_TS(X)
Basic TO algorithm
◦ If conflicting operations detected, later operation rejected
by aborting transaction that issued it
◦ Schedules produced guaranteed to be conflict serializable
◦ Starvation may occur
Strict TO algorithm
◦ Ensures schedules are both strict and conflict serializable
46
Concurrency Control Based on Timestamp Ordering
47
Summary
Concurrency control techniques
◦ Two-phase locking
◦ Timestamp-based ordering
48
Data Security
Outline
Database Security and Authorization
Introduction to Database Security Issues
Types of Security
Database Security and DBA
Access Protection, User Accounts, and Database Audits
Discretionary Access Control Based on
Granting Revoking Privileges
Types of Discretionary Privileges
Specifying Privileges Using Views
Revoking Privileges
Propagation of Privileges Using the GRANT OPTION
Specifying Limits on Propagation of Privileges
Introduction to Database Security Issues
Database
The collection of data or information
Security
The State of being free from danger or injury.
Database Security :
◦ Securing databases against a variety of threats.
◦ Securing the database from any kinds of damage
or attack
◦ Preventing database access privileges to
unauthorized users.
Introduction to Database Security Issues
Types of Security
Legal and ethical issues
regarding the right to access certain information.
Policy issues
at the governmental, institutional, or corporate level as
to what kinds of information should not be made
publicly available
System-related issues
the system levels at which various security functions
should be enforced.
The need to identify multiple security levels
categorize the data and users based on these
classifications, For example top secret, secret,
confidential, and unclassified
Introduction to Database Security Issues
Threats to databases
Loss of confidentiality
Database confidentiality refers to the protection of data from
unauthorized disclosure
Loss of integrity
Database integrity refers to the requirement that information be
protected from improper modification
Loss of availability
Database availability refers to making objects available to a human
user or a program to which they have a legitimate right
To protect databases against these types of
threats four kinds of countermeasures can
be implemented:
Access control
Inference control
Flow control
Encryption
Introduction to Database Security Issues
A DBMS typically includes a database security and
authorization subsystem that is responsible for ensuring the
security portions of a database against unauthorized access.
66