0% found this document useful (0 votes)

23 views96 pages

Transaction Processing

This document introduces transaction processing concepts, detailing the structure and properties of transactions, including their execution and recovery mechanisms. It outlines the importance of concurrency control, the ACID properties of transactions, and the classification of schedules based on recoverability. Additionally, it discusses the components of a transaction log and the significance of commit points in ensuring data integrity during transaction processing.

Uploaded by

aishanyaz123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views96 pages

Transaction Processing

Uploaded by

aishanyaz123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

UNIT V

Introduction to Transaction
Processing Concepts and Theory
Chapter Outline
1 Introduction to Transaction Processing
2 Transaction and System Concepts
3 Desirable Properties of Transactions
4 Characterizing Schedules based on
Recoverability
5 Characterizing Schedules based on Serializability

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 2

Introduction to Transaction Processing
■ A Transaction

■ Logical unit of database processing

■ includes one or more access operations
(read -retrieval, write - insert or update, delete).

■ A transaction (set of operations) may be stand-alone

specified in a high level language like SQL submitted
interactively, or may be embedded within a program.

■ Transaction boundaries:
■ Begin and End transaction.

■ An application program may contain several

transactions separated by the Begin and End transaction
boundaries.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Transaction - Example
Suppose a bank employee transfers Rs 500 from A's account to B's
account. This very simple and small transaction involves several low-
level tasks

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

SIMPLE MODEL OF A DATABASE

■ A database is a collection of named data items

■ Granularity of data - a field, a record , or a whole disk

block (Concepts are independent of granularity)

■ Basic operations are read and write

■read_item(X): Reads a database item named X into a
program variable
■write_item(X): Writes the value of program variable X
into the database item named A.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 5

Problems in Transaction Processing

■Concurrency Control in multi-user system

■Recovery techniques - failure

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Introduction to Transaction Processing

■Single-User System
■At most one user at a time can use the system.
■Multiuser System:
■Many users can access the system concurrently.
■Concurrency
■Interleaved processing:
■Concurrent execution of processes is interleaved
in a single CPU
■Parallel processing:
■Processes are concurrently executed in multiple
CPUs.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 7

Introduction to Transaction Processing
READ AND WRITE OPERATIONS:
■ Basic unit of data transfer from the disk to the computer
main memory is one block.
■ read_item(X) command includes the following steps:

■ Find the address of the disk block that contains item

X.
■ Copy that disk block into a buffer in main memory

■ Copy item X from the buffer to the program variable

named X.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 8

Introduction to Transaction Processing
READ AND WRITE OPERATIONS

■ write_item(X) command includes the following steps:

■ Find the address of the disk block that contains item A.

■ Copy that disk block into a buffer in main memory

■ Copy item X from the program variable named X into its

correct location in the buffer.

■ Store the updated block from the buffer back to disk (either
immediately or at some later point in time).

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 9

Two sample transactions
■FIGURE 17.2 Two sample transactions:
■(a) Transaction T1
■(b) Transaction T2

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 10

Why Concurrency Control is needed
■ The Lost Update Problem
■ This occurs when two transactions that access the same database
items have their operations interleaved in a way that makes the
value of some database item incorrect.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 11

Why Concurrency Control is needed
The Temporary Update (or Dirty Read) Problem

■ This occurs when one transaction updates a database item and

then the transaction fails for some reason
■ The updated item is accessed by another transaction before it is
changed back to its original value.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Why Concurrency Control is needed
The Incorrect Summary Problem
■ If one transaction is calculating an aggregate summary function on a
number of records while other transactions are updating some of
these records, the aggregate function may calculate some values
before they are updated and others after they are updated.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 13

Why Recovery is needed
What causes a Transaction to fail

1. A computer failure (system crash):

A hardware or software error occurs in the computer system
during transaction execution. If the hardware crashes, the
contents of the computer’s internal memory may be lost.
2. A transaction or system error:
Some operation in the transaction may cause it to fail
■ integer overflow or division by zero.
■ logical programming error.
■ The user may interrupt the transaction during its
execution.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 14

Why Recovery is needed
What causes a Transaction to fail
3. Local errors or exception conditions detected by the
transaction:
■ Certain conditions necessitate cancellation of the
transaction.
■ insufficient account balance in a banking database,
■ A programmed abort in the transaction causes it to fail.
4. Concurrency control enforcement:
The concurrency control method may decide to abort the
transaction, to be restarted later, because it violates
serializability or because several transactions are in a state of
deadlock

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 15

Why Recovery is needed
What causes a Transaction to fail
5. Disk failure:
Some disk blocks may lose their data because of a
read or write malfunction or because of a disk
read/write head crash. This may happen during a
read or a write operation of the transaction.
6. Physical problems and catastrophes:
This refers to an endless list of problems that
includes power or air-conditioning failure, fire, theft,
sabotage, overwriting disks or tapes by mistake,
and mounting of a wrong tape by the operator.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 16

2 Transaction and System Concepts
■A transaction is an atomic unit of work that is
either completed in its entirety or not done at all.
■For recovery purposes, the system needs to
keep track of when the transaction starts,
terminates, and commits or aborts.
■Transaction states:
■Active state
■Partially committed state
■Committed state
■Failed state
■Terminated State

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 17

State transition diagram illustrating
the states for transaction execution

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 18

Recovery manager keeps track of the following
operations
■ begin_transaction: This marks the beginning of transaction
execution.
■ read or write: These specify read or write operations on the
database items that are executed as part of a transaction.
■ end_transaction: This specifies that read and write transaction
operations have ended and marks the end limit of transaction
execution.
■ commit_transaction: This signals a successful end of the
transaction so that any changes (updates) executed by the
transaction can be safely committed to the database and will not
be undone.
■ rollback (or abort): This signals that the transaction has ended
unsuccessfully, so that any changes or effects that the
transaction may have applied to the database must be undone.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 19

Recovery techniques use the following
operators
■ undo: Similar to rollback except that it applies to a
single operation rather than to a whole transaction.

■ redo: This specifies that certain transaction

operations must be redone to ensure that all the
operations of a committed transaction have been
applied successfully to the database.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 20

The System Log
■ Log or Journal: The log keeps track of all
transaction operations that affect the values of
database items.
■This information may be needed to permit recovery
from transaction failures.
■The log is kept on disk, so it is not affected by any
type of failure except for disk or catastrophic failure.
■In addition, the log is periodically backed up to
archival storage (tape) to guard against such
catastrophic failures.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 21

System Log Record
T in the following discussion refers to a unique transaction-id
that is generated automatically by the system and is used to
identify each transaction:

■ Types of log record:

■[start_transaction,T]: Records that transaction T has started
execution.
■[write_item,T,X,old_value,new_value]: Records that
transaction T has changed the value of database item X from
old_value to new_value.
■[read_item,T,X]: Records that transaction T has read the
value of database item X.
■[commit,T]: Records that transaction T has completed
successfully, and affirms that its effect can be committed
(recorded permanently) to the database.
■[abort,T]: Records that transaction T has been aborted.
22
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Commit Point of a Transaction

■ Definition a Commit Point:

■ A transaction T reaches its commit point when all its
operations that access the database have been executed
successfully and the effect of all the transaction operations on
the database has been recorded in the log.
■ Beyond the commit point, the transaction is said to be
committed, and its effect is assumed to be permanently
recorded in the database.
■ The transaction then writes an entry [commit,T] into the log.
■ Roll Back of transactions:
■ Needed for transactions that have a [start_transaction,T] entry
into the log but no commit entry [commit,T] into the log.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 23

Commit Point of a Transaction

■ Redoing transactions:
■ Transactions that have written their commit entry in the log must
also have recorded all their write operations in the log; otherwise
they would not be committed, so their effect on the database can
be redone from the log entries. (Notice that the log file must be
kept on disk.
■ At the time of a system crash, only the log entries that have been
written back to disk are considered in the recovery process
because the contents of main memory may be lost.)
■ Force writing a log:
■ Before a transaction reaches its commit point, any portion of the
log that has not been written to the disk yet must now be written
to the disk.
■ This process is called force-writing the log file before committing
a transaction.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 24

Desirable Properties of Transactions
ACID properties

■ Atomicity: A transaction is an atomic unit of processing; it is either

performed in its entirety or not performed at all.

■ Consistency preservation: A correct execution of the transaction

must take the database from one consistent state to another.

■ Isolation: A transaction should not make its updates visible to other

transactions until it is committed; this property, when enforced
strictly, solves the temporary update problem and makes cascading
rollbacks of transactions unnecessary

■ Durability or permanency: Once a transaction changes the

database and the changes are committed, these changes must
never be lost because of subsequent failure.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 25
Characterizing Schedules based on
Recoverability
■ Transaction schedule or history:
When transactions are executing concurrently in an interleaved
fashion, the order of execution of operations from the
various transactions forms what is known as a transaction
schedule (or history).

■ A schedule (or history) S of n transactions T1, T2, …, Tn:

■ It is an ordering of the operations of the transactions subject

to the constraint that, for each transaction Ti that participates
in S, the operations of T1 in S must appear in the same
order in which they occur in T1.

■ Note, however, that operations from other transactions Tj can

be interleaved with the operations of Ti in S.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 26

Schedule - Example

Sa : r1(X);r2(X);w1(X);r1(Y);w2(X);w1(Y);

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Conflict Schedule - Example
Two operations in a schedule are said
to conflict if they satisfy all three of the
following conditions

● they belong to different

transactions
● they access the same item X
● At least one of the operations is a
write_item(X)

Sa : r1(X);r2(X);w1(X);r1(Y);w2(X);w1(Y);

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Characterizing Schedules based on
Recoverability

■Recoverable schedule

■Cascadeless schedule

■Strict Schedules

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Characterizing Schedules based on
Recoverability
Schedules classified on recoverability:
■Recoverable schedule

A schedule S is recoverable if no
transaction T in S commits until all
transactions T’ that have written an item
that T reads have committed.

Example
Sa’: r1(X);r2(X);w1(X);r1(Y);w2(X);c2;w1(Y);c1;

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 30

Characterizing Schedules based on
Recoverability

Example (non recoverable schedule)

Sc:r1(X);w1(X);r2(X);r1(Y);w2(X);c2;a1;

Converting Sc recoverable
Sd: r1(X);w1(X);r2(X);r1(Y);w2(X);w1(Y);c1;c2;

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Characterizing Schedules based on
Recoverability
■Cascadeless schedule:

One where every transaction reads only the

items that are written by committed transactions.

Se:r1(X);w1(X);r2(X);r1(Y);w2(X);w1(X);a1;a2;

r2(X) command in the above schedule must

be postponed until after T1 has committed (or)
aborted

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 32

Characterizing Schedules based on
Recoverability
Schedules classified on recoverability

■Strict Schedules:

A schedule in which a transaction can

neither read or write an item X until the
last transaction that wrote X has
committed.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 33

Consider following schedule. Determine whether each
schedule is Strict, Cascadeless, Recoverable or non-
recoverable. Provide justification to your answer

1. R1(X), W1(X), R1(Y), W1(Y), C1, R2(X), W2(X), C2

2. R1(X), W1(X), R1(Y), W1(Y), R2(X), W2(X), C2, C1

3. R1(X), R2(X), W1(X), R1(Y), W1(Y), C1, W2(X), C2

4. R1(X), R2(X), W2(X), W1(X), C2, R1(Y), W1(Y), C1

5. R2(X), R1(X), W1(X), R1(Y), W1(Y), C1, W2(X), C2

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Consider following schedule. Determine whether each
schedule is Strict, Cascadeless, Recoverable or non-
recoverable. Provide justification to your answer

1. R1(X), W1(X), R1(Y), W1(Y), C1, R2(X), W2(X), C2

Strict
2. R1(X), W1(X), R1(Y), W1(Y), R2(X), W2(X), C2, C1
Non Recoverable
3. R1(X), R2(X), W1(X), R1(Y), W1(Y), C1, W2(X), C2
Strict
4. R1(X), R2(X), W2(X), W1(X), C2, R1(Y), W1(Y), C1
Cascadeless
5. R2(X), R1(X), W1(X), R1(Y), W1(Y), C1, W2(X), C2
Strict
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
■R1(X), W1(X), R1(Y), W1(Y), C1, R2(X), W2(X), C2
Strict because T1 has updated data item X, it is committing and after the commit of T1, T2 is
doing the conflicting operations R2(x) and W2(x).

■R1(X), W1(X), R1(Y), W1(Y), R2(X), W2(X), C2, C1

Not recoverable because T2 is reading data item X which was updated by T1, but T2 is
committing before T1.
■R1(X), R2(X), W1(X), R1(Y), W1(Y), C1, W2(X), C2
Strict because T1 has updated data item X, it is committing and after the commit of T1, T2 is
doing the conflicting operationW2(X).
■R1(X), R2(X), W2(X), W1(X), C2, R1(Y), W1(Y), C1
It is cascadeless because transaction T1 is not reading any data item previously updated by
T2 and vice versa. So, there is no chance that if one transaction fails, the other one will also
fail.

■R2(X), R1(X), W1(X), R1(Y), W1(Y), C1, W2(X), C2

Strict because T1 has updated data item X, it is committing and after the commit of T1, T2 is
doing the conflicting operations W2(x).
Recoverable schedule ?

T1 T2 T3 Operation#
read(C) 1
read (B) 2
read (C) 3
write (B) 5
write (A) 6
read (B) 7
read (A) 8
Write (C) 9
Write (B ) 10
Commit 11
Commit 12
Commit 13

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Characterizing Schedules based on
Serializability

■Serial
■Non Serial
■Result Equivalent
■Conflict Equivalent
■Conflict –Serializable

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Characterizing Schedules based on
Serializability
■It characterize the types of schedules that are considered
correct when concurrent transactions are executing
■When two transactions T1 and T2 are submitted at the
same time, if there is no interleaving of operations is
permitted, there are only two possible outcomes

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Characterizing Schedules based on
Serializability
■Serial schedule:
■A schedule S is serial if, for every transaction T
participating in the schedule, all the operations of
T are executed consecutively in the schedule.
■Otherwise, the schedule is called nonserial
schedule.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 40

Disadvantages of Serial schedule
■Serial schedules have less resource
utilization and low throughput.
■To improve it, two are more transactions
are run concurrently.
■But concurrency of transactions may lead
to inconsistency in database.
■To avoid this, we need to check whether
these concurrent schedules are serializable
or not.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Non Serial Schedule

They are called nonserial because each sequence

interleaves operations from the two transactions

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Serializable
■A schedule S is serializable if it is
equivalent to some serial schedule of the
same n transactions

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Characterizing Schedules based on
Serializability
■Result equivalent:
■Two schedules are called result equivalent if they
produce the same final state of the database.
S1 S2
read_item(X); read_item(X);
X:=X+10; X:=X*1.1;
write_item(X); write_item(X);

The above two schedules that are result equivalent for the
initial values of x=100 but are not result equivalent in general

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 44

Characterizing Schedules based on
Serializability
■Conflict equivalent:
■Two schedules are said to be conflict equivalent if
the order of any two conflicting operations is the
same in both schedules.
■Conflict serializable:
■A schedule S is said to be conflict serializable if it
is conflict equivalent to some serial schedule S’.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Revisit
■ Conflicting operations pair (R1(A), W2(A)) because they belong
to two different transactions on same data item A and one of
them is write operation.
■ Similarly, (W1(A), W2(A)) and (W1(A), R2(A)) pairs are also
conflicting.
■ On the other hand, (R1(A), W2(B)) pair is non-conflicting
because they operate on different data item.
■ Similarly, ((W1(A), W2(B)) pair is non-conflicting.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Testing for conflict serializability: Algorithm

■Looks at only read_Item (X) and write_Item (X)

operations

■Constructs a precedence graph (serialization graph) - a

graph with directed edges

■An edge is created from Ti to Tj if one of the operations

in Ti appears before a conflicting operation in Tj

■The schedule is serializable if and only if the precedence

graph has no cycles.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 48

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Constructing the Precedence Graphs
■ FIGURE 17.7 Constructing the precedence graphs for schedules A and D from
Figure 17.5 to test for conflict serializability.
■ (a) Precedence graph for serial schedule A.
■ (b) Precedence graph for serial schedule B.
■ (c) Precedence graph for schedule C (not serializable).
■ (d) Precedence graph for schedule D (serializable, equivalent to schedule A).

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 50

Example
S = r1(A) r2(A) w1(A) w2(A)

Answer

The schedule is not conflict-serializable, since there

is a cycle in the precedence graph

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Example
S = r1(A) r2(B) w3(A) r2(A) r1(B)

Answer

The schedule is conflict-serializable, since its

precedence graph does not contain any cycle.

The only conflict-equivalent serial schedule is T1,

T3, T2.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Exercise
Consider the following schedule S, consisting of
transactions T1, T2, and T3

●Give the precedence graph of S

●Is S conflict serializable ? Justify your answer
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Answer

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe

Example

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe 55

Example

Is it Conflict Serializable?

Characterizing Schedules based on
Serializability
■View equivalence:
■A less restrictive definition of equivalence of
schedules

■View serializability:
■Definition of serializability based on view
equivalence.
■A schedule is view serializable if it is view
equivalent to a serial schedule.

Database Concurrency Control
Purpose of Concurrency Control
■ To enforce Isolation (through mutual exclusion) among
conflicting transactions.
■ To preserve database consistency through consistency
preserving execution of transactions.
■ To resolve read-write and write-write conflicts.

Two-Phase Locking Techniques for
Concurrency Control
■A lock is a variable associated with a data item
that describes the status of the item with respect
to possible operations that can be applied to it
■Locks are used as a means of synchronization
the access by concurrent transactions to the
database items

Types of Locks
■Binary Locks
■Shared / Exclusive (or) Read/ Write Locks

Binary Locks
A binary lock can have two states or values : Locked or
unlocked
■ Locking is an operation which secures
■ (a) permission to Read
■ (b) permission to Write a data item for a transaction.
■ Example:
■ Lock (X). Data item X is locked in behalf of the requesting
transaction.
■ Unlocking is an operation which removes these
permissions from the data item.
■ Example:
■ Unlock (X): Data item X is made available to all other
transactions.
■ Lock and Unlock are Atomic operations.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 18- 63
Binary Locks
lock_item(X)
B: if LOCK (X) = 0 (*item is unlocked*)
then LOCK (X) ← 1 (*lock the item*)
else begin
wait (until lock (X) = 0) and
the lock manager wakes up the transaction);
goto B
end;
Unlock_item(X)
LOCK (X) ← 0 (*unlock the item*)
if any transactions are waiting then
wake up one of the waiting the transactions;

Shared/ Exclusive Locks
■ Two locks modes:
■ (a) shared (read) (b) exclusive (write).
■ Shared mode: shared lock (X)
■ More than one transaction can apply share lock on X for
reading its value but no write lock can be applied on X by
any other transaction.
■ Exclusive mode: Write lock (X)
■ Only one write lock on X can exist at any time and no
shared lock can be applied by any other transaction on X.
■ Conflict matrix

Shared /Exclusive Locks
Read_lock(X):

B: if LOCK (X) = “unlocked” then

begin LOCK (X) ← “read-locked”;
no_of_reads (X) ← 1;
end
else if LOCK (X) ← “read-locked” then
no_of_reads (X) ← no_of_reads (X) +1
else begin wait (until LOCK (X) = “unlocked” and
the lock manager wakes up the transaction);
go to B
end;

Shared /Exclusive Locks
write_lock(X):
B: if LOCK (X) = “unlocked” then
begin LOCK (X) ← “write-locked”;
else begin wait (until LOCK (X) = “unlocked” and
the lock manager wakes up the transaction);
go to B
end;

Two Phase Locking Protocol
A transaction is said to follow the two-phase
locking protocol if all locking operations precede
the first unlock operation in the transaction
■Two Phases:
■(a) Locking (Growing)
■(b) Unlocking (Shrinking).

Two Phase Locking Protocol
■Locking (Growing) Phase:
■ During which new locks on items can be acquired but
none can be released
■Unlocking (Shrinking) Phase:
■ During which existing locks can be released but no
new locks can be acquired

Two Phase Locking Protocol
Two-Phase Locking Techniques: The algorithm

T1 T2
read_lock (Y); read_lock (X);
read_item (Y); read_item (X);
unlock (Y); unlock (X);
write_lock (X); Write_lock (Y);
read_item (X); read_item (Y);
X:=X+Y; Y:=X+Y;
write_item (X); write_item (Y);
unlock (X); unlock (Y);

Two Phase Locking Protocol

T’1 T’2
read_lock (Y); read_lock (X);
read_item (Y); read_item (X);
write_lock (X); write_lock (Y);
unlock (Y); unlock (X);
read_item (X); read_item (Y);
X:=X+Y; Y:=X+Y;
write_item (X); write_item (Y);
unlock (X); unlock (Y);

Database Recovery
Purpose of Database Recovery
■ To bring the database into the last consistent state,
which existed prior to the failure.
■ To preserve transaction properties (Atomicity,
Consistency, Isolation and Durability).
■ Example:
■ If the system crashes before a fund transfer
transaction completes its execution, then either one
or both accounts may have incorrect value. Thus, the
database must be restored to the state before the
transaction modified any of the accounts.

Database Recovery
Types of Failure
■The database may become unavailable for use
due to
■ Transaction failure: Transactions may fail
because of incorrect input, deadlock, incorrect
synchronization.
■ System failure: System may fail because of
addressing error, application error, operating
system fault, RAM failure, etc.
■ Media failure: Disk head crash, power disruption,
etc.

Database Recovery
Transaction Log
■ For recovery from any type of failure data values prior to
modification (BFIM - BeFore Image) and the new value
after modification (AFIM – AFter Image) are required.
■ These values and other information is stored in a
sequential file called Transaction log. A sample log is
given below. Back P and Next P point to the previous and
next log records of the same transaction.

Database Recovery
Data Update
■ Immediate Update: As soon as a data item is modified in
cache, the disk copy is updated.
■ Deferred Update: All modified data items in the cache is
written either after a transaction ends its execution or after
a fixed number of transactions have completed their
execution.
■ Shadow update: The modified version of a data item
does not overwrite its disk copy but is written at a separate
disk location.
■ In-place update: The disk version of the data item is
overwritten by the cache version.

Database Recovery
Data Caching
■Data items to be modified are first stored into
database cache by the Cache Manager (CM) and
after modification they are flushed (written) to the
disk.
■The flushing is controlled by dirty bit and Pin-
Unpin bits.
■ Pin-Unpin: Instructs the operating system not to
flush the data item.
■ Dirty bit: Indicates the AFIM of the data item.

Database Recovery
Transaction Roll-back (Undo) and Roll-Forward
(Redo)
■To maintain atomicity, a transaction’s operations
are redone or undone.
■ Undo: Restore all BFIMs on to disk (Remove all
AFIMs).
■ Redo: Restore all AFIMs on to disk.
■ Database recovery is achieved either by performing
only Undos or only Redos or by a combination of the
two. These operations are recorded in the log as they
happen.

Transaction Rollback

Database Recovery
Write-Ahead Logging
■ When in-place update (immediate or deferred) is used
then log is necessary for recovery and it must be available
to recovery manager. This is achieved by Write-Ahead
Logging (WAL) protocol. WAL states that
■ For Undo: Before a data item’s AFIM is flushed to the
database disk (overwriting the BFIM) its BFIM must be
written to the log and the log must be saved on a stable
store (log disk).
■ For Redo: Before a transaction executes its commit
operation, all its AFIMs must be written to the log and the log
must be saved on a stable store.

Database Recovery
Checkpointing
■ Time to time (randomly or under some criteria) the
database flushes its buffer to database disk to minimize
the task of recovery. The following steps defines a
checkpoint operation:
1. Suspend execution of transactions temporarily.
2. Force write modified buffer data to disk.
3. Write a [checkpoint] record to the log, save the log to disk.
4. Resume normal transaction execution.
■ During recovery redo or undo is required to transactions
appearing after [checkpoint] record.

Database Recovery
Steal/No-Steal and Force/No-Force
■ Possible ways for flushing database cache to database
disk:
1. Steal: Cache can be flushed before transaction commits.
2. No-Steal: Cache cannot be flushed before transaction
commit.
3. Force: Cache is immediately flushed (forced) to disk.
4. No-Force: Cache is deferred until transaction commits
■ These give rise to four different ways for handling
recovery:
■ Steal/No-Force (Undo/Redo)
■ Steal/Force (Undo/No-redo)
■ No-Steal/No-Force (Redo/No-undo)
■ No-Steal/Force (No-undo/No-redo)

Recovery Scheme
■Deferred Update (No Undo/Redo)
■A transaction can not change the database
on disk until it reaches its common points

■A transaction does not reach its commit

point until all its update operations are
recorded in the log and the log is force-
written to disk

■Redo is needed in case the system fails

Deferred Update – Single user
environment

Deferred Update with concurrent
users
■ This environment requires some concurrency control
mechanism to guarantee isolation property of
transactions. In a system recovery transactions which
were recorded in the log after the last checkpoint were
redone. The recovery manager may scan some of the
transactions recorded before the checkpoint to get the
AFIMs.

Database Recovery
Immediate Update
■Undo/No-redo Algorithm
■In this algorithm AFIMs of a transaction are
flushed to the database disk under WAL before it
commits.
■For this reason the recovery manager undoes all
transactions during recovery.
■No transaction is redone.
■It is possible that a transaction might have
completed execution and ready to commit but this
transaction is also undone.

Shadow Paging
■ The AFIM does not overwrite its BFIM but recorded at
another place on the disk. Thus, at any time a data item
has AFIM and BFIM (Shadow copy of the data item) at
two different places on the disk.

X and Y: Shadow copies of data items

X' and Y': Current copies of data items

Shadow Paging
■ To manage access of data items by concurrent
transactions two directories (current and shadow) are
used.
■ The directory arrangement is illustrated below. Here a
page is a data item.

The ARIES Recovery Algorithm
■ARIES uses a steal/no force approach for writing
and it is based on three concepts
■Write-ahead logging
■Repeating history during redo
■Logging changes during undo

The ARIES Recovery Algorithm
■WAL (Write Ahead Logging)
■Repeating history during redo:
■ ARIES will retrace all actions of the database system
prior to the crash to reconstruct the database state
when the crash occurred.
■ Transaction that were uncommitted at the time of the
crash (active transactions) are undone
■Logging changes during undo:
■ It will prevent ARIES from repeating the completed
undo operations if a failure occurs during recovery,
which causes a restart of the recovery process.

The ARIES Recovery Algorithm
■ The ARIES recovery algorithm consists of three steps:
1. Analysis: step identifies the dirty (updated) pages in the
buffer and the set of transactions active at the time of
crash. The appropriate point in the log where redo is to
start is also determined.
2. Redo: necessary redo operations are applied.
3. Undo: log is scanned backwards and the operations of
transactions active at the time of crash are undone in
reverse order.

The ARIES Recovery Algorithm

The ARIES Recovery Algorithm
■ The Log and Log Sequence Number (LSN)
■ A unique LSN is associated with every log record.
■ LSN increases monotonically and indicates the disk address
of the log record it is associated with.
■ In addition, each data page stores the LSN of the latest log
record corresponding to a change for that page.
■ A log record stores
■ (a) the previous LSN of that transaction
■ (b) the transaction ID
■ (c) the type of log record.

The ARIES Recovery Algorithm
The Transaction table and the Dirty Page table
■For efficient recovery following tables are also
stored in the log during checkpointing:
■ Transaction table: Contains an entry for each
active transaction, with information such as
transaction ID, transaction status and the LSN of
the most recent log record for the transaction.
■ Dirty Page table: Contains an entry for each dirty
page in the buffer, which includes the page ID and
the LSN corresponding to the earliest update to
that page.

The ARIES Recovery Algorithm
■ Checkpointing
■ A checkpointing does the following:
■ Writes a begin_checkpoint record in the log
■ Writes an end_checkpoint record in the log. With this record
the contents of transaction table and dirty page table are
appended to the end of the log.
■ Writes the LSN of the begin_checkpoint record to a special
file. This special file is accessed during recovery to locate
the last checkpoint information.
■ To reduce the cost of checkpointing and allow the system
to continue to execute transactions, ARIES uses “fuzzy
checkpointing”.

The ARIES Recovery Algorithm
The ARIES Recovery Algorithm (contd.)
■ The following steps are performed for recovery
■ Analysis phase: Start at the begin_checkpoint record and
proceed to the end_checkpoint record. Access transaction table
and dirty page table are appended to the end of the log. Note
that during this phase some other log records may be written to
the log and transaction table may be modified. The analysis
phase compiles the set of redo and undo to be performed and
ends.
■ Redo phase: Starts from the point in the log up to where all dirty
pages have been flushed, and move forward to the end of the
log. Any change that appears in the dirty page table is redone.
■ Undo phase: Starts from the end of the log and proceeds
backward while performing appropriate undo. For each undo it
writes a compensating record in the log.
■ The recovery completes at the end of undo phase.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 19- 95
Revisit
Conflict Serializable: A schedule is called conflict
serializable if it can be transformed into a serial
schedule by swapping non-conflicting operations.

Conflicting operations: Two operations are said to be conflicting if all

conditions satisfy:

● They belong to different transactions

● They operate on the same data item
● At Least one of them is a write operation

Elmasri 6e Ch21
No ratings yet
Elmasri 6e Ch21
54 pages
U5Transaction Management, Concurrency Control and Recovery Techniques
No ratings yet
U5Transaction Management, Concurrency Control and Recovery Techniques
115 pages
Chapter 5
No ratings yet
Chapter 5
83 pages
Chapter 17
No ratings yet
Chapter 17
56 pages
Chapter 1 Int To Transaction Processig
No ratings yet
Chapter 1 Int To Transaction Processig
54 pages
Chapter 3
No ratings yet
Chapter 3
55 pages
Chapter THREE
No ratings yet
Chapter THREE
46 pages
Topic 3 - Part 1
No ratings yet
Topic 3 - Part 1
15 pages
Transaction Manag PDF
No ratings yet
Transaction Manag PDF
55 pages
Transaction 2020 Modified
No ratings yet
Transaction 2020 Modified
40 pages
Understanding Transaction Processing Basics
No ratings yet
Understanding Transaction Processing Basics
55 pages
Transaction 2
No ratings yet
Transaction 2
66 pages
Chapter 20
No ratings yet
Chapter 20
60 pages
W6 DBMS Chapter22
No ratings yet
W6 DBMS Chapter22
50 pages
31-M5 - Transaction Concepts - Transaction States-30-09-2024
No ratings yet
31-M5 - Transaction Concepts - Transaction States-30-09-2024
46 pages
Database Transaction Processing Overview
No ratings yet
Database Transaction Processing Overview
44 pages
Interleaved Transaction Processing Overview
No ratings yet
Interleaved Transaction Processing Overview
76 pages
18CS53 - 2022 - 23 - Module5 - DBMS
No ratings yet
18CS53 - 2022 - 23 - Module5 - DBMS
89 pages
Chapter 20
No ratings yet
Chapter 20
24 pages
Transaction Processing Concepts Explained
No ratings yet
Transaction Processing Concepts Explained
54 pages
Transactionprocessing Unit IV 1
No ratings yet
Transactionprocessing Unit IV 1
27 pages
Transaction Processing Concepts Overview
No ratings yet
Transaction Processing Concepts Overview
52 pages
Transaction Processing Concepts and Theory
No ratings yet
Transaction Processing Concepts and Theory
56 pages
Transaction Processing Concepts Overview
No ratings yet
Transaction Processing Concepts Overview
68 pages
Introduction To Transaction Processing Concepts and Theory
No ratings yet
Introduction To Transaction Processing Concepts and Theory
47 pages
Introduction to Transaction Processing
No ratings yet
Introduction to Transaction Processing
37 pages
Transaction Processing in Database Systems
No ratings yet
Transaction Processing in Database Systems
87 pages
WINSEM2024-25 BCSE302L TH VL2024250503831 2025-02-28 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE302L TH VL2024250503831 2025-02-28 Reference-Material-I
44 pages
Chapter 3 Transaction Processing Conceptes
100% (1)
Chapter 3 Transaction Processing Conceptes
75 pages
Transaction Processing in Databases
No ratings yet
Transaction Processing in Databases
39 pages
Transaction Processing in Databases
No ratings yet
Transaction Processing in Databases
47 pages
Transaction Management in Databases
No ratings yet
Transaction Management in Databases
55 pages
Transaction Processing Concepts Overview
No ratings yet
Transaction Processing Concepts Overview
29 pages
Module 4 TransactionProcessing
No ratings yet
Module 4 TransactionProcessing
73 pages
ADB Slides 6
No ratings yet
ADB Slides 6
64 pages
Transaction Processing Concepts Explained
No ratings yet
Transaction Processing Concepts Explained
57 pages
Understanding Transaction Processing Systems
No ratings yet
Understanding Transaction Processing Systems
33 pages
Transaction Processing in Advanced DBMS
No ratings yet
Transaction Processing in Advanced DBMS
61 pages
20 Transaction
No ratings yet
20 Transaction
44 pages
Transaction Processing Systems
No ratings yet
Transaction Processing Systems
71 pages
Transaction Processing Fundamentals
No ratings yet
Transaction Processing Fundamentals
62 pages
Transaction Processing Fundamentals
No ratings yet
Transaction Processing Fundamentals
65 pages
Transaction Processing Concepts Explained
No ratings yet
Transaction Processing Concepts Explained
45 pages
8-Chapter17-Transaction - Updated
No ratings yet
8-Chapter17-Transaction - Updated
32 pages
Transaction Processing Concepts Overview
No ratings yet
Transaction Processing Concepts Overview
46 pages
Transaction Processing Concepts Explained
No ratings yet
Transaction Processing Concepts Explained
39 pages
Chapter 3 ADB
No ratings yet
Chapter 3 ADB
40 pages
Transaction Processing Concepts Overview
No ratings yet
Transaction Processing Concepts Overview
60 pages
Understanding Transaction Processing Basics
No ratings yet
Understanding Transaction Processing Basics
15 pages
Chapter 3
No ratings yet
Chapter 3
55 pages
DBMS
No ratings yet
DBMS
160 pages
Transaction Processing I-1
No ratings yet
Transaction Processing I-1
27 pages
Chapter Four: Introduction To Transaction Processing Concepts and Theory
No ratings yet
Chapter Four: Introduction To Transaction Processing Concepts and Theory
36 pages
Transactions Management
No ratings yet
Transactions Management
68 pages
Transaction Processing Concepts Overview
No ratings yet
Transaction Processing Concepts Overview
45 pages
Transaction Processing Fundamentals
No ratings yet
Transaction Processing Fundamentals
69 pages
Transaction Processing Concepts Explained
No ratings yet
Transaction Processing Concepts Explained
41 pages
Transaction Management in Databases
No ratings yet
Transaction Management in Databases
25 pages
Distributed Transactions, ACID, BLOB
No ratings yet
Distributed Transactions, ACID, BLOB
3 pages
DBMS Concurrency Control - Two Phase, Timestamp, Lock-Based Protocol
No ratings yet
DBMS Concurrency Control - Two Phase, Timestamp, Lock-Based Protocol
11 pages
3 Semaphores
No ratings yet
3 Semaphores
26 pages
DBMS Revision Notes
No ratings yet
DBMS Revision Notes
7 pages
Critical Sections in Operating Systems Neha Yadav232
No ratings yet
Critical Sections in Operating Systems Neha Yadav232
5 pages
OS Lecture29 Locking in Xv6
No ratings yet
OS Lecture29 Locking in Xv6
8 pages
SPOS Assignment No5
No ratings yet
SPOS Assignment No5
16 pages
Advanced Transaction Models
No ratings yet
Advanced Transaction Models
93 pages
Timestamp Based Protocol
100% (2)
Timestamp Based Protocol
3 pages
Process Synchronization Descriptive Questions
No ratings yet
Process Synchronization Descriptive Questions
3 pages
Process Synchronization - Semaphore: Lecture - 10
No ratings yet
Process Synchronization - Semaphore: Lecture - 10
11 pages
DB2 Locks and Isolation Levels Explained
No ratings yet
DB2 Locks and Isolation Levels Explained
9 pages
Process Synchronization Guide
No ratings yet
Process Synchronization Guide
5 pages
DBMS Unit-4 Notes
No ratings yet
DBMS Unit-4 Notes
62 pages
ACID Properties & Transaction Management in DBMS
No ratings yet
ACID Properties & Transaction Management in DBMS
2 pages
Detailed Answers For Selected Quest
No ratings yet
Detailed Answers For Selected Quest
9 pages
Database Transactions and Locking Mechanisms
No ratings yet
Database Transactions and Locking Mechanisms
12 pages
Two-Phase Commit Protocol Explained
No ratings yet
Two-Phase Commit Protocol Explained
10 pages
4.2 Transaction-Management-And-Concurrency-Control
No ratings yet
4.2 Transaction-Management-And-Concurrency-Control
61 pages
5 Test and Set - BCS401
No ratings yet
5 Test and Set - BCS401
11 pages
Transaction Processing in Advanced DBMS
No ratings yet
Transaction Processing in Advanced DBMS
48 pages
Concurrency Control With Locking
No ratings yet
Concurrency Control With Locking
7 pages
503 ADBMS April May 2025
No ratings yet
503 ADBMS April May 2025
2 pages
Introduction To Database Failures
No ratings yet
Introduction To Database Failures
12 pages
Timestamp Notes Dbms
No ratings yet
Timestamp Notes Dbms
7 pages
Transaction Processing Basics
No ratings yet
Transaction Processing Basics
10 pages
DBMS Unit 5 Notes
No ratings yet
DBMS Unit 5 Notes
31 pages
Process Synchronization Guide
No ratings yet
Process Synchronization Guide
19 pages
Deadlock Prevention
No ratings yet
Deadlock Prevention
2 pages
DBMS Module5 Notes
No ratings yet
DBMS Module5 Notes
16 pages