Professional Documents
Culture Documents
Data Replication Techniques PDF
Data Replication Techniques PDF
Abstract— Replication is the one of the good way to increase the performance of database system by separate out
database by maintaining different servers. Database replication is used to provide high availability and performance
for accessing data. In this paper we are discussing the eager replication technique of data replication to ensure
consistency using three parameters; this helps the database designer to examine various components and find the
most efficient and reliable replication technique.
Keywords—Database Replication, Data Consistency, Eager Replication, Data Replication Strategy, Eager Replication
Parameters.
I. Introduction
Distributed system is that system in which systems are logically connected. Data may be distributed or stored on multiple
systems located at same physical location or may be distributed over a network of interconnected systems. In order to
distribute database load to different database servers rather having a single mainframe server we replicate data. In this
way work load on single server can be decreased.
Replication is a process in which data is stored in more than one site for reliability and performance reasons. Database
replication having two types:
(1) Full Data Replication :- In this copy of whole database is stored at every site.
(2) Partial Data Replication :- In this some fragment of database are replicated.
(ii) Update everywhere: Update everywhere provide update to data item to be performed on any server in a cluster
regardless of whether a transaction is update, delete, insert transaction or a simple transaction. The server on which
data update is performed will delegate the transaction to all other servers using multicast transaction primitive
propagation such as Total Order Broadcast. It removes the limitation of a single point failure in primary copy by
updating two different copies of same data item. It also removes the limitation of bottlenecks.
(b) Server Interaction:- This second parameter is necessary to performance of a replication protocol in distributed
systems. This provides the degree of communication between servers during execution of a transaction. Server
Interaction provides us with the amount of network traffic generated by transaction algorithm and overall overhead
of execution. This is considered as function of number of messages required to handle the operation of transaction. If
interaction between servers will be high, then consistency will also be high but low performance. On the other hand,
if interaction is low then consistency will be low and performance will be high. The reason for this relation between
consistency and performance is that server interaction is the result of consistency checking protocols such as two-
phase commit or 1-safe which is a trimmed down version of two-phase commit. In this we consider two cases:
(i) Constant Interaction: Constant Interaction is a technique in which constant number of messages is directly
related to number of transactions received independent of the operations in the transaction by means of message
grouping. In constant interaction protocols exchange a single message by grouping all the operations in a single
message. By grouping all the operation of a transaction, node communication can be limited.
(ii) Linear interaction: Linear interaction is a technique in which a database server transmits each operation of a
transaction on individual basis. This is mostly suited for small minimal node environment but when it is applied to
large distributed systems then scalability performance is limited but when limited nodes are replicated then overhead
becomes negligible. As propagation is done on individual basis then overhead reaches unacceptable levels. Database
server sent each operation as SQL statement or as log records.
(c) Transaction Termination: The last parameter in eager replication is way to terminate the transaction. It is a
process in which all servers either abort or commit the transaction. It is done to ensure Atomicity. Protocols use
transaction manager to manage transaction termination. Some protocols use single transaction manager while some
use multiple transaction managers. Transaction Termination can affect replication performance, data availability,
and data consistency. It can be broken down into two categories:
(i) Voting Termination: Voting Termination requires round of messages to coordinate different replicas to ensure
atomicity. The complexity of round messages in Voting Mechanism can be complex or simple. It can be complex as
an atomic commitment protocol which generally includes two phase commitment protocol and it can be simple as
single confirmation from a slave, show in figure.
(ii) Non Voting Termination: In non-voting termination technique individual sites determine whether to commit or abort
a transaction on their own. It is also called lazy replication. To have replicas in non-voting technique is not necessary
because it will only help to behave deterministically and determinism only affects transaction which are serializable. Its
main advantage is that it is non-blocking and is doing so on the expense of data consistency. The idea behind non-voting
technique very simple. In this technique, master copy writes everything in the log file that it believes should be
committed or aborted and then the log file is read by all of its slaves. No voting or interaction between the masters and
slaves or between the slaves occurs during read of log file hence the name NON-VOTING.
VII. Conclusion
Database Replication is one of the best processes to find out different components of data that is used during share,
access, store, fetch or retrieve the data components from databases. Eager Replication is the technique that helps to
database designers and programmers to find out their necessity components of data during processing of databases.
References
[1]http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=885408&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%
2Fabs_all.jsp%3Farnumber%3D885408
[2]http://www.dbjournal.ro/archive/2/4_Cristian_Mazilu.pdf
[3]http://infoscience.epfl.ch/record/49996/files/WPS%2B00b.pdf
[4]http://www2.stetson.edu/mathcs/people/students/research/pdf/2007/acaloiaro/final.pdf
[5]http://www.waset.org/journals/waset/v59/v59-61.pdf
[6]http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.17.9166&rep=rep1&type=pdf
[7]http://jarrett.cis.unimelb.edu.au/papers/DataReplicationInDSChapter2006.pdf