Professional Documents
Culture Documents
Normalization
What Is Normalization?
Normalization is the branch of relational theory that provides design
insights. It is the process of determining how much redundancy exists
in a table. The goals of normalization are to:
Be able to characterize the level of redundancy in a relational
schema
Provide mechanisms for transforming schemas in order to remove
redundancy
How to produce a good relation schema?
2
Data Redundancy and Update Anomalies
Redundant
data
3
Data Redundancy and Update Anomalies
4
Relation Decomposition
• Normalization process involve decomposing a relation.
• Decomposition require to be reversible.
• Functional dependencies guarantee decomposition to be
reversible.
5
Functional Dependencies
• Describes the relationship between attributes in a relation.
• If A and B are attributes of relation R,
• B is functionally dependent on A, denoted A B, if each value
of A is associated with exactly one value of B. B may have
several values of A.
B is functionally
A dependent on A B
Determinant Dependent
6
An Example Functional Dependency
1:M relationship
between attributes in a
relation
7
Example Functional Dependency that holds for all Time
staffNo → sName
sName → staffNo
9
Example Full Functional Dependency
10
Transitive Dependencies
• Important to recognize a transitive dependency because its
existence in a relation can potentially cause update anomalies.
11
Example Transitive Dependency
• Consider functional dependencies in the StaffBranch relation.
staffNo → sName, position, salary, branchNo, bAddress
branchNo → bAddress
13
Identifying the Primary Key for a Relation
using Functional Dependencies
• Main purpose of identifying a set of functional dependencies for a
relation is to specify the set of integrity constraints that must hold
on a relation.
14
Example - Identify Primary Key for
StaffBranch Relation
• StaffBranch relation has five functional dependencies.
staffNo → sName, position, salary, branchNo, bAddress
branchNo → bAddress
bAddress → branchNo
branchNo, position → salary
bAddress, position → salary
15
Inference Rules for Functional
Dependencies
Armstrong’s axioms (inference rules): The set of inference rules specifies how
functional dependencies can be inferred from given one.
Inference rules:
Reflexivity If B A, then A B
16
The Process of Normalization
Normalization is a technique for analyzing relations based on their Candidate Key &
Functional dependencies.
1NF
2NF s
lie
a
m
3NF at no
o rm e a
f at
BCNF n
i pd
r
e u
g
n to
4NF o
r le
St rab
5NF lne
vu
Higher Normal ess
L
Form
17
The Purpose of Normalization
Normalization is a bottom-up approach to database design that begins
by examining the relationships between attributes. It is performed as a
series of tests on a relation to determine whether it satisfies or violates
the requirements of a given normal form.
Purpose:
Guarantees no redundancy due to FDs
Guarantees no update anomalies
Normal Forms:
First Normal Form (1NF)
Second Normal Form (2NF)
Third Normal Form (3NF) 18
First Normal Form (1NF)
Un normalized form (UNF): A relation that contains one or more repeating groups.
First normal form (1NF): A relation in which the intersection of each row and column
contains one & only one value.
CLIENT_PROPERTY
Client No Name PropertyNo
CR76 John Key PG4
PG16
CR56 Aline Stewart PG4
PG36
PG16
Un normalized relation
19
UNF 1NF
Approach 1
Expand the key so that there will be a separate tuple in the original relation for
each repeated attribute(s). Primary key becomes the combination of primary key
and redundant value.
CLIENT_PROPERTY
ClientNo Name PropertyNo
CR76 John Key PG4
CR76 John Key PG16
CR56 Aline Stewart PG4
CR56 Aline Stewart PG36
CR56 Aline Stewart PG16
1NF relation
CLIENT_PROPERTY
ClientNo Name PropertyNo1 PropertyNo2 PropertyNo3
CR76 John Key PG4 PG16 NULL
CR56 Aline Stewart PG4 PG36 PG16
1NF relation
21
UNF 1NF
Approach 3
Remove the attribute that violates the 1NF and place it in a separate relation along
with a copy of the primary key.
CLIENT PROPERTY
ClientNo Name ClientNo PropertyNo
1NF relation
22
Second Normal Form (2NF)
Second normal form (2NF): A 1NF relation in which every attribute is fully
nontrivial functionally dependent on the PK. (non-prime attributes fully
dependent on PK.)
CLIENT_RENTAL
ClientNo PropertyNo cName pAddress RentStart RentFinish Rent OwnerNo OName
1NF relation
23
Third Normal Form (3NF)
2NF relation
24