Relational Algebra-2023

Relational
Algebra
Key = attribute set that uniquely identifies each
row in the relation/table
Keys:
Customer ID
Coustomer ID, Tax ID
……
Superkey
• set of attributes of table for which

every row has distinct set of
values
Candidate key
Keys • Minimal such set of attributes
Primary key
• DB Chosen Candidate key

• Plays a very important role
• E.g. relations typically sorted by
this
• Superkey
• {代碼}, {名稱} ,{電話} , {代碼, 名稱}, {代碼, 電話}, {代碼,
Example of 口罩剩餘數}
• Candidate key:
FaceMask • {代碼, 電話} is not a miminal set. Since {電話} could be
removed from {代碼, 電話} while the remining attribute
dataset {代碼} is still a key
• {代碼}, {名稱}, {電話}, {地址}
Introduction
• Query languages are specialized languages for
asking questions or queries, that involve the
data in a database.
• Relational algebra : queries in terms of

operators
• Every operator in relational algebra accepts

(one or two) relation instances as arguments
and returns a relation instance as the result.
( 1+2=3 ) +(1,2) = 3
Introduction
( 1+2=3 ) +(1,2) = 3
• A relational algebra expression is

recursively defined to be a relation.
• Relational algebra is a procedural query

language
• Defines a step-by-step procedure for
computing the answer.
Basic
Operators
• There are six basic
operators in relation algebra:
• select ( s )
• project ( P )
• union ( U )
• set different ( － )
• Cartesian product ( x )
• rename ( r )
Selection ( s )
Relation R
s s a=b Ùd>5 ( R )
a b c d
a a 1 7 a b c d
a b 5 7 a a 1 7
b b 12 3 b b 23 10
b b 23 10
Selection ( s )
• The selection operation s specifies the tuples to retain

through a selection condition.
• Selection condition is a Boolean combination (an

expression using logical connectives Ù and Ú) of terms that
have the form
• attribute op constant ( e.g. A = 3)
• attribute1 op attribute2 ( e.g. X < Y)
(op is one of the comparison operators
<, £, =, ¹, >, ³).
Projection ( P )
P
P a,c ( R )
Relation R
a c a c
a b c a 1 a 1
a 10 1 a 1 Eliminate
b 1
Duplicated
a 20 1 b 1 rows b 2
b 30 1 b 2
b 40 2
Projection ( P )
• The projection operator P allows us to extract columns from

a relation.
• The subscript specifies the fields to be retained.

(The other fields are ‘projected out’).
• The schema of the result of a projection is determined by

the fields that are projected.
• Duplicated row will be eliminated in the final result.

• This follows from the definition of a relation as a set of
tuples.
Employee
Find all employees who works in department 4 and whose salary is

greater than 25000.
s dno=4 Ùsalary>25000 (Employee)

1. Find the employee names and department number of all
employees
2. Find the gender and department number of all employees
P lname,fname,dno ( Employee ) P gender,dno ( Employee )

• Compositing operations
• Substituting an expression where a relation is expected
• The result of an relational-algebra expression is always a relation.
P lname,fname ( s dno=4^salary>25000 ( Employee ) )

Union
a b
a b
a b a 1
a 1
R S a 2 RUS a 2
a 2
b 3 b 1
b 1
b 3
• R È S returns a relation instance containing all tuples
that occur in either relation R or relation S (or both).
• The union operation is commutative: R È S = S È R
• Duplicated tuples are eliminated.

• R and S must be union-compatible.
• They have the same number of fields,
• The corresponding fields have the same domains.
• The field names are not so important
• The schema of the result is defined to be identical to
the schema of R.
• The fields of R È S inherit names from R.
Set Difference
a b
a b a b
a 1
R S a 2 R-S a 1
a 2
b 3 b 1
b 1
Set Difference
• Set difference R – S
returns a relation instance containing all the tuples that occur in R but
not in S.
• Set difference is not commutative:

in general, R – S ¹ S – R.
• The relations R and S must be union-compatible.

• The schema of the result is defined to be identical to the
schema of R.
Intersection
a b
a b a b
a 1
R S a 2 RÇS a 2
a 2
b 3
b 1
Intersection
• R Ç S returns a relation instance containing all tuples that

occur in both R and S.
• The relations R and S must be union-compatible.
• The schema of the result is defined to be identical to the
schema of R.
• Intersection is not considered a basic operation, as it can be

derived from the basic operations as shown below:
R Ç S = R - (R - S )
Cartesian-Product ( ´ ) R´S
a b c d e
S a 1 a 10 +
R c d e a 1 b 10 +
a b a 10 + a 1 b 20 -
a 1 b 10 + a 1 g 10 -
b 2 b 20 - b 2 a 10 +
g 10 - b 2 b 10 +
b 2 b 20 -
b 2 g 10 -
Cartesian product (Cross product)
• R ´ S returns a relation instance whose schema

contains all the fields of R followed by all the
fields of S.
• The result contains one tuple <r,s>

(concatenation of tuples r and s)
for each pair of tuples r Î R, s Î S.
Query: for each female employee, retrieve employee’s
name and a list of the names of her dependents.
EMPLOYEE
f_name l_name id bdate addr gender salary super_id dno
DEPENDENT
eid dependent-name gender bdate relationship

Female_emps ¬ s gender=‘F’ ( Employee )
Empnames ¬ P fname,lname,id ( Female_emps )
Female_emps
Empnames
Dependents
Emp_dependents ¬ Empnames x Dependents
Emp_dependents
Query: for each female employee, retrieve employee’s name
and a list of the names of her dependents
Actual_dependents ¬ s id=eid ( Emps_dependents )
Result ¬ P fname,lname,dep_name ( Actual_dependents )
Actual_dependents
Result
Join
• Because the sequence of operations, X followed by s is

quite common, a special operations, called the “join”
operation was created to specify these as a single
operation.
• Join can be defined as a cross-product followed by

selections and sometimes with projections.
Query: for each female employee, retrieve employee’s
name and a list of the names of her dependents.
Female_emps ¬ s gender=‘F’ (Employee)

Empnames ¬ p fname,lname,id ( Female_emps )
Actual_dependents ¬ empnames id=eid Dependents
Result ¬ p fname,lname,dep_name (Actual_dependents )
Female_emps
Empnames id=eid
Dependents
Actual_dependents
Result
Conditional Join
• The most general version of join operation accepts a join

condition c.
• The join condition is identical to a selection condition in form.
• The operation is defined as follows:

R  C S = s C ( R ´ S )
S R
sid sname rating age sid bid day
22 Dustin 7 45.0 22 101 10/10/96
31 Lubber 8 55.5 58 103 11/12/96
58 Rusty 10 35.5
S  S .sid < R.sid R

(sid) sname rating age (sid) bid day
22 Dustin 7 45.0 58 103 11/12/96
31 Lubber 8 55.5 58 103 11/12/96
Example
Query: Find the names of employees with the highest salary.

Retrieve employee
whose salary is
P lname,fname ( Employee ) - smaller than others
P Employee.lname, Employee.fname (
Employee Employee.salary<F.salary r ( F, Employee ) )
Employee Employee.salary>F.salary
r ( F, Employee )
Cannot work ? Why ?

Employee Employee as F
eid lname salary eid lname salary
22 Dustin 7 22 Dustin 7
31 Lubber 8 31 Lubber 8
58 Rusty 10 58 Rusty 10
Employee Employee.salary<F.salary F
eid lname salary (eid) lname salary
22 Dustin 7 31 Lubber 8
31 Dustin 7 58 Rusty 10
31 Lubber 8 58 Rusty 10
Equi-Join
A condition join where the condition contains only equalities.
The conditions of the join have the form R.name1 = S.name2,
where we want to join R with S.
In the resulting relation, S.name2 will be dropped.
R R.b=S.b S
a b c b e f a b c e f
a 1 a 1 X a
b a 1 a X a
R
b 5 g S
3 X a 1 a X g
g 4 b 1 X g a 1 g X a
a 1 g 2 Y d 1
3 Y
a g X g
d 2 b e d 2 b Y d
Natural Join
An Equijoin on all common fields.
For R S, the resulting schema contains the attributes of R
followed by the attributes in S that are not in R.
If two relations have no attributes in common, the result is
simply the cross-product.
R S
a b c d b d e R S
a 1 a X 1 X a a b c d e
b 2 g X 3 X b a 1 a X a
g 4 b Y 1 X g a 1 a X g
a 1 g Y 2 Y d d 2 b Y d
d 2 b Y 3 Y e
Division
x y y x
a 1 B 1 A/B a
a 2 2 e
a 3
b 1
A g 1 The division operation A/B is the
d 1 set of all x values
(in the form of unary tuples)
d 3
d 4 such that for every y value in a
e 1 tuple of B,
e 2 there is a tuple <x,y> in A.
x y y x
a 1 B 1 A/B a
a 2 2 e
a 3
b 1
Another way to understand division:
A g 1
d 1 For each x value in A, consider the
d 3 set of y values that appear in tuples
d 4 of A with that x value.
e 1
e 2 If this set contains all y values in B,
the x value is in the result of A/B.
Examples of Division A/B
sno pno pno pno pno
s1 p1 p2 p2 p1
s1 p2 p4 p2
s1 p3
B1 p4
s1 p4
B2
s2 p1 sno B3
s2 p2 s1
s3 p2 s2 sno
s4 p2 s3 s1 sno
s4 p4 s4 s4 s1
A A/B1 A/B2 A/B3

Employee fname lname id bdate address salary sid dno
Works_on id pno
Query: Retrieve the names of employees who work on all the

projects that `Chris Peng’ works on.
PENG ¬ s fname=“Chris” ^ lname=“Peng” ( Employee )

PENG_pnos ¬ P pno ( Works_on  PENG )
fname lname id bdate address salary sid dno pno pno
Result_id ¬ Works_on / PENG_pnos

Result ¬ P fname,lname ( Result_id  Employee )
User (ID, PID, Name)
Inbound (PID, Origin, flight_no, date)
Airport (IATA code, city, country)
Query 1: Find PID of users who traveled UK and were
back Taiwan on 2020/03/16
- Solution 1
P PID ( s country=UK (Airport ) s date=2020/03/16 (Inbound ))
- Solution 2:
P PID( s country=UK and date=2020/03/16 ( Airport Inbound ))
Query 2: Find the ID of users who traveled to UK and were
back Taiwan on 2020/03/16
- Solution 1:
P ID (( s country=UK (Airport ) s date=2020/03/16 (Inbound)) User)
- Solution 2:
P ID ( P PID ( ( P LATA code (s country=UK Airport )) s date=2020/03/16
(Inbound)) User )
Query 3: Find the ID of users who were back from UK or Japan.
- Solution 1:
r ( TempAirport, ( s country=UK V country=Japan Airport ) )
P ID ( TempAirport Inbound User )
- Solution 2
P ID (( ( s country=UK Airport ) Inbound ) User ) U
P ID (( ( s country=Japan Airport) Inbound ) User )
Query 4: Find the PID of users who were in the same plant of one
confirmed case with his PID as TW0001 on 2020/02/28
-Solution 1
P PID, flight_no (s Date=2020/02/28 Inbound )/P flight_no(s PID=TW001 and
Date=2020/02/28 Inbound)
-Any other solution ?

Covid 2019 HackMD Open Data Taiwan FB Group
Maskdata (ID, name, address, Tel, adult_no, child _no, time)

Additional Relational Operations
• AGGREGATE FUNCTIONS
• Functions such as SUM, COUNT, AVERAGE, MIN, MAX are
often applied to sets of values or sets of tuples in
database applications
• <grouping attributes> F <function list> (R)
• The grouping attributes are optional
• e.g. Retrieve the average salary of all employees (no grouping
needed):
R(AVGSAL) ß F AVERAGE SALARY (EMPLOYEE)
Query 1: Find the average number of adult face mask in each
clinic .
R(ID, ave_adult)=ID F avg(adult_no) (Maskdata)

Query 2: Find the phone number of the clinic ID that has the maximal
number of adult face mask
Step 1: R(ID, MAX_ID_adult)=ID F MAX(adult_no) (Maskdata)

ID MAX_ID_adult
Step 2: R(ID, Final_MAX)=ID F MAX(MAX_ID_adult) (Maskdata)

ID Final_MAX
Step 3: P tel (R(ID, Final_MAX) Maskdata)

Query 3: Given one location (longitude, latitude),
find the nearest clinic ID that has face mask
Geocoding API Google

Openstreet
ID Name (X, Y) Tel
2312010014 新竹市東區衛生所 (122.1, 34.123)
2312041028 新竹市北區衛生所 (122.1, 35.12131)

Spatial
Query in
DB will
rescue
you !
Summary
• The relational model has rigorously defined query
languages that are simple and powerful.
• Relational algebra is operational; useful as an

internal representation for query evaluation steps.
• Typically there are several ways of expressing a

given query; a query optimizer should choose an
efficient version.
• Reading assignment: Chapter X
•

Relational Algebra-2023

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Relational Algebra-2023

Uploaded by

Copyright:

Available Formats

Relational

• set of attributes of table for which

Keys • Minimal such set of attributes

• DB Chosen Candidate key

• Relational algebra : queries in terms of

• Every operator in relational algebra accepts

• A relational algebra expression is

• Relational algebra is a procedural query

• The selection operation s specifies the tuples to retain

• Selection condition is a Boolean combination (an

• The projection operator P allows us to extract columns from

• The subscript specifies the fields to be retained.

• The schema of the result of a projection is determined by

• Duplicated row will be eliminated in the final result.

Find all employees who works in department 4 and whose salary is

s dno=4 Ùsalary>25000 (Employee)

2. Find the gender and department number of all employees

P lname,fname,dno ( Employee ) P gender,dno ( Employee )

P lname,fname ( s dno=4^salary>25000 ( Employee ) )

• Duplicated tuples are eliminated.

• Set difference is not commutative:

• The relations R and S must be union-compatible.

• R Ç S returns a relation instance containing all tuples that

• Intersection is not considered a basic operation, as it can be

• R ´ S returns a relation instance whose schema

• The result contains one tuple <r,s>

f_name l_name id bdate addr gender salary super_id dno

eid dependent-name gender bdate relationship

• Because the sequence of operations, X followed by s is

• Join can be defined as a cross-product followed by

Female_emps ¬ s gender=‘F’ (Employee)

• The most general version of join operation accepts a join

• The join condition is identical to a selection condition in form.

• The operation is defined as follows:

S  S .sid < R.sid R

Query: Find the names of employees with the highest salary.

Cannot work ? Why ?

A A/B1 A/B2 A/B3

Query: Retrieve the names of employees who work on all the

PENG ¬ s fname=“Chris” ^ lname=“Peng” ( Employee )

fname lname id bdate address salary sid dno pno pno

Result_id ¬ Works_on / PENG_pnos

-Any other solution ?

Maskdata (ID, name, address, Tel, adult_no, child _no, time)

R(ID, ave_adult)=ID F avg(adult_no) (Maskdata)

Step 1: R(ID, MAX_ID_adult)=ID F MAX(adult_no) (Maskdata)

Step 2: R(ID, Final_MAX)=ID F MAX(MAX_ID_adult) (Maskdata)

Step 3: P tel (R(ID, Final_MAX) Maskdata)

Geocoding API Google

ID Name (X, Y) Tel

2312010014 新竹市東區衛生所 (122.1, 34.123)

2312041028 新竹市北區衛生所 (122.1, 35.12131)

• Relational algebra is operational; useful as an

• Typically there are several ways of expressing a

You might also like