Indexing and Query Processing: Introduction To Databases Compsci 316 Fall 2017

Indexing
and
Query Processing
Introduction to Databases
CompSci 316 Fall 2017
2
Announcements (Wed., Mar. 8)

• Homework #3
• follow piazza posts for updates after each lecture
• Project
• Comments on milestone-1 tomorrow
• Keep working on it
• No class next week

• spring break
3
Today:
• Finish B+ tree and index
• Start query processing
4
Index
5
Recall: What are indexes for?

• Given a value (search key), locate the record(s)
with this value, or range search
• SELECT * FROM R WHERE A = value;
• SELECT * FROM R, S WHERE R.A = S.B;
• SELECT * FROM R WHERE A > value;
• Search key ≠ key in a relation (unique attributes)

• “Key” is highly overloaded in databases
• Recap: index structure on whiteboard

6
Recall: Index classification

• Dense vs. Sparse
• Clustered vs. unclustered
• Primary vs. Secondary
• Tree-based vs. Hash-based
• we will only do tree indexes in 316
7
Recall: B+-tree
• A hierarchy of nodes with intervals
• Balanced (more or less): good performance guarantee
• Disk-based: one node per block; large fan-out
Max fan-out: 4
100
120
150
180
30
179
120
156
180
100
101
130
150
200
110
30
11
35
5
3
8
Recall: Sample B+-tree nodes

to keys
100 ≤ 𝑘
Max fan-out: 4
150
180
120
Non-leaf
to keys to keys to keys to keys

100 ≤ 𝑘 < 120 120 ≤ 𝑘 < 150 150 ≤ 𝑘 < 180 180 ≤ 𝑘
120
130
Leaf to next leaf node in sequence
to records with these 𝑘 values;

or, store records directly in leaves
9
Recall: B+-tree balancing properties

• Height constraint: all leaves at the same lowest level
• Fan-out constraint: all nodes at least half full
(except root)
Max # Max # Min # Min #

pointers keys active pointers keys
Non-leaf 𝑓 𝑓−1 ⌈𝑓/2⌉ ⌈𝑓/2⌉ − 1
Root 𝑓 𝑓−1 2 1
Leaf 𝑓 𝑓−1 ⌊𝑓/2⌋ ⌊𝑓/2⌋
End of lecture 14
10
Lookups
• SELECT * FROM R WHERE k = 179;
• SELECT * FROM R WHERE k = 32;
Max fan-out: 4
100
120
150
180
30
Not found
179
120
156
180
100
101
130
150
200
110
30
11
35
5
3
11
Range query
• SELECT * FROM R WHERE k > 32 AND k < 179;
Max fan-out: 4
100
120
150
180
30
Look up 32…
179
180
200
120
156
100
101
130
150
110
30
11
35
5
3
And follow next-leaf pointers until you hit upper bound

12
Insertion
• Insert a record with search key value 32
Max fan-out: 4
100
120
150
180
30
Look up where the

inserted key
should go…
32
179
120
156
180
100
101
130
150
200
110
30
11
35
5
3
And insert it right there

13
Another insertion example

• Insert a record with search key value 152
Max fan-out: 4
100
120
150
180
152
179
120
156
180
100
101
130
150
200
110
Oops, node is already full!

14
Node splitting
Max fan-out: 4
100
Oops, that node
120
150
180
becomes full!
156
Need to add to parent node a pointer
to the newly created node
179
150
156
180
100
120
101
130
200
152
110
15
More node splitting
Max fan-out: 4
156
100
Need to add to parent node a pointer
to the newly created node
180
120
150
179
150
156
180
100
120
101
130
200
152
110
• In the worst case, node splitting can “propagate” all the way up
to the root of the tree (not illustrated here)
• Splitting the root introduces a new root of fan-out 2 and causes the tree
to grow “up” by one level
16
Deletion
• Delete a record with search key value 130
Max fan-out: 4
100
120
150
180
Look up the key If a sibling has more
to be deleted… than enough keys,
steal one!
179
120
156
180
100
101
130
150
200
110
And delete it
Oops, node is too empty!
17
Stealing from a sibling
Max fan-out: 4
100
150 156
Remember to fix the key
120
180
in the least common ancestor
of the affected nodes
179
120
156
180
100
101
150
200
110
18
Another deletion example

• Delete a record with search key value 179
Max fan-out: 4
100
120
156
180
179
120
156
180
100
101
150
200
110
Cannot steal from siblings

Then coalesce (merge) with a sibling!
19
Coalescing
Max fan-out: 4
100
Remember to delete the
120
156
180
appropriate key from parent
180
200
120
156
100
101
150
110
• Deletion can “propagate” all the way up to the root of the

tree (not illustrated here)
• When the root becomes empty, the tree “shrinks” by one level
20
Performance analysis
• How many I/O’s are required for each operation?
• ℎ, the height of the tree (more or less)
• Plus one or two to manipulate actual records
• Plus 𝑂 ℎ for reorganization (rare if 𝑓 is large)
• Minus one if we cache the root in memory
• How big is ℎ?
• Roughly log 56789: 𝑁, where 𝑁 is the number of records
• B+-tree properties guarantee that fan-out is least 𝑓/2 for
all non-root nodes
• Fan-out is typically large (in hundreds)—many keys and
pointers can fit into one block
• A 4-level B+-tree is enough for “typical” tables (next
slide)
21
Typicall B+ Trees in Practice

• Typical max entries: 200.
• Typical fill-factor: 67%
• average fanout F = 133
• Typical capacities:
• Height 4: 1334 = 312,900,700 records
• Height 3: 1333 = 2,352,637 records
• Can often hold top levels in buffer pool:
• Level 1 = 1 page = 8 Kbytes
• Level 2 = 133 pages = 1 Mbyte
• Level 3 = 17,689 pages = 133 MBytes
22
B+-tree in practice
• Complex reorganization for deletion often is not
implemented (e.g., Oracle)
• Leave nodes less than half full and periodically
reorganize
• Most commercial DBMS use B+-tree instead of
hashing-based indexes because B+-tree handles
range queries
23
The Halloween Problem

• Story from the early days of System R…
UPDATE Payroll
SET salary = salary * 1.1
WHERE salary <= 25000;
• There is a B+-tree index on Payroll(salary)
• The update never stopped until all employees earned 25k
(why?)
• Solutions?
• Scan index in reverse, or
• Before update, scan index to create a “to-do” list, or
• During update, maintain a “done” list, or
• Tag every row with transaction/statement id
https://en.wikipedia.org/wiki/Halloween_Problem
24
B+-tree versus ISAM

• ISAM is more static; B+-tree is more dynamic
• ISAM can be more compact (at least initially)
• Fewer levels and I/O’s than B+-tree
• Overtime, ISAM may not be balanced
• Cannot provide guaranteed performance as B+-tree does
• Due to “skew”
25
B+-tree versus B-tree

• B-tree: why not store records (or record pointers)
in non-leaf nodes?
• These records can be accessed with fewer I/O’s
• Problems?
• Storing more data in a node decreases fan-out and
increases ℎ
• Records in leaves require more I/O’s to access
• Vast majority of the records live in leaves!
26
B+ tree vs. Hash-based indexes

• Extensible hashing, linear hashing, etc.
• Can only handle “=” in join or selection
• Cannot handle range predicates >, ≥, <, ≤
Index organized file hashed on AGE, with Auxiliary index on SAL
Smith, 44, 3000 h2(AGE) = 00

Jones, 40, 6003 3000
h1(AGE) = 00 3000
Tracy, 44, 5004
5004 h2
5004
h1(AGE) = 01
AGE Ashby, 25, 3000
h1 Basu, 33, 4003 4003 h2(SAL) = 01
Bristow, 29, 2007 2007
6003
6003
h1(AGE) = 10 Cass, 50, 5004
Daniels, 22, 6003
File of <SAL, rid> pairs hashed on SAL
Employee File hashed on AGE

27
Beyond ISAM, B-, and B+-trees, and hash

• Other tree-based indexes: R-trees and variants,
GiST, etc.
• How about binary tree?
vs.
• Text indexes: inverted-list index, suffix arrays, etc.

• Other tricks: bitmap index, bit-sliced index, etc.
28
Query Processing
29
Overview
• Many different ways of processing the same query
• Scan? Sort? Hash? Use an index?
• All have different performance characteristics and/or
make different assumptions about data
• Best choice depends on the situation
• Implement all alternatives
• Let the query optimizer choose at run-time
30
Notation
• Relations: 𝑅, 𝑆
• Tuples: 𝑟, 𝑠
• Number of tuples: 𝑅 , 𝑆
• Number of disk blocks: 𝐵 𝑅 , 𝐵 𝑆
• Number of memory blocks available: 𝑀
• Cost metric
• Number of I/O’s
• Memory requirement
• We do not count the cost of final write to disk
• Do not try to memorize the formulas for cost

estimation!
• understand the logic
• recall the diagram of disk and memory on whiteboard
31
Scanning-based algorithms
32
Table scan
• Scan table R and process the query
• Selection over R
• Projection of R without duplicate elimination
• I/O’s: 𝐵 𝑅
• Trick for selection: stop early if it is a lookup by key
• Memory requirement: 2
• Not counting the cost of writing the result out
• Same for any algorithm!
• Maybe not needed—results may be pipelined into
another operator
33
Nested-loop join
𝑅 ⋈D 𝑆
• For each block of 𝑅, and for each 𝑟 in the block:
For each block of 𝑆, and for each 𝑠 in the block:
Output 𝑟𝑠 if 𝑝 evaluates to true over 𝑟 and 𝑠
• 𝑅 is called the outer table; 𝑆 is called the inner table
• I/O’s: 𝐵 𝑅 + 𝑅 ⋅ 𝐵 𝑆
• Memory requirement: 3
Improvement: block-based nested-loop join
• For each block of 𝑅, for each block of 𝑆:
For each 𝑟 in the 𝑅 block, for each 𝑠 in the 𝑆 block: …
• I/O’s: 𝐵 𝑅 + 𝐵 𝑅 ⋅ 𝐵 𝑆
End of lecture 15
• Memory requirement: same as before

Indexing and Query Processing: Introduction To Databases Compsci 316 Fall 2017

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Indexing and Query Processing: Introduction To Databases Compsci 316 Fall 2017

Uploaded by

Copyright:

Available Formats

Indexing

Announcements (Wed., Mar. 8)

• No class next week

Recall: What are indexes for?

• Search key ≠ key in a relation (unique attributes)

• Recap: index structure on whiteboard

Recall: Index classification

Recall: Sample B+-tree nodes

to keys to keys to keys to keys

Leaf to next leaf node in sequence

to records with these 𝑘 values;

Recall: B+-tree balancing properties

Max # Max # Min # Min #

And follow next-leaf pointers until you hit upper bound

Look up where the

And insert it right there

Another insertion example

Oops, node is already full!

More node splitting

Stealing from a sibling

Another deletion example

Cannot steal from siblings

• Deletion can “propagate” all the way up to the root of the

Typicall B+ Trees in Practice

The Halloween Problem

B+-tree versus ISAM

B+-tree versus B-tree

B+ tree vs. Hash-based indexes

Smith, 44, 3000 h2(AGE) = 00

Employee File hashed on AGE

Beyond ISAM, B-, and B+-trees, and hash

• Text indexes: inverted-list index, suffix arrays, etc.

• Do not try to memorize the formulas for cost

You might also like