0% found this document useful (0 votes)

40 views29 pages

Understanding Dataflow Analysis Techniques

Dataflow analysis is a common framework for expressing algorithms that compute information about a program. It provides a common language which makes analyses easier to communicate, compare, adapt, and reuse. Global iterative dataflow analysis collects information about how procedures manipulate data by constructing equations that model how information flows through basic blocks and solving them iteratively. Reaching definitions analysis determines which variable definitions may reach a use by modeling definitions, kills, and the flow of information through a control flow graph.

Uploaded by

krishnanand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views29 pages

Understanding Dataflow Analysis Techniques

Uploaded by

krishnanand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Dataflow analysis

Dataflow analysis: what is it?

A common framework for expressing algorithms that compute

information about a program

Why is such a framework useful?

It provides a common language, which makes it easier to:

communicate your analysis to others
compare analyses
adapt techniques from one analysis to another
reuse implementations (eg: dataflow analysis frameworks)

Data flow analysis

Goal :
collect information about how a procedure manipulates its data

This information is used in various optimizations

For example, knowledge about what expressions are available at
some point helps in common subexpression elimination.

IMPORTANT!
Soundness is a must: Data flow analysis should never tell us that
a transformation is safe when in fact it is not.
It is better to not perform a valid optimization that to perform one
that changes the function of the program.

Soundness is a must!
Data flow analysis should never tell us that a transformation is
safe when in fact it is not.
When doing data flow analysis we must be

Conservative
Do not consider information that may not preserve the
behavior of the program

Aggressive
Try to collect information that is as exact as possible, so
we can get the greatest benefit from our optimizations.

Global Iterative Data Flow Analysis

Global:
Performed on the control flow graph
Goal = to collect information at the beginning and end of each
basic block

Iterative:
Construct data flow equations that describe how information
flows through each basic block and solve them by iteratively
converging on a solution.
The ingredients of the equations:
Algebraic representation of the property of interest
Labels associated to the control flow diagrams

Global Iterative Data Flow Analysis

Components of data flow equations
Sets containing collected information
In (or entry) set: information coming into the BB from
outside (following flow of dats)
gen set: information generated/collected within the BB
kill set: information that, due to action within the BB, will
affect what has been collected outside the BB
out (or exit) set: information leaving the BB
Functions (operations on these sets)
Transfer functions describe how information changes as it
flows through a basic block
Meet functions describe how information from multiple
paths is combined.

Global Iterative Data Flow Analysis

Algorithm sketch
Typically, a bit vector is used to store the information.
For example, in reaching definitions, each bit position corresponds to
one definition.
We use an iterative fixed-point algorithm.
Depending on the nature of the problem we are solving, we may need to
traverse each basic block in a forward (top-down) or backward direction.
The order in which we "visit" each BB is not important in terms of
algorithm correctness, but is important in terms of efficiency.
In & Out sets should be initialized in a conservative and aggressive way.
Initialize gen and kill sets
Initialize in or out sets (depending on "direction")
while there are no changes in in and out sets {
for each BB {
apply meet function
apply transfer function
}
}

Typical problems
Reaching definitions
For each use of a variable, find all definitions that reach it.

Upward exposed uses

For each definition of a variable, find all uses that it reaches.

Live variables
For a point p and a variable v, determine whether v is live at p.

Available expressions
Find all expressions whose value is available at some point p.

Very Busy expressions

Find all expressions whose value will be used in all the next
paths

Reaching definitions

Determine which definitions of a variable may reach a

use of the variable.

For each use, list the definitions that reach it. This is also
called a ud-chain.
In global data flow analysis, we collect such information at the
endpoints of a basic block, but we can do additional local
analysis within each block.

Uses of reaching definitions :

constant propagation

copy propagation

we need to know that all the definitions that reach a variable

assign it to the same constant
we need to know whether a particular copy statement is the only
definition that reaches a use.

code motion

we need to know whether a computation is loop-invariant

Something obvious
boolean x = true;
while (x) {
. . . // no change to x (and no exit/return stm)

The program doesnt terminate.

Proof: the only assignment to x is at top, so x is always
true.

As a Control Flow Graph

x = true

if x == true

body

Formulation: Reaching Definitions

Each place some variable x is assigned is a definition.
Ask:
for this use of x, where could x last have been defined?
In our example:
only at x=true.

Example: Reaching Definitions

d1: x = true
d1

d2
d1

if x == true

d2: a = 10

Clincher
Since at x == true, d1 is the only definition of x that
reaches, it must be that x is true at that point.
The conditional is not really a conditional and can be
replaced by a branch.

Not Always That Easy

int i = 2; int j = 3;

while (i != j) {
if (i < j) i += 2;
else j += 2;

Well develop techniques for this problem, but later

The Control Flow Graph

d1: i = 2
d2: j = 3
d1

d4
if i != j

d 1, d 2, d 3, d 4
if i < j

d 2, d 3, d 4

d1, d 2, d3, d4
d3: i = i+2

d1, d2, d3 , d4

d 1, d 3, d 4

d4: j = j+2
16

DFA is Sufficient Only

In this example, i can be defined in two places, and j
in two places.
No obvious way to discover that i!=j is always true.

But OK, because reaching definitions is sufficient to

catch most opportunities for constant folding
(replacement of a variable by its only possible value).

Example: Be Conservative
boolean x = true;
while (x) {

. . . *p = false; . . .
}
Is it possible that p points to x?

As a Control Flow Graph

d1: x = true
Another
def of x

if x == true

d2: *p = false

Possible Resolution
Just as data-flow analysis of reaching definitions can
tell what definitions of x might reach a point, another
DFA can eliminate cases where p definitely does not
point to x.

Example: the only definition of p is

p = &y and there
is no possibility that y is an alias of x.

Formalization:
Reaching definitions Analysis

How can we formalize a definition D?

By a pair (x,n) where x is the variable modified by D, and n
identifies the assignment D.

A definition D reaches a point p if there is a path from D to p along

which D is not killed.

A definition D of a variable x is killed when there is a redefinition

of x.

How can we represent the set of definitions reaching a point?

Reaching definitions

What is safe?

To assume that a definition reaches a point even if it turns

out not to.

The computed set of definitions reaching a point p will be a

superset of the actual set of definitions reaching p

Its a possible, not a definite property

Goal : make the set of reaching definitions as small as

possible (i.e. as close to the actual set as possible)

Reaching definitions

How are the gen and kill sets defined?

gen[B] = {definitions that appear in B and reach the end of B}

kill[B] = {all definitions that never reach the end of B}

What is the direction of the analysis?

forward
out[B] = gen[B] (in[B] - kill[B])

Reaching definitions

What is the confluence operator?

union

in[P] = out[Q], over the predecessors Q of P

How do we initialize?

start small
Why? Because we want the resulting set to be as small as
possible

for each block B initialize out[B] = gen[B]

Formal specification

The reaching Definition Analysis is specified by the folllowing

equations:

For each program point,

i if p is the initial point in the control graph

RDin(p)=

U { RDout(q) | there is an arrow from q to p in the CFD}

RDout(p)= genRD(p) U (RDin(p) \ killRD(p) )

RDin(1)= {(n,?),(m,?)}
RDout(1) = {(n,?),(m,?)}

input n;

RDin(2)= {(n,?),(m,?)}
RDout(2)= {(n,?),(m,2)}

m:= 1;
3

n>1;

m:= m*n;
5
6

output m;

n:= n-1;

RDin(3)= RDout(2) U RDout(5)

={(n,?),(n,5),(m,2),(m,4)}
RDout(3)= {(n,?),(n,5),(m,2),(m,4)}

RDin(4)= {(n,?),(n,5),(m,2),(m,4)}
RDout(4)= {(n,?),(n,5),(m,4)}
RDin(5)= {(n,?),(n,5),(m,4)}
RDout(5)= {(n,5),(m,4)}
RDin(6)= {(n,?),(n,5),(m,2),(m,4)}
RDout(6)= {(n,?),(n,5),(m,2),(m,4)}

Algorithm
Input: Control Graph Diagram

Output : RD
Steps:
step 1 (inizialization):
RDin(p) is the emptyset for each p
RDin(1) = i = {(x,?) | x is a program variable}

Step 2 (iteration)
Flag =TRUE;
while Flag
Flag = FALSE;
for each program point p
new = U{f(RD,q) | (q,p) is an edge of the graph}
if RDin(p) != new
Flag = TRUE;
RDin(p) = new;
where f(RD,q)= genRD(q) U (RDin(q) \ killRD(q) )

Example
[ input n; ]1

RDin(1)= {(n,?), (m,?)}

[ m:= 1; ]2

RDin(2)= {(n,?), (m,?)}

[ while n>1 do ]3
[ m:= m * n; ]4
[ n:= n - 1; ]5

RDin(3)= {(n,?), (n,5), (m,2), (m,4)}

RDin(4)= {(n,?), (n,5), (m,4)}

[ output m; ]6
RDin(5)= {(n,5), (m,4)}
RDin(6)= {(n,?), (n,5), (m,2), (m,4)}

Data Flow Analysis Techniques Explained
No ratings yet
Data Flow Analysis Techniques Explained
40 pages
Global Iterative Data Flow Analysis
No ratings yet
Global Iterative Data Flow Analysis
28 pages
Introduction To Data Flow Analysis
No ratings yet
Introduction To Data Flow Analysis
23 pages
Data Flow Analysis in Compiler Design
No ratings yet
Data Flow Analysis in Compiler Design
16 pages
1 Dfa
No ratings yet
1 Dfa
33 pages
Data Flow Analysis in Compilers
No ratings yet
Data Flow Analysis in Compilers
44 pages
Unit 5 Part 3
No ratings yet
Unit 5 Part 3
18 pages
Data Flow Analyaia
No ratings yet
Data Flow Analyaia
42 pages
Data-Flow Analysis in Compiler Design
No ratings yet
Data-Flow Analysis in Compiler Design
21 pages
Static vs. Dynamic Program Analysis
No ratings yet
Static vs. Dynamic Program Analysis
108 pages
Data-Flow Analysis Techniques Explained
No ratings yet
Data-Flow Analysis Techniques Explained
26 pages
Data-Flow Analysis Techniques Explained
No ratings yet
Data-Flow Analysis Techniques Explained
26 pages
Data Flow Analysis Lecture Notes
No ratings yet
Data Flow Analysis Lecture Notes
11 pages
Iterative Data Flow Analysis Techniques
No ratings yet
Iterative Data Flow Analysis Techniques
88 pages
Reaching Definitions in Data-Flow Analysis
No ratings yet
Reaching Definitions in Data-Flow Analysis
18 pages
Data-Flow Analysis Overview
No ratings yet
Data-Flow Analysis Overview
13 pages
Data Flow Analysis Fundamentals
No ratings yet
Data Flow Analysis Fundamentals
7 pages
Compiler Design
No ratings yet
Compiler Design
9 pages
Overview of Static Program Analysis Techniques
No ratings yet
Overview of Static Program Analysis Techniques
16 pages
Flow Sensitive Program Analysis Techniques
No ratings yet
Flow Sensitive Program Analysis Techniques
42 pages
Dataflow Handout
No ratings yet
Dataflow Handout
10 pages
Unit-4-Cd Dfa
No ratings yet
Unit-4-Cd Dfa
6 pages
Compiler Design: Optimization Techniques
No ratings yet
Compiler Design: Optimization Techniques
36 pages
Unit4 Contd CD
No ratings yet
Unit4 Contd CD
49 pages
DataFlowAnalysis v1
No ratings yet
DataFlowAnalysis v1
31 pages
Compiler Design
No ratings yet
Compiler Design
7 pages
4 Data-Testing PDF
No ratings yet
4 Data-Testing PDF
79 pages
Compiler Code Optimization Techniques
No ratings yet
Compiler Code Optimization Techniques
33 pages
Data Flow Coverage Analysis Techniques
No ratings yet
Data Flow Coverage Analysis Techniques
20 pages
USC Data-Flow Analysis Exercises
No ratings yet
USC Data-Flow Analysis Exercises
6 pages
Data Flow Testing in Software Quality
No ratings yet
Data Flow Testing in Software Quality
32 pages
Data Flow Analysis and Code Generation
No ratings yet
Data Flow Analysis and Code Generation
20 pages
CD Unit-5
No ratings yet
CD Unit-5
15 pages
Compiler Optimization Techniques
No ratings yet
Compiler Optimization Techniques
33 pages
Code Optimization Techniques in Compilers
No ratings yet
Code Optimization Techniques in Compilers
40 pages
Levels of Software Testing Explained
No ratings yet
Levels of Software Testing Explained
40 pages
CH 4 Part II
No ratings yet
CH 4 Part II
24 pages
Compiler Design: Code Optimization Techniques
No ratings yet
Compiler Design: Code Optimization Techniques
48 pages
Available Expressions in Compiler Optimization
No ratings yet
Available Expressions in Compiler Optimization
8 pages
Software Testing Techniques Overview
No ratings yet
Software Testing Techniques Overview
40 pages
Compiler Design
No ratings yet
Compiler Design
25 pages
Lecture 11 Part 2
No ratings yet
Lecture 11 Part 2
19 pages
Module 5 - Code Optimization
No ratings yet
Module 5 - Code Optimization
72 pages
Basics of Data Flow Testing
100% (1)
Basics of Data Flow Testing
13 pages
Foundation of Data Flow Analysis
No ratings yet
Foundation of Data Flow Analysis
31 pages
CH 13
No ratings yet
CH 13
12 pages
CD Unit-5 Full Notes Ok
No ratings yet
CD Unit-5 Full Notes Ok
49 pages
Chapter - Four Structural (White Box) Testing Part II
No ratings yet
Chapter - Four Structural (White Box) Testing Part II
34 pages
Data Flow Analysis in Control Flow
No ratings yet
Data Flow Analysis in Control Flow
18 pages
Chapter-5-Data Flow Models - Testing
No ratings yet
Chapter-5-Data Flow Models - Testing
52 pages
Program Analysis
No ratings yet
Program Analysis
73 pages
Compiler Data-Flow Analysis
No ratings yet
Compiler Data-Flow Analysis
10 pages
Code Optimization
No ratings yet
Code Optimization
32 pages
Dataflow Analysis in Compiler Optimization
No ratings yet
Dataflow Analysis in Compiler Optimization
20 pages
Compiler Optimization Techniques Guide
No ratings yet
Compiler Optimization Techniques Guide
10 pages
Data-Flow Analysis
No ratings yet
Data-Flow Analysis
11 pages
CD Unit 5
No ratings yet
CD Unit 5
12 pages
Reverse Engineering: Key Activities Explained
No ratings yet
Reverse Engineering: Key Activities Explained
50 pages
Chapter 06 Test Records For Retail Banking PDF
100% (1)
Chapter 06 Test Records For Retail Banking PDF
11 pages
CS 425 Software Engineering Principles
No ratings yet
CS 425 Software Engineering Principles
1 page
Chap03 CodesPostInstall
No ratings yet
Chap03 CodesPostInstall
1 page
IT Service Management in Cloud Computing
No ratings yet
IT Service Management in Cloud Computing
88 pages
Data Science & Predictive Analytics Solutions
No ratings yet
Data Science & Predictive Analytics Solutions
2 pages
Cassandra Succinctly PDF
No ratings yet
Cassandra Succinctly PDF
121 pages
AI 2 Marks PDF
No ratings yet
AI 2 Marks PDF
14 pages
M.SC - Bioinformatics
No ratings yet
M.SC - Bioinformatics
43 pages
UBRP201617 ITSpecilaist Officer Recruitment Notification
No ratings yet
UBRP201617 ITSpecilaist Officer Recruitment Notification
17 pages
Army Welfare Fund for Battle Casualties
No ratings yet
Army Welfare Fund for Battle Casualties
1 page
Sample CV for Fresh Graduates
100% (1)
Sample CV for Fresh Graduates
2 pages
Official SBI Specialist Officers Eligibility & Recruitment Notification 2016
No ratings yet
Official SBI Specialist Officers Eligibility & Recruitment Notification 2016
6 pages
Green Energy Expertise in Pondicherry
No ratings yet
Green Energy Expertise in Pondicherry
3 pages
Director General Position at IIDT
No ratings yet
Director General Position at IIDT
2 pages
Cloud Computing Conference 2016 Deadlines
No ratings yet
Cloud Computing Conference 2016 Deadlines
1 page
President (IT Promotion) Role
No ratings yet
President (IT Promotion) Role
2 pages
System Identification With Matlab. Linear Models
No ratings yet
System Identification With Matlab. Linear Models
267 pages
Fall 2012 Lab Scores Overview
No ratings yet
Fall 2012 Lab Scores Overview
3 pages
Understanding Cryptographic Hash Functions
No ratings yet
Understanding Cryptographic Hash Functions
8 pages
Railway RRB Solved Paper 2016 Questions
No ratings yet
Railway RRB Solved Paper 2016 Questions
21 pages
Client and Product Database Queries
No ratings yet
Client and Product Database Queries
2 pages
Computational Thinking Course Syllabus
No ratings yet
Computational Thinking Course Syllabus
2 pages
Mutual Exclusion Problem Solved
No ratings yet
Mutual Exclusion Problem Solved
3 pages
Beejs Guide 2001
No ratings yet
Beejs Guide 2001
47 pages
Partial Differential Equations Overview
No ratings yet
Partial Differential Equations Overview
5 pages
Differential Calculus
No ratings yet
Differential Calculus
22 pages
Student Exploration: Radical Functions
No ratings yet
Student Exploration: Radical Functions
10 pages
A Combined Genetic Adaptive Search (Geneas) For Engineering Design
100% (3)
A Combined Genetic Adaptive Search (Geneas) For Engineering Design
34 pages
SAT Math Problem Set 2 Answers and Explanations
No ratings yet
SAT Math Problem Set 2 Answers and Explanations
5 pages
Derivation of The Inverse Hyperbolic Trig Functions: y Sinh X
No ratings yet
Derivation of The Inverse Hyperbolic Trig Functions: y Sinh X
4 pages
Solving Equations Involving Complex Numbers
No ratings yet
Solving Equations Involving Complex Numbers
25 pages
Year 12 Math AA SL Mock Exam Paper 1
No ratings yet
Year 12 Math AA SL Mock Exam Paper 1
13 pages
Mean Value Theorem
No ratings yet
Mean Value Theorem
2 pages
Digital Signal Processing Overview
No ratings yet
Digital Signal Processing Overview
2 pages
Abstract Algebra Fourth Edition John A. Beachy Ebook Latest Full Release
100% (1)
Abstract Algebra Fourth Edition John A. Beachy Ebook Latest Full Release
50 pages
Mathopolis Inverse Function Quiz
No ratings yet
Mathopolis Inverse Function Quiz
2 pages
Spde No R PDF
No ratings yet
Spde No R PDF
26 pages
IMOMATH - Classical Inequalities
100% (1)
IMOMATH - Classical Inequalities
24 pages
SRG AOD Questions
No ratings yet
SRG AOD Questions
14 pages
Discrete Mathematics
No ratings yet
Discrete Mathematics
63 pages
Numerical Methods For Civil Engineering PDF
No ratings yet
Numerical Methods For Civil Engineering PDF
256 pages
H2 Further Mathematics (To Be Implemented in 2016) Brief Notes and Examples
No ratings yet
H2 Further Mathematics (To Be Implemented in 2016) Brief Notes and Examples
9 pages
IGNOU Real Analysis Assignment
No ratings yet
IGNOU Real Analysis Assignment
16 pages
Graph Theory Exercises and Concepts
No ratings yet
Graph Theory Exercises and Concepts
4 pages
Complex Numbers - Part 1
No ratings yet
Complex Numbers - Part 1
3 pages
Differential Forms - Tutorial
No ratings yet
Differential Forms - Tutorial
3 pages
CMI Maths Paper For PG
100% (1)
CMI Maths Paper For PG
17 pages
11th Class Notes 2024 Physics CH 2
No ratings yet
11th Class Notes 2024 Physics CH 2
28 pages
ITMO Students: Math Exam Guidelines
No ratings yet
ITMO Students: Math Exam Guidelines
6 pages
Differential Equations
No ratings yet
Differential Equations
28 pages
FCP BK 1 CH 7 Linear Transformations Qns
No ratings yet
FCP BK 1 CH 7 Linear Transformations Qns
2 pages
Multiple Choice Questions on Real Numbers
No ratings yet
Multiple Choice Questions on Real Numbers
2 pages
Engineering Mathematics - III BTBS301 - BTES301
No ratings yet
Engineering Mathematics - III BTBS301 - BTES301
2 pages
AOPS Books Part 7
No ratings yet
AOPS Books Part 7
3 pages

Understanding Dataflow Analysis Techniques

Uploaded by

Understanding Dataflow Analysis Techniques

Uploaded by

Dataflow analysis

Dataflow analysis: what is it?

A common framework for expressing algorithms that compute

Why is such a framework useful?

It provides a common language, which makes it easier to:

Data flow analysis

This information is used in various optimizations

Global Iterative Data Flow Analysis

Global Iterative Data Flow Analysis

Global Iterative Data Flow Analysis

Upward exposed uses

Very Busy expressions

Determine which definitions of a variable may reach a

Uses of reaching definitions :

we need to know that all the definitions that reach a variable

we need to know whether a computation is loop-invariant

The program doesnt terminate.

As a Control Flow Graph

Formulation: Reaching Definitions

Example: Reaching Definitions

Not Always That Easy

Well develop techniques for this problem, but later

The Control Flow Graph

DFA is Sufficient Only

But OK, because reaching definitions is sufficient to

As a Control Flow Graph

Example: the only definition of p is

How can we formalize a definition D?

A definition D reaches a point p if there is a path from D to p along

A definition D of a variable x is killed when there is a redefinition

How can we represent the set of definitions reaching a point?

To assume that a definition reaches a point even if it turns

The computed set of definitions reaching a point p will be a

Its a possible, not a definite property

Goal : make the set of reaching definitions as small as

How are the gen and kill sets defined?

gen[B] = {definitions that appear in B and reach the end of B}

What is the direction of the analysis?

What is the confluence operator?

in[P] = out[Q], over the predecessors Q of P

for each block B initialize out[B] = gen[B]

The reaching Definition Analysis is specified by the folllowing

For each program point,

i if p is the initial point in the control graph

U { RDout(q) | there is an arrow from q to p in the CFD}

RDin(3)= RDout(2) U RDout(5)

RDin(1)= {(n,?), (m,?)}

RDin(2)= {(n,?), (m,?)}

RDin(3)= {(n,?), (n,5), (m,2), (m,4)}

You might also like