Professional Documents
Culture Documents
Symbol Table
3
Over View..
Language can also be classified using generations as well.
4
Over View...
3rd generation programming language (3GL)
C, C++, C#, Java, Basic, COBOL, Lisp and ML.
5
Over View...
Modeling in Compiler Design
Compiler design is one of the places where theory has had the most
impact on practice.
6
Over View…
Optimization is to produce code that is more efficient than the
obvious code.
The optimization must be correct, that is, preserve the meaning of the
compiled program.
The optimization must improve the performance of many programs.
The compilation time must be kept reasonable.
7
TODAY’S LESSON
8
Contents
Syntax Director Translator
Introduction
Syntax Definition
Context Free Grammars
Derivations
Parse Trees
Ambiguity
Associativity of Operators
Operator Precedence
9
Syntax Directed Translator
This section illustrates the compiling techniques by developing a
program that translates representative programming language
statements into three-address code(Assembly language), an
intermediate representation.
We will focus on
Front end of a compiler
Lexical analysis
Parsing
Intermediate code generation.
10
ROUGH
Background : Parser uses a CFG(Context-free-Grammer) to
validate the input string and produce output for next phase of
the compiler.
Output could be either a parse tree or abstract syntax tree.
Now to interleave semantic analysis with syntax analysis
phase of the compiler, we use Syntax Directed Translation.
11
Syntax Directed Translator..
12
Introduction
Analysis is organized around the "syntax" of the language to be
compiled.
The syntax of a programming language describes the proper form of its
programs.
The semantics of the language defines what its programs mean.
13
ROUGH
Infix : an arithmetic expression such as B * C, In this case we
know that the variable B is being multiplied by the variable C
since the multiplication operator * appears between them in
the expression. This type of notation is referred to as infix since
the operator is in between the two operands that it is working
on.
There are two other very important expression formats that may not
seem obvious to you at first. Consider the infix expression A + B. What
would happen if we moved the operator before the two operands? The
resulting expression would be + A B. Likewise, we could move the
operator to the end. We would get A B +. These look a bit strange.
These changes to the position of the operator with respect to the
operands create two new expression formats, prefix and postfix. Prefix
expression notation requires that all operators precede the two
operands that they work on. Postfix, on the other hand, requires that
its operators come after the corresponding operands.
15
ROUGH
In postfix, A + B * C would be written as A B C * +. the order
of operations is preserved since the * appears immediately
after the B and the C, denoting that * has precedence, with +
coming after. Although the operators moved and now appear
either before or after their respective operands, the order of
the operands stayed exactly the same relative to one another.
16
ROUGH
17
ROUGH
4+5*6 456*+
18
ROUGH
)C + D( * )A + B( AB+CD+* *+AB+CD
19
Syntax Definition
Context Free Grammar is used to specify the syntax of the
language.
Shortly we can say it “Grammar”.
Ex.
if ( expression ) statement else statement
20
Syntax Definition..
This rule can be expressed as production by using the variable expr
to denote an expression and the variable stmt to denote a
statement.
stmt -> if ( expr ) stmt else stmt
In a production
lexical elements like the keyword if, else and the parentheses are
called terminals.
Variables like expr and stmt represent sequences of terminals and are
called nonterminals.
21
Grammars
A context-free grammar has four components
22
Grammars..
Expressions …
9–5+2, 5–4, 8…
Since a plus or minus sign must appear between two digits, we refer
to such expressions as lists of digits separated by plus or minus signs.
23
Grammars..
Terminals
0,1,2,3,4,5,6,7,8,9
Non-Terminals
list , digit
24
Derivations
Given a CF grammar we can determine the set of all strings
(sequences of tokens) generated by the grammar using derivation.
25
Derivations..
Derivation for our example expression.
26
Parse Trees
Parsing is the problem of taking a string of terminals and figuring
out how to derive it from the start symbol of the grammar.
If it cannot be derived from the start symbol of the grammar, then
reporting syntax errors within the string.
27
Parse Trees..
Parse tree of the string 9-5+2 using grammar G
list
list digit
list digit
digit
The sequence of
9 - 5 + 2 leafs is called the
yield of the parse tree
28
Tree Terminology
A tree consists of one or more nodes.
Exactly one is the root.
29
Ambiguity
A grammar can have more than one parse tree generating a given
string of terminals.
Such a grammar is said to be ambiguous.
30
Ambiguity..
Consider the Grammar
G = [ {string}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, string ]
31
Ambiguity…
string string
9 - 5 + 2 9 - 5 + 2
32
Associativity of Operators
Left-associative operators have left-recursive productions
For instance
list list – digit | digit
String 9-5-2 has the same meaning as (9-5)-2
33
Associativity of Operators..
34
Operator Precedence
Consider the expression 9+5*2.
35
Operator Precedence..
Lets see an example of four common arithmetic operators and a
precedence table, showing the operators in order of increasing
precedence.
left-associative: + -
left-associative: */
Now we create two nonterminals expr and term for the two levels of
precedence, and an extra nonterminal factor for generating basic units in
expressions.
36
Operator Precedence..
Now consider the binary operators, * and /, that have the highest
precedence and left associativity.
term - > term * factor | term / factor | factor
Similarly, expr generates lists of terms separated by the additive
operators.
expr -> expr + term I expr – term I term
Final grammar is
expr -> expr + term I expr – term I term
term - > term * factor | term / factor | factor
factor -> digit I ( expr )
37
Operator Precedence..
Ex. String 2+3*5 has the same meaning as 2+(3*5)
expr
expr term
term term factor
factor factor number
number number
2 + 3 * 5
38
Associativity & Precedence Table
39
Thank You