Welcome to Scribd!

Slide Set 4 Lexical Analysis

Uploaded by

0% found this document useful (0 votes)

7 views11 pages

Lexical analysis is the first step of compilation that breaks source code into individual tokens. It takes program text as input and outputs tokens representing keywords, identifiers, numbers, operators, and other syntax elements. Regular expressions are used to specify patterns for recognizing tokens. Ambiguities in regular expressions are resolved using the longest match rule and rule priority. Finite automata are used to model the token recognition process, and multiple automata are combined into a single deterministic finite automaton for efficient lexical analysis by a compiler.

Original Description:

Original Title

slide_set_4_lexical_analysis

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

7 views11 pages

Slide Set 4 Lexical Analysis

Uploaded by

tushowergoyal

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 11

Search inside document

Lexical analysis

Prof Rajeev Barua

Program Analysis -- Slide Set 4

1
Lexical analysis: definition and tokens
Lexical analysis is the first task of the compiler. It has the following inputs and outputs:

● Input: The text of a high level program.

● Output: Individual words or "tokens" in the program representing keywords, identifiers, numbers,
operators, syntax elements, etc.

Examples of tokens: The following are not tokens because the

preprocessor removes them before lexical
analysis:
Lexical analysis: example input & output

From page 17 of the text book

Regular expressions
We need a way to recognize tokens automatically.
● We use a mathematical formulation called regular expressions for that.
● Regular expressions are a way of specifying legal expressions for tokens

An example of a regular expression: (a|b)∗aa(a|ϵ)∗

Regular expressions are specified as follows:
● Symbols (eg a)
● Alternation. (eg M|N means M or N)
● Concatenation (eg M.N or MN)
● Epsilon (ϵ) implies empty string "".
● Repetition M* means concatenation of zero or more instances of M.
○ M+ means one or more instances.

A regular expression is a way of representing a set of strings that are members of the
language specified by the regular expression.
Examples of regular expressions
In general:

For programming languages: Rule 5: -- is for starting

comments in this
language.

Rule 6: . matches any

single character except
newline, and is present
to match any single
character not matched
by any other rule.
Ambiguity and disambiguation of regular expressions

Above rules are ambiguous. Examples:

● "if" could be a keyword or an identifier.
● "if8" could be a single identifier, or a keyword"if" followed by token "8".

Disambiguation rules:

● Longest match: the longest initial substring of the input that can match any regular
expression is taken as the next token.
○ Eg of use: "if8" matches identifier since that is the token with the longest match.

● Rule priority: For a particular longest initial substring, the first regular expression that
matches is used.
○ Thus the order of writing rules is important!
○ Eg of use: "if" matches a reserved word, not identifier, by rule priority.
Finite Automaton

A finite automaton (FA)

is a graph-based
formalism for recognizing
regular expressions.
● It contain states
and edges.
● Examples of FAs
are on the right.
● Plural form: Finite
Automata.
The need to combine finite automata

Next we shall try to combine multiple FAs into one.

To do so, we need concept of DFA (deterministic FA):

● In a DFA, there cannot be more than one edge out of any state with the same input
character.
○ So it knows where to go next!
● A combined FA must be a DFA to be useful as a automated recognizer in a computer
program!

Combining the FAs on the previous page in the obvious way does not work.

● For example, combining the FAs for IF and ID results in two outgoing edges from state 1 for
the letter “i”.
○ This violates the condition for a DFA above!
Combining finite automata

A formal method for combining FAs into a single DFA is used in compilers.
● However it is out of syllabus.
● In this course we will look at an informal (intuitive) method, which you should know how to do.

An example of combining the multiple prior FAs into a single DFA is on the next slide.
Combining finite automata: Example

In this kind of DFA, if a

character is missing from the
out-edges of a state, and it is
a final state, then we end the
token and move to start
state.

If it is NOT a final state, then

generate an error, and move
to the start state.

Rest of chapter 2 is out of

syllabus. It deals with formal
methods of combining
regular expressions into
DFAs.
There are existing softwares for Lexical analysis

E.g., lex on Unix-based system

File with C program

regular Lex to recognize
expressions tokens

Compile

Executable
HLL program lexical
Tokens
analyzer for
your reg exp

Summary - Intro To The Theory of Computation by Michael Sipser
Document13 pages
Summary - Intro To The Theory of Computation by Michael Sipser
Daniela Natal
No ratings yet
Champion Compressors Modbus RTU Manual: Aircon L1 Compressor Controller Enercon Sequencer Controller & Accessories
Document45 pages
Champion Compressors Modbus RTU Manual: Aircon L1 Compressor Controller Enercon Sequencer Controller & Accessories
sudirmanpriyo
100% (2)
Lexical Analyzer
Document5 pages
Lexical Analyzer
Mahnoor CH
No ratings yet
Lesson 16
Document36 pages
Lesson 16
sdfgedr4t
No ratings yet
Cs09 404 Programming Paradigm (Module 1 Notes)
Document24 pages
Cs09 404 Programming Paradigm (Module 1 Notes)
Rohith Bhaskaran
No ratings yet
HW 6
Document28 pages
HW 6
Intekhab Khan
No ratings yet
What Is Syntax Analysis?: Syntax Analysis Is A Second Phase of The Compiler Design Process in
Document8 pages
What Is Syntax Analysis?: Syntax Analysis Is A Second Phase of The Compiler Design Process in
Ramna Satar
No ratings yet
Compiler Answers
Document18 pages
Compiler Answers
Anupam Vats
No ratings yet
Parsing PDF
Document6 pages
Parsing PDF
Awais
No ratings yet
2 Lexical
Document7 pages
2 Lexical
yas123
100% (1)
Chapter 3 Compiler Design
Document42 pages
Chapter 3 Compiler Design
mishamoamanuel574
No ratings yet
2.1 A First Look at Operational Semantics
Document46 pages
2.1 A First Look at Operational Semantics
KHAN G
No ratings yet
Chapter 3 - Syntax Analysis
Document160 pages
Chapter 3 - Syntax Analysis
samlegend2217
No ratings yet
003chapter 3 - Syntax Analysis
Document171 pages
003chapter 3 - Syntax Analysis
Eyoab
No ratings yet
CS-352 - Spring 2024 - Lec2
Document35 pages
CS-352 - Spring 2024 - Lec2
jouf00008
No ratings yet
CC Viva Questions
Document5 pages
CC Viva Questions
Saraah Ghori
0% (1)
7 Formal Systems and Programming Languages: An Introduction
Document40 pages
7 Formal Systems and Programming Languages: An Introduction
rohit_pathak_8
No ratings yet
Module 3-CD-NOTES
Document12 pages
Module 3-CD-NOTES
lekhanagowda797
No ratings yet
DLP VIVA Questions and Answers
Document5 pages
DLP VIVA Questions and Answers
D19DCS143 PRIYAM VAGHASIA
No ratings yet
Department of Computer Science: Subject Code: 18Kp2Cs05
Document24 pages
Department of Computer Science: Subject Code: 18Kp2Cs05
IT130 Fenil Rangani
No ratings yet
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
Document80 pages
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
Dular Chandran
No ratings yet
Problems in Compilation
Document21 pages
Problems in Compilation
Nimi Khan
No ratings yet
Basic Parsing Techniques
Document34 pages
Basic Parsing Techniques
oju
No ratings yet
Sri Vidya College of Engineering and Technology Question Bank
Document5 pages
Sri Vidya College of Engineering and Technology Question Bank
Thiyagarajan Jayasankar
No ratings yet
Introduction To Common Lisp
Document32 pages
Introduction To Common Lisp
Uzman
No ratings yet
3.syntax Analysis1
Document51 pages
3.syntax Analysis1
Pk Brand
No ratings yet
Lesson Plan Formal Language and Automata Theory
Document32 pages
Lesson Plan Formal Language and Automata Theory
Esha Dutta
No ratings yet
Syntax Analyzer
Document65 pages
Syntax Analyzer
Asma Fayyaz
No ratings yet
Compiler Design
Document17 pages
Compiler Design
Gayathri Ramasamy
No ratings yet
Lecture3 - Lexical Analysis
Document153 pages
Lecture3 - Lexical Analysis
Milon Sheikh
No ratings yet
1 Syntax Analyzer
Document33 pages
1 Syntax Analyzer
saharkamal7274
No ratings yet
Compiler Design
Document7 pages
Compiler Design
chaitanya518
No ratings yet
Module 4
Document31 pages
Module 4
arunlalds
No ratings yet
SI 413, Unit 4: Scanning and Parsing: 1 Syntax & Semantics
Document12 pages
SI 413, Unit 4: Scanning and Parsing: 1 Syntax & Semantics
Aditya Roy karmakar
No ratings yet
MIS Year 4 Lecture Notes: Semantics and Storage Management
Document8 pages
MIS Year 4 Lecture Notes: Semantics and Storage Management
dsbaby
No ratings yet
CC Summary (Slides)
Document9 pages
CC Summary (Slides)
sab640887
No ratings yet
WWW Tutorialspoint Com Compiler Design Compiler Design Syntax Analysis HTM
Document11 pages
WWW Tutorialspoint Com Compiler Design Compiler Design Syntax Analysis HTM
DESSIE FIKIR
No ratings yet
Application Domains : Business Processing Scientific System Control Publishing
Document21 pages
Application Domains : Business Processing Scientific System Control Publishing
Akshay Balia
No ratings yet
Principles of Compiler Design
Document35 pages
Principles of Compiler Design
Ravi Raj
100% (2)
Compiler Design 2marks
Document21 pages
Compiler Design 2marks
lenaprasanna
No ratings yet
Compiler Design - Lexical Analysis: University of Salford, UK
Document1 page
Compiler Design - Lexical Analysis: University of Salford, UK
Ammar Mehmood
No ratings yet
Unit - 2 Lexical Analysis
Document11 pages
Unit - 2 Lexical Analysis
mdumar umar
No ratings yet
Programming Languaged Scanning Week 1-2
Document7 pages
Programming Languaged Scanning Week 1-2
Emmanuel Anceno
No ratings yet
Language Structural Layers
Document3 pages
Language Structural Layers
ejioforjo01
No ratings yet
Module 5 Lexical Analyser
Document10 pages
Module 5 Lexical Analyser
Jeevan
No ratings yet
Programming Paradigms: Abdelghani Bellaachia, Advanced Software Paradigms
Document10 pages
Programming Paradigms: Abdelghani Bellaachia, Advanced Software Paradigms
Đặng Phương Anh
No ratings yet
Lexical Analysis: Risul Islam Rasel
Document148 pages
Lexical Analysis: Risul Islam Rasel
Risul Islam Rasel
No ratings yet
Finals Review
Document42 pages
Finals Review
Nagappan CSE
No ratings yet
Lexical Analyzer
Document1 page
Lexical Analyzer
Dale sharma
No ratings yet
Chapter 3 - Syntax Analysis Part One
Document10 pages
Chapter 3 - Syntax Analysis Part One
Aschalew Ayele
No ratings yet
Syntax Analysis Introduction: Distributed Computing, M. L. Liu 1
Document21 pages
Syntax Analysis Introduction: Distributed Computing, M. L. Liu 1
Akshay N
No ratings yet
Unit Iii
Document17 pages
Unit Iii
Allan Robey
No ratings yet
Cs1352 Principles of Compiler Design
Document33 pages
Cs1352 Principles of Compiler Design
SathishDon01
No ratings yet
Tokens and Python's Lexical Structure
Document15 pages
Tokens and Python's Lexical Structure
Rohit Rawat
No ratings yet
Compiler 18700220055 Prathamrai
Document12 pages
Compiler 18700220055 Prathamrai
Pratham Rai
No ratings yet
From Simple IO to Monad Transformers
From Everand
From Simple IO to Monad Transformers
J Adrian Zimmer
Rating: 2 out of 5 stars
2/5 (1)
Python: Best Practices to Programming Code with Python
From Everand
Python: Best Practices to Programming Code with Python
Charlie Masterson
No ratings yet
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
From Everand
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
Yana Kortsarts
Rating: 4.5 out of 5 stars
4.5/5 (2)
Python: Best Practices to Programming Code with Python: Python Computer Programming, #2
From Everand
Python: Best Practices to Programming Code with Python: Python Computer Programming, #2
Charlie Masterson
No ratings yet
C Programming Language Essentials
From Everand
C Programming Language Essentials
Ernest C. Ackermann
Rating: 4 out of 5 stars
4/5 (16)
Ruby Programming For Beginners: The Simple Guide to Learning Ruby Programming Language Fast!
From Everand
Ruby Programming For Beginners: The Simple Guide to Learning Ruby Programming Language Fast!
Tim Warren
Rating: 2 out of 5 stars
2/5 (2)
MTBC Marketing Operations
Document6 pages
MTBC Marketing Operations
ahmed yaqoob
No ratings yet
Lecture 19 - FlipFlops II
Document46 pages
Lecture 19 - FlipFlops II
Abdul Wahab
No ratings yet
tra1 - 복사본2
Document167 pages
tra1 - 복사본2
aa
No ratings yet
CNH Industrial Supplier Portal Guidelines For Suppliers
Document8 pages
CNH Industrial Supplier Portal Guidelines For Suppliers
SamuelsonLeite
No ratings yet
Tally Dot Matrix Printer T2150, T2250, T2150S Parts & Service
Document124 pages
Tally Dot Matrix Printer T2150, T2250, T2150S Parts & Service
chamaidisv
100% (2)
8086 Trainer Kit User and Technical Reference Manual PDF
Document71 pages
8086 Trainer Kit User and Technical Reference Manual PDF
John Johnston
0% (1)
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
Document149 pages
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
kuldeepsingh panwar
No ratings yet
Combined14 S 2
Document520 pages
Combined14 S 2
Umair Mujtaba Qureshi
No ratings yet
Readme
Document3 pages
Readme
christina rina
No ratings yet
Three Tier DBMS Architecture
Document8 pages
Three Tier DBMS Architecture
Nikhil Bharadwaj
No ratings yet
Mr. A. B. Shinde: Assistant Professor, Electronics Engineering, PVPIT, Budhgaon
Document34 pages
Mr. A. B. Shinde: Assistant Professor, Electronics Engineering, PVPIT, Budhgaon
premkumar
No ratings yet
User Manual - Printing With Simplify3D
Document17 pages
User Manual - Printing With Simplify3D
Guilherme Almeida
No ratings yet
Chapter 4 - : The Network Layer
Document3 pages
Chapter 4 - : The Network Layer
zeynep
No ratings yet
Online Library Management System
Document11 pages
Online Library Management System
Maria
No ratings yet
DSP MCQ PDF
Document3 pages
DSP MCQ PDF
Akshay Patil
No ratings yet
Step by Step Guide For Windows Server 2003 Domain Controller and DNS Server Setup - Windows Reference
Document29 pages
Step by Step Guide For Windows Server 2003 Domain Controller and DNS Server Setup - Windows Reference
sivasankar015
No ratings yet
Use Hot Potatoes
Document27 pages
Use Hot Potatoes
eim
No ratings yet
Spotify Vs Pandora Paper
Document10 pages
Spotify Vs Pandora Paper
Giorgio Galianno
No ratings yet
Advanced Java Cha 1
Document52 pages
Advanced Java Cha 1
Dawit Woldemichael
No ratings yet
Bollard Pull Calculation Sheet Validation and User Guide: Additional Info
Document10 pages
Bollard Pull Calculation Sheet Validation and User Guide: Additional Info
1234scr5678
No ratings yet
IOS Appsec
Document309 pages
IOS Appsec
Sistemas Sistemas
No ratings yet
Wireless-N ADSL2+ Gateway: User Guide
Document41 pages
Wireless-N ADSL2+ Gateway: User Guide
naweedqadir
No ratings yet
Covid-19 Hospital Management Python Django Project With Source Code
Document16 pages
Covid-19 Hospital Management Python Django Project With Source Code
0571 mariam
No ratings yet
Postman Readme Guide
Document36 pages
Postman Readme Guide
Zoran
No ratings yet
240-76992014 Project or Plant Specific Technical Documents and Records Management Work Instructio
Document23 pages
240-76992014 Project or Plant Specific Technical Documents and Records Management Work Instructio
iabhua
No ratings yet
2006-12E Web MOD LIST Edition
Document4 pages
2006-12E Web MOD LIST Edition
tuyen
No ratings yet
Java Platform Overview
Document44 pages
Java Platform Overview
Akshat Pareek
No ratings yet
SEO Search Engine Optimization: Prepared By: Dr. Tejas Shah, Institute of Management, Nirma University
Document92 pages
SEO Search Engine Optimization: Prepared By: Dr. Tejas Shah, Institute of Management, Nirma University
himanshi
No ratings yet
BATCH 2020-2024: School of Electrical and Communication Department of Electronics and Communication Engineering
Document4 pages
BATCH 2020-2024: School of Electrical and Communication Department of Electronics and Communication Engineering
prasannaram88
No ratings yet