0% found this document useful (0 votes)

63 views15 pages

Intro To Compilers Lecture 2

Lexical analysis partitions a program's source code string into tokens. It defines a set of token types like identifiers, integers, keywords, and whitespace. A lexical analyzer recognizes substrings that correspond to each token type and returns the lexeme (substring) and token type. Regular expressions provide a notation for specifying the patterns that define each token type. A lexical analyzer implementation uses these regular expression patterns to efficiently scan the source code and classify its substrings into tokens.

Uploaded by

fikadu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views15 pages

Intro To Compilers Lecture 2

Uploaded by

fikadu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Compilers

Lexical Analysis
Lexical Analysis
• What is the goal?
if (i ==0)
z=0;
else
z=1;

• The input is just a string of characters:

• If (i==0)\n\tz=0;\nelse\n\tz=1;
• Goal: Partition input string into substrings
• where the substrings are tokens
Token
• Words which are the smallest unit above letters.
• Is the minimal syntax category.
• English: noun, verb, adjective …
• Programming language: Identifier, integer, keyword, whitespace, …
• Tokens correspond to sets of strings
• Identifier: strings of letters or digits, starting with a letter
• Integer: a non-empty string of digits
• Keyword: ”else” or “if” …
• Whitespace: a non-empty sequence of blanks, newlines and tabs.
Contd…
• Tokens classify program substrings according to its role
• The output of a lexical analysis is a stream of tokens.
• Parser relies on token distinction.
• Identifier, is treated differently than a keyword
Designing a lexical analyser
• Define a finite set of tokens
• Tokens describe all items of interest
• Choice of tokens depends on language, design of parser …
• Recall
• \tif (i == j)\n\t\tz = 0;\n\telse\n\t\tz = 1;
• Useful tokens for this expression:
• Integer, Keyword, Relation, Identifier, Whitespace, (, ), =, ;
• N.B., (, ), =, ; are tokens, not characters, here
• Next step is to Describe which substrings belong to each token.
Implementation
• An implementation is responsible for two things.
• Recognize substrings corresponding to tokens accurately
• Return the value or lexeme (substring) of the token.
• First it discards unneeded tokens which won’t contribute to parsing
• Whitespaces and comments.

if (i ==0) //if clause

z=0;
if (i == 0)\n\tz=0;\nelse\n\tz=1;
else /*else clause is located here*/
z=1;
Some examples
• C++
• Most are easily done.
• In Template syntax : Foo<Bar>
• Stream syntax: Cin >> var;
• When there is nested templates occur, there is a conflict: FOO<Bar<Bazz>>
• Is if two variables I and f?
• Is == two equal signs = = or ?
Solution
• Left-to-right scan
• lookahead sometimes required.
Regular languages
• Are one of the several formalisms for specifying tokens.
• Regular languages are simple and useful theory
• Easy to understand
• Efficient implementation
• Definition: Let Σ be a set of characters. A language over Σ is a set of
strings of characters drawn from Σ.
Examples of languages

English Programming language

• Alphabet = characters • Alphabet = ASCII
• Language = Sentences • Language = programs
Notations
• Languages are sets of strings.

• Need some notation for specifying which sets we want

• The standard notation for regular languages is regular expressions.

Regullar expressions
• Single character : ‘c’ ={“c”}
• Epsilon: ε ={“”}
• Union A+B ={ s| s ∈A or s ∈B}
• Concatenation AB = {ab | a ∈A and b ∈A}
• Iteration A* = where = AAA… i times.
Regular expressions
• Definition: The regular expressions over Σ are the smallest set of
expressions including
• ε
• ‘c’ where c ∈ Σ
• A + B where A, B are rexp over Σ
• AB “ “ “
• A* Where A is a rexp over Σ
Examples
• Keywords: “else” or “if” or …
• ‘else’ + ‘if’ …
• ‘else’ abbreviates as ‘e’ ‘l’ ‘s’ ‘e’
• Integer: a non-empty string of digits
• Digit = ‘0’ +'1’ +'2’ +'3’ +'4’ +'5’ +'6’ +'7’ +'8’ +’9’
• Integer = digit digit*
• Abbreviation: = AA*
• Identifir: strings of letters or digits, starting with a letter
• Letter = ‘A’ + … + ‘z’ +’a’+….+’z’
• Identifier = letter (letter + digit)*
• Whitespace: a non empty sequence of blanks, newlines, and tabs
Examples
• Phone Number
• +251-911-00 00 00
• Σ = digits U { -, +, ‘ ‘}
• Email Address
• Abc@abc.com

• There are regular expressions everywhere.

• Everything discussed so far is Syntax not semantics (meaning).

Acd Unit-2
No ratings yet
Acd Unit-2
16 pages
Ch3myppt
No ratings yet
Ch3myppt
59 pages
Lexi Cal a Analyzer
No ratings yet
Lexi Cal a Analyzer
38 pages
Lexical Analysis: Risul Islam Rasel
No ratings yet
Lexical Analysis: Risul Islam Rasel
148 pages
Compiler Design Unit-1 - 4
No ratings yet
Compiler Design Unit-1 - 4
4 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
4-LexicalAnalysis
No ratings yet
4-LexicalAnalysis
27 pages
2
No ratings yet
2
109 pages
CD_UNIT-2
No ratings yet
CD_UNIT-2
64 pages
SE Compiler Chapter 2
No ratings yet
SE Compiler Chapter 2
16 pages
M.Suhaib Khalid PDF
No ratings yet
M.Suhaib Khalid PDF
10 pages
CD Unit-2
No ratings yet
CD Unit-2
64 pages
2
No ratings yet
2
40 pages
Chapter 2 lexical_analysis
No ratings yet
Chapter 2 lexical_analysis
38 pages
Lexical Analysis
No ratings yet
Lexical Analysis
44 pages
Chapter 7 Lexical Analysis
No ratings yet
Chapter 7 Lexical Analysis
61 pages
pr
No ratings yet
pr
40 pages
Lecture3_E
No ratings yet
Lecture3_E
153 pages
Violent Python a cookbook for hackers forensic analysts penetration testers and security engineers 1st Edition O'Connor pdf download
100% (4)
Violent Python a cookbook for hackers forensic analysts penetration testers and security engineers 1st Edition O'Connor pdf download
54 pages
2024_CD-Ch02_Lexical_Analysis
No ratings yet
2024_CD-Ch02_Lexical_Analysis
25 pages
Ayush Pandey 45014802718 8C9 PPL
No ratings yet
Ayush Pandey 45014802718 8C9 PPL
46 pages
Lexical Analyzer 1
No ratings yet
Lexical Analyzer 1
37 pages
PL Lec 2 Syntax and Semantics
No ratings yet
PL Lec 2 Syntax and Semantics
48 pages
Lecture 03
No ratings yet
Lecture 03
42 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
69 pages
SP Unit III-2024-25
No ratings yet
SP Unit III-2024-25
126 pages
Lexical Analysis 3
No ratings yet
Lexical Analysis 3
27 pages
Lexical Analyser
No ratings yet
Lexical Analyser
55 pages
Ch3 Modified
No ratings yet
Ch3 Modified
80 pages
Module 5 Lexical Analyser
No ratings yet
Module 5 Lexical Analyser
10 pages
Ch2+3 Compiler
No ratings yet
Ch2+3 Compiler
21 pages
Lexical Analysis
No ratings yet
Lexical Analysis
57 pages
03 Lex Analysis
No ratings yet
03 Lex Analysis
61 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
14 pages
Chapter2-Lexical Analysis
No ratings yet
Chapter2-Lexical Analysis
64 pages
WINSEM2023-24_CSI2005_TH_VL2023240501823_2024-01-08_Reference-Material-I
No ratings yet
WINSEM2023-24_CSI2005_TH_VL2023240501823_2024-01-08_Reference-Material-I
23 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
56 pages
CC 2
No ratings yet
CC 2
65 pages
CD KCS502 Unit 1 B
No ratings yet
CD KCS502 Unit 1 B
12 pages
Lexical Analysis
No ratings yet
Lexical Analysis
41 pages
Lecture 02
No ratings yet
Lecture 02
150 pages
Lexical Analysis
No ratings yet
Lexical Analysis
153 pages
CH 2 - Lexical Analysis
No ratings yet
CH 2 - Lexical Analysis
36 pages
Compiler Design Lexical Analysis
No ratings yet
Compiler Design Lexical Analysis
24 pages
Lexical Analyzer 2023
No ratings yet
Lexical Analyzer 2023
38 pages
2_Lexical Analysis
No ratings yet
2_Lexical Analysis
52 pages
Chapter 2 - Lexical Analysis_Regular Expressions(1)
No ratings yet
Chapter 2 - Lexical Analysis_Regular Expressions(1)
27 pages
Chapter 2
No ratings yet
Chapter 2
27 pages
UNIT 4 (File Handling and Exception Handling)
No ratings yet
UNIT 4 (File Handling and Exception Handling)
15 pages
Unit 2 Lexical Analyzer
No ratings yet
Unit 2 Lexical Analyzer
63 pages
Compilers - Week 2
No ratings yet
Compilers - Week 2
14 pages
Compiler-Lexical Analysis
100% (1)
Compiler-Lexical Analysis
59 pages
SKH Industrie Catalogue 2022
No ratings yet
SKH Industrie Catalogue 2022
59 pages
04 Lexi Cal A Analysis
No ratings yet
04 Lexi Cal A Analysis
39 pages
2 Lex
No ratings yet
2 Lex
45 pages
Session 2
No ratings yet
Session 2
58 pages
Unit 6
No ratings yet
Unit 6
109 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Thesis Doc v01
No ratings yet
Thesis Doc v01
34 pages
IP-Part B
No ratings yet
IP-Part B
218 pages
Utility Applications of Time Sensitive Networking - White Paper - Final Review
No ratings yet
Utility Applications of Time Sensitive Networking - White Paper - Final Review
17 pages
07 - Model Selection & Building
No ratings yet
07 - Model Selection & Building
17 pages
GlitchedOnEarth Slides
No ratings yet
GlitchedOnEarth Slides
46 pages
Zhong 2017 Study On The Iot Architecture and A
No ratings yet
Zhong 2017 Study On The Iot Architecture and A
4 pages
Account Statement
No ratings yet
Account Statement
6 pages
Assignment 1 Final
No ratings yet
Assignment 1 Final
52 pages
Vivo Mutual NDA Signed
No ratings yet
Vivo Mutual NDA Signed
2 pages
Types of PC Expansion Cards - Google Search
No ratings yet
Types of PC Expansion Cards - Google Search
1 page
CCS340 Compressed
No ratings yet
CCS340 Compressed
50 pages
Lexical Analysis
No ratings yet
Lexical Analysis
47 pages
Chapter 1
No ratings yet
Chapter 1
16 pages
Compiler
No ratings yet
Compiler
60 pages
2-Lexical Analysis
No ratings yet
2-Lexical Analysis
52 pages
Intelligent Maintenance
No ratings yet
Intelligent Maintenance
10 pages
Web Technology Lab
No ratings yet
Web Technology Lab
1 page
Minimize
No ratings yet
Minimize
10 pages
4.1 4-S2S-IPSecVPN-Tunnel-Router
No ratings yet
4.1 4-S2S-IPSecVPN-Tunnel-Router
6 pages
Chapter 1 Review Question and Answers
100% (1)
Chapter 1 Review Question and Answers
20 pages
Metal Detecting Robot - Bluetoot
No ratings yet
Metal Detecting Robot - Bluetoot
5 pages
Midterm Exam: University of Washington CSE 403 Software Engineering Spring 2011
No ratings yet
Midterm Exam: University of Washington CSE 403 Software Engineering Spring 2011
8 pages
Midterm Exam: University of Washington CSE 403 Software Engineering Spring 2011
No ratings yet
Midterm Exam: University of Washington CSE 403 Software Engineering Spring 2011
8 pages
Manual de Instalación de Camaras Trampa
No ratings yet
Manual de Instalación de Camaras Trampa
33 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
Bucess Result 2021-Neric Ma. Angelica Nuylan
No ratings yet
Bucess Result 2021-Neric Ma. Angelica Nuylan
1 page
Icecreamsales Worksheet: Training 3
No ratings yet
Icecreamsales Worksheet: Training 3
4 pages
Chapter 3 Review Questions On Tansport Layer
No ratings yet
Chapter 3 Review Questions On Tansport Layer
11 pages
User Mode
No ratings yet
User Mode
4 pages
QmEye PC Client Introduction
No ratings yet
QmEye PC Client Introduction
16 pages
Software Deployment Views 12
No ratings yet
Software Deployment Views 12
5 pages
Guidance For Applicants Applying For Ieng and Ceng Registration September 2021
No ratings yet
Guidance For Applicants Applying For Ieng and Ceng Registration September 2021
10 pages
Computer Aided Process Design and Simulation (Cheg
No ratings yet
Computer Aided Process Design and Simulation (Cheg
45 pages
Paperless Society
No ratings yet
Paperless Society
13 pages
Deployment View
No ratings yet
Deployment View
2 pages
Review of Related Literature Management Information System
No ratings yet
Review of Related Literature Management Information System
3 pages
4 Tle Eim G8 Exploratory Q2 Week2 Day4 PDF
No ratings yet
4 Tle Eim G8 Exploratory Q2 Week2 Day4 PDF
3 pages
Compiler Design Chapter-2
60% (5)
Compiler Design Chapter-2
105 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Home Automation PPT - Modified - 2
0% (2)
Home Automation PPT - Modified - 2
15 pages
What Are Batch Processing System and Real Time Processing System and The Difference Between Them
No ratings yet
What Are Batch Processing System and Real Time Processing System and The Difference Between Them
6 pages
Practice Problems Based On Derivations and Parse Tree
No ratings yet
Practice Problems Based On Derivations and Parse Tree
12 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Intro To Compilers Lecture 2

Uploaded by

Intro To Compilers Lecture 2

Uploaded by

Compilers

• The input is just a string of characters:

if (i ==0) //if clause

English Programming language

• Need some notation for specifying which sets we want

• The standard notation for regular languages is regular expressions.

• There are regular expressions everywhere.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.