0% found this document useful (0 votes)

18 views19 pages

Lex

The document introduces the concept of lexical analysis and the use of the Lex tool to create lexical analyzers through regular expressions. It outlines the structure of Lex programs, including declarations, translation rules, and auxiliary functions, and explains how the generated C program interacts with a parser. Additionally, it discusses conflict resolution and the lookahead operator in Lex for matching patterns in input.

Uploaded by

Subashini Hari Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views19 pages

Lex

Uploaded by

Subashini Hari Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

UNIT I

INTRODUCTION TO COMPILERS

Lexical Analysis
Lex – The Lexical-Analyzer Generator
Introduction
• Recent implementation – Flex
• To specify a lexical analyzer by specifying
regular expressions to describe patterns for
tokens
• Tool: Lex compiler
• Input notation: Lex language
• Lex compiler transforms the input patterns into
a transition diagram and generates code,
lex.yy.c, simulates the transition diagram
Use of Lex
• Creating a lexical analyzer with Lex
Use of Lex
• An input file, lex.l, is written in the Lex language
and describes the lexical analyzer to be generated
• The Lex compiler transforms lex.l to a C program
always named lex.yy.c
• The latter file is compiled by the C compiler into a
file called a.out
• The C-compiler output is a lexical analyzer that
take a stream of input characters and produce a
stream of tokens
Use of Lex
• a.out
– A subroutine of the parser
– Returns an integer, a code for one of the possible token
names
– The attribute value, whether a numeric code, a pointer
to the symbol table, or nothing, is placed in a global
variable yylval
– yylval is shared between the lexical analyzer and parser
– Thereby returns both the name and an attribute value
of a token
Structure of Lex Programs
• A Lex program has the following form:

declarations
%%
translation rules
%%
auxiliary functions

• The declarations section includes declarations of

– variables
– manifest constants
(identifiers declared to stand for a constant, e.g. , the name of a token)
Structure of Lex Programs
• The translation rules have the form
Pattern { Action }
– Each pattern is a regular expression
– The actions are fragments of code written in C
• The third section holds additional functions
used in the actions
– These functions can be compiled separately and
loaded with the lexical analyzer
Structure of Lex Programs
• The lexical analyzer created by Lex behaves in concert with
the parser as follows
– When called by the parser, the lexical analyzer begins reading its
remaining input, one character at a time, until it finds the input
that matches one of the patterns Pi
– It then executes the associated action Ai
– Ai will return to the parser
• If it does not (e.g., whitespace), then the lexical analyzer proceeds to
find additional lexemes, until one of the corresponding actions causes a
return to the parser
• The lexical analyzer returns a single value, the token name,
to the parser
• Uses the shared, integer variable yylval to pass additional
Example
• Tokens, patterns and attribute value
Example
%{
/* definitions of manifest constants
LT, LE, EQ, NE, GT, GE, IF, THEN, ELSE, ID, NUMBER, RELOP */
%}

/* regular definitions */
delim [ \t\n]
ws {del im}+
letter [A-Za-z]
digit [0-9]
id {letter}({letter}|{digit})*
number {digit}+(\.{digit }+)?(E[+-]?{digit}+)?
%%
Example
{ws} {/* no action and no return */}
if {return (IF);}
then {return (THEN);}
else {return (ELSE) ;}
{id} {yylval = (int) installID (); return (ID);}
{number} {yylval = (int) installNum (); return (NUMBER);}
"<“ {yylval = LT; return (RELOP);}
" <=“ {yylval = LE; return (RELOP);}
"=“ {yylval = EQ; return (RELOP);}
" <>“ {yylval = NE; return (RELOP);}
">“ {yylval = GT; return (RELOP);}
" >=“ {yylval = GE; return (RELOP);}
%%
Example
int installID () {/* function to install the lexeme, whose
first character is pointed to by yytext,
and whose length is yyleng, into the
symbol table and return a pointer
thereto */
}

int installNum () {/* similar to installID, but puts numerical

constants into a separate table */
}
Example
• The Lex program recognizes the tokens given
and returns the token found
• Declarations section
– A pair of special brackets, %{ and %}
• Anything within these brackets is copied directly to the
file lex.yy.c, and is not treated as a regular definition
• The definitions of the manifest constants are placed
here using C #define statements
– Associates unique integer codes with each of the manifest
constants
Example
– A sequence of regular definitions
• These use the extended notation for regular
expressions
• Regular definitions that are used in later definitions or
in the patterns of the translation rules are surrounded
by curly braces
• For instance, delim
– defined to be a shorthand for the character class consisting of
the blank, the tab, and the newline
– ws is defined to be one or more delimiters, by the regular
expression {delim}+
Example
• Patterns and rules in the middle section
• Auxiliary-function section
– Everything in the auxiliary section is copied directly to file lex.yy.c,
but may be used in the actions
– Function installID ()
• Called to place the lexeme found in the symbol table
• Returns a pointer to the symbol table, which is placed in global variable
yylval, where it can be used by the parser or a later component of the
compiler
• yytext is a pointer to the beginning of the lexeme
• yyleng is the length of the lexeme found
• The token name ID is returned to the parser
– Function installNum ()
• Action is taken when a lexeme matching the pattern number
Conflict Resolution in Lex
• The two rules that Lex uses to decide on the
proper lexemeto select, when several prefixes
of the input match one or more patterns:
– Always prefer a longer prefix to a shorter prefix
– If the longest possible prefix matches two or more
patterns, prefer the pattern listed first in the Lex
program
The Lookahead Operator
• Lex automatically reads one character ahead of the last
character that forms the selected lexeme
• Then retracts the input so only the lexeme itself is consumed
from the input
• Sometimes, a certain pattern to be matched to the input only
when it is followed by a certain other characters
• If so, use the slash in a pattern to indicate the end of the part of
the pattern that matches the lexeme
• What follows / is additional pattern that must be matched
before deciding that the token
• The second pattern is not part of the lexeme
int mark1, mark2;
The Lookahead Operator
• Keywords are not reserved
IF (I , J) = 3 IF is the name of an
array
IF (condition) THEN . . . IF is a keyword
IF / \ ( . * \) {letter} Lex rule

Recognition of Tokens
No ratings yet
Recognition of Tokens
11 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
Lec5 LEX Lexical Analyzer Generator
No ratings yet
Lec5 LEX Lexical Analyzer Generator
12 pages
Specification of Tokens
No ratings yet
Specification of Tokens
21 pages
Digital Execution Manual
No ratings yet
Digital Execution Manual
82 pages
Evonix Company Deck - Educational
No ratings yet
Evonix Company Deck - Educational
18 pages
Role of Lexical Analyzer_Input Buffering
No ratings yet
Role of Lexical Analyzer_Input Buffering
11 pages
Screenshot 2024-06-23 at 10.07.50
No ratings yet
Screenshot 2024-06-23 at 10.07.50
1 page
Cma Manual (1)
No ratings yet
Cma Manual (1)
20 pages
CC_unit_2
No ratings yet
CC_unit_2
80 pages
study of Lex
No ratings yet
study of Lex
3 pages
ReleaseNotes
No ratings yet
ReleaseNotes
15 pages
Lab_session
No ratings yet
Lab_session
27 pages
Lexical-Analyzer
No ratings yet
Lexical-Analyzer
33 pages
Finite Automata
No ratings yet
Finite Automata
16 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
Lexical-Analyzer
No ratings yet
Lexical-Analyzer
31 pages
lex tool
No ratings yet
lex tool
7 pages
Agile Unit - 5
No ratings yet
Agile Unit - 5
11 pages
1lex and Yacc
No ratings yet
1lex and Yacc
42 pages
LP IV Compiler Manual
No ratings yet
LP IV Compiler Manual
26 pages
GOC 43 Catalogue Web (1)
No ratings yet
GOC 43 Catalogue Web (1)
8 pages
Unit 1 Part 3
No ratings yet
Unit 1 Part 3
12 pages
UNIT I BKS Lexical Analysis IX - LEX
No ratings yet
UNIT I BKS Lexical Analysis IX - LEX
17 pages
CD Cse Record
No ratings yet
CD Cse Record
76 pages
Lecture3 Lex
No ratings yet
Lecture3 Lex
44 pages
IS Course
No ratings yet
IS Course
2 pages
SPCC EXP7
No ratings yet
SPCC EXP7
8 pages
ATCD Mod 3
No ratings yet
ATCD Mod 3
46 pages
Adobe Chat Transcript 1-13-24
No ratings yet
Adobe Chat Transcript 1-13-24
3 pages
Lex A Lexical Analyzer Generator: M M.. E E.. L Leesskk A An ND de E.. S SCCH HM Miid DTT
No ratings yet
Lex A Lexical Analyzer Generator: M M.. E E.. L Leesskk A An ND de E.. S SCCH HM Miid DTT
13 pages
UNIT 2 Part 3 Lexical Analyzer Generator
No ratings yet
UNIT 2 Part 3 Lexical Analyzer Generator
27 pages
Python 07 Files
No ratings yet
Python 07 Files
23 pages
Introduction To Lex
No ratings yet
Introduction To Lex
18 pages
CLL
No ratings yet
CLL
3 pages
Designing A Lexical Analyzer Using LEX-1
No ratings yet
Designing A Lexical Analyzer Using LEX-1
12 pages
G11 Lesson 02 Essay 1
No ratings yet
G11 Lesson 02 Essay 1
3 pages
Enterprise Cloud Strategy PDF
100% (3)
Enterprise Cloud Strategy PDF
109 pages
Social Media Usage in Nigeria
100% (2)
Social Media Usage in Nigeria
11 pages
Mathematics Written in Sand - : The hp-15C, Intel 8087, Etc
No ratings yet
Mathematics Written in Sand - : The hp-15C, Intel 8087, Etc
49 pages
Lex Programming Lab
No ratings yet
Lex Programming Lab
9 pages
Compiler Desing-Final ppt2
No ratings yet
Compiler Desing-Final ppt2
194 pages
Lex Yaac
No ratings yet
Lex Yaac
24 pages
Saad Gmira Resume
No ratings yet
Saad Gmira Resume
1 page
Lex Introduction
No ratings yet
Lex Introduction
26 pages
Compiler Design (CD) : Lab Assignment 1
No ratings yet
Compiler Design (CD) : Lab Assignment 1
36 pages
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
No ratings yet
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
13 pages
OBE Action Plan
No ratings yet
OBE Action Plan
2 pages
LEX Examples
No ratings yet
LEX Examples
9 pages
LEX Programming
No ratings yet
LEX Programming
36 pages
SS & OS Final Lab Manual
No ratings yet
SS & OS Final Lab Manual
46 pages
Cisco
100% (2)
Cisco
12 pages
Compiler Design Manual
No ratings yet
Compiler Design Manual
69 pages
Introduction For Lab Compiler
No ratings yet
Introduction For Lab Compiler
15 pages
Estd 1919
No ratings yet
Estd 1919
22 pages
Quiz 102 - Attempt Review Supervision
No ratings yet
Quiz 102 - Attempt Review Supervision
5 pages
Assignment CD
No ratings yet
Assignment CD
7 pages
Class 2019 Lex
No ratings yet
Class 2019 Lex
30 pages
What Do You Mean by LEX2
No ratings yet
What Do You Mean by LEX2
7 pages
CTRL + A CTRL + C CTRL + X CTRL + O CTRL + F CTRL + I CTRL + K CTRL + U CTRL + Y CTRL + Z CTRL + G CTRL + H
No ratings yet
CTRL + A CTRL + C CTRL + X CTRL + O CTRL + F CTRL + I CTRL + K CTRL + U CTRL + Y CTRL + Z CTRL + G CTRL + H
3 pages
Lecture 07 PDF
No ratings yet
Lecture 07 PDF
8 pages
Implementation of Symbol Table Using Flex On Unix Environment
No ratings yet
Implementation of Symbol Table Using Flex On Unix Environment
19 pages
Lab
No ratings yet
Lab
169 pages
Assignment No.: Assignment To Understand Basic Syntax of LEX Specifications, Built-In Functions and Variables
No ratings yet
Assignment No.: Assignment To Understand Basic Syntax of LEX Specifications, Built-In Functions and Variables
32 pages
Lab Manual
No ratings yet
Lab Manual
23 pages
SURPASS HiT7035 R4.2 Acceptance Test Manual
100% (1)
SURPASS HiT7035 R4.2 Acceptance Test Manual
72 pages
Flex/Le X: Javeria Akram (276) Ifra Zahid
No ratings yet
Flex/Le X: Javeria Akram (276) Ifra Zahid
21 pages
Lexical Analysis & Lex Tool
No ratings yet
Lexical Analysis & Lex Tool
17 pages
Study of R2R 4-Bit and 8-Bit DAC Circuit Using Multisim Technology
No ratings yet
Study of R2R 4-Bit and 8-Bit DAC Circuit Using Multisim Technology
5 pages
Compiler Lab Manual Final E-Content
75% (16)
Compiler Lab Manual Final E-Content
55 pages
The Function of Lex Is As Follows
No ratings yet
The Function of Lex Is As Follows
3 pages
Cisco's Response To Gartner's 'Debunking The Myth of The Single-Vendor Network'
No ratings yet
Cisco's Response To Gartner's 'Debunking The Myth of The Single-Vendor Network'
5 pages
Grow Your Network Marketing Business Using MLM Software
No ratings yet
Grow Your Network Marketing Business Using MLM Software
6 pages
Lex
No ratings yet
Lex
4 pages
1 Introduction To LEX: Input - File.l
No ratings yet
1 Introduction To LEX: Input - File.l
19 pages
System Programming & Compiler Design Lab Manual
No ratings yet
System Programming & Compiler Design Lab Manual
41 pages
PSE Cortex Demo
No ratings yet
PSE Cortex Demo
4 pages
What Is Lex
No ratings yet
What Is Lex
10 pages
Lex1 Lab Manual TE Computer SPPU
No ratings yet
Lex1 Lab Manual TE Computer SPPU
6 pages
Syllabus: 100 Days of Code Complete Professional Python Bootcamp
No ratings yet
Syllabus: 100 Days of Code Complete Professional Python Bootcamp
3 pages
12th Computer Science Guess Papers +subjective + MCQs by Urdu Books
No ratings yet
12th Computer Science Guess Papers +subjective + MCQs by Urdu Books
28 pages
PLT Lecture Notes
No ratings yet
PLT Lecture Notes
5 pages
LTE1534 - Multiple Frequency Band Indicator
No ratings yet
LTE1534 - Multiple Frequency Band Indicator
13 pages
Lex and Yacc
No ratings yet
Lex and Yacc
8 pages
Lex and Yacc Roll No 23
No ratings yet
Lex and Yacc Roll No 23
7 pages
Lexical Analyser Parser
No ratings yet
Lexical Analyser Parser
37 pages
Lex & Yacc
No ratings yet
Lex & Yacc
46 pages
How To Configure MPLS VPN Using Cisco Routers
No ratings yet
How To Configure MPLS VPN Using Cisco Routers
5 pages
Core Banking Solution
No ratings yet
Core Banking Solution
30 pages
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lex

Uploaded by

Lex

Uploaded by

UNIT I

• The declarations section includes declarations of

int installNum () {/* similar to installID, but puts numerical

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.