0% found this document useful (0 votes)

3 views

slides08-lr-parsing

The document discusses Bottom-Up LR Parsing, contrasting it with Top-Down parsing methods. It explains the mechanics of LR parsing, including the use of LR items and the handling of shift-reduce conflicts, particularly in the context of if-then-else statements. Additionally, it touches on error reporting and recovery strategies, as well as other parsing tools like GLR and PEG parsers.

Uploaded by

Rasha Elsayed Sakr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

slides08-lr-parsing

Uploaded by

Rasha Elsayed Sakr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Bottom-Up LR Parsing

17-363/17-663: Programming Language Pragmatics

Reading: PLP section 2.3

Copyright © 2016 Elsevier

Prof. Jonathan Aldrich
Top-Down vs. Bottom-Up Parsing

• Top-Down/LL Parsing Intuition

program Start trying to parse a program

stmt_list $$$ Based

Start trying
on lookahead,
to parse arefine
program
to stmt_list
then to stmt stmt_list
stmt stmt_list $$$
Stack tracks predicted future parsing
...
• Bottom-Up/LR Parsing Intuition
read A Start by shifting a few tokens

stmt Reduce tokens to a stmt, thentotoa stmt_list

stmt, then a stmt_list

stmt_list Continue
Continue to
tokens
to shift
shift and
and reduce
reduce tokens
tokens
tokens to
to recognize
recognize another
another stmt
stmt
stmt_list read B Stack shows what constructs
stmt_list stmt have been recognized so far
Example Program and SLR(1) Grammar

read A
read B
sum := A + B
write sum
write sum / 2
Modeling a Parse with LR Items

• Initial parse state captured by an item

– includes start symbol, production, and current location

• What we see next might be inside stmt_list

– So we expand stmt_list and get a set of items:
Modeling a Parse with LR Items

• We can likewise expand stmt to get the item set:

• This is an SLR parser state

– We’ll call it state 0
Modeling a Parse with LR Items

• Our starting stack has state 0 on it:

0
• Input: read A read B …

• From state 0, we shift read onto the stack and

move to state 1:
0 read 1

• State 1 represents the following item:

Modeling a Parse with LR Items

• stack / item: 0 read 1

• input: A read B …

• From state 1, we shift id onto the stack

• stack / item: 0 read 1 id 1’
• input: read B …

• Now we reduce to stmt, and put stmt into the input

• stack / item: 0
• input: stmt read B …
Modeling a Parse with LR Items

• stack / item: 0
• input: stmt read B …

• We now shift stmt

• stack / item: 0 stmt 0’
• input: read B …

• Next we reduce to stmt_list

• stack / item: 0
• input: stmt_list read B …
Modeling a Parse with LR Items

• stack / item: 0
• input: stmt_list read B …

• Now we shift stmt_list

• stack / item: 0 stmt_list 2
• input: read B …
The Characteristic Finite State
Machine (CFSM)

There are also shift-reduce actions. So our states 0’, 1’ aren’t shown
here: they are “in between” states within a shift-reduce action
The CFSM as a Table
A Detailed Explanation of the CFSM
A Detailed Explanation of the CFSM
A Detailed Explanation of the CFSM
Exercise: LR Parsing

• Assume you are in parsing state 0

and the token stream is write sum / 2
• Show how the parse stack changes as the token
stream is consumed
• We’ll do the first action together
Parsing if-then-else Statements

• A famous parsing challenge (from Algol) involves if-

then-else, where else is optional:

stmt ::= if exp then stmt

| if exp then stmt else stmt

• Consider the phrase:

if exp then if exp then stmt else stmt

• Which then does the else belong to?

Shift/Reduce Conflicts

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• we can shift, treating it as part of the inner if statement, or
• we can reduce the inner if statement,
treating the else as part of the outer if statement
• How to solve?
– Many existing tools prioritize shift over reduce
– You can declare productions with precedence
• E.g. giving the if-then-else production higher precedence
than the if-then production
Shift/Reduce Conflicts

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• we can shift, treating it as part of the inner if statement, or
• we can reduce the inner if statement,
treating the else as part of the outer if statement
• How to solve?
– Many existing tools prioritize shift over reduce
– You can declare productions with precedence
– Rewrite the grammar to make it LR(1)
An LR(0) If-Then-Else Grammar
stmt → balanced_stmt | unbalanced_stmt
balanced_stmt → if cond then balanced_stmt
else balanced_stmt
| other_stuff
unbalanced_stmt → if cond then stmt
| if cond then balanced_stmt
else unbalanced_stmt

Invariant: balanced_stmts may be inside unbalanced_stmts

– but not vice versa
Unfortunately this grammar is LR(0) but not LL(0)
– Have to use precedence in LL parsers
or custom code in a recursive-descent parser
Connections to Theory
• A scanner is a Deterministic Finite Automaton (DFA)
– it can be specified with a state diagram

• An LL or LR parser is a Pushdown Automaton (PDA)

– a PDA can be specified with a state diagram and a stack
• the state diagram looks just like a DFA state diagram, except the arcs
are labeled with <input symbol, top-of-stack symbol> pairs, and in
addition to moving to a new state the PDA has the option of pushing
or popping a finite number of symbols onto/off the stack
• For LL(1) parsers the state machine has only two states:
processing and accepted
• All the action is in the input symbol and top of stack
• LR(1) parsers are richer (and more expressive)
Error Reporting
• Error reporting is relatively simple
• If you get a token for which there’s no entry in the
current parsing state / top of stack element, signal an
error
• Can tell the user what tokens would be OK here
Error Recovery
• Nice to report more than one error to the user
• Rather than stopping after the first one
• Simple idea: Panic mode
• In C-like languages, semicolons are good recovery spots
• So on an error:
• read tokens until you get to a semicolon
• discard the parser’s stack (predictions in an LL parser, states in an LR
parser) until you come to a production that has a semicolon
• assume you’ve parsed the semicolon-containing construct,
and continue parsing
• There are ways to do substantially better – see the online
supplement to the textbook
Other Parsing Tools
• Generalized LR (GLR) parser generators
• Accept any grammar – even ambiguous ones!
• This can be good if you have grammars written by nonexperts, as in
SASyLF
• But for a compiler-writer it is dangerous—you may not even know
your grammar is ambiguous, and then your poor users get ambiguity
errors when the parser runs
• Works like an LR parser, but on ambiguity considers all
possible parses in parallel
• Still O(n) if the grammar is LR (or “close”)
Other Parsing Tools
• Parsing Expression Grammar (PEG) parser generators
• Sidestep ambiguity by always favoring the first production
• Same danger as GLR parsers – you may not know your
grammar is ambiguous
• Still used some in practice (e.g. in Python)
• About as efficient as LL or LR in practice
• Like LR, PEG grammars can be cleaner than LL grammars
• Requires extreme care to get right – must think algorithmically
instead of declaratively
• Guido van Rossum, the developer of Python, saw this as an advantage

30days English Learning Plan
100% (5)
30days English Learning Plan
32 pages
The Ophanic Revelation
93% (14)
The Ophanic Revelation
242 pages
EDPM01 Proposal Proforma
0% (1)
EDPM01 Proposal Proforma
4 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
18 Miscellaneous Parsing
No ratings yet
18 Miscellaneous Parsing
8 pages
CS346 Bottom Up Parser
No ratings yet
CS346 Bottom Up Parser
64 pages
Syntax Analyzer 2-up to LR(0)
No ratings yet
Syntax Analyzer 2-up to LR(0)
73 pages
Syntax Analyzer 2-up to LALR
No ratings yet
Syntax Analyzer 2-up to LALR
74 pages
Module 3
No ratings yet
Module 3
29 pages
Bottomupparser
No ratings yet
Bottomupparser
58 pages
Mod 2
No ratings yet
Mod 2
29 pages
Compiler Design 5
No ratings yet
Compiler Design 5
7 pages
Unit 3 21csc304j CD
No ratings yet
Unit 3 21csc304j CD
103 pages
mod3
No ratings yet
mod3
29 pages
CD_Chap3_III_Bottom Up Parsing (2)
No ratings yet
CD_Chap3_III_Bottom Up Parsing (2)
37 pages
Bottom Up Parser
No ratings yet
Bottom Up Parser
75 pages
Syntax Analysis 2
No ratings yet
Syntax Analysis 2
70 pages
Syntax Analysis (Part-II)
No ratings yet
Syntax Analysis (Part-II)
69 pages
CD - R16 - UNIT III - Notes
No ratings yet
CD - R16 - UNIT III - Notes
33 pages
Bottomupparsing
No ratings yet
Bottomupparsing
12 pages
M2 - P4 LR Parser
No ratings yet
M2 - P4 LR Parser
38 pages
CD_Unit3
No ratings yet
CD_Unit3
103 pages
Parsing
No ratings yet
Parsing
33 pages
07 Bottom Up Parsing
No ratings yet
07 Bottom Up Parsing
79 pages
21 SLR Parsing
No ratings yet
21 SLR Parsing
93 pages
General Framework: X X X X: LR Parser
No ratings yet
General Framework: X X X X: LR Parser
6 pages
LR 0 Notes
No ratings yet
LR 0 Notes
14 pages
Bottom Up Parsing
No ratings yet
Bottom Up Parsing
11 pages
Lecture 8
No ratings yet
Lecture 8
13 pages
CC LR Parser
No ratings yet
CC LR Parser
37 pages
LR Parsing
No ratings yet
LR Parsing
21 pages
Introduction To Bottom Up Parser
No ratings yet
Introduction To Bottom Up Parser
75 pages
Chapter 6-1 note
No ratings yet
Chapter 6-1 note
54 pages
Bottom Up Parse
No ratings yet
Bottom Up Parse
14 pages
Sectlrparse S
No ratings yet
Sectlrparse S
19 pages
UNIT-4 Parsing Techniques
No ratings yet
UNIT-4 Parsing Techniques
20 pages
Mehak
No ratings yet
Mehak
23 pages
Bottom Up Parsing
No ratings yet
Bottom Up Parsing
39 pages
CD Unit-3 (1) (R20)
No ratings yet
CD Unit-3 (1) (R20)
29 pages
CH 4 Syntax Analysis - Part2
No ratings yet
CH 4 Syntax Analysis - Part2
31 pages
r20 CD Unit-3 Part 2
No ratings yet
r20 CD Unit-3 Part 2
8 pages
Parsing Notes
No ratings yet
Parsing Notes
96 pages
LR Parsing Methods
No ratings yet
LR Parsing Methods
50 pages
D LR Parsing
No ratings yet
D LR Parsing
41 pages
Lec06 Bottomupparser
83% (6)
Lec06 Bottomupparser
88 pages
LR Parser
No ratings yet
LR Parser
15 pages
CD R19 Unit-2
No ratings yet
CD R19 Unit-2
53 pages
Compiler Design(Unit-II)
No ratings yet
Compiler Design(Unit-II)
89 pages
LR
No ratings yet
LR
4 pages
LR (K) Parsing: CPSC 388 Ellen Walker Hiram College
No ratings yet
LR (K) Parsing: CPSC 388 Ellen Walker Hiram College
30 pages
Compilef Design Unit 2 AKTU As Per 2023-24 Syllabus
No ratings yet
Compilef Design Unit 2 AKTU As Per 2023-24 Syllabus
46 pages
Unit 02 - Part 03
No ratings yet
Unit 02 - Part 03
50 pages
Lecture3 Parser Full
No ratings yet
Lecture3 Parser Full
30 pages
Syntax Analysis: CD: Compiler Design
No ratings yet
Syntax Analysis: CD: Compiler Design
90 pages
CD Unit3 Part1
No ratings yet
CD Unit3 Part1
22 pages
Bottom-Up Parsing: Goal of Parser: Build A Derivation
No ratings yet
Bottom-Up Parsing: Goal of Parser: Build A Derivation
31 pages
Lrparser HaLrparser Handout Ndout
No ratings yet
Lrparser HaLrparser Handout Ndout
16 pages
S2 BottomUpParsing
No ratings yet
S2 BottomUpParsing
59 pages
Compiler Design Unit-2
No ratings yet
Compiler Design Unit-2
29 pages
bottom up
No ratings yet
bottom up
10 pages
Bottom Up Parsing
No ratings yet
Bottom Up Parsing
44 pages
ch2 3
No ratings yet
ch2 3
26 pages
Module 4
No ratings yet
Module 4
53 pages
lect33-textcat (1)
No ratings yet
lect33-textcat (1)
70 pages
reduction proofs
No ratings yet
reduction proofs
9 pages
Syntactic and Dependency Parsing
No ratings yet
Syntactic and Dependency Parsing
159 pages
bag_of_words nlp
No ratings yet
bag_of_words nlp
23 pages
ch07-consistency-replication (1)
No ratings yet
ch07-consistency-replication (1)
30 pages
Tut4_WordEmb nlp
No ratings yet
Tut4_WordEmb nlp
30 pages
2DI90_ch9 (1)
No ratings yet
2DI90_ch9 (1)
83 pages
Primes
No ratings yet
Primes
39 pages
2DI90_ch11 (1)
No ratings yet
2DI90_ch11 (1)
54 pages
new trends for authentication
No ratings yet
new trends for authentication
5 pages
2DI90_chID190-CH5
No ratings yet
2DI90_chID190-CH5
62 pages
3_slides corpus3
No ratings yet
3_slides corpus3
88 pages
10-estimators-pre-lecture
No ratings yet
10-estimators-pre-lecture
109 pages
Jarrar.LectureNotes.Ch1.Introduction
No ratings yet
Jarrar.LectureNotes.Ch1.Introduction
18 pages
NLP-LLM
No ratings yet
NLP-LLM
47 pages
ML4D-L6 nlp2
No ratings yet
ML4D-L6 nlp2
58 pages
13-oo-opolymorphism plc
No ratings yet
13-oo-opolymorphism plc
15 pages
13-neuralcrf pos tagging
No ratings yet
13-neuralcrf pos tagging
40 pages
CSE538 sp25 (4) Lexical and Vector Semantics 2-25 nlp
No ratings yet
CSE538 sp25 (4) Lexical and Vector Semantics 2-25 nlp
126 pages
imc_shift-cipher
No ratings yet
imc_shift-cipher
17 pages
02 Random Vars All Handout
No ratings yet
02 Random Vars All Handout
23 pages
4_slides Regualer expression
No ratings yet
4_slides Regualer expression
75 pages
2.BasicTextProcessing NEW
No ratings yet
2.BasicTextProcessing NEW
39 pages
61799956 POS tagging
No ratings yet
61799956 POS tagging
63 pages
01-introduction plc
No ratings yet
01-introduction plc
53 pages
07-covariance-answers-hidden-lecture
No ratings yet
07-covariance-answers-hidden-lecture
62 pages
04-textcat text class
No ratings yet
04-textcat text class
77 pages
01-bayes-all-handout prob
No ratings yet
01-bayes-all-handout prob
28 pages
Ch. 1 Notes
No ratings yet
Ch. 1 Notes
11 pages
2 Corpora and Smoothing
No ratings yet
2 Corpora and Smoothing
85 pages
1MS Worksheets
No ratings yet
1MS Worksheets
13 pages
PH-06 (KD 3.6) Past Tense (PG30) GForm
100% (1)
PH-06 (KD 3.6) Past Tense (PG30) GForm
4 pages
02 Introduction To Logic
No ratings yet
02 Introduction To Logic
35 pages
Gerund & Infinitive
100% (1)
Gerund & Infinitive
23 pages
Www Educsector Com...
No ratings yet
Www Educsector Com...
17 pages
Chapter 4 Job Application Letter
No ratings yet
Chapter 4 Job Application Letter
3 pages
Structure of The Universe in The Norse and Slavic Beliefs
100% (2)
Structure of The Universe in The Norse and Slavic Beliefs
42 pages
2012 BT2 Timetable
No ratings yet
2012 BT2 Timetable
2 pages
Sentences in Arabic
50% (2)
Sentences in Arabic
7 pages
Materi Ajar
No ratings yet
Materi Ajar
24 pages
A Study of The Classification of Verb Roots and Formation of Infinitive Verbs in Tamil Language
No ratings yet
A Study of The Classification of Verb Roots and Formation of Infinitive Verbs in Tamil Language
13 pages
TEFL - Introduction Chapter 1
No ratings yet
TEFL - Introduction Chapter 1
3 pages
FOL Questionnaire-Eric
No ratings yet
FOL Questionnaire-Eric
2 pages
Prayer Time
No ratings yet
Prayer Time
1 page
De Thi FINAL EXAMINATION - FE 1 (SE - TRI)
No ratings yet
De Thi FINAL EXAMINATION - FE 1 (SE - TRI)
8 pages
Makalah Pidgin and Creoles
100% (1)
Makalah Pidgin and Creoles
10 pages
Report On Second Language Acquisition Hypothesis
100% (1)
Report On Second Language Acquisition Hypothesis
10 pages
Lesson 3 PREP PS
No ratings yet
Lesson 3 PREP PS
7 pages
0500, Paper 2, Section B, Narrative Writing: by Ms Mehala Lesson 1 and 2 Monday 4/5/2020
100% (3)
0500, Paper 2, Section B, Narrative Writing: by Ms Mehala Lesson 1 and 2 Monday 4/5/2020
12 pages
Auto Subtitle Generator Online - Free in 100+ Languages
No ratings yet
Auto Subtitle Generator Online - Free in 100+ Languages
10 pages
Vosa Vakaviti Fijian Language Cards
No ratings yet
Vosa Vakaviti Fijian Language Cards
6 pages
World Englishes Assignment
No ratings yet
World Englishes Assignment
3 pages
Online Banking System: A Project Report On
100% (2)
Online Banking System: A Project Report On
58 pages
Fight Manju Fight 1698679748987
No ratings yet
Fight Manju Fight 1698679748987
2 pages
200+ Basic Oromo Language Words & Phrases You Should Know
No ratings yet
200+ Basic Oromo Language Words & Phrases You Should Know
8 pages
New List of Verbs
No ratings yet
New List of Verbs
1 page
Frederick George Bailey - Tribe, Caste, and Nation - A Study of Political Activity and Political Change in Highland Orissa - Manchester University Press (1960) PDF
No ratings yet
Frederick George Bailey - Tribe, Caste, and Nation - A Study of Political Activity and Political Change in Highland Orissa - Manchester University Press (1960) PDF
317 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

slides08-lr-parsing

Uploaded by

slides08-lr-parsing

Uploaded by

Bottom-Up LR Parsing

17-363/17-663: Programming Language Pragmatics

Reading: PLP section 2.3

Copyright © 2016 Elsevier

• Top-Down/LL Parsing Intuition

stmt_list $$$ Based

stmt Reduce tokens to a stmt, thentotoa stmt_list

• Initial parse state captured by an item

– includes start symbol, production, and current location

• What we see next might be inside stmt_list

• We can likewise expand stmt to get the item set:

• This is an SLR parser state

• Our starting stack has state 0 on it:

• From state 0, we shift read onto the stack and

• State 1 represents the following item:

• stack / item: 0 read 1

• From state 1, we shift id onto the stack

• Now we reduce to stmt, and put stmt into the input

• We now shift stmt

• Next we reduce to stmt_list

• Now we shift stmt_list

• Assume you are in parsing state 0

• A famous parsing challenge (from Algol) involves if-

stmt ::= if exp then stmt

• Consider the phrase:

if exp then if exp then stmt else stmt

• Which then does the else belong to?

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

Invariant: balanced_stmts may be inside unbalanced_stmts

• An LL or LR parser is a Pushdown Automaton (PDA)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.