0% found this document useful (0 votes)

67 views28 pages

CH06

This document discusses intermediate code generation and optimization in compilers. It describes how producing an intermediate representation facilitates retargeting a compiler to different machines and allows for machine-independent optimizations. Common intermediate representations include graphs, postfix notation, and three-address code. The document outlines various machine-independent optimizations that can improve the intermediate code, such as peephole, local, global, loop, and inter-procedural optimizations. It also discusses basic blocks and how they are constructed from three-address instructions.

Uploaded by

zemike

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views28 pages

CH06

Uploaded by

zemike

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

CHAPTER SIX

Intermediate Code Generation and

Optimization

Outline
 Introduction
 Intermediate-Code Generation
 Machine-Independent Optimizations
6.1 Introduction: Structure of a Compiler
6.2 Intermediate Code Generation

 Although a compiler can directly produce a target language

(i.e. machine code or assembly of the target machine),
producing a machine independent intermediate representation
has the following benefits.
 Retargeting to another machine is facilitated.
 Intermediate code representation is neutral in relation to target
machine, so the same intermediate code generator can be
shared for all target languages (machines).
 Build a compiler for a new machine by attaching a new code
generator to an existing front-end
 Machine independent code optimization can be applied to
intermediate code.
Compiling Process without
Intermediate Representation

C SPARC

Pascal HP PA

FORTRAN x86

C++ IBM PPC

Compiling Process with Intermediate
Representation

C SPARC

Pascal HP PA
IR
FORTRAN x86

C++ IBM PPC

10
Methods of Intermediate Code (IC) Generation

Intermediate language can be many different languages,

and the designer of the compiler decides this intermediate
Language. Common IRs:
 Graphical Representation: such as syntax trees, AST
(Abstract Syntax Trees), DAG
 Postfix Notation: the abstract syntax tree is linearized as a
sequence of data references and operations.
 For instance, the tree for : a * ( 9 + d ) can be mapped to the
equivalent postfix notation: a9d+*
 Three-address Code: All operations are represented as a 4-
part list in quadruples:
 (op, arg1, arg2, result). E.g., x := y + z -> (+ y z x)
Direct Acyclic Graph (DAG) Representation

 Example: F = ((A+BC) (ABC))+C

= =
F + +
F

* *
C
+ * + *
A
* * *
A
B C B C A
B C
DAG Syntax tree
A syntax tree depicts the natural hierarchical structure of a
source program. A DAG gives the same information but in
compact way because common expressions are identified
Postfix Notation: PN

 A mathematical notation wherein every operator follows all

of its operands.
 Or a list of nodes of a tree in which a node appears
immediately next to its children.
Example: PN of expression a* (b+c) is abc+*
How about (a+b)/(c-d)
 Form Rules:
 If E is a variable/constant, the PN of E is E itself.
 If E is an expression of the form E1 op E2, the PN of E is
E1 ’E2 ’op (E1 ’ and E2 ’ are the PN of E1 and E2,
respectively.)
 If E is a parenthesized expression of form (E1), the PN
of E is the same as the PN of E1.
Three Address Code
 The general form:x = y op z
 x,y,and z are names, constants, compiler-generated temporaries
 op stands for any operator such as +,-,….

 We use the term “three-address code” because each statement

usually contains three addresses (two for operands, one for the
result).
 A popular form of intermediate code used in optimizing
compilers is three-address statements.
 Linearized representation of syntax tree with explicit names
given to interior nodes.
 There is only one operator in the right. Thus a source language
expression like : a+b*c might be translated into a sequence with
temporaries t1 and t2
t1 = b* c
t2 = a + t1
DAG vs. Three Address Code
 Three address code is a linearized representation of
a syntax tree (or a DAG) in which explicit names
(temporaries) correspond to the interior nodes of the
graph.
Expression: F = ((A+B*C) * (A*B*C))+C
=
T1 := A T1 := B * C
F + T2 := C T2 := A+T1
T3 := B * T2 T3 := A*T1
T4 := T1+T3 T4 := T2*T3
* T5 := T1*T3 T5 := C
+ * T6 := T4 * T5 T6 := T4 + T5
T7 := T6 + T2 F := T6
A
* F := T7
B C
Syntax tree DAG

Question: Which IR code sequence is better?

Implementation of Three Address Code

• Quadruples
Four fields: op, arg1, arg2, result
Array of struct {op, *arg1, *arg2, *result}
 x:=y op z is represented as op y, z, x
arg1, arg2 and result are usually pointers to symbol table
entries.
May need to use many temporary names.
Many assembly instructions are like quadruple, but arg1,
arg2, and result are real registers.
• Triples
Three fields: op, arg1, and arg2. Result become implicit.
arg1 and arg2 can be pointers to the symbol table.
5/31/2015 \course\cpeg621-10F\Topic-1a.ppt 11
Types of Three-Address Statements

 Assignment statements:
 x := y op z, where op is a binary operator add a,b,c
 x := op z, where op is a unary operator not a, ,c or intoreal a, ,c
 Copy statements
 x := y mov a, ,c
 The unconditional jumps:
 goto L jump , ,L1
 Conditional jumps:
 if x relop y goto L jmprelop y,z,L or if y relop z goto L
 param x and call p, n and return y relating to procedure calls
Eg: f(x+1,y)  add x,1,t1
param t1, ,
param y, ,
call f,2,
 Indexed assignments:
 x := y[i]
 x[i] := y
 Address and pointer assignments:
 x := &y, x := *y, and *x = y
6.3 Code Optimization:
Summary of Front End

Lexical Analyzer (Scanner)

+
Syntax Analyzer (Parser)
+ Semantic Analyzer

Front
Abstract Syntax Tree w/Attributes End

Intermediate-code Generator

Error Non-optimized Intermediate Code

Message
5/31/2015 \course\cpeg621-10F\Topic-1a.ppt 13
Code Optimization

• The machine-independent code-optimization phase attempts to

improve the intermediate code so that better target code will
result.
• Usually better means faster, but other objectives may be
desired, such as shorter code, or target code that consumes less
power.
• A simple intermediate code generation algorithm followed by
code optimization is a reasonable way to generate good target
code.
How Compiler Improves Performance
• Execution time = Operation count * Machine cycles per
operation
• Minimize the number of operations
• Arithmetic operations, memory accesses
• Replace expensive operations with simpler ones
• E.g., replace 4-cycle multiplication with1-cycle shift
• Minimize cache misses
• Both data and instruction accesses
• Perform work in parallel
• Instruction scheduling within a thread
• Parallel execution across multiple threads
Code Optimization

• There is a great variation in the amount of code optimization

different compilers perform.
• In those that do the most, the so called “optimizing compilers”,
take significant time in this phase.
• Trade off between compilation time and degree of optimization
Why to use optimization:
• There are simple optimizations that significantly improve the
running time of target program without slowing down
compilation too much
Types of Optimization

• Peephole
• Local
• Global
• Loop
• Inter-procedural, whole-program or link-time
• Machine code
• ….
Basic Blocks

 Basic blocks are maximal sequences of consecutive three-

address instructions.
 The flow of control can only enter the basic block through the
first instruction in the block. (no jumps into the middle of the
block )
 Control will leave the block without halting or
branching, except possibly at the last instruction in the
block.
 The basic blocks become the nodes of a flow graph,
whose edges indicate which blocks can follow which
other blocks.
Construction of Basic Blocks
 Input: A sequence of three-address instructions
 Output: A list of the basic blocks for that sequence in
which each instruction is assigned to exactly one basic
block
 Method: Determine instructions in the intermediate code that
are leaders:
 The rules for finding leaders are:
 The first three-address instruction in the intermediate code
 Any instruction that is the target of a conditional or
unconditional jump
 Any instruction that immediately follows a conditional or
unconditional Jump is a leader
Construction Partitioning Three-address
Instructions in to Basic Blocks
1. i=1
 First, instruction 1 is a leader by rule (1).
2. j=1
Jumps are at instructions 6, 8, and 11. By 3. t1 = 10 * i
rule (2), the targets of these jumps are 4. t2 = t1 + j
leaders ( instructions 3, 2, and 10, 5. j=j+1
respectively) 6. if j <= 10 goto (3)
 By rule (3), each instruction following a 7. i=i+1
jump is a leader; instructions 7 and 9. 8. if i <= 10 goto (2)
 Leaders are instructions 1, 2, 3, 7, 9 and 9. i=1
10. t3 = i – 1
10. The basic block of each leader
11. if i <= 10 goto (10)
contains all the instructions from itself
until just before the next leader.
Flow Graphs
 Flow Graph is the representation of control flow between
basic blocks. The nodes of the flow graph are the basic blocks.
 There is an edge from block B to block C if and only if it is
possible for the first instruction in block C to immediately
follow the last instruction in block B. There are two ways that
such an edge could be justified:
1. There is a conditional or unconditional jump from the end

of B to the beginning of C.
2. C immediately follows B in the original order of the three-
address instructions, and B does not end in an
unconditional jump.
 B is a predecessor of C, and C is a successor of B.
Flow Graphs: Example
Flow Graph Example of program in Example(1).
The block led by first statement of the program is the
start, or entry node.
Entry
Exit
B1: i = 1
B6: t3 = i – 1
B2: j = 1 if i <= 10 goto (10)

B3: t1 = 10 * i B5: i = 1
t2 = t1 + j
j=j+1 B4: i = i + 1
if j <= 10 goto (3) if i <= 10 goto (2)

22
Representation of Basic Blocks

• Each basic block is represented by a record

consisting of
– a count of the number of statements
– a pointer to the leader
– a list of predecessors
– a list of successors

23
Peephole Optimization
• Improve the performance of the target program by
examining and transforming a short sequence of
target instructions
• Depends on the window size
• May need repeated passes over the code
Examples Redundant loads and stores
MOV R0, a
MOV a, Ro
• Algebraic Simplification
x := x + 0
x := x * 1
• Constant folding
x := 2 + 3 x := 5
y := x + 3 y := 8
Local Optimizations
 Analysis and transformation performed within a basic block
 No control flow information is considered
 Examples of local optimizations:
 Local common sub expression elimination
analysis: same expression evaluated more than once.
transformation: replace with single calculation
 Local constant folding or elimination
analysis: expression can be evaluated at compile time
transformation: replace by constant, compile-time value
 Dead code elimination

25
Global Optimizations:

Intraprocedural
 Global versions of local optimizations
 Global common sub-expression elimination
 Global constant propagation
 Dead code elimination

 Loop optimizations
 Reduce code to be executed in each iteration

26
Examples

• Unreachable code
#define debug 0
if (debug) (print debugging information)

if 0 <> 1 goto L1
print debugging
information L1:

if 1 goto L1
print debugging information
L1:
27
Examples

• Flow-of-control optimization

goto L1 goto L2
… …
L1: goto L2 L2: …

goto L1 if a < b goto L2

… …
L1: if a < b goto L2

TSR - Class Cd-Unit 3
No ratings yet
TSR - Class Cd-Unit 3
111 pages
CH-6 Intermediate Code Generator
No ratings yet
CH-6 Intermediate Code Generator
54 pages
Unit V Updated
No ratings yet
Unit V Updated
126 pages
Code Optimization
No ratings yet
Code Optimization
32 pages
Lecture Notes Compiler Design Chapter-6
No ratings yet
Lecture Notes Compiler Design Chapter-6
55 pages
Lecture Notes On Code Generation
No ratings yet
Lecture Notes On Code Generation
74 pages
Chapter 6 Code Generation and Optimization
No ratings yet
Chapter 6 Code Generation and Optimization
34 pages
Unit V-CD New
No ratings yet
Unit V-CD New
126 pages
Unit 6
No ratings yet
Unit 6
80 pages
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
No ratings yet
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
29 pages
Code Generation and Code Optimization Eng 16
No ratings yet
Code Generation and Code Optimization Eng 16
11 pages
BCS 324 Topic 5
No ratings yet
BCS 324 Topic 5
35 pages
CSE-303 Chapter-06 Final
No ratings yet
CSE-303 Chapter-06 Final
97 pages
Lecture 08
No ratings yet
Lecture 08
36 pages
CS 603 C Compiler Design Unit 4
No ratings yet
CS 603 C Compiler Design Unit 4
14 pages
4 - Intermediate Code Generation
No ratings yet
4 - Intermediate Code Generation
51 pages
CD Unit 6
No ratings yet
CD Unit 6
27 pages
Intermediate Code Generation and Code Optimization
No ratings yet
Intermediate Code Generation and Code Optimization
40 pages
Emailing Optimization
No ratings yet
Emailing Optimization
50 pages
Cufsm Advanced Functions
No ratings yet
Cufsm Advanced Functions
34 pages
1 Unit 4 Complete
No ratings yet
1 Unit 4 Complete
92 pages
Compiler - Three Addr Codes
No ratings yet
Compiler - Three Addr Codes
33 pages
Julia-1 5 0-DEV PDF
No ratings yet
Julia-1 5 0-DEV PDF
1,340 pages
Mod 4
No ratings yet
Mod 4
39 pages
Chapter 5 - Code Generation
No ratings yet
Chapter 5 - Code Generation
27 pages
UNIT 4 Notes CD
No ratings yet
UNIT 4 Notes CD
14 pages
Basic Blocks
No ratings yet
Basic Blocks
18 pages
Friction - DPPs
No ratings yet
Friction - DPPs
11 pages
Unit 4
No ratings yet
Unit 4
19 pages
RRB NTPC 12 January 2021 Question Paper PDF
No ratings yet
RRB NTPC 12 January 2021 Question Paper PDF
3 pages
Unit-4 LMD CD
No ratings yet
Unit-4 LMD CD
34 pages
Compiler Unit 4
No ratings yet
Compiler Unit 4
59 pages
Unit-Iii: Intermediate Code Generation
No ratings yet
Unit-Iii: Intermediate Code Generation
47 pages
Cdunit 5
No ratings yet
Cdunit 5
41 pages
18 Code Gen
No ratings yet
18 Code Gen
24 pages
Compiler Design Unit 5
No ratings yet
Compiler Design Unit 5
39 pages
Morphological Analysis: Natural Language Processing (CSE 5321)
No ratings yet
Morphological Analysis: Natural Language Processing (CSE 5321)
23 pages
Chapter 8 - Code Generation
No ratings yet
Chapter 8 - Code Generation
62 pages
Khairul - Naim.bin - Ahmad 109213 PDF
100% (1)
Khairul - Naim.bin - Ahmad 109213 PDF
623 pages
Week 4 Groundwater Flow Equation - Unconfined Aquifer
No ratings yet
Week 4 Groundwater Flow Equation - Unconfined Aquifer
18 pages
Code Generation
No ratings yet
Code Generation
43 pages
Code Generation: M.B.Chandak Lecture Notes On Language Processing
No ratings yet
Code Generation: M.B.Chandak Lecture Notes On Language Processing
19 pages
QA 06 Ratio-2
No ratings yet
QA 06 Ratio-2
34 pages
Chapter 6
No ratings yet
Chapter 6
28 pages
Compilation Techniques
No ratings yet
Compilation Techniques
15 pages
Chapter 5 - Intermediate Code Generation
No ratings yet
Chapter 5 - Intermediate Code Generation
27 pages
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
No ratings yet
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
26 pages
Sti Thesis Format
100% (2)
Sti Thesis Format
6 pages
CS 346: Intermediate Code Generation: Resource
No ratings yet
CS 346: Intermediate Code Generation: Resource
60 pages
Compiler Design - Code Generation
No ratings yet
Compiler Design - Code Generation
62 pages
Introduction To Compilers: Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn
No ratings yet
Introduction To Compilers: Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn
51 pages
Code Optimization-I
No ratings yet
Code Optimization-I
12 pages
Session 15
100% (1)
Session 15
31 pages
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
No ratings yet
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
32 pages
CD Unit3
No ratings yet
CD Unit3
17 pages
CSC3201 - Compiler Construction (Part II) - Lecture 5 - Code Generation
No ratings yet
CSC3201 - Compiler Construction (Part II) - Lecture 5 - Code Generation
64 pages
Compiler Design Chapter-6
No ratings yet
Compiler Design Chapter-6
83 pages
eBook Tặng 1 - Em Tự Tin Vào Lớp 1 Với Mighty Math Singapore - Full - 224 Trang
No ratings yet
eBook Tặng 1 - Em Tự Tin Vào Lớp 1 Với Mighty Math Singapore - Full - 224 Trang
226 pages
Three Dimensional Figures
No ratings yet
Three Dimensional Figures
19 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
23 pages
Sketch of A Noisy Channel Model For The Translation Process: Michael Carl Moritz Schaeffer
No ratings yet
Sketch of A Noisy Channel Model For The Translation Process: Michael Carl Moritz Schaeffer
46 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
23 pages
G-01 KAN Guide On Measurement Uncertainty (En)
No ratings yet
G-01 KAN Guide On Measurement Uncertainty (En)
32 pages
Unit-Iv: Intermediate Code Generation
No ratings yet
Unit-Iv: Intermediate Code Generation
19 pages
Tutorial 5 (With Answers)
No ratings yet
Tutorial 5 (With Answers)
10 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
21 pages
19-10-2024 SR - Super60 Nucleus&Sterling-bt Jee-Main Rptm-11&14 Final Key
No ratings yet
19-10-2024 SR - Super60 Nucleus&Sterling-bt Jee-Main Rptm-11&14 Final Key
1 page
Case-Based MCQs Trigonometry
No ratings yet
Case-Based MCQs Trigonometry
4 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
Compiler Construction: A Compulsory Module For Students in
No ratings yet
Compiler Construction: A Compulsory Module For Students in
34 pages
Unit 4
No ratings yet
Unit 4
4 pages
MT Impact - Horizon 2020 (And Beyond) : Rudy Tirry
No ratings yet
MT Impact - Horizon 2020 (And Beyond) : Rudy Tirry
23 pages
Of The Text Book: Code Optimization
No ratings yet
Of The Text Book: Code Optimization
19 pages
Of The Text Book: Code Optimization
No ratings yet
Of The Text Book: Code Optimization
19 pages
EDST5139 - Assignment 1
No ratings yet
EDST5139 - Assignment 1
8 pages
Memory Interface
No ratings yet
Memory Interface
42 pages
Week 1 - Introduction To Discrete Structures
No ratings yet
Week 1 - Introduction To Discrete Structures
3 pages
CH04
No ratings yet
CH04
24 pages
SPSS
No ratings yet
SPSS
30 pages
Java Module Part1
No ratings yet
Java Module Part1
74 pages
3 Intermediate Code Generation
No ratings yet
3 Intermediate Code Generation
20 pages
Achievement Test
No ratings yet
Achievement Test
17 pages
The 1D Diffusion Equation
No ratings yet
The 1D Diffusion Equation
23 pages
Zernik e Polynomials A Guide Final
No ratings yet
Zernik e Polynomials A Guide Final
18 pages
Multi Variable Loop Shaping
100% (1)
Multi Variable Loop Shaping
27 pages
Development of Framework For An Integrated Model For Technology Transfer
No ratings yet
Development of Framework For An Integrated Model For Technology Transfer
14 pages
English To Yorùbá Machine Translation System Using Rule-Based Approach
No ratings yet
English To Yorùbá Machine Translation System Using Rule-Based Approach
6 pages
Word Based Statistical Machine Translation From English Text To Indian Sign Language
No ratings yet
Word Based Statistical Machine Translation From English Text To Indian Sign Language
8 pages
1 SM
No ratings yet
1 SM
10 pages
Introduction To Compiler Design (CD) : Mu-Mit
No ratings yet
Introduction To Compiler Design (CD) : Mu-Mit
22 pages
Ex 265 266
No ratings yet
Ex 265 266
2 pages
Needs Assessment For Refugee Emergencies NARE
No ratings yet
Needs Assessment For Refugee Emergencies NARE
12 pages
Thermal Physics & Circular Motion
No ratings yet
Thermal Physics & Circular Motion
2 pages
Department of Education: Brigada Eskwela School Accomplishment Report (F7)
No ratings yet
Department of Education: Brigada Eskwela School Accomplishment Report (F7)
4 pages
Review On Natural Language Processing
No ratings yet
Review On Natural Language Processing
4 pages
ZXF01U03
No ratings yet
ZXF01U03
4 pages
Multiple Intelligences Test
No ratings yet
Multiple Intelligences Test
2 pages
04 Exercise Solutions e PDF
No ratings yet
04 Exercise Solutions e PDF
15 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CH06

Uploaded by

CH06

Uploaded by

CHAPTER SIX

Intermediate Code Generation and

 Although a compiler can directly produce a target language

C++ IBM PPC

C++ IBM PPC

Intermediate language can be many different languages,

 Example: F = ((A+BC) (ABC))+C

 A mathematical notation wherein every operator follows all

 We use the term “three-address code” because each statement

Question: Which IR code sequence is better?

Lexical Analyzer (Scanner)

Error Non-optimized Intermediate Code

• The machine-independent code-optimization phase attempts to

• There is a great variation in the amount of code optimization

 Basic blocks are maximal sequences of consecutive three-

• Each basic block is represented by a record

goto L1 if a < b goto L2

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

CH06

Uploaded by

CH06

Uploaded by

CHAPTER SIX

Intermediate Code Generation and

 Although a compiler can directly produce a target language

C++ IBM PPC

C++ IBM PPC

Intermediate language can be many different languages,

 Example: F = ((A+B*C) * (A*B*C))+C

 A mathematical notation wherein every operator follows all

 We use the term “three-address code” because each statement

Question: Which IR code sequence is better?

Lexical Analyzer (Scanner)

Error Non-optimized Intermediate Code

• The machine-independent code-optimization phase attempts to

• There is a great variation in the amount of code optimization

 Basic blocks are maximal sequences of consecutive three-

• Each basic block is represented by a record

goto L1 if a < b goto L2

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

 Example: F = ((A+BC) (ABC))+C