0% found this document useful (0 votes)

12 views19 pages

Unit 5 Code Generation

The document discusses the code generation phase of a compiler, detailing the input, output, and various design issues such as memory management, instruction selection, and register allocation. It explains the importance of understanding the target machine's architecture and instruction set for effective code generation, as well as the concepts of basic blocks and flow graphs in optimizing code. Additionally, it covers run-time storage management strategies and transformations applicable to basic blocks to enhance efficiency.

Uploaded by

sathwikrama4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views19 pages

Unit 5 Code Generation

Uploaded by

sathwikrama4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

UNIT V - CODE GENERATION

The final phase in compiler model is the code generator. It takes as input an intermediate
representation of the source program and produces as output an equivalent target program. The
code generation techniques presented below can be used whether or not an optimizing phase
occurs before code generation.

Position of code generator

source intermediate intermediate target

front end code code
program code optimizer code generator program

symbol
table

ISSUES IN THE DESIGN OF A CODE GENERATOR

The following issues arise during the code generation phase :

1. Input to code generator

2. Target program
3. Memory management
4. Instruction selection
5. Register allocation
6. Evaluation order

1. Input to code generator:

The input to the code generation consists of the intermediate representation of the source
program produced by front end , together with information in the symbol table to
determine run-time addresses of the data objects denoted by the names in the
intermediate representation.

 Intermediate representation can be :

a. Linear representation such as postfix notation
b. Three address representation such as quadruples
c. Virtual machine representation such as stack machine code
d. Graphical representations such as syntax trees and dags.

 Prior to code generation, the front end must be scanned, parsed and translated into
intermediate representation along with necessary type checking. Therefore, input to code
generation is assumed to be error-free.

2. Target program:
 The output of the code generator is the target program. The output may be :
a. Absolute machine language
- It can be placed in a fixed memory location and can be executed immediately.
b. Relocatable machine language
- It allows subprograms to be compiled separately.

c. Assembly language
- Code generation is made easier.

3. Memory management:
 Names in the source program are mapped to addresses of data objects in run-time
memory by the front end and code generator.

 It makes use of symbol table, that is, a name in a three-address statement refers to a
symbol-table entry for the name.

 Labels in three-address statements have to be converted to addresses of instructions.
For example,
j : goto i generates jump instruction as follows :
 if i < j, a backward jump instruction with target address equal to location of
code for quadruple i is generated.
if i > j, the jump is forward. We must store on a list for quadruple i the
location of the first machine instruction generated for quadruple j. When i is
processed, the machine locations for all instructions that forward jumps to i
are filled.

4. Instruction selection:
The instructions of target machine should be complete and uniform.

Instruction speeds and machine idioms are important factors when efficiency of target
program is considered.

The quality of the generated code is determined by its speed and size.

 The former statement can be translated into the latter statement as shown below:

















5. Register allocation
 Instructions involving register operands are shorter and faster than those involving
operands in memory.

 The use of registers is subdivided into two subproblems :
 Register allocation – the set of variables that will reside in registers at a point in
the program is selected.

Register assignment – the specific register that a variable will reside in is
picked.

 Certain machine requires even-odd register pairs for some operands and results.
For example , consider the division instruction of the form :
D x, y

where, x – dividend even register in even/odd register pair

y – divisor
even register holds the remainder
odd register holds the quotient

6. Evaluation order
 The order in which the computations are performed can affect the efficiency of the
target code. Some computation orders require fewer registers to hold intermediate
results than others.

TARGET MACHINE

Familiarity with the target machine and its instruction set is a prerequisite for designing a
good code generator.
The target computer is a byte-addressable machine with 4 bytes to a word.
It has n general-purpose registers, R0, R1, . . . , Rn-1.
 It has two-address instructions of the form:
 op source, destination
where, op is an op-code, and source and destination are data fields.

It has the following op-codes :
MOV (move source to destination)
 ADD (add source to destination)
SUB (subtract source from destination)

The source and destination of an instruction are specified by combining registers
and memory locations with address modes.

 Address modes with their assembly-language forms

MODE FORM ADDRESS ADDED COST

absolute M M 1

indexed c(R) c+contents(R) 1

indirect register *R contents (R) 0

indirect indexed *c(R) contents(c+ 1

contents(R))

literal #c c 1



For example : MOV R0, M stores contents of Register R0 into memory location M ;
MOV 4(R0), M stores the value contents(4+contents(R0)) into M.

Instruction costs :

 Instruction cost = 1+cost for source and destination address modes. This cost corresponds
 to the length of the instruction.
 Address modes involving registers have cost zero.
 Address modes involving memory location or literal have cost one.
 Instruction length should be minimized if space is important. Doing so also minimizes
the time taken to fetch and perform the instruction.
For example : MOV R0, R1 copies the contents of register R0 into R1. It has cost one,
since it occupies only one word of memory.
 The three-address statement a : = b + c can be implemented by many different
instruction sequences :

 i) MOV b, R0
ADD c, R0 cost = 6
MOV R0, a

ii) MOV b, a
ADD c, a cost = 6

iii) Assuming R0, R1 and R2 contain the addresses of a, b, and c :

MOV *R1, *R0
ADD *R2, *R0 cost = 2

In order to generate good code for target machine, we must utilize its
addressing capabilities efficiently.

RUN-TIME STORAGE MANAGEMENT

Information needed during an execution of a procedure is kept in a block of storage
called an activation record, which includes storage for names local to the procedure.
The two standard storage allocation strategies are:
1. Static allocation
2. Stack allocation
 In static allocation, the position of an activation record in memory is fixed at compile
 time.
 In stack allocation, a new activation record is pushed onto the stack for each execution of
 a procedure. The record is popped when the activation ends.
 The following three-address statements are associated with the run-time allocation and
deallocation of activation records:
1. Call,
2. Return,
3. Halt, and
4. Action, a placeholder for other statements.
 We assume that the run-time memory is divided into areas for:
1. Code
2. Static data
3. Stack
Static allocation

Implementation of call statement:

The codes needed to implement static allocation are as follows:

MOV #here + 20, callee.static_area /It saves return address/

GOTO callee.code_area /*It transfers control to the target code for the called procedure */

where,
callee.static_area – Address of the activation record
callee.code_area – Address of the first instruction for called procedure
#here + 20 – Literal return address which is the address of the instruction following GOTO.

Implementation of return statement:

A return from procedure callee is implemented by :

GOTO *callee.static_area

This transfers control to the address saved at the beginning of the activation record.

Implementation of action statement:

The instruction ACTION is used to implement action statement.

Implementation of halt statement:

The statement HALT is the final instruction that returns control to the operating

system. Stack allocation

Static allocation can become stack allocation by using relative addresses for storage in
activation records. In stack allocation, the position of activation record is stored in register so
words in activation records can be accessed as offsets from the value in this register.

The codes needed to implement stack allocation are as follows:

Initialization of stack:

MOV #stackstart , SP /* initializes stack */

Code for the first procedure

HALT /* terminate execution */

Implementation of Call statement:

ADD #caller.recordsize, SP /* increment stack pointer */

MOV #here + 16, SP /Save return address */

GOTO callee.code_area
where,
caller.recordsize – size of the activation record
#here + 16 – address of the instruction following the GOTO

Implementation of Return statement:

GOTO 0 ( SP ) /return to the caller */

SUB #caller.recordsize, SP /* decrement SP and restore to previous value */

BASIC BLOCKS AND FLOW GRAPHS

Basic Blocks

 A basic block is a sequence of consecutive statements in which flow of control enters at

the beginning and leaves at the end without any halt or possibility of branching except at
 the end.
The following sequence of three-address statements forms a basic
block: t1 : = a * a
t2 : = a * b
t3 : = 2 * t2
t 4 : = t 1 + t3
t5 : = b * b
t 6 : = t 4 + t5

Basic Block Construction:

Algorithm: Partition into basic blocks

Input: A sequence of three-address statements

Output: A list of basic blocks with each three-address statement in exactly one block

Method:

1. We first determine the set of leaders, the first statements of basic blocks. The rules
we use are of the following:
a. The first statement is a leader.
b. Any statement that is the target of a conditional or unconditional goto is a
leader.
c. Any statement that immediately follows a goto or conditional goto statement
is a leader.
2. For each leader, its basic block consists of the leader and all statements up to but not
including the next leader or the end of the program.
Consider the following source code for dot product of two vectors a and b of length 20

begin

prod :=0;

i:=1;

do begin

prod :=prod+ a[i] * b[i];

i :=i+1;

end

while i <= 20

end

The three-address code for the above source program is given as :

(1) prod := 0

(2) i := 1

(3) t1 := 4* i

(4) t2 := a[t1] /compute a[i] /

(5) t3 := 4* i

(6) t4 := b[t3] /compute b[i] /

(7) t5 := t2*t4

(8) t6 := prod+t5

(9) prod := t6

(10) t7 := i+1

(11) i := t7

(12) if i<=20 goto (3)

Basic block 1: Statement (1) to (2)

Basic block 2: Statement (3) to (12)

Transformations on Basic Blocks:

A number of transformations can be applied to a basic block without changing the set of
expressions computed by the block. Two important classes of transformation are :

 Structure-preserving transformations

 Algebraic transformations

1. Structure preserving transformations:

a) Common subexpression elimination:

a:=b+c a:=b+c
b:=a–d b:=a-d
c:=b+c c:=b+c
d:=a–d d:=b

Since the second and fourth expressions compute the same expression, the basic block can
be transformed as above.

b) Dead-code elimination:

Suppose x is dead, that is, never subsequently used, at the point where the statement x :
= y + z appears in a basic block. Then this statement may be safely removed without
changing the value of the basic block.

c) Renaming temporary variables:

A statement t : = b + c ( t is a temporary ) can be changed to u : = b + c (u is a new

temporary) and all uses of this instance of t can be changed to u without changing the value
of the basic block.
Such a block is called a normal-form block.

d) Interchange of statements:

Suppose a block has the following two adjacent statements:

t1 : = b + c
t2 : = x + y

We can interchange the two statements without affecting the value of the block if
and only if neither x nor y is t1 and neither b nor c is t2.
2. Algebraic transformations:

Algebraic transformations can be used to change the set of expressions computed by a

basic block into an algebraically equivalent set.
Examples:
i) x : = x + 0 or x : = x * 1 can be eliminated from a basic block without changing the set of
expressions it computes.
ii) The exponential statement x : = y * * 2 can be replaced by x : = y * y.
Flow Graphs

 Flow graph is a directed graph containing the flow-of-control information for the set of
 basic blocks making up a program.
 The nodes of the flow graph are basic blocks. It has a distinguished initial node.
 E.g.: Flow graph for the vector dot product is given as follows:

prod : = 0 B1
i:=1

t1 : = 4 * i
t2 : = a [ t1 ]
t3 : = 4 * i B2
t4 : = b [ t3 ]
t5 : = t2 * t4
t6 : = prod + t5
prod : = t6
t7 : = i + 1
i : = t7
if i <= 20 goto B2

B1 is the initial node. B2 immediately follows B1, so there is an edge from B1 to B2. The
target of jump from last statement of B1 is the first statement B2, so there is an edge from
B1 (last statement) to B2 (first statement).
B1 is the predecessor of B2, and B2 is a successor of B1.

Loops

 A loop is a collection of nodes in a flow graph such that

1. All nodes in the collection are strongly connected.
2. The collection of nodes has a unique entry.
 A loop that contains no other loops is called an inner loop.

NEXT-USE INFORMATION

 If the name in a register is no longer needed, then we remove the name from the register
and the register can be used to store some other names.
Input: Basic block B of three-address statements

Output: At each statement i: x= y op z, we attach to i the liveliness and next-uses of x,

y and z.

Method: We start at the last statement of B and scan backwards.

1. Attach to statement i the information currently found in the symbol table

regarding the next-use and liveliness of x, y and z.
2. In the symbol table, set x to “not live” and “no next use”.
3. In the symbol table, set y and z to “live”, and next-uses of y and z to i.

Symbol Table:

Names Liveliness Next-use

x not live no next-use

y Live i

z Live i

A SIMPLE CODE GENERATOR

A code generator generates target code for a sequence of three- address statements and
effectively uses registers to store operands of the statements.

For example: consider the three-address statement a := b+c

It can have the following sequence of codes:

ADD Rj, Ri Cost = 1 // if Ri contains b and Rj contains c

(or)

ADD c, Ri Cost = 2 // if c is in a memory location

(or)

MOV c, Rj Cost = 3 // move c from memory to Rj and add

ADD Rj, Ri

Register and Address Descriptors:

 A register descriptor is used to keep track of what is currently in each registers. The
 register descriptors show that initially all the registers are empty.
 An address descriptor stores the location where the current value of the name can be
 found at run time.
A code-generation algorithm:

The algorithm takes as input a sequence of three-address statements constituting a basic block.
For each three-address statement of the form x : = y op z, perform the following actions:

1. Invoke a function getreg to determine the location L where the result of the computation y op
z should be stored.

2. Consult the address descriptor for y to determine y’, the current location of y. Prefer the
register for y’ if the value of y is currently both in memory and a register. If the value of y
is not already in L, generate the instruction MOV y’ , L to place a copy of y in L.

3. Generate the instruction OP z’ , L where z’ is a current location of z. Prefer a register to

a memory location if z is in both. Update the address descriptor of x to indicate that x is
in location L. If x is in L, update its descriptor and remove x from all other descriptors.

4. If the current values of y or z have no next uses, are not live on exit from the block, and are
in registers, alter the register descriptor to indicate that, after execution of x : = y op z , those
registers will no longer contain y or z.

Generating Code for Assignment Statements:

The assignment d : = (a-b) + (a-c) + (a-c) might be translated into the following three-
address code sequence:
t:=a–b
u:=a–c
v:=t+u
d:=v+u
with d live at the end.

Code sequence for the example is:

Statements Code Generated Register descriptor Address descriptor

t:=a-b MOV a, R0 R0 contains t t in R0

SUB b, R0

u:=a-c MOV a , R1 R0 contains t t in R0

SUB c , R1 R1 contains u u in R1

v:=t+u ADD R1, R0 R0 contains v u in R1

R1 contains u v in R0

d:=v+u ADD R1, R0 R0 contains d d in R0

d in R0 and memory
MOV R0, d
Generating Code for Indexed Assignments

The table shows the code sequences generated for the indexed assignment statements
a : = b [ i ] and a [ i ] : = b

Statements Code Generated Cost

a : = b[i] MOV b(Ri), R 2

a[i] : = b MOV b, a(Ri) 3

Generating Code for Pointer Assignments

The table shows the code sequences generated for the pointer assignments
a : = *p and *p : = a

Statements Code Generated Cost

a : = *p MOV *Rp, a 2

*p : = a MOV a, *Rp 2

Generating Code for Conditional Statements

Statement Code

if x < y goto z CMP x, y

CJ< z /* jump to z if condition code
is negative */

x : = y +z MOV y, R0
if x < 0 goto z ADD z, R0
MOV R0,x
CJ< z

THE DAG REPRESENTATION FOR BASIC BLOCKS

 A DAG for a basic block is a directed acyclic graph with the following labels on nodes:
1. Leaves are labeled by unique identifiers, either variable names or constants.
2. Interior nodes are labeled by an operator symbol.
3. Nodes are also optionally given a sequence of identifiers for labels to store the
computed values.
 DAGs are useful data structures for implementing transformations on basic blocks.
 It gives a picture of how the value computed by a statement is used in subsequent
 statements.
 It provides a good way of determining common sub - expressions.
Algorithm for construction of DAG

Input: A basic block

Output: A DAG for the basic block containing the following information:

1. A label for each node. For leaves, the label is an identifier. For interior nodes, an
operator symbol.
2. For each node a list of attached identifiers to hold the computed values.
Case (i) x : = y OP z

Case (ii) x : = OP y

Case (iii) x : = y

Method:

Step 1: If y is undefined then create node(y).

If z is undefined, create node(z) for case(i).

Step 2: For the case(i), create a node(OP) whose left child is node(y) and right child is

node(z). ( Checking for common sub expression). Let n be this node.

For case(ii), determine whether there is node(OP) with one child node(y). If not create such
a node.

For case(iii), node n will be node(y).

Step 3: Delete x from the list of identifiers for node(x). Append x to the list of attached

identifiers for the node n found in step 2 and set node(x) to n.

Example: Consider the block of three- address statements:

1. t1 := 4* i
2. t2 := a[t1]
3. t3 := 4* i
4. t4 := b[t3]
5. t5 := t2*t4
6. t6 := prod+t5
7. prod := t6
8. t7 := i+1
9. i := t7
10. if i<=20 goto (1)
Stages in DAG Construction
Application of DAGs:

1. We can automatically detect common sub expressions.

2. We can determine which identifiers have their values used in the block.
3. We can determine which statements compute values that could be used outside the block.
GENERATING CODE FROM DAGs

The advantage of generating code for a basic block from its dag representation is that,
from a dag we can easily see how to rearrange the order of the final computation sequence than
we can starting from a linear sequence of three-address statements or quadruples.

Rearranging the order

The order in which computations are done can affect the cost of resulting object code.

For example, consider the following basic block:

t1 : = a + b
t2 : = c + d
t3 : = e – t2
t 4 : = t 1 – t3

Generated code sequence for basic block:

MOV a , R0
ADD b , R0
MOV c , R1
ADD d , R1
MOV R0 , t1
MOV e , R0
SUB R1 , R0
MOV t1 , R1
SUB R0 , R1
MOV R1 , t4

Rearranged basic block:

Now t1 occurs immediately before t4.

t2 : = c + d
t3 : = e – t2
t1 : = a + b
t 4 : = t 1 – t3

Revised code sequence:

MOV c , R0
ADD d , R0
MOV a , R0
SUB R0 , R1
MOV a , R0
ADD b , R0
SUB R1 , R0
MOV R0 , t4

In this order, two instructions MOV R0 , t1 and MOV t1 , R1 have been saved.
A Heuristic ordering for Dags

The heuristic ordering algorithm attempts to make the evaluation of a node immediately
follow the evaluation of its leftmost argument.

The algorithm shown below produces the ordering in reverse.

Algorithm:

1) while unlisted interior nodes remain do begin

2) select an unlisted node n, all of whose parents have been listed;
3) list n;
4) while the leftmost child m of n has no unlisted parents and is not a leaf do
begin
5) list m;
6) n:=m
end
end

Example: Consider the DAG shown below:

1 *

2 + - 3

4
*

5 - + 8

6 + 7 c d 11 e 12

a b
9 10

Initially, the only node with no unlisted parents is 1 so set n=1 at line (2) and list 1 at line (3).

Now, the left argument of 1, which is 2, has its parents listed, so we list 2 and set n=2 at line (6).

Now, at line (4) we find the leftmost child of 2, which is 6, has an unlisted parent 5. Thus we
select a new n at line (2), and node 3 is the only candidate. We list 3 and proceed down its
left chain, listing 4, 5 and 6. This leaves only 8 among the interior nodes so we list that.
The resulting list is 1234568 and the order of evaluation is 8654321.
Code sequence:

t8 : = d + e
t6 : = a + b
t5 : = t6 – c
t 4 : = t 5 * t8
t3 : = t4 – e
t 2 : = t 6 + t4
t 1 : = t 2 * t3

This will yield an optimal code for the DAG on machine whatever be the number of registers.

mod 4-5
No ratings yet
mod 4-5
40 pages
BCS 324 TOPIC 6
No ratings yet
BCS 324 TOPIC 6
56 pages
Unit-5-Issues in Code Generation
No ratings yet
Unit-5-Issues in Code Generation
20 pages
Unit 5 contd final
No ratings yet
Unit 5 contd final
12 pages
UNIT V cd print
No ratings yet
UNIT V cd print
9 pages
Unit 4 Compiler
No ratings yet
Unit 4 Compiler
19 pages
REDO%20-2%20CD.pdf 2
No ratings yet
REDO%20-2%20CD.pdf 2
2 pages
5.1 Issues in Code Generation
No ratings yet
5.1 Issues in Code Generation
16 pages
Code Generation & Optimization
No ratings yet
Code Generation & Optimization
17 pages
CDU5
No ratings yet
CDU5
15 pages
Compiler Design (Unit-5)
No ratings yet
Compiler Design (Unit-5)
22 pages
Compiler Design Lec-8Code Generation and Optimization (2)
No ratings yet
Compiler Design Lec-8Code Generation and Optimization (2)
46 pages
Code Generation
No ratings yet
Code Generation
21 pages
Codegeneration Final Ppt
No ratings yet
Codegeneration Final Ppt
31 pages
Unit 5 part 1_CD
No ratings yet
Unit 5 part 1_CD
14 pages
Lec 12 - Code Generator
No ratings yet
Lec 12 - Code Generator
23 pages
CS 346: Code Generation: Resource
No ratings yet
CS 346: Code Generation: Resource
52 pages
CODE GENERATION and Issues
No ratings yet
CODE GENERATION and Issues
3 pages
Code Generation and Optimization
No ratings yet
Code Generation and Optimization
42 pages
Acd 5
No ratings yet
Acd 5
9 pages
CC 7
No ratings yet
CC 7
20 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
No ratings yet
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
32 pages
CS602PC - Compiler - Design - Lecture Notes - Unit - 4
No ratings yet
CS602PC - Compiler - Design - Lecture Notes - Unit - 4
19 pages
UNIT-5 Compiler Design
No ratings yet
UNIT-5 Compiler Design
15 pages
CH5 2
No ratings yet
CH5 2
23 pages
CD Unit 6.1
No ratings yet
CD Unit 6.1
20 pages
Experiment No 6 - DONE
No ratings yet
Experiment No 6 - DONE
8 pages
Code Generation
No ratings yet
Code Generation
9 pages
Code Generation 5th Year Computer Science Course
No ratings yet
Code Generation 5th Year Computer Science Course
20 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Unit V
No ratings yet
Unit V
42 pages
UNIT 4 - Chapter 1 in Compiler Design
No ratings yet
UNIT 4 - Chapter 1 in Compiler Design
51 pages
Pcs - 2m
No ratings yet
Pcs - 2m
6 pages
Unit 5
No ratings yet
Unit 5
13 pages
CH5 2
No ratings yet
CH5 2
24 pages
Position of Code Generator: Principles of Compiler Design Lecture Notes
No ratings yet
Position of Code Generator: Principles of Compiler Design Lecture Notes
16 pages
2018 - Lecture - 14 - Code Generation - 2 PDF
No ratings yet
2018 - Lecture - 14 - Code Generation - 2 PDF
96 pages
Code Geneartion
No ratings yet
Code Geneartion
13 pages
Input To The Code Generator
No ratings yet
Input To The Code Generator
62 pages
Code Generation
No ratings yet
Code Generation
22 pages
34-Issues in the design of a code generator_Target Machine-25-10-2024
No ratings yet
34-Issues in the design of a code generator_Target Machine-25-10-2024
29 pages
CD UNIT-6 LM
No ratings yet
CD UNIT-6 LM
17 pages
Code Generation
No ratings yet
Code Generation
25 pages
Unit4 Compiler PDF
No ratings yet
Unit4 Compiler PDF
73 pages
15Cs314J - Compiler Design: Unit 4
No ratings yet
15Cs314J - Compiler Design: Unit 4
71 pages
Unit 5
No ratings yet
Unit 5
10 pages
Unit 4 Compiler
No ratings yet
Unit 4 Compiler
19 pages
Compiler Design and Construction Lecture Notes
No ratings yet
Compiler Design and Construction Lecture Notes
28 pages
13-Issues in the Design of a Code Generator--22!10!2024
No ratings yet
13-Issues in the Design of a Code Generator--22!10!2024
54 pages
Issues in Code Generator-Pages-2
No ratings yet
Issues in Code Generator-Pages-2
3 pages
Code Generation I
No ratings yet
Code Generation I
32 pages
Code Generation: Issues in The Design of A Code Generator
No ratings yet
Code Generation: Issues in The Design of A Code Generator
33 pages
Unit-4-5
No ratings yet
Unit-4-5
36 pages
Peralatan TIK
No ratings yet
Peralatan TIK
9 pages
Unit - V: Study Material 1/11
No ratings yet
Unit - V: Study Material 1/11
11 pages
Unit 4 PCD
No ratings yet
Unit 4 PCD
15 pages
Code Generation
No ratings yet
Code Generation
49 pages
Code Generation and Instruction Selection Unit-8
No ratings yet
Code Generation and Instruction Selection Unit-8
6 pages
Code Generation PDF
No ratings yet
Code Generation PDF
19 pages
lecture1
No ratings yet
lecture1
46 pages
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
SE Lab Record
No ratings yet
SE Lab Record
41 pages
ALC5631Q
No ratings yet
ALC5631Q
91 pages
Brush Up Session on Core Java
No ratings yet
Brush Up Session on Core Java
20 pages
Dataman 48 XPManual
No ratings yet
Dataman 48 XPManual
63 pages
Brkcol 2610
No ratings yet
Brkcol 2610
109 pages
PLT Test Center Technical Requirements - CCT
No ratings yet
PLT Test Center Technical Requirements - CCT
16 pages
Open Souce Lidar Software
No ratings yet
Open Souce Lidar Software
29 pages
Hdve 50 Nu
No ratings yet
Hdve 50 Nu
10 pages
Ben Shneiderman Eight Golden Rules of Interface
No ratings yet
Ben Shneiderman Eight Golden Rules of Interface
3 pages
Heart Rate Animation With C
No ratings yet
Heart Rate Animation With C
7 pages
Cryptography & Network Security
No ratings yet
Cryptography & Network Security
10 pages
PPS 2023-24 Even Sem_45385173_2025_01_30_11_52
No ratings yet
PPS 2023-24 Even Sem_45385173_2025_01_30_11_52
2 pages
2.8inch E32R28T E32N28T ESP32-32E Demo Instructions
No ratings yet
2.8inch E32R28T E32N28T ESP32-32E Demo Instructions
29 pages
Discrete Wavelet Transform (DWT) : Sharon Shen
No ratings yet
Discrete Wavelet Transform (DWT) : Sharon Shen
44 pages
7.1.3 Organizational Unit Facts
No ratings yet
7.1.3 Organizational Unit Facts
1 page
Second Semester Prelims Class 11
No ratings yet
Second Semester Prelims Class 11
3 pages
APM211+ +Introduction+to+AppDynamics+ +Student+Guide
No ratings yet
APM211+ +Introduction+to+AppDynamics+ +Student+Guide
58 pages
JavaScript in Hindi GLA
No ratings yet
JavaScript in Hindi GLA
41 pages
Medecom BC MedArchive and Clipper EN
No ratings yet
Medecom BC MedArchive and Clipper EN
2 pages
Jun Han Jeff Xiang Resume
No ratings yet
Jun Han Jeff Xiang Resume
1 page
SAP ABAP On HANA - Top Interview Questions
No ratings yet
SAP ABAP On HANA - Top Interview Questions
19 pages
Hyder Khan M365
No ratings yet
Hyder Khan M365
5 pages
JAVA Oracle Notes
No ratings yet
JAVA Oracle Notes
2 pages
Instruction Sheet: Installation Instructions For Inpower™ 8.0
No ratings yet
Instruction Sheet: Installation Instructions For Inpower™ 8.0
5 pages
Module 2 Hebb Net
No ratings yet
Module 2 Hebb Net
12 pages
Introduction To AI Unit 1
No ratings yet
Introduction To AI Unit 1
10 pages
C.S Project
No ratings yet
C.S Project
24 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Unit 5 Code Generation

Uploaded by

Unit 5 Code Generation

Uploaded by

UNIT V - CODE GENERATION

Position of code generator

source intermediate intermediate target

ISSUES IN THE DESIGN OF A CODE GENERATOR

The following issues arise during the code generation phase :

1. Input to code generator

1. Input to code generator:

 Intermediate representation can be :

where, x – dividend even register in even/odd register pair

indexed c(R) c+contents(R) 1

indirect register *R contents (R) 0

indirect indexed *c(R) contents(c+ 1

iii) Assuming R0, R1 and R2 contain the addresses of a, b, and c :

Implementation of call statement:

The codes needed to implement static allocation are as follows:

MOV #here + 20, callee.static_area /*It saves return address*/

Implementation of return statement:

A return from procedure callee is implemented by :

Implementation of action statement:

The instruction ACTION is used to implement action statement.

Implementation of halt statement:

system. Stack allocation

The codes needed to implement stack allocation are as follows:

MOV #stackstart , SP /* initializes stack */

Code for the first procedure

HALT /* terminate execution */

Implementation of Call statement:

ADD #caller.recordsize, SP /* increment stack pointer */

MOV #here + 16, *SP /*Save return address */

Implementation of Return statement:

GOTO *0 ( SP ) /*return to the caller */

SUB #caller.recordsize, SP /* decrement SP and restore to previous value */

BASIC BLOCKS AND FLOW GRAPHS

 A basic block is a sequence of consecutive statements in which flow of control enters at

Basic Block Construction:

Algorithm: Partition into basic blocks

Input: A sequence of three-address statements

prod :=prod+ a[i] * b[i];

The three-address code for the above source program is given as :

(4) t2 := a[t1] /*compute a[i] */

(6) t4 := b[t3] /*compute b[i] */

(12) if i<=20 goto (3)

Basic block 1: Statement (1) to (2)

Basic block 2: Statement (3) to (12)

a) Common subexpression elimination:

c) Renaming temporary variables:

A statement t : = b + c ( t is a temporary ) can be changed to u : = b + c (u is a new

Suppose a block has the following two adjacent statements:

Algebraic transformations can be used to change the set of expressions computed by a

 A loop is a collection of nodes in a flow graph such that

Output: At each statement i: x= y op z, we attach to i the liveliness and next-uses of x,

Method: We start at the last statement of B and scan backwards.

1. Attach to statement i the information currently found in the symbol table

Names Liveliness Next-use

x not live no next-use

A SIMPLE CODE GENERATOR

For example: consider the three-address statement a := b+c

3. Generate the instruction OP z’ , L where z’ is a current location of z. Prefer a register to

Generating Code for Assignment Statements:

Code sequence for the example is:

Statements Code Generated Register descriptor Address descriptor

t:=a-b MOV a, R0 R0 contains t t in R0

u:=a-c MOV a , R1 R0 contains t t in R0

v:=t+u ADD R1, R0 R0 contains v u in R1

d:=v+u ADD R1, R0 R0 contains d d in R0

Statements Code Generated Cost

a : = b[i] MOV b(Ri), R 2

a[i] : = b MOV b, a(Ri) 3

Generating Code for Pointer Assignments

Statements Code Generated Cost

Generating Code for Conditional Statements

if x < y goto z CMP x, y

THE DAG REPRESENTATION FOR BASIC BLOCKS

Input: A basic block

Step 1: If y is undefined then create node(y).

If z is undefined, create node(z) for case(i).

MOV #here + 20, callee.static_area /It saves return address/

MOV #here + 16, SP /Save return address */

GOTO 0 ( SP ) /return to the caller */

(4) t2 := a[t1] /compute a[i] /

(6) t4 := b[t3] /compute b[i] /