0% found this document useful (0 votes)

33 views4 pages

Ca CT2

The document discusses various techniques for dealing with pipeline stalls caused by branch hazards in a processor pipeline, including branch prediction, branch target buffer, delayed branching, out-of-order execution, speculative execution, loop buffer, and software pipelining. It also analyzes the dependencies and hazards for different instruction sequences in a 5-stage pipeline with and without forwarding.

Uploaded by

krahul74714

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views4 pages

Ca CT2

Uploaded by

krahul74714

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Tuesday, October 31, 2023 10:53 PM

Consider the following instructions are executed in five stage pipeline:

lw
add
sub

RI, O(R2)
R3, R4, R5
R6, RI, R3

fixplain the flow of execution in a pipeline using a simple diagram. 2. a.Consider the following
instructions are executed five stage pipclinc: lw R
Explain the now of execution in a pipeline using a simple diagram

b. For the instruction mix above, on what instruction results does the last add instruction depend on?
3. Indicate the type of hazards and data dependences for the following instructions: (5)
b. Iw R5, - 16(R5)
sw R5, - - 6(R5)
add R5, R5, R5

4. For the above sequence, add nop instructions to eliminate the hazards. Assume that there is no
forwarding in this pipelined processor.
5. Discuss, the schemes for dealing with the pipeline stalls caused by branch hazards. (3)

Discuss, the schemes for dealing with the pipeline stalls caused by branch hazards.
ChatGPT
Sure, let's dive into the strategies for mitigating pipeline stalls caused by branch hazards.
1. Branch Prediction:
• Static Prediction: This involves predicting branches at compile-time based on the program's
structure. However, it may not be very accurate for dynamic and runtime-dependent branches.
• Dynamic Prediction: Here, the processor predicts the outcome of a branch during runtime.
Two-level predictors, tournament predictors, and neural branch predictors are examples of
dynamic prediction techniques.
2. Branch Target Buffer (BTB):
• A BTB is a cache that stores target addresses of recent branches. When a branch instruction
is encountered, the BTB is checked to fetch the predicted target address. This can help in
reducing stalls as the pipeline can continue with the predicted instruction.
3. Delayed Branching:
• Instructions following a branch are executed regardless of the branch outcome. If the branch is
taken, the pipeline is flushed, and incorrect instructions are discarded. This reduces the impact
of branch stalls but may lead to wasted cycles if the prediction is incorrect.
4. Out-of-Order Execution:
• Modern processors often use out-of-order execution, allowing instructions after a branch to be
executed even before the branch outcome is determined. If the branch prediction was correct,
the results are committed; otherwise, the pipeline is rolled back.
5. Speculative Execution:

New Section 1 Page 1

5. Speculative Execution:
• Similar to out-of-order execution, speculative execution involves executing instructions based
on predicted branch outcomes. If the prediction is correct, the results are committed;
otherwise, the speculatively executed instructions are discarded.
6. Loop Buffer:
• A loop buffer is a small, fast cache that stores recently executed instructions in loops. It helps
in predicting loop branches more accurately, reducing the number of mispredicted branches.
7. Software Pipelining:
• This is a compiler optimization technique that reorganizes the code to minimize pipeline stalls.
It can be used to overlap the execution of multiple iterations of a loop, reducing the impact of
branch hazards.
Each of these techniques comes with its advantages and limitations. The effectiveness of a
particular scheme may depend on the characteristics of the code being executed and the
architecture of the processor.

From <https://chat.openai.com/c/cb22d345-9d92-4035-ad45-9cbc0fb32bb6>

4.13.1: As we know lw used to load word from memory to a register. and sw same but stores a
register into memory add to adds together its two operands.

Instruction sequence Dependences

a
I1: lw $1,40($6)
I2: add $6,$2,$2
I3: sw $6,50($1)

RAW on $1 from I1 to I3
RAW on $6 from I2 to I3
WAR on $6 from I1 to I2 and I3

b
I1: lw $5,-16($5)
I2: sw $5,-16($5)
I3: add $5,$5,$5

RAW on $5 from I1 to I2 and I3

WAR on $5 from I1 and I2 to I3
WAW on $5 from I1 to I3

4.13.2: In the basic five-stage pipeline WAR and WAW dependences do not cause any hazards. Without
forwarding, any RAW dependence between an instruction and the next two instructions (if register read
happens in the second half of the clock cycle and the register write happens in the fi rst half). The code
that eliminates these hazards by inserting nop instructions is:

Instruction sequence
a
lw $1,40($6)
add $6,$2,$2
nop
sw $6,50($1)

Delay I3 to avoid RAW hazard on $1 from I1

New Section 1 Page 2

b
lw $5,-16($5)
nop
nop
sw $5,-16($5)
add $5,$5,$5

Delay I2 to avoid RAW hazard on $5 from I1

Note: No RAW hazard from on $5 from I1 now

4.13.3: With full forwarding, an ALU instruction can forward a value to EX stage of the next instruction
without a hazard. However, a load cannot forward to the EX stage of the next instruction (by can to the
instruction after that).The code that eliminates these hazards by inserting nop instructions is:

Instruction sequence
a
lw $1,40($6)
add $6,$2,$2
sw $6,50($1)

No RAW hazard on $1 from I1 (forwarded)

b
lw $5,-16($5)
nop
sw $5,-16($5)
add $5,$5,$5

Delay I2 to avoid RAW hazard on $5 from I1

Value for $5 is forwarded from I2 now
Note: no RAW hazard from on $5 from I1 now

4.13.4 The total execution time is the clock cycle time times the number of cycles. Without any stalls, a
three-instruction sequence executes in 7 cycles (5 to complete the fi rst instruction, then one per
instruction). The execution without forwarding must add a stall for every nop we had in 4.13.2, and
execution forwarding must add a stall cycle for every nop we had in 4.13.3. Overall, we get:

No forwarding With forwarding Speed-up due to forwarding

(7 + 1) × 300ps = 2400ps

7 × 400ps = 2800ps

0.86 (This is really a slowdown)

(7 + 2) × 200ps = 1800ps

(7 + 1) × 250ps = 2000ps

0.90 (This is really a slowdown)

4.13.5 With ALU-ALU-only forwarding, an ALU instruction can forward to the next instruction, but not to
the second-next instruction (because that would be forwarding from MEM to EX). A load cannot forward

New Section 1 Page 3

the second-next instruction (because that would be forwarding from MEM to EX). A load cannot forward
at all, because it determines the data value in MEM stage, when it is too late for ALU-ALU forwarding.
We have:

Instruction sequence
a

lw $1,40($6)
add $6,$2,$2
nop
sw $6,50($1)

Can’t use ALU-ALU forwarding, ($1 loaded in MEM)

lw $5,-16($5)
nop
nop
sw $5,-16($5)
add $5,$5,$5

Can’t use ALU-ALU forwarding ($5 loaded in MEM)

4.13.6: Total execulation time of this instruction sequence with different condition ALU-ALU, NO
forwarding are given bellow.

No forwarding With ALU-ALU forwarding only Speed-up with ALU-ALU forwarding

(7 + 1) × 300ps = 2400ps

(7 + 1) × 360ps = 2880ps

0.83 (This is really a slowdown)

(7 + 2) × 200ps = 1800ps

(7 + 2) × 220ps = 1980ps

0.91 (This is really a slowdown)

New Section 1 Page 4

FemtoRV32 Piplined Processor Report
No ratings yet
FemtoRV32 Piplined Processor Report
25 pages
U33
No ratings yet
U33
61 pages
Lec7 Pipelining
No ratings yet
Lec7 Pipelining
22 pages
Lecture 4.3 - The Processor - Pipelining
No ratings yet
Lecture 4.3 - The Processor - Pipelining
27 pages
ch4 3
No ratings yet
ch4 3
61 pages
05 Risc V Pipeline
No ratings yet
05 Risc V Pipeline
31 pages
Arch4 Pipelined Processor Design Afterlecture
No ratings yet
Arch4 Pipelined Processor Design Afterlecture
130 pages
SRM Pipelining 05
No ratings yet
SRM Pipelining 05
42 pages
L13 Stalls and Flushes
No ratings yet
L13 Stalls and Flushes
27 pages
Chapter 04 Processor 2
No ratings yet
Chapter 04 Processor 2
28 pages
EE457Unit9a OoO
No ratings yet
EE457Unit9a OoO
77 pages
Pipe 3
No ratings yet
Pipe 3
32 pages
CS M151B / EE M116C: Computer Systems Architecture
No ratings yet
CS M151B / EE M116C: Computer Systems Architecture
50 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
Group 17 - 2151177
No ratings yet
Group 17 - 2151177
15 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
53 pages
Computer Architecture - Sheet 6 Solution
No ratings yet
Computer Architecture - Sheet 6 Solution
7 pages
Unit 5 Pipeline Hazard
No ratings yet
Unit 5 Pipeline Hazard
31 pages
L8 PipelineHazards 1
No ratings yet
L8 PipelineHazards 1
28 pages
Lec12 Pipeline 2 Notes
No ratings yet
Lec12 Pipeline 2 Notes
58 pages
Pipelining and Pipelining Hazards
No ratings yet
Pipelining and Pipelining Hazards
43 pages
Lect8 Pipelined DP Control
No ratings yet
Lect8 Pipelined DP Control
59 pages
Pipeline Hazards Detailed Notes
No ratings yet
Pipeline Hazards Detailed Notes
49 pages
Chapter 3 PPTV 31 Sem IIv 31
No ratings yet
Chapter 3 PPTV 31 Sem IIv 31
40 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
4 pages
31 Pipeline Hazards 25-04-2024
No ratings yet
31 Pipeline Hazards 25-04-2024
35 pages
Control Hazard
No ratings yet
Control Hazard
20 pages
Hazards - V3
No ratings yet
Hazards - V3
34 pages
CS3350B Computer Architecture: Lecture 6.2: Instructional Level Parallelism: Hazards and Resolutions
No ratings yet
CS3350B Computer Architecture: Lecture 6.2: Instructional Level Parallelism: Hazards and Resolutions
31 pages
2b.pipeline RISC-V v2
No ratings yet
2b.pipeline RISC-V v2
13 pages
Pipeline Review: Here Is The Example Instruction Sequence Used To Illustrate Pipelining On The Previous Page
No ratings yet
Pipeline Review: Here Is The Example Instruction Sequence Used To Illustrate Pipelining On The Previous Page
11 pages
A4 Solution
No ratings yet
A4 Solution
4 pages
Co - Unit Ii - Ii
No ratings yet
Co - Unit Ii - Ii
34 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
3 Pipeline
No ratings yet
3 Pipeline
21 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
Investigating Instruction Pipelining
No ratings yet
Investigating Instruction Pipelining
20 pages
M116C 1 EE116C-Midterm2-w15 Solution
100% (1)
M116C 1 EE116C-Midterm2-w15 Solution
8 pages
Pipelining PDF
No ratings yet
Pipelining PDF
70 pages
Problem Set 4 Sol
No ratings yet
Problem Set 4 Sol
14 pages
CO Assignment 4 Solution
100% (1)
CO Assignment 4 Solution
10 pages
PARALLELISM VIA INSTRUCTIONS: Pipelining Exploits The Potential Parallelism Among Instructions. This Parallelism Is
No ratings yet
PARALLELISM VIA INSTRUCTIONS: Pipelining Exploits The Potential Parallelism Among Instructions. This Parallelism Is
2 pages
CS3351 Digital Principles and Computer Organization
No ratings yet
CS3351 Digital Principles and Computer Organization
55 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Pipelining
No ratings yet
Pipelining
44 pages
Two Forms of Pipelining: - E.g., Floating Point Operations
No ratings yet
Two Forms of Pipelining: - E.g., Floating Point Operations
36 pages
CS3351 Dpco Qbank
No ratings yet
CS3351 Dpco Qbank
43 pages
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
No ratings yet
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
7 pages
Computer Architecture LAB 2
No ratings yet
Computer Architecture LAB 2
4 pages
Table 1: Control Signals and Opcodes
No ratings yet
Table 1: Control Signals and Opcodes
6 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
L06 Arc DR Wail c4 Pipelining
No ratings yet
L06 Arc DR Wail c4 Pipelining
168 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
71 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
Lect 06
No ratings yet
Lect 06
89 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
Lec18-Static BRANCH PREDICTION VLIW
No ratings yet
Lec18-Static BRANCH PREDICTION VLIW
40 pages
Pipelining
No ratings yet
Pipelining
29 pages
Chapter - 04 Mips Assembly Data Path
No ratings yet
Chapter - 04 Mips Assembly Data Path
137 pages
3.3.5 Reduced Instruction Set Computing Processors (RISC)
No ratings yet
3.3.5 Reduced Instruction Set Computing Processors (RISC)
11 pages
Cst308 Comprehensive Course Work May 2024
No ratings yet
Cst308 Comprehensive Course Work May 2024
7 pages
Ripes A Visual Computer Architecture Simulator
100% (1)
Ripes A Visual Computer Architecture Simulator
8 pages
CSC 305 pAST Questions
No ratings yet
CSC 305 pAST Questions
4 pages
Vector Processor
No ratings yet
Vector Processor
83 pages
Safety Critical Systems Desing
No ratings yet
Safety Critical Systems Desing
70 pages
CA Assignment
100% (1)
CA Assignment
8 pages
COA Module 5 (Pipelining)
No ratings yet
COA Module 5 (Pipelining)
31 pages
William Stallings Computer Organization and Architecture 8 Edition Instruction Level Parallelism and Superscalar Processors
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Instruction Level Parallelism and Superscalar Processors
50 pages
Cs152 Sp16 F Sol VLIW
No ratings yet
Cs152 Sp16 F Sol VLIW
40 pages
Lecture-14 CH-04 2
No ratings yet
Lecture-14 CH-04 2
20 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
Pipelining Seminar
No ratings yet
Pipelining Seminar
14 pages
Es MCQ
No ratings yet
Es MCQ
31 pages
06 Solutions For Chapter 6 Exercises
No ratings yet
06 Solutions For Chapter 6 Exercises
14 pages
Parallelism
No ratings yet
Parallelism
22 pages
Solution of CSE340 Assignment 3 Spring 2022
No ratings yet
Solution of CSE340 Assignment 3 Spring 2022
7 pages
Processor Architecture
No ratings yet
Processor Architecture
13 pages
5 Marks Q. Describe Array Processor Architecture
No ratings yet
5 Marks Q. Describe Array Processor Architecture
11 pages
Data Hazards
No ratings yet
Data Hazards
15 pages
High Performance Computing - CS 3010 - MID SEM Question by Subhasis Dash With Solution
No ratings yet
High Performance Computing - CS 3010 - MID SEM Question by Subhasis Dash With Solution
12 pages
9.dynamic Scheduling (Score Boarding)
No ratings yet
9.dynamic Scheduling (Score Boarding)
10 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
CCNA Exam Focus: Study Guide with Practice Tests
From Everand
CCNA Exam Focus: Study Guide with Practice Tests
SUJAN
No ratings yet
CCNA Exam Excellence: Study Guide & Practice Tests
From Everand
CCNA Exam Excellence: Study Guide & Practice Tests
SUJAN
No ratings yet
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
From Everand
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
Mamta Devi
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Ca CT2

Uploaded by

Ca CT2

Uploaded by

Tuesday, October 31, 2023 10:53 PM

Consider the following instructions are executed in five stage pipeline:

New Section 1 Page 1

Instruction sequence Dependences

RAW on $5 from I1 to I2 and I3

Delay I3 to avoid RAW hazard on $1 from I1

New Section 1 Page 2

Delay I2 to avoid RAW hazard on $5 from I1

Note: No RAW hazard from on $5 from I1 now

No RAW hazard on $1 from I1 (forwarded)

Delay I2 to avoid RAW hazard on $5 from I1

No forwarding With forwarding Speed-up due to forwarding

0.86 (This is really a slowdown)

0.90 (This is really a slowdown)

New Section 1 Page 3

Can’t use ALU-ALU forwarding, ($1 loaded in MEM)

Can’t use ALU-ALU forwarding ($5 loaded in MEM)

No forwarding With ALU-ALU forwarding only Speed-up with ALU-ALU forwarding

0.83 (This is really a slowdown)

0.91 (This is really a slowdown)

New Section 1 Page 4

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.