0% found this document useful (0 votes)

31 views53 pages

Pipeline Hazards

Pipeline hazards are situations that prevent the next instruction from executing during its designated clock cycle, reducing performance. There are three classes of hazards: structural, data, and control hazards, each arising from different conflicts in instruction execution. Solutions to these hazards include stalling the pipeline, forwarding results, and implementing branch prediction techniques.

Uploaded by

jekitoc589

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views53 pages

Pipeline Hazards

Uploaded by

jekitoc589

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 53

Pipeline Hazards

Pipeline Hazards
• There are situations, called hazards, that prevent the next instruction in the
instruction stream from executing during its designated clock cycle.
• Hazards reduce the performance from the ideal speedup gained by pipelining.
• There are three classes of hazards:
• 1. Structural hazards arise from resource conflicts when the hardware cannot
support all possible combinations of instructions simultaneously in overlapped
execution.
• 2. Data hazards arise when an instruction depends on the results of a previous
instruction in a way that is exposed by the overlapping of instructions in the pipeline.
• 3. Control hazards arise from the pipelining of branches and other instructions that
change the PC
• Hazards in pipelines can make it necessary to stall the pipeline.
• Avoiding a hazard often requires that some instructions in the
pipeline be allowed to proceed while others are delayed
Structural Hazards
• When a processor is pipelined, the overlapped execution of
instructions requires pipelining of functional units and duplication of
resources to allow all possible combinations of instructions in the
pipeline.
• If some combination of instructions cannot be accommodated
because of resource conflicts, the processor is said to have a
structural hazard.
Structural Hazards
• The most common instances of structural hazards arise
when some functional unit is not fully pipelined.
• Then a sequence of instructions using that unpipelined unit
cannot proceed at the rate of one per clock cycle.
• Another common way that structural hazards appear is
when some resource has not been duplicated enough to
allow all combinations of instructions in the pipeline to
execute.
• For example, a processor may have only one register-file
write port, but under certain circumstances, the pipeline
might want to perform two writes in a clock cycle. This will
generate a structural hazard
Structural Hazards
• When a sequence of instructions encounters this hazard, the pipeline
will stall one of the instructions until the required unit is available.
Solution 1-Structural Hazard: stall

Stall Instr i+3

till CC 5
Solution 2-Structural Hazard
Data Hazards

• A major effect of pipelining is to change the relative timing of

instructions by overlapping their execution.
• This overlap introduces data and control hazards.
• Data hazards occur when the pipeline changes the order of read/write
accesses to operands so that the order differs from the order seen by
sequentially executing instructions on an unpipelined processor
•C
• The DADD instruction writes the value of R1 in the WB pipe stage, but
the DSUB instruction reads the value during its ID stage. This problem
is called a data hazard.
SOLUTIONS 1: Forwarding
• directly feed back EX/MEM&MEM/WB pipeline registers’ results to
the ALU inputs;

• if forwarding hardware detects that previous ALU has written the reg
corresponding to a source for the current ALU,control logic selects the
forwarded result as the ALU input
• Generalized forwarding
-pass a result directly to the functional unit that requires it;

-forward results to not only ALU inputs but also other types of
functional units;
Data Hazards Requiring Stalls
• Unfortunately, not all potential data hazards can be handled by
bypassing.
• Consider the following sequence of instructions:
LD R1,0(R2)
DSUB R4,R1,R5
AND R6,R1,R7
OR R8,R1,R9
The LD instruction does not have the data until the end of clock cycle 4 (its MEM
cycle), while the DSUB instruction needs to have the data by the beginning of that
clock cycle.
Thus, the data hazard from using the result of a load instruction cannot be
completely eliminated with simple hardware
Solution 2 – STALL
ADAS.MCNSAKLchlk/jn

ADD R1,R2,R3
SUB R4,R1,R5
AND R6,R1,R7
OR R8,R1,R9
• ADD R1,R2,R3
• LOAD R4,8 (R1)
• STR R4 ,12(R1)
• ADD R1,R4 R3
• LOAD R1,0 (R3)
• SUB R4,R1,R5
• AND R6 R4 R1
• OR R8 R4 R1
• The IF, ID and WB stages take one clock cycle each to complete the
operation. The number of clock cycles for the EX stage depends on
the instruction. The ADD and SUB instructions need 1 clock cycle and
the MUL instruction needs 3 clock cycles in the EX stage. Operand
forwarding is used in the pipelined processor. What is the number of
clock cycles taken to complete the following sequence of instructions?
• ADD R2, R1, R0 R2 <- R0 + R1
• MUL R4, R3, R2 R4 <- R3 * R2
• SUB R6, R5, R4 R6 <- R5 - R4
• A 5-stage pipelined processor has Instruction Fetch(IF),Instruction
Decode(ID),Operand Fetch(OF),Execution (EXE)and Write
Operand(WO)stages.The IF,ID,OF and WO stages take 1 clock cycle each for any
instruction.The EXE stage takes 1 clock cycle for ADD and SUB instructions,3
clock cycles for MUL instruction,and 6 clock cycles for DIV instruction
respectively.Operand forwarding is used in the pipeline.What is the number of
clock cycles needed to execute the following sequence of instructions?

• Instruction Meaning of instruction

• I0 :MUL R2 ,R0 ,R1 R2 ¬ R0 *R1
• I1 :DIV R5 ,R3 ,R4 R5 ¬ R3/R4
• I2 :ADD R2 ,R5 ,R2 R2 ¬ R5+R2
• I3 :SUB R5 ,R2 ,R6 R5 ¬ R2-R6
Branch Hazards
• Control hazards are called Branch hazards and caused by
Branch Instructions.
• Branch instructions control the flow of program/
instructions execution
• Control hazards are caused by branches in the code.
• During the IF stage remember that the PC is incremented by 4 in
preparation for the next IF cycle of the next instruction.
• What happens if there is a branch performed and we aren’t simply
incrementing the PC by 4.
• The easiest way to deal with the occurrence of a branch is to perform
the IF stage again once the branch occurs.
The instruction after the branch is fetched, but the instruction is
ignored, and the fetch is restarted once the branch target is
known.
It is probably obvious that if the branch is not taken, the second
IF for branch successor is redundant. This will be addressed
shortly.
Reducing Pipeline Branch Penalties
• First solution
• The simplest scheme to handle branches is to freeze or flush the pipeline, holding
or deleting any instructions after the branch until the branch destination is known.
• Second Solution
• . In the simple five-stage pipeline, this predicted-not-taken or predicted untaken
scheme is implemented by continuing to fetch instructions as if the branch were a
normal instruction.
• The pipeline looks as if nothing out of the ordinary is happening. If the branch is
taken, however, we need to turn the fetched instruction into a no-op and restart the
fetch at the target address
• Another scheme in use in some processors is called delayed branch. This
technique was heavily used in early RISC processors and works reasonably well
in the five-stage pipeline.
• In a delayed branch, the execution cycle with a branch delay of one is
• branch instruction
• sequential successor1
• branch target if taken
• The sequential successor is in the branch delay slot.
• This instruction is executed whether or not the branch is taken.
Reducing the Cost of Branches
through Prediction
• Static Branch Prediction
• A key way to improve compile-time branch prediction is to use
profile information collected from earlier runs.
• The key observation that makes this worthwhile is that the behavior
of branches is often bimodally distributed; that is, an individual
branch is often highly biased toward taken or untaken
Dynamic Branch Prediction and
Branch-Prediction Buffers
• The simplest dynamic branch-prediction scheme is a branch-
prediction buffer or branch history table.
• A branch-prediction buffer is a small memory indexed by the lower
portion of the address of the branch instruction.
• The memory contains a bit that says whether the branch was recently
taken or not.
• This scheme is the simplest sort of buffer; it has no tags and is useful
only to reduce the branch delay when it is longer than the time to
compute the possible target PCs
Dynamic Branch Prediction and
Branch-Prediction Buffers
• If the branch is taken the bit is set to 1. The next time the branch
instruction is fetched we will know that the branch occurred and we
can assume that the branch will be taken.
• This scheme adds some “history” to our previous discussion on
“branch taken” and “branch not taken” control hazard avoidance
2-bit Prediction Scheme
• This method is more reliable than using a single bit to represent
whether the branch was recently taken or not.
• The use of a 2-bit predictor will allow branches that favor taken (or
not taken) to be mispredicted less often than the one-bit case.

ENGR9861 Winter 2005 JPR

ENGR9861 Winter 2005 JPR
Datapath and control considerations

1.There are separate instruction and data caches that use

separate address and data connections to the processor. This
requires two versions of the MAR register, IMAR for accessing
tile instruction cache and DMAR for accessing the data cache.
2.The PC is connected directly to the IMAR, so that the contents
of the PC can be transferred to IMAR at the same time that an
independent ALU operation is taking place.
3.The data address in DMAR can be obtained directly from the
register file or from the ALU to support the register indirect and
indexed addressing modes.
Datapath and control considerations
4.Separate MDR registers are provided for read and write
operations. Data can be transferred directly between these
registers and the register file during load and store operations
without the need to pass through the ALU.
5.Buffer registers have been introduced at the inputs and output
of the ALU. These are registers SRCl, SRC2, and RSLT.
Forwarding connections may be added if desired.
6.The instruction register has been replaced with an instruction
queue, which is loaded from the instruction cache.
Load / Store Architecture
• RISC is referred to as Load/Store architecture.
• Alternatively the operations in its instruction set are defined as Register-to-Register
operations.
• The reason is that all the RISC machine operations are between the operands that reside in
the General Purpose Register File (GPR).
• The result of the operation is also written back to GPR. Restricting the locations of the
operands to the GPR only, allows for determinism in the RISC operation.
• In the other words, a potentially multi-cycle and unpredictable access to memory has
been separated from the operation. Once the operands are available in the GPR the
operation can proceed in a deterministic fashion.
• It is almost certain that once commenced the operation will be completed in the number of
cycled determined by the pipeline depth and the result will be written back into the GPR.
• Memory Access is accomplished through Load and Store instructions
only, thus the term “Load/Store Architecture” is often used when
referring to RISC.
• The RISC pipeline is specified in a way in which it must accommodate
both: operation and memory access with equal efficiency.

Unit 3
No ratings yet
Unit 3
94 pages
05 Risc V Pipeline
No ratings yet
05 Risc V Pipeline
31 pages
Ch#16 (CPU Structure and Function)
No ratings yet
Ch#16 (CPU Structure and Function)
48 pages
CH14-WS - 10thed - Pipeline
No ratings yet
CH14-WS - 10thed - Pipeline
16 pages
Unit 5.2 Processor
No ratings yet
Unit 5.2 Processor
40 pages
Pipeline - Instr - Super Branch
No ratings yet
Pipeline - Instr - Super Branch
48 pages
Pipelining 2019
No ratings yet
Pipelining 2019
82 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
Moduel 5
No ratings yet
Moduel 5
46 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Pipelining (All Slides)
No ratings yet
Pipelining (All Slides)
45 pages
CH 6
No ratings yet
CH 6
29 pages
Pipeline Part 2
No ratings yet
Pipeline Part 2
7 pages
Unit-V: Performance Enhancement Techinques
No ratings yet
Unit-V: Performance Enhancement Techinques
61 pages
4-Pipeline
No ratings yet
4-Pipeline
30 pages
10 Pipelining
No ratings yet
10 Pipelining
44 pages
CAP EndSem Unit 5
No ratings yet
CAP EndSem Unit 5
8 pages
Unit 6
No ratings yet
Unit 6
20 pages
Pipelining New
No ratings yet
Pipelining New
33 pages
CoA Batch13
No ratings yet
CoA Batch13
30 pages
COA Unit - V Notes
No ratings yet
COA Unit - V Notes
21 pages
SIMD Machines:: Pipeline System
No ratings yet
SIMD Machines:: Pipeline System
35 pages
Pipeline Hazards: Structural Hazards: Resource Conflict
No ratings yet
Pipeline Hazards: Structural Hazards: Resource Conflict
49 pages
Pipeline Hazard
No ratings yet
Pipeline Hazard
8 pages
Co - Unit Ii - Ii
No ratings yet
Co - Unit Ii - Ii
34 pages
Pipelining
No ratings yet
Pipelining
5 pages
Pipelining: Basic Concepts
No ratings yet
Pipelining: Basic Concepts
20 pages
Lec5 PDF
No ratings yet
Lec5 PDF
23 pages
Dpco Unit 4
No ratings yet
Dpco Unit 4
21 pages
31 Pipeline Hazards 25-04-2024
No ratings yet
31 Pipeline Hazards 25-04-2024
35 pages
Computer Architecture M2 (Part 3)
No ratings yet
Computer Architecture M2 (Part 3)
34 pages
Kuliah 14 Pipeliningg
No ratings yet
Kuliah 14 Pipeliningg
28 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
4 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Pipelining
No ratings yet
Pipelining
44 pages
CA Unit 3 Answers
No ratings yet
CA Unit 3 Answers
10 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Coa Unit 4
No ratings yet
Coa Unit 4
10 pages
CH14 COA9e Processor Structure and Function
No ratings yet
CH14 COA9e Processor Structure and Function
40 pages
Pipeline Hazards. Presentation
100% (2)
Pipeline Hazards. Presentation
20 pages
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
DLCO Module 6 Sem 3
No ratings yet
DLCO Module 6 Sem 3
40 pages
CA-unit 4-Material
No ratings yet
CA-unit 4-Material
31 pages
Pipeline Hazards Detailed Notes
No ratings yet
Pipeline Hazards Detailed Notes
49 pages
Lect3 Pipeline
No ratings yet
Lect3 Pipeline
4 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
Pipelining
No ratings yet
Pipelining
29 pages
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
No ratings yet
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
7 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
2.1: Advanced Processor Technology: Qn:Explain Design Space of Processor?
No ratings yet
2.1: Advanced Processor Technology: Qn:Explain Design Space of Processor?
29 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
32 pages
L10-L11-Instruction Pipelining
No ratings yet
L10-L11-Instruction Pipelining
38 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
Content: - Introduction To Pipeline Hazard - Structural Hazard - Data Hazard - Control Hazard
No ratings yet
Content: - Introduction To Pipeline Hazard - Structural Hazard - Data Hazard - Control Hazard
27 pages
Pipelining PDF
No ratings yet
Pipelining PDF
70 pages
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
No ratings yet
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
24 pages
Homework 2
No ratings yet
Homework 2
8 pages
Test 1
No ratings yet
Test 1
51 pages
COA Question Bank PDF
No ratings yet
COA Question Bank PDF
27 pages
Single and Multi Cycle Pipelined Units
No ratings yet
Single and Multi Cycle Pipelined Units
18 pages
Heterogeneous Soc Design and Verification HWSW Coexploration Codesign Coverification and Codebugging Khaled Salah Mohamed PDF Download
No ratings yet
Heterogeneous Soc Design and Verification HWSW Coexploration Codesign Coverification and Codebugging Khaled Salah Mohamed PDF Download
59 pages
Ca Notes (Chatgpt)
No ratings yet
Ca Notes (Chatgpt)
245 pages
Chap. 9 Pipeline and Vector Processing
No ratings yet
Chap. 9 Pipeline and Vector Processing
16 pages
UNIT-5: Pipeline and Vector Processing
No ratings yet
UNIT-5: Pipeline and Vector Processing
63 pages
Unit Iv Coa - PPT
No ratings yet
Unit Iv Coa - PPT
99 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
Solution 2
No ratings yet
Solution 2
3 pages
Unit 3
No ratings yet
Unit 3
55 pages
7COA Slides-1
No ratings yet
7COA Slides-1
26 pages
How Data Hazards Can Be Removed Effectively
No ratings yet
How Data Hazards Can Be Removed Effectively
6 pages
Computer Network - Lab Manuals
No ratings yet
Computer Network - Lab Manuals
29 pages
Pipelining Basic and Intermediate Concepts
No ratings yet
Pipelining Basic and Intermediate Concepts
75 pages
Courseproject - Computers Assignment Design Compilers .
No ratings yet
Courseproject - Computers Assignment Design Compilers .
6 pages
8 - RISCV - Pipelined - Arch2
No ratings yet
8 - RISCV - Pipelined - Arch2
57 pages
Coa Unit - 5 Notes
No ratings yet
Coa Unit - 5 Notes
6 pages
Co Unit3
No ratings yet
Co Unit3
41 pages
Mips Processor Using Cisc Architecture
No ratings yet
Mips Processor Using Cisc Architecture
23 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
37 pages
Computer ArchitectureT4
No ratings yet
Computer ArchitectureT4
7 pages
Internal Structure of CPU - PDF - Central Processing Unit - Integrated Circuit - 1635563291339
No ratings yet
Internal Structure of CPU - PDF - Central Processing Unit - Integrated Circuit - 1635563291339
7 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
94 pages
Classic RISC Pipeline
No ratings yet
Classic RISC Pipeline
10 pages
RISC Instruction Set:: I) Data Manipulation Instructions
No ratings yet
RISC Instruction Set:: I) Data Manipulation Instructions
8 pages
Homework1 PDF
No ratings yet
Homework1 PDF
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pipeline Hazards

Uploaded by

Pipeline Hazards

Uploaded by

Pipeline Hazards

Stall Instr i+3

• A major effect of pipelining is to change the relative timing of

• Instruction Meaning of instruction

ENGR9861 Winter 2005 JPR

1.There are separate instruction and data caches that use

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.