0% found this document useful (0 votes)

17 views8 pages

CAP EndSem Unit 5

Uploaded by

Apurva Jarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views8 pages

CAP EndSem Unit 5

Uploaded by

Apurva Jarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

UNIT 5

Pipeline Hazards
Earlier we had mentioned that the memory limits the speed of the CPU. Now there is
one more case. In a pipelined design few instructions are in some stage of execution.
There are possibilities for some kind of dependency amongst these set of instructions
and thereby limiting the speed of the Pipeline. The dependencies occur for a few
reasons which we will be discussing soon. The dependencies in the pipeline are called
Hazards as these cause hazard to the execution. We use the
word Dependencies and Hazard interchangeably as these are used so in Computer
Architecture. Essentially an occurrence of a hazard prevents an instruction in the pipe
from being executed in the designated clock cycle. We use the word clock cycle,
because each of these instructions may be in different machine cycle of theirs.

There are three kinds of hazards:

 Structural Hazards
 Data Hazards
 Control Hazards

There are many specific solutions to dependencies. The simplest is introducing

a bubble which stalls the pipeline and reduces the throughput. The bubble makes the
next instruction wait until the earlier instruction is done with.

Structural Hazards
Structural hazards arise due to hardware resource conflict amongst the instructions in
the pipeline. A resource here could be the Memory, a Register in GPR or ALU. This
resource conflict is said to occur when more than one instruction in the pipe is requiring
access to the same resource in the same clock cycle. This is a situation that the hardware
cannot handle all possible combinations in an overlapped pipelined execution.
A better solution would be to increase the structural resources in the system using one
of the few choices below:

 The pipeline may be increased to 5 or more stages and suitably redefine the
functionality of the stages and adjust the clock frequency. This eliminates the issue
of the hazard at every 4th instruction in the 4-stage pipeline
 The memory may physically be separated as Instruction memory and Data
Memory. A Better choice would be to design as Cache memory in CPU, rather than
dealing with Main memory. IF uses Instruction memory and Result writing uses
Data Memory. These become two separate resources avoiding dependency.
 It is possible to have Multiple levels of Cache in CPU too.
 There is a possibility of ALU in resource dependency. ALU may be required in IE
machine cycle by an instruction while another instruction may require ALU in IF
stage to calculate Effective Address based on addressing mode. The solution would
be either stalling or have an exclusive ALU for address calculation.
 Register files are used in place of GPRs. Register files have multiport access with
exclusive read and write ports. This enables simultaneous access on one write
register and read register.

The last two methods are implemented in modern CPUs. Beyond these, if dependency
arises, Stalling is the only option. Keep in mind that increasing resources involves
increased cost. So the trade-off is a designer’s choice.

Data Hazards
Data hazards occur when an instruction's execution depends on the results of some
previous instruction that is still being processed in the pipeline. Consider the example
below. Occur when given instruction depends on data from an instruction ahead of it in
pipeline.

Solution 1: Introduce three bubbles at SUB instruction IF stage. This will facilitate SUB –
ID to function at t6. Subsequently, all the following instructions are also delayed in the
pipe.

Solution 2: Data forwarding - Forwarding is passing the result directly to the

functional unit that requires it: a result is forwarded from the output of one unit to the
input of another. The purpose is to make available the solution early to the next
instruction.
Solution 3: Compiler can play a role in detecting the data dependency and reorder
(resequence) the instructions suitably while generating executable code. This way the
hardware can be eased.

Solution 4: In the event, the above reordering is infeasible, the compiler may detect and
introduce NOP ( no operation) instruction(s). NOP is a dummy instruction equivalent
bubble, introduced by the software.

The compiler looks into data dependencies in code optimisation stage of the
compilation process.

Data Hazards classification

Data hazards are classified into three categories based on the order of READ or WRITE
operation on the register and as follows:

1. RAW (Read after Write) [Flow/True data dependency]

Given two instructions I and J, where I comes before J …

Instruction J should read an operand after it is written by I

Called a data dependence in compiler terminology.

This is a case where an instruction uses data produced by a previous one. Example

ADD R0, R1, R2

SUB R4, R3, R0

2. WAR (Write after Read) [Anti-Data dependency]

This is a case where the second instruction writes onto register before the first
instruction reads. This is rare in a simple pipeline structure. However, in some
machines with complex and special instructions case, WAR can happen.

ADD R2, R1, R0

SUB R0, R3, R4

3. WAW (Write after Write) [Output data dependency]

This is a case where two parallel instructions write the same register and must do it
in the order in which they were issued.

ADD R0, R1, R2

SUB R0, R4, R5

WAW and WAR hazards can only occur when instructions are executed in parallel or out
of order. These occur because the same register numbers have been allotted by the
compiler although avoidable. This situation is fixed by renaming one of the registers by
the compiler or by delaying the updating of a register until the appropriate value has
been produced. Modern CPUs not only have incorporated Parallel execution with
multiple ALUs but also Out of order issue and execution of instructions along with many
stages of pipelines.

Control Hazards
Control hazards are called Branch hazards and caused by Branch Instructions. Branch
instructions control the flow of program/ instructions execution. Recall that we use
conditional statements in the higher-level language either for iterative loops or with
conditions checking (correlate with for, while, if, case statements). These are transformed
into one of the variants of BRANCH instructions. It is necessary to know the value of the
condition being checked to get the program flow. Life is complicating you! So it is for
the CPU!

Thus a Conditional hazard occurs when the decision to execute an instruction is based
on the result of another instruction like a conditional branch, which checks the
condition’s resultant value.

The branch and jump instructions decide the program flow by loading the appropriate
location in the Program Counter(PC). The PC has the value of the next instruction to be
fetched and executed by CPU. Consider the following sequence of instructions.

Solutions for Conditional Hazards

1. Stall the Pipeline as soon as decoding any kind of branch instructions. Just not
allow anymore IF. As always, stalling reduces throughput. The statistics say that in a
program, at least 30% of the instructions are BRANCH. Essentially the pipeline
operates at 50% capacity with Stalling.
2. Prediction – Imagine a for or while loop getting executed for 100 times. We know
for sure 100 times the program flows without the branch condition being met. Only
in the 101st time, the program comes out of the loop. So, it is wiser to allow the
pipeline to proceed and undo/flush when the branch condition is met. This does
not affect the throttle of the pipeline as much stalling.
3. Dynamic Branch Prediction - A history record is maintained with the help of
Branch Table Buffer (BTB). The BTB is a kind of cache, which has a set of entries,
with the PC address of the Branch Instruction and the corresponding effective
branch address. This is maintained for every branch instruction encountered. SO
whenever a conditional branch instruction is encountered, a lookup for the
matching branch instruction address from the BTB is done. If hit, then the
corresponding target branch address is used for fetching the next instruction. This
is called dynamic branch prediction.

Figure 16.6
Branch Table Buffer

This method is successful to the extent of the temporal locality of reference in the
programs. When the prediction fails flushing needs to take place.

4. Reordering instructions - Delayed branch i.e. reordering the instructions to

position the branch instruction later in the order, such that safe and useful
instructions which are not affected by the result of a branch are brought-in earlier
in the sequence thus delaying the branch instruction fetch. If no such instructions
are available then NOP is introduced. This delayed branch is applied with the help
of Compiler.

------------------------------------------------------------------------------------

Instruction Level Parallelism:-

Instruction Level Parallelism (ILP) is used to refer to the architecture in which multiple
operations can be performed parallelly in a particular process, with its own set of
resources – address space, registers, identifiers, state, program counters. It refers to
the compiler design techniques and processors designed to execute operations, like
memory load and store, integer addition, float multiplication, in parallel to improve
the performance of the processors. Examples of architectures that exploit ILP are
VLIWs, Superscalar Architecture.
ILP processors have the same execution hardware as RISC processors. The machines
without ILP have complex hardware which is hard to implement. A typical ILP allows
multiple-cycle operations to be pipelined

Architecture :

Instruction Level Parallelism is achieved when multiple operations are performed in

single cycle, that is done by either executing them simultaneously or by utilizing gaps
between two successive operations that is created due to the latencies.
Now, the decision of when to execute an operation depends largely on the compiler
rather than hardware. However, extent of compiler’s control depends on type of ILP
architecture where information regarding parallelism given by compiler to hardware
via program varies. The classification of ILP architectures can be done in the following
ways –
1. Sequential Architecture :
Here, program is not expected to explicitly convey any information regarding
parallelism to hardware, like superscalar architecture.
2. Dependence Architectures :
Here, program explicitly mentions information regarding dependencies between
operations like dataflow architecture.
3. Independence Architecture :
Here, program gives information regarding which operations are independent of
each other so that they can be executed instead of the ‘nop’s.
In order to apply ILP, compiler and hardware must determine data dependencies,
independent operations, and scheduling of these independent operations, assignment
of functional unit, and register to store data

-----------------------------------------------------------------------------------------------------------------
What is Indirect instruction cycle? Explain data flow in it?

• The execution of an instruction may involve one or more

operands in memory, each of which requires a memory
access.
• Further if indirect addressing is used then additional
memory accesses are required
• We can think of the fetching of indirect addresses as one
more instruction stages
• The main line of activity consists of alternating
instruction fetch and instruction execution activities.
• After an instruction is fetched it is examined to
determine if any indirect addressing is involved
• If so the required operands are fetched using indirect
addressing
• Following execution an interrupt may be processed
before the next instruction fetch

DATA Flow:-

Ch#16(CPU Structure and Function)
No ratings yet
Ch#16(CPU Structure and Function)
48 pages
DLCO Module 6 Sem 3
No ratings yet
DLCO Module 6 Sem 3
40 pages
4-Advanced pipelining_241114_060906
No ratings yet
4-Advanced pipelining_241114_060906
80 pages
Pipelining: Basic Concepts
No ratings yet
Pipelining: Basic Concepts
20 pages
CA unit-2 Chapter-2
No ratings yet
CA unit-2 Chapter-2
36 pages
Auto Code
No ratings yet
Auto Code
3 pages
Doubly Linked List
100% (1)
Doubly Linked List
16 pages
1.0 C Basics
100% (1)
1.0 C Basics
25 pages
Pipelining New.pptx
No ratings yet
Pipelining New.pptx
33 pages
Pipeline Hazards (1)
No ratings yet
Pipeline Hazards (1)
53 pages
CBA Processor
No ratings yet
CBA Processor
21 pages
Unit 5.2 Processor
No ratings yet
Unit 5.2 Processor
40 pages
Lecture-5-09.01.2025
No ratings yet
Lecture-5-09.01.2025
25 pages
Unit 1 Financial Institutions
No ratings yet
Unit 1 Financial Institutions
60 pages
Percona2018linuxperformance 180430171528
No ratings yet
Percona2018linuxperformance 180430171528
19 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
37 pages
C.Arch Large
No ratings yet
C.Arch Large
57 pages
Pipelining2019_(1)[1]
No ratings yet
Pipelining2019_(1)[1]
82 pages
Pipe Lining
No ratings yet
Pipe Lining
14 pages
10_Pipelining
No ratings yet
10_Pipelining
44 pages
Kuliah 14 Pipeliningg
No ratings yet
Kuliah 14 Pipeliningg
28 pages
HP NC360T PCI Express Dual Port Gigabit Server Adapter-C04163767
No ratings yet
HP NC360T PCI Express Dual Port Gigabit Server Adapter-C04163767
12 pages
Atmel 42385 SAM L21 - Datasheet - Summary PDF
No ratings yet
Atmel 42385 SAM L21 - Datasheet - Summary PDF
40 pages
cs101 Solved Mcqs Final Term by Junaid
No ratings yet
cs101 Solved Mcqs Final Term by Junaid
25 pages
CoA Batch13
No ratings yet
CoA Batch13
30 pages
Unit 6
No ratings yet
Unit 6
31 pages
Performance Testing
100% (3)
Performance Testing
160 pages
Lecutre-7 Instruction Pipelining
No ratings yet
Lecutre-7 Instruction Pipelining
29 pages
ch6
No ratings yet
ch6
29 pages
Pipeline II: Hazards
No ratings yet
Pipeline II: Hazards
16 pages
8. CH14-WS_10thEd_Pipeline
No ratings yet
8. CH14-WS_10thEd_Pipeline
16 pages
Pipelining (All Slides)
No ratings yet
Pipelining (All Slides)
45 pages
lec2
No ratings yet
lec2
21 pages
Lecture 3.1.2 (Concept of Pipelining, Pipeline Hazards)
No ratings yet
Lecture 3.1.2 (Concept of Pipelining, Pipeline Hazards)
6 pages
Lecutre-7 Instruction Pipelining
No ratings yet
Lecutre-7 Instruction Pipelining
29 pages
Unit 6
No ratings yet
Unit 6
11 pages
F5 Load Balancer Questions & Answers 2023
No ratings yet
F5 Load Balancer Questions & Answers 2023
7 pages
Unit IV Material Part 2 1704950984185
No ratings yet
Unit IV Material Part 2 1704950984185
6 pages
Unit 5 Pipeline Hazard
No ratings yet
Unit 5 Pipeline Hazard
31 pages
Cap Unit 6 Ans
No ratings yet
Cap Unit 6 Ans
10 pages
SIMD Machines:: Pipeline System
No ratings yet
SIMD Machines:: Pipeline System
35 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Elips
No ratings yet
Elips
4 pages
study guide chapter 3
No ratings yet
study guide chapter 3
3 pages
31 Pipeline Hazards 25-04-2024
No ratings yet
31 Pipeline Hazards 25-04-2024
35 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Arjun PL Vlsi3
No ratings yet
Arjun PL Vlsi3
87 pages
Coa Iat-2 QB Soln
No ratings yet
Coa Iat-2 QB Soln
16 pages
Untitled
No ratings yet
Untitled
9 pages
Different Types of Computer
No ratings yet
Different Types of Computer
12 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
37 pages
CA Slides#5 Pipeline Hazards
No ratings yet
CA Slides#5 Pipeline Hazards
33 pages
English 4 IT - Unit 17 Some Common Computer Input Devices Reading
No ratings yet
English 4 IT - Unit 17 Some Common Computer Input Devices Reading
2 pages
PipelineHazards
No ratings yet
PipelineHazards
4 pages
Accounts & Financial Management Unit 3
No ratings yet
Accounts & Financial Management Unit 3
9 pages
Dpco Unit 4
No ratings yet
Dpco Unit 4
21 pages
Automatic Data Master Server
No ratings yet
Automatic Data Master Server
19 pages
Ch2 Lec7 Instruction Piplining
No ratings yet
Ch2 Lec7 Instruction Piplining
34 pages
CA-unit 4-Material
No ratings yet
CA-unit 4-Material
31 pages
Coa Unit 4
No ratings yet
Coa Unit 4
10 pages
PYthon Class 11 Telugu
No ratings yet
PYthon Class 11 Telugu
7 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
214 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
03 Dynamic Sched
No ratings yet
03 Dynamic Sched
84 pages
Assignment Unit 04
No ratings yet
Assignment Unit 04
1 page
Array Methods
No ratings yet
Array Methods
22 pages
Pipeline Hazards - Computer Architecture
No ratings yet
Pipeline Hazards - Computer Architecture
5 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
Lecture 1
100% (1)
Lecture 1
10 pages
Ab Initio - V1.3
No ratings yet
Ab Initio - V1.3
37 pages
Unit 4 Ratio Analysis
No ratings yet
Unit 4 Ratio Analysis
17 pages
Case Study On Apple
No ratings yet
Case Study On Apple
21 pages
Techopedia Explains: Amdahl's Law
No ratings yet
Techopedia Explains: Amdahl's Law
19 pages
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
Lec3 PDF
No ratings yet
Lec3 PDF
15 pages
Sorting Algorithms
100% (6)
Sorting Algorithms
16 pages
Pipeline Hazards Detailed Notes
No ratings yet
Pipeline Hazards Detailed Notes
49 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Pipeline Hazards. Presentation
100% (2)
Pipeline Hazards. Presentation
20 pages
Datasheet - XPG SPECTRIX S40G - EN - 20191224
No ratings yet
Datasheet - XPG SPECTRIX S40G - EN - 20191224
2 pages
Accounts & Financial Management Unit 2
No ratings yet
Accounts & Financial Management Unit 2
17 pages
Phone Finder Results - MHL
No ratings yet
Phone Finder Results - MHL
6 pages
Content: - Introduction To Pipeline Hazard - Structural Hazard - Data Hazard - Control Hazard
No ratings yet
Content: - Introduction To Pipeline Hazard - Structural Hazard - Data Hazard - Control Hazard
27 pages
Cpu Intel 18% Cpu Amd 18%
No ratings yet
Cpu Intel 18% Cpu Amd 18%
12 pages
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
No ratings yet
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
24 pages
Address Calculation PDF
33% (3)
Address Calculation PDF
2 pages
Apple Receipt Template
50% (4)
Apple Receipt Template
3 pages
Control M Agent Remote Installation
No ratings yet
Control M Agent Remote Installation
3 pages
FX Loop Block - Fractal Audio Wiki
No ratings yet
FX Loop Block - Fractal Audio Wiki
4 pages
Technical Note #60 DGH Corp. Modbus RTU
No ratings yet
Technical Note #60 DGH Corp. Modbus RTU
4 pages
Instruction Level Parallelism-Concepts N Challenges
100% (1)
Instruction Level Parallelism-Concepts N Challenges
4 pages
Cu20025ecpb W1J 202990
No ratings yet
Cu20025ecpb W1J 202990
2 pages
FCIDE (Core) Frontend and Backend Setup Guidelines
No ratings yet
FCIDE (Core) Frontend and Backend Setup Guidelines
6 pages
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
From Everand
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
Karl Josef Hensel
No ratings yet
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
First Hop Redundancy Protocol: Network Redundancy Protocol
From Everand
First Hop Redundancy Protocol: Network Redundancy Protocol
Mulayam Singh
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CAP EndSem Unit 5

Uploaded by

CAP EndSem Unit 5

Uploaded by

UNIT 5

There are three kinds of hazards:

There are many specific solutions to dependencies. The simplest is introducing

Solution 2: Data forwarding - Forwarding is passing the result directly to the

Data Hazards classification

1. RAW (Read after Write) [Flow/True data dependency]

Given two instructions I and J, where I comes before J …

Instruction J should read an operand after it is written by I

Called a data dependence in compiler terminology.

ADD R0, R1, R2

2. WAR (Write after Read) [Anti-Data dependency]

ADD R2, R1, R0

3. WAW (Write after Write) [Output data dependency]

ADD R0, R1, R2

Solutions for Conditional Hazards

4. Reordering instructions - Delayed branch i.e. reordering the instructions to

Instruction Level Parallelism:-

Instruction Level Parallelism is achieved when multiple operations are performed in

• The execution of an instruction may involve one or more

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.