0% found this document useful (0 votes)

15 views12 pages

Pipelining Basics

The document discusses instruction-level parallelism and how it can be exploited in pipelined processors by executing independent instructions simultaneously. It describes different types of dependences that limit instruction-level parallelism and explains hazards that can occur in pipelined processors due to dependences. It also discusses techniques for handling hazards like forwarding and stalling.

Uploaded by

ssmukherjee2013

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views12 pages

Pipelining Basics

Uploaded by

ssmukherjee2013

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Instruction-Level Parallelism (ILP)

Fine-grained parallelism
Obtained by:
• instruction overlap in a pipeline
• executing instructions in parallel (later, with multiple instruction
issue)
In contrast to:
• loop-level parallelism (medium-grained)
• process-level or task-level or thread-level parallelism (coarse-
grained)

Autumn 2006 CSE P548 - Basics of Pipelining 1

Instruction-Level Parallelism (ILP)

Can be exploited when instruction operands are independent of each

other, for example,
• two instructions are independent if their operands are different
• an example of independent instructions

ld R1, 0(R2)
or R7, R3, R8

Each thread (program) has a fair amount of potential ILP

• very little can be exploited on today’s computers
• researchers trying to increase it

Autumn 2006 CSE P548 - Basics of Pipelining 2

1
Dependences

data dependence: arises from the flow of values through programs

• consumer instruction gets a value from a producer instruction
• determines the order in which instructions can be executed

ld R1, 32(R3)
add R3, R1, R8

name dependence: instructions use the same register but no flow of data
between them
• antidependence ld R1, 32(R3)

• output dependence add R3, R1, R8

ld R1, 16 (R3)

Autumn 2006 CSE P548 - Basics of Pipelining 3

Dependences

control dependence
• arises from the flow of control
• instructions after a branch depend on the value of the branch’s
condition variable

beqz R2, target

lw r1, 0(r3)
target: add r1, ...

Dependences inhibit ILP

Autumn 2006 CSE P548 - Basics of Pipelining 4

2
Pipelining

Implementation technique (but it is visible to the architecture)

• overlaps execution of different instructions
• execute all steps in the execution cycle simultaneously, but on
different instructions
Exploits ILP by executing several instructions “in parallel”
Goal is to increase instruction throughput

Autumn 2006 CSE P548 - Basics of Pipelining 5

Pipelining

Autumn 2006 CSE P548 - Basics of Pipelining 6

3
Pipelining

Not that simple!

• pipeline hazards (structural, data, control)
• place a soft “limit” on the number of stages
• increase instruction latency (a little)
• write & read pipeline registers for data that is computed in a
stage
• information produced in a stage travels down the pipeline
with the instruction
• time for clock & control lines to reach all stages
• all stages are the same length which is determined by the
longest stage
• stage length determines clock cycle time

IBM Stretch (1961): the first general-purpose pipelined computer

Autumn 2006 CSE P548 - Basics of Pipelining 7

Hazards

Structural hazards
Data hazards
Control hazards
What happens on a hazard
• instruction that caused the hazard & previous instructions complete
• all subsequent instructions stall until the hazard is removed
(in-order execution)
• only instructions that depend on that instruction stall
(out-of-order execution)
• hazard removed
• instructions continue execution

Autumn 2006 CSE P548 - Basics of Pipelining 8

4
Structural Hazards

Cause: instructions in different stages want to use the same hardware

resource in the same cycle
e.g., 4 FP instructions ready to execute & only 2 FP units
Solutions:
• more hardware (eliminate the hazard)
• stall (tolerate the hazard)
• less hardware, lower performance
• only for big hardware components

Autumn 2006 CSE P548 - Basics of Pipelining 9

Autumn 2006 CSE P548 - Basics of Pipelining 10

5
Data Hazards

Cause:
• an instruction early in the pipeline needs the result produced by an
instruction farther down the pipeline before it is written to a register
• would not have occurred if the implementation was not pipelined
Types
RAW (data), WAR (name: antidependence), WAW (name: output)
HW solutions
• forwarding hardware (eliminate the hazard)
• stall via pipelined interlocks
Compiler solution
• code scheduling (for loads)

Autumn 2006 CSE P548 - Basics of Pipelining 11

Dependences vs. Hazards

Autumn 2006 CSE P548 - Basics of Pipelining 12

6
Forwarding

Forwarding (also called bypassing):

• output of one stage (the result in that stage’s pipeline register) is
bused (bypassed) to the input of a previous stage
• why forwarding is possible
• results are computed 1 or more stages before they are written
to a register
• at the end of the EX stage for computational instructions
• at the end of MEM for a load
• results are used 1 or more stages after registers are read
• if you forward a result to an ALU input as soon as it has been
computed, you can eliminate the hazard or reduce stalling

Autumn 2006 CSE P548 - Basics of Pipelining 13

Forwarding Example

Autumn 2006 CSE P548 - Basics of Pipelining 14

7
Forwarding Implementation

Forwarding unit checks whether forwarded values should be used:

• between instructions in ID and EX
• compare the R-type destination register number in EX/MEM
pipeline register to each source register number in ID/EX
• between instructions in ID and MEM
• compare the R-type destination register number in MEM/WB
to each source register number in ID/EX
If a match, set MUX to choose bussed values from EX/MEM or MEM/WB

Autumn 2006 CSE P548 - Basics of Pipelining 15

consumer producer producer

Autumn 2006 CSE P548 - Basics of Pipelining 16

8
Forwarding Hardware

Hardware to implement forwarding:

• destination register number in pipeline registers
(but might need it anyway because we need to know which register
to write when storing an ALU or load result)
• source register numbers
(probably only one, e.g., rs on MIPS R2/3000) is extra)
• a comparator for each source-destination register pair
• buses to ship data and register numbers − the BIG cost
• larger ALU MUXes for 2 bypass values

Autumn 2006 CSE P548 - Basics of Pipelining 17

Loads

Loads
• data hazard caused by a load instruction & an immediate use of the
loaded value
• forwarding won’t eliminate the hazard
why? data not back from memory until the end of the MEM stage
• 2 solutions used together
• stall via pipelined interlocks
• schedule independent instructions into the load delay slot
(a pipeline hazard that is exposed to the compiler) so that there
will be no stall

Autumn 2006 CSE P548 - Basics of Pipelining 18

9
Loads

Autumn 2006 CSE P548 - Basics of Pipelining 19

Implementing Pipelined Interlocks

How a stall situation is detected:

Hazard detection unit stalls the use after a load
• is the instruction in EX a load?
• does the destination register number of the load = either source
register number in the next instruction?
• compare the load write register number in ID/EX to each read
register number in IF/ID
⇒ if both yes, stall the pipe 1 cycle

Autumn 2006 CSE P548 - Basics of Pipelining 20

10
Implementing Pipelined Interlocks

How stalling is implemented:

• nullify the instruction in the ID stage, the one that uses the
loaded value
• change EX, MEM, WB control signals in ID/EX pipeline register
to 0
• the instruction in the ID stage will have no side effects as it
passes down the pipeline
• restart the instructions that were stalled in ID & IF stages
• disable writing the PC --- the same instruction will be fetched
again
• disable writing the IF/ID pipeline register --- the load use
instruction will be decoded & its registers read again

Autumn 2006 CSE P548 - Basics of Pipelining 21

Loads

hazard detection

decode again

fetch again

Autumn 2006 CSE P548 - Basics of Pipelining 22

11
Implementing Pipelined Interlocks

Hardware to implement stalling:

• rt register number in ID/EX pipeline register
(but need it anyway because we need to know what register to write
when storing load data)
• both source register numbers in IF/ID pipeline register
(already there)
• a comparator for each source-destination register pair
• buses to ship register numbers
• write enable/disable for PC
• write enable/disable for the IF/ID pipeline register
• a MUX to the ID/EX pipeline register (+ 0s)
Trivial amount of hardware & needed for cache misses anyway

Autumn 2006 CSE P548 - Basics of Pipelining 23

Control Hazards

Cause: condition & target determined after the next fetch has already been
done
Early HW solutions
• stall
• assume no branch & flush the pipeline if wrong
• move branch resolution hardware forward in the pipeline
Compiler solutions
• code scheduling
• static branch prediction
Today’s HW solutions
• dynamic branch prediction
Today’s architectural solutions
• predicated execution

Autumn 2006 CSE P548 - Basics of Pipelining 24

Lecture 1 Introduction To Information Technology
100% (10)
Lecture 1 Introduction To Information Technology
40 pages
NSE7 - Enterprise Firewall FortiOS 7.0 - Study Guide
100% (1)
NSE7 - Enterprise Firewall FortiOS 7.0 - Study Guide
528 pages
User Manual: Jadoogar
No ratings yet
User Manual: Jadoogar
7 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
No ratings yet
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
58 pages
LECTURE 5
No ratings yet
LECTURE 5
50 pages
CS530-Fall2015-Lecture9
No ratings yet
CS530-Fall2015-Lecture9
5 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
Week 11
No ratings yet
Week 11
33 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
3 Pipeline
No ratings yet
3 Pipeline
38 pages
06 Pipeline PDF
No ratings yet
06 Pipeline PDF
17 pages
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Advanced Linux Programming
No ratings yet
Advanced Linux Programming
31 pages
Pipelining Unit 3
No ratings yet
Pipelining Unit 3
19 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
No ratings yet
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
60 pages
Lect8 Pipelined DP Control
No ratings yet
Lect8 Pipelined DP Control
59 pages
Pipelining
No ratings yet
Pipelining
44 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
SRM Pipelining 05.Pptx
No ratings yet
SRM Pipelining 05.Pptx
42 pages
CA unit-2 Chapter-2
No ratings yet
CA unit-2 Chapter-2
36 pages
Pipe Lining
No ratings yet
Pipe Lining
16 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
71 pages
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
No ratings yet
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
60 pages
Pipeline
No ratings yet
Pipeline
33 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
60 pages
1. Lecture 13 Pipelining
No ratings yet
1. Lecture 13 Pipelining
12 pages
Chapter 10 Principles of Pipelining
No ratings yet
Chapter 10 Principles of Pipelining
124 pages
Pipelining
No ratings yet
Pipelining
29 pages
CS 6461: Computer Architecture Instruction Level Parallelism
No ratings yet
CS 6461: Computer Architecture Instruction Level Parallelism
41 pages
ch4-3
No ratings yet
ch4-3
61 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
1.Pipelining & ILP
No ratings yet
1.Pipelining & ILP
37 pages
Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX
No ratings yet
Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX
28 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
Pipelining2019_(1)[1]
No ratings yet
Pipelining2019_(1)[1]
82 pages
Cse410 10 Pipelining A
No ratings yet
Cse410 10 Pipelining A
7 pages
Computer Architecture Pipe Line
No ratings yet
Computer Architecture Pipe Line
28 pages
Helping Slides Pipelining Hazards Solutions
No ratings yet
Helping Slides Pipelining Hazards Solutions
55 pages
Pipeline Hazards Detailed Notes
No ratings yet
Pipeline Hazards Detailed Notes
49 pages
Module 5 Part2 pipelining
No ratings yet
Module 5 Part2 pipelining
36 pages
A-pipelining
No ratings yet
A-pipelining
16 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Lec 06
No ratings yet
Lec 06
18 pages
Arch4 Pipelined Processor Design Afterlecture
No ratings yet
Arch4 Pipelined Processor Design Afterlecture
130 pages
Computer System Organization
No ratings yet
Computer System Organization
26 pages
COA Pipelining
No ratings yet
COA Pipelining
35 pages
Chapter 17_Pipelining Hazards
No ratings yet
Chapter 17_Pipelining Hazards
33 pages
1.Pipelining & ILP
No ratings yet
1.Pipelining & ILP
38 pages
Pipelining and Parallelism
No ratings yet
Pipelining and Parallelism
41 pages
16900123131_PCC-CS402
No ratings yet
16900123131_PCC-CS402
10 pages
Module 4 - Parallel & Pipeline Processing - Final
No ratings yet
Module 4 - Parallel & Pipeline Processing - Final
31 pages
Chapter4 Pipelining END FA11
No ratings yet
Chapter4 Pipelining END FA11
84 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
From Everand
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
Mamta Devi
No ratings yet
15 Branch N Bound
No ratings yet
15 Branch N Bound
10 pages
DEBOTTAM MUKHERJEE Offer Letter Java Dev
No ratings yet
DEBOTTAM MUKHERJEE Offer Letter Java Dev
1 page
03 Sorting Algorithms
No ratings yet
03 Sorting Algorithms
60 pages
Compiler Design
100% (1)
Compiler Design
130 pages
Sample Questions On Polarization of Light
No ratings yet
Sample Questions On Polarization of Light
3 pages
AstroShock PPT by Debottam Mukherjee Final
No ratings yet
AstroShock PPT by Debottam Mukherjee Final
9 pages
Computer Organization
No ratings yet
Computer Organization
159 pages
SWOT Analysis of ChatGPT by Debottam Mukherjee
No ratings yet
SWOT Analysis of ChatGPT by Debottam Mukherjee
12 pages
Assignment 5 On Numpy
No ratings yet
Assignment 5 On Numpy
6 pages
Routine Bvoc Even 2022-23
No ratings yet
Routine Bvoc Even 2022-23
3 pages
Stack Queue
No ratings yet
Stack Queue
19 pages
Sorting and Hashing
No ratings yet
Sorting and Hashing
33 pages
Introduction
No ratings yet
Introduction
11 pages
GRAPHS
No ratings yet
GRAPHS
15 pages
Mathematics
No ratings yet
Mathematics
192 pages
Address Mapping
No ratings yet
Address Mapping
28 pages
A+ Dumps: Single Ended SCSI Systems Don't Work If The Total Cable Length Exceeds 6 Meters
100% (2)
A+ Dumps: Single Ended SCSI Systems Don't Work If The Total Cable Length Exceeds 6 Meters
25 pages
LinuxFormatUK261 - 2020-04
100% (1)
LinuxFormatUK261 - 2020-04
100 pages
Ospf DR BDR
No ratings yet
Ospf DR BDR
5 pages
Check List Active Directory
100% (2)
Check List Active Directory
6 pages
AspenONE Engineering Cloud V12 Using Windows Virtual Desktop
No ratings yet
AspenONE Engineering Cloud V12 Using Windows Virtual Desktop
22 pages
User Manual of IVMS-4200 - V2.6.1
No ratings yet
User Manual of IVMS-4200 - V2.6.1
250 pages
Connecting Modbus/TCP IO-Link Masters (AL134x Models) To MELSEC iQ-F FX5U
No ratings yet
Connecting Modbus/TCP IO-Link Masters (AL134x Models) To MELSEC iQ-F FX5U
26 pages
SMAPI Latest
No ratings yet
SMAPI Latest
19 pages
Intel Ethernet Controller Products - Release Notes - 29.1
No ratings yet
Intel Ethernet Controller Products - Release Notes - 29.1
22 pages
Trac Download
No ratings yet
Trac Download
5 pages
M100754 Fanuc 16 18 Memory Reference Chart (Loc)
No ratings yet
M100754 Fanuc 16 18 Memory Reference Chart (Loc)
3 pages
Logcat 1581365240126
No ratings yet
Logcat 1581365240126
15 pages
Ee445M: Embedded and Real Time Systems: Study Guide Set #01
No ratings yet
Ee445M: Embedded and Real Time Systems: Study Guide Set #01
4 pages
Preparation Before Using DJI Terra
No ratings yet
Preparation Before Using DJI Terra
39 pages
What Is RFC in SAP
100% (2)
What Is RFC in SAP
11 pages
CICS Web Services As A Provider and Requestor
No ratings yet
CICS Web Services As A Provider and Requestor
75 pages
Exam Study Guide PDF
No ratings yet
Exam Study Guide PDF
4 pages
HP All in One 123
No ratings yet
HP All in One 123
4 pages
INTERNSHIP REPORT 22
No ratings yet
INTERNSHIP REPORT 22
6 pages
Opencpu Server
No ratings yet
Opencpu Server
14 pages
Finals Cheat Sheet
No ratings yet
Finals Cheat Sheet
2 pages
Vsat Installation Guide Connexstar
No ratings yet
Vsat Installation Guide Connexstar
45 pages
B Implement MP BGP Control Plane v2
No ratings yet
B Implement MP BGP Control Plane v2
32 pages
Datasheet CH3MNAS English
No ratings yet
Datasheet CH3MNAS English
3 pages
CDKR Web v0.2rc
No ratings yet
CDKR Web v0.2rc
3 pages
G11-Ps-Computer
No ratings yet
G11-Ps-Computer
126 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pipelining Basics

Uploaded by

Pipelining Basics

Uploaded by

Instruction-Level Parallelism (ILP)

Autumn 2006 CSE P548 - Basics of Pipelining 1

Instruction-Level Parallelism (ILP)

Can be exploited when instruction operands are independent of each

Each thread (program) has a fair amount of potential ILP

Autumn 2006 CSE P548 - Basics of Pipelining 2

data dependence: arises from the flow of values through programs

• output dependence add R3, R1, R8

Autumn 2006 CSE P548 - Basics of Pipelining 3

beqz R2, target

Dependences inhibit ILP

Autumn 2006 CSE P548 - Basics of Pipelining 4

Implementation technique (but it is visible to the architecture)

Autumn 2006 CSE P548 - Basics of Pipelining 5

Autumn 2006 CSE P548 - Basics of Pipelining 6

Not that simple!

IBM Stretch (1961): the first general-purpose pipelined computer

Autumn 2006 CSE P548 - Basics of Pipelining 7

Autumn 2006 CSE P548 - Basics of Pipelining 8

Cause: instructions in different stages want to use the same hardware

Autumn 2006 CSE P548 - Basics of Pipelining 9

Autumn 2006 CSE P548 - Basics of Pipelining 10

Autumn 2006 CSE P548 - Basics of Pipelining 11

Dependences vs. Hazards

Autumn 2006 CSE P548 - Basics of Pipelining 12

Forwarding (also called bypassing):

Autumn 2006 CSE P548 - Basics of Pipelining 13

Autumn 2006 CSE P548 - Basics of Pipelining 14

Forwarding unit checks whether forwarded values should be used:

Autumn 2006 CSE P548 - Basics of Pipelining 15

consumer producer producer

Autumn 2006 CSE P548 - Basics of Pipelining 16

Hardware to implement forwarding:

Autumn 2006 CSE P548 - Basics of Pipelining 17

Autumn 2006 CSE P548 - Basics of Pipelining 18

Autumn 2006 CSE P548 - Basics of Pipelining 19

Implementing Pipelined Interlocks

How a stall situation is detected:

Autumn 2006 CSE P548 - Basics of Pipelining 20

How stalling is implemented:

Autumn 2006 CSE P548 - Basics of Pipelining 21

Autumn 2006 CSE P548 - Basics of Pipelining 22

Hardware to implement stalling:

Autumn 2006 CSE P548 - Basics of Pipelining 23

Autumn 2006 CSE P548 - Basics of Pipelining 24

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.