0% found this document useful (0 votes)

120 views3 pages

Question 1 (50 Points) Pipelining

This document contains instructions for an online final exam with 3 independent questions worth a total of 50 points over 40 minutes. Question 1 has 3 parts about pipelining concepts like speedup calculations, maximum clock rates, and pipeline hazards. Question 2 compares two 5-stage and 6-stage pipeline implementations. Question 3 analyzes the execution of a loop through a 6-stage pipeline showing stalls and forwarding.

Uploaded by

Muhammad Zahid iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views3 pages

Question 1 (50 Points) Pipelining

Uploaded by

Muhammad Zahid iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Zoom/Online Final - question Q1

There are 3 independent questions Total: 50

points Duration: 40 minutes

GOOD LUCK!

Question 1 (50 points) Pipelining

The following 3 parts are independent, you should answer each as if it is a separate question. Do not
forget to write your name on every page.

PART 1 (15 points) Assume you have a single cycle processor operating at 1 GHz. You are going to
make a 5-stage pipeline out of this processor. Although the processor can potentially operate at a
higher frequency, overheads associated with pipelining force you to operate the pipelined processor at
3 GHz. In a given program, assume that 40% are memory instructions, 50% are ALU instructions and
the rest are branch instructions. 10% of the memory instructions cause stalls of 20 clock cycles each
due to cache misses and 50% of the branch instructions cause stalls of 4 cycles each. Assume that
there are no stalls associated with the execution of ALU instructions. For this program, what is the
speedup achieved by the pipelined processor over the single cycle processor?

Answer:

time_single_cycle = IC x CPI x t_clock

= IC x 1 x 1/1GHz

CPI_pipeline = 1 + Overhead due to mem instr + Overhead due to branch instr

= 1 + 0.4 x 0.1 x 20 + 0.1 x 0.5 x 4 = 2
time_pipeline = IC x CPI x t_clock
= IC x 2 x 1/3GHz

Speedup = time_single_cycle / time_pipeline

= (IC x 1 x 1/1GHz) / (IC x 2 x 1/3GHz)
= 3/2 = 1.5
PART 2 (15 points) Compare two pipeline implementations: A and B with 5 and 6 stages,
respectively.The logic delays of the pipeline stages are as follows:

Stage 1 2 3 4 5 6
A 250ps 180ps 400ps 200ps 150ps -
B 200ps 150ps 250ps 250ps 150ps 180ps

a) What are the maximum clock rates for the two implementations? Note that 1ps = 10-12 seconds.

Option A f max = (include the unit with your result)

Tc = 400ps, f = 1/400ps therefore f = 1/ (400 * 10 ) = 2.5 * 109 Hz = 2.5 GHz
-12

Option B f max = (include the unit with your result)

Tc = 250ps, f = 1/250ps therefore f = 1/ (250 * 10-12) = 4 * 109 Hz = 4 GHz

b) Consider a program which requires 2 billion instructions to execute on pipeline A with a CPI
of 1.5, whereas 1.5 billion instructions to execute on pipeline B with a CPI of 4. Which
implementation would you prefer for this program?
T = IC * CPI * Tc
T_A = 2. 109 * 1.5 * 400 . 10-12 = 1.2 sec.
T_B = 1.5. 109 * 4 * 250 . 10-12 = 1.5 sec.
Therefore, A is faster for this program and should be chosen.
PART 3 (20 points) Assume you have a 6 stage pipeline which is composed of the following stages:

F D X1 X2 M W

Instruction RegFile ALU Data RegFile

Memory Memory

Note that, execute stage requires two clock cycles (X1 and X2). Also, the register file is designed in a
way so that there is NO early write and late read. Assuming that the execute stage is designed in such
a way that a new execution can begin even while the previous one is in progress to complete, we have
a pipeline which can theoretically start (and complete) one instruction per clock cycle. But hazards
complicate things, and stalls which are unavoidable will result in a CPI greater than 1. Assume that
branch decisions are performed in the X1 stage. The following code needs to be run:

I1:Loop: add $t0, $t1, $t2

I2: lw $t3, 0($t0)
I3: beq $t3, $t0, Loop
I4: Exit: ...

Consider only 2 iterations of the loop, that is, for a total of 3x2=6 instructions:
a) How many clock cycles does this code take in an ideal world if there were no control dependencies
or data dependencies?

b) Similar to the following table show which stage of each instruction is executed (F, D, X1, X2, M,
W) using the info given above, and assuming that pipeline has forwarding hardware. Also,
clearly show forwarding with arrows between stages (if any). Make sure that you explicitly
show stalls (if any).

Clock Cycle No.

(use as many as needed)
Instr 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29

I1 add F D X1 X2 M W
I2 lw F D - X1 X2 M W
I3 beq F - D - - X1 X2 M W
I1 add - - - - F D X1 X2 M W
I2 lw F D - X1 X2 M W
I3 beq F - D - - X1 X2 M W

Advanced Microcontroller and Embedded Systems
100% (1)
Advanced Microcontroller and Embedded Systems
64 pages
ARM Exception Handling
No ratings yet
ARM Exception Handling
30 pages
CSC 424 Assignment
100% (1)
CSC 424 Assignment
8 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
CS2071-Computer Architecture QB
100% (1)
CS2071-Computer Architecture QB
6 pages
Assignment Solution Week11
100% (1)
Assignment Solution Week11
5 pages
Computer Organization: Ahmed Hashim
No ratings yet
Computer Organization: Ahmed Hashim
48 pages
Ca Mid1 2017
No ratings yet
Ca Mid1 2017
9 pages
PS4 Solution
No ratings yet
PS4 Solution
6 pages
Cs433 Fa20 Hw3 Solution
No ratings yet
Cs433 Fa20 Hw3 Solution
15 pages
Assignment#2 Solution
No ratings yet
Assignment#2 Solution
8 pages
Exam2 Practice Sol
No ratings yet
Exam2 Practice Sol
6 pages
CS M151B / EE M116C: Computer Systems Architecture
No ratings yet
CS M151B / EE M116C: Computer Systems Architecture
38 pages
Final Exam - Fall 2008: COE 308 - Computer Architecture
No ratings yet
Final Exam - Fall 2008: COE 308 - Computer Architecture
8 pages
HW1 Sol SP 25
No ratings yet
HW1 Sol SP 25
11 pages
Sheet 9
No ratings yet
Sheet 9
12 pages
CSN-221 Pipelines-Quiz: Enrollment No.: 18114031 Name - Hemil Panchiwala
No ratings yet
CSN-221 Pipelines-Quiz: Enrollment No.: 18114031 Name - Hemil Panchiwala
6 pages
Computer Organization Exercise Answer4
No ratings yet
Computer Organization Exercise Answer4
7 pages
2018 Second
No ratings yet
2018 Second
7 pages
18116029
No ratings yet
18116029
6 pages
Computer Architecture and Design QP Set A CA 3
No ratings yet
Computer Architecture and Design QP Set A CA 3
6 pages
Computer Architecture - Sheet 7 Solution
No ratings yet
Computer Architecture - Sheet 7 Solution
5 pages
Nmam Institute of Technology: Department of Computer Science and Engineering
No ratings yet
Nmam Institute of Technology: Department of Computer Science and Engineering
8 pages
Homework Set 4: Class CPI On P1 CPI On P2
No ratings yet
Homework Set 4: Class CPI On P1 CPI On P2
2 pages
Lecture10 - Chapter4-P2
No ratings yet
Lecture10 - Chapter4-P2
46 pages
F10 E1 Solution
No ratings yet
F10 E1 Solution
5 pages
Unit 3 Problems
No ratings yet
Unit 3 Problems
18 pages
HPC Question Bank
No ratings yet
HPC Question Bank
5 pages
Arm
100% (2)
Arm
44 pages
L24 Pipeline
No ratings yet
L24 Pipeline
40 pages
Solution of Questions From Chapter 4-COAL
No ratings yet
Solution of Questions From Chapter 4-COAL
28 pages
CS641
No ratings yet
CS641
2 pages
Chapter4 2
No ratings yet
Chapter4 2
34 pages
HCT222 - 22computer Architecture and Organization 2021 July Test1
No ratings yet
HCT222 - 22computer Architecture and Organization 2021 July Test1
6 pages
Assignment5 Soln
No ratings yet
Assignment5 Soln
5 pages
COE301 Final Solution 162
No ratings yet
COE301 Final Solution 162
10 pages
CSE 560 - Practice Problem Set 4 Solution
No ratings yet
CSE 560 - Practice Problem Set 4 Solution
3 pages
CompEng 361 Final Review Problems - Solutions
No ratings yet
CompEng 361 Final Review Problems - Solutions
6 pages
Midterm Solutions Mar 30
No ratings yet
Midterm Solutions Mar 30
6 pages
CO Gate 2023
No ratings yet
CO Gate 2023
6 pages
CMPE361-Final - Sanple
No ratings yet
CMPE361-Final - Sanple
8 pages
Csis Csg524 Midsem Q
No ratings yet
Csis Csg524 Midsem Q
3 pages
Sample Problems Pipe&Memory
No ratings yet
Sample Problems Pipe&Memory
57 pages
Pipeline Ex.1
No ratings yet
Pipeline Ex.1
1 page
PIPELINE
No ratings yet
PIPELINE
13 pages
350 Exam 2 Spring 2024
No ratings yet
350 Exam 2 Spring 2024
7 pages
CENG400-Final-Fall 2015
No ratings yet
CENG400-Final-Fall 2015
10 pages
Computer System Architecture CHO
No ratings yet
Computer System Architecture CHO
7 pages
CENG400 Midterm Fall 2015
No ratings yet
CENG400 Midterm Fall 2015
10 pages
Mid Term 13-14
No ratings yet
Mid Term 13-14
3 pages
Mid 2
No ratings yet
Mid 2
8 pages
TMS320F2812-Flash Programming
No ratings yet
TMS320F2812-Flash Programming
22 pages
History: History of General-Purpose Cpus
No ratings yet
History: History of General-Purpose Cpus
17 pages
Tve Icf7 q1 w4 Computer Architecturemachine Cycle
No ratings yet
Tve Icf7 q1 w4 Computer Architecturemachine Cycle
14 pages
Coa Applied
No ratings yet
Coa Applied
13 pages
Courseproject - Computers Assignment Design Compilers .
No ratings yet
Courseproject - Computers Assignment Design Compilers .
6 pages
BFE Final Organization Fall 2014 Answer
No ratings yet
BFE Final Organization Fall 2014 Answer
8 pages
PgtrbcomputerscienceIN PART 3
No ratings yet
PgtrbcomputerscienceIN PART 3
91 pages
Instructions: Csce 212: Final Exam Spring 2009
No ratings yet
Instructions: Csce 212: Final Exam Spring 2009
5 pages
Homework3 Solution v2
No ratings yet
Homework3 Solution v2
41 pages
Lect3 - Design Metrics
No ratings yet
Lect3 - Design Metrics
34 pages
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
No ratings yet
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
5 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
21CS43 - Module 1
No ratings yet
21CS43 - Module 1
21 pages
111 Computer Organization - Final
No ratings yet
111 Computer Organization - Final
4 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
CS433 hw1 Fall 07
No ratings yet
CS433 hw1 Fall 07
3 pages
OSCA Assignment
No ratings yet
OSCA Assignment
18 pages
Operating System Questions
No ratings yet
Operating System Questions
6 pages
高级计算机体系结构第四课PPT
No ratings yet
高级计算机体系结构第四课PPT
43 pages
15IF11 Multicore E PDF
No ratings yet
15IF11 Multicore E PDF
14 pages
Problems Chapter 17 Parallel Processsing: 17.14 An Application Program Is Executed On A Nine-Computer Cluster. A
No ratings yet
Problems Chapter 17 Parallel Processsing: 17.14 An Application Program Is Executed On A Nine-Computer Cluster. A
4 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
High-Performance Energy-Efficient Microprocessor Design - Vojin G - Oklobdzija (Editor), Ram K - Krishnamurthy (Editor) - 1, 2006 - Springer - 9780387340470 - Anna's Archive
No ratings yet
High-Performance Energy-Efficient Microprocessor Design - Vojin G - Oklobdzija (Editor), Ram K - Krishnamurthy (Editor) - 1, 2006 - Springer - 9780387340470 - Anna's Archive
342 pages
TS7. Bus & Pipeline
No ratings yet
TS7. Bus & Pipeline
6 pages
Microprocessor Computer ArchitectureCACS155
No ratings yet
Microprocessor Computer ArchitectureCACS155
7 pages
Pipelining (All Slides)
No ratings yet
Pipelining (All Slides)
45 pages
Micro
No ratings yet
Micro
20 pages
Unit Iii General Purpose Processor Software Development
No ratings yet
Unit Iii General Purpose Processor Software Development
11 pages
Co Unit 4
No ratings yet
Co Unit 4
17 pages
Unit II MCQ
No ratings yet
Unit II MCQ
8 pages
Mini Project Topics
No ratings yet
Mini Project Topics
2 pages
Password 123
No ratings yet
Password 123
3 pages
MCES Mod-1 PPT-1
No ratings yet
MCES Mod-1 PPT-1
90 pages
Sample Midterm2
No ratings yet
Sample Midterm2
4 pages
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
From Everand
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
Derek Molloy
4/5 (2)
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Comptia Server+ Primer
From Everand
Comptia Server+ Primer
John Greene
5/5 (1)
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet
Solutions to Problems in Fluids and Turbomachinery
From Everand
Solutions to Problems in Fluids and Turbomachinery
Rahul Basu
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Question 1 (50 Points) Pipelining

Uploaded by

Question 1 (50 Points) Pipelining

Uploaded by

Zoom/Online Final - question Q1

There are 3 independent questions Total: 50

Question 1 (50 points) Pipelining

time_single_cycle = IC x CPI x t_clock

CPI_pipeline = 1 + Overhead due to mem instr + Overhead due to branch instr

Speedup = time_single_cycle / time_pipeline

Option A f max = (include the unit with your result)

Option B f max = (include the unit with your result)

Instruction RegFile ALU Data RegFile

I1:Loop: add $t0, $t1, $t2

Clock Cycle No.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.