0% found this document useful (0 votes)

355 views9 pages

MEL G642-Compre Solution - 2 2016-17

The document contains a question paper for the subject VLSI Architecture with 4 questions. Question 1 has multiple parts asking about instruction coding format, pipelined processor design with and without forwarding, hazards in a code sequence, and effect of adding load/store instructions. Question 2 asks about branch penalty, name dependence, instruction level parallelism. Question 3 contrasts DSP and GPP, discusses multiply accumulate operation in DSPs, distinctive DSP addressing modes and functional blocks. Question 4 describes a CISC instruction and asks for its flowchart, exception states, and control word generation logic.

Uploaded by

Gaurav Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

355 views9 pages

MEL G642-Compre Solution - 2 2016-17

Uploaded by

Gaurav Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI

(PILANI, K.K.BIRLA GOA & HYDERABAD CAMPUSES)

II SEMESTER 2016-17
MEL G642 VLSI ARCHITECTURE 13th May 2017
(CLOSED BOOK) MM: 40 Duration 3 Hours
_____________________________________________________________________________________

Q1. Assuming that a 32-bit RISC processor ( with a register file containing 32 registers) that has only the
following three instructions in its instruction set: (i) ADD Rd, Rs1, Rs2 (ii) SUB Rd, Rs1, Rs2 (iii) BEQ Di,
Rs1, Rs2. (Here Rs1 and Rs2 are source registers and Rd is the destination register. ADD and SUB
instructions perform addition and subtraction operations. Instruction BEQ is a conditional branch
instruction which causes branching when the contents of its two source registers are equal. The 8-bit
branching distance Di (relative to the current value of program counter) is provided by a bit-field in the
binary code of BEQ instruction.

(a) Suggest an instruction coding format for the above instruction set and also binary codes for the
three instructions in view of ease of implementation. (1.5)
(b) Design the architectural schematic diagrams of a 4-stage (FETCH, DECODE-OPERAND READ,
EXECUTE, WRITE-BACK) pipelined implementation of this instruction set (i) without internal
forwarding of operands and (ii) with internal forwarding of operands clearly depicting different
fields of the pipieline registers (and what they contain), different functional blocks used in the
pipeline stages and the control circuit. (3+3)
(c) Following code is to be executed on this processor:
ADD R10, R6, R4
SUB R10, R10, R5
BEQ 40, R10, R5
(i) Enumerate all the hazards and their types in the above code. (2)
(ii) Give a clock cycle-by-clock cycle account of execution of this code on your 4-stage
pipelined implementations of the processor without and with internal forwarding of
operands (1.5+1.5)
(d) Now LOAD and STORE instructions are added to the instruction set, and data memory access
(for reading or writing) is organized through the addition of two pipeline stages MEM1 and
MEM2 between the EXECUTE and WRITEBACK stages. How will execution time of the code in
part (c) get effected in the case when there is no internal forwarding of operands ?
Give a cycle-by-cycle description of execution of the code. (1.5)

Q2. What is branch penalty? How can it be reduced / minimized? Give example. What is name
dependence or anti-dependence? Give an example. How is it tackled to gain execution efficiency? What
is Instruction Level Parallelism (ILP) ? How is it exploited in computer architecture? (6)
Q3.

(a) Contrast the design objectives of DSP processors and General Purpose Processors (GPPs). (2)
(b) What is the single most important DSP operation that influences the micro-architecture of DSP
processors. How is it accelerated in DSP processors? (2)
(c) Name and describe two distinctive data addressing modes that are supported only by DSP
Processors and not by GPPs and why ? (2)
(d) Name and briefly describe (functionally) the distinctive functional blocks of a DSP data path and
DSP address path that are typically not found in GPPs. Also draw the overall architectural
diagram of a DSP processor. (3)
(e) What is fractional data type? Why is it used in DSP processors? How do you convert a 16-bit
integer multiplier to a 16-bit fractional multiplier? (2)
(f) What special variants of commonly used arithmetic operations are supported by a DSP
processor? How are they implemented by the main functional blocks of the data path ? (2)

Q4.A CISC processor features an instruction CMX Rx Ry. This instruction compares the magnitudes
(absolute values) of integer data (assume 2’s complement representation) stored in registers Rx and
Ry. The instruction exchanges the stored data values in the registers if the magnitude of data stored
in register Ry happens to be smaller than the magnitude of data stored in register Rx.

(a) Write level II flowcharts for this instruction using the data path diagram given at the end of the
question paper (4)
(b) Assuming that no external interrupts of any kind occur during the execution of the above
instruction (including program or data memory access related interrupts), name the flowchart
states that can potentially cause exception processing to initiate immediately upon their
completion and why ? (1)
(c) Draw the schematic diagram of the next control word address generation logic of a CISC
processor which can handle deferred external interrupts and immediate external interrupts.
(2)

Execution Unit Block Diagram for Q4. Part (a)

ALU: Arithmetic and Logic Unit DO : Data Out buffer
IRF: Instruction Register for Fetch K: Constant generator
IRE: Instruction Register for Execution DI: Data Input register
T1, T2 : Temporary registers PC : Program Counter
R0 - Rn : Programmer’s registers AO : Address Out buffer

Rules of Operation for the Execution Unit:

1. A transfer from source to bus to destination takes one state time
2. A source can drive up to three destination loads
3. Inputs to the ALU are from the A internal bus and either K (values 0, +1, -1) or the B
internal bus
4. When the ALU is a destination, T1 is automatically loaded from the ALU output
5. A transfer to AO activates the on-chip external bus controller
6. ALU supports addition and subtraction (B input – A input) operations on 2’s complement
binary integers, and can set condition codes reflecting condition of the ALU result :
V(arithmetic overflow), N(ALU result negative), Z(ALU result zero) when desired, or
leave the condition codes unaltered if so desired.
7. All memory addresses are represented as positive integers in 2’s complement binary
representation
Q3.

(a) Contrast the design objectives of DSP processors and General Purpose Processors (GPPs). (2)

Solution:
GPP
 The GPP designers think of ultimate performance and ultimate flexibility as well as the compiler-
friendly instruction set.
 The instruction set must be general because the application is unknown and the programmers
behavior is unknown.
DSP
 The DSP designers think of application and cost first, and the challenge is to be efficient.
 Flexibility should be sufficient instead of ultimate.
 The goal of DSP designer is to reach the highest performance over silicon, the highest
performance over power consumption, the highest performance over the design cost.

(b) What is the single most important DSP operation that influences the micro-architecture of DSP
processors. How is it accelerated in DSP processors? (2)
Solution:
The most important DSP operation is Multiply and Accumulate (MAC) operation. The
enhancements in the architecture to support MAC operation are:
1. MAC Instruction supported by MAC unit-performing multiply and accumulate operation
2. Multiple data memories
3. Direct memory access capability for the MAC unit
4. Auto-increment addressing mode
5. Modulo/circular addressing mode
6. Hardware loop control
7. Guarding and saturation arithmetic in MAC to handle iterative loops and avoid exception
(which affects the real time constraints)

(c) Name and describe two distinctive data addressing modes that are supported only by DSP
Processors and not by GPPs and why ? (2)
Solution:
1. Modulo/circular addressing mode
Most of the DSP operation is carried out by convolution (FIR filter, IIR, Filter,
Autocorrelation, Cross correlation etc.). example: ( ) ∑ ( )
Since these are data shifting algorithms, shifting the sample for every output sample
computation is expensive in terms of time. In order to avoid this overhead, modulo
addressing has been proposed. In DSP processors, modulo addressing is implemented in
hardware and is present in the AGU. [Refer Lecture-DSP_Introduction for more details.]
2. Bit reversed addressing mode.
DFT is one of the most widely used operations in DSP. DFT can be computed using FFT which
requires less computational steps than the normal method.
 The Discrete Fourier Transform (DFT) allows for spectral analysis in the frequency domain.

– It is computed as

o for k = 0, 1, … , N-1, where

o x is the input sequence in the time domain

o y is an output sequence in the frequency domain

 The Inverse Discrete Fourier Transform (IDFT) is computed as

 The Fast Fourier Transform (FFT) provides an efficient method for computing the DFT.

FFT can be computed by DIT or DIF methods

If we look at DIT FFT, the data sample has to be preordered and supplied where as in DIF FFT, the input
sample is supplied in order but the output sample has to be preordered. In order to speedup this
process, hardwired bit -reversed addressing mode is supported by DSP.

(d) Name and briefly describe (functionally) the distinctive functional blocks of a DSP data path and
DSP address path that are typically not found in GPPs. Also draw the overall architectural
diagram of a DSP processor. (3)

Solution:
DSP data path has
1. Register File
a. Multiple registers present generally more than 64 registers are present. Some of the
special DSP processors have 512 registers
2. ALU
a. Perform special operations with and without saturation arithmetic, absolute value
finding, Select larger value, Select smaller value, Difference of two absolute values,
Absolute of the difference etc.
b. Have Guard bit (generally one guard bit) and saturation arithmetic units.
3. MAC
a. Performs iterative computing, have guard bits and saturation arithmetic units
b. Performs multiplication (integer, fractional, signed, unsigned, double and single
precision) and MAC operation
c. Performs scaling operation also
4. Other accelerated instruction execution units

DSP address path

1. Multiple address generating units (AGUs)

a. Special registers and multiple address pointers
b. Address calculation units

Figure shows the data path and address path components

Figure shows the overall architecture of a DSP

(e) What is fractional data type? Why is it used in DSP processors? How do you convert a 16-bit
integer multiplier to a 16-bit fractional multiplier? (2)

Solution:
Fractional: between -1 and 1-2-n+1 or [-1, 1-2-n+1]
Why Important?
 For computationally intensive application (like DSP), without taking exceptions,
fractional data type favors faster execution.
 Easy to implement data path HW
 Short physical critical path
 Low hardware (memory) costs, low power, But, it must be the acceptable precision

Steps:

1. Supply the 16-bit fractional input to the integer multiplier

2. Shift the result to the left (discard the additional sign bit) and the newly introduced LSB
is filled with zero.
3. One special case: before shifting the result to the left, check the two MSB bits. If they
are same, no overflow has occurred and proceeds to step 2. If they are not the same,
saturation arithmetic has to be performed. This is the function of saturation arithmetic
unit.

[Refer Lecture-ASIP_DSP_Implementation slide from 16-24 for more details]

(f) What special variants of commonly used arithmetic operations are supported by a DSP
processor? How are they implemented by the main functional blocks of the data path ? (2)

Solution:

Operations generally supported by DSP

1. Addition and subtraction with and without saturation arithmetic
2. absolute value finding
3. Select larger value
4. Select smaller value
5. Difference of two absolute values
6. Absolute of the difference etc.
7. Performs multiplication (integer, fractional, signed, unsigned, double and single precision)
8. MAC operation
9. Double precision arithmetic
10. Data format conversion
11. scaling operation
12. Guarding and saturation arithmetic.

[Implementation examples are given in the lecture slide]

Computer Organization and Architecture. Designing For Performance. 11 Global Edition Edition William Stallings - Ebook PDF PDF Download
100% (1)
Computer Organization and Architecture. Designing For Performance. 11 Global Edition Edition William Stallings - Ebook PDF PDF Download
47 pages
Getdb PDF
100% (1)
Getdb PDF
25 pages
Getdb PDF
100% (1)
Getdb PDF
25 pages
Unit 1 Notes JSW
No ratings yet
Unit 1 Notes JSW
8 pages
Floorplanning181 Lab
No ratings yet
Floorplanning181 Lab
58 pages
Test Bank For Testbank Computer Organization and Architecture Designing For Performance 11 Global Edition Download
No ratings yet
Test Bank For Testbank Computer Organization and Architecture Designing For Performance 11 Global Edition Download
405 pages
Application-Specific Integrated Circuit ASIC A Complete Guide
From Everand
Application-Specific Integrated Circuit ASIC A Complete Guide
Gerardus Blokdyk
No ratings yet
Mca Mcan-103 Computer Organization and Architecture r21
No ratings yet
Mca Mcan-103 Computer Organization and Architecture r21
2 pages
COOS Questions
No ratings yet
COOS Questions
3 pages
Csis Csg524 Midsem Q
No ratings yet
Csis Csg524 Midsem Q
3 pages
Module 8 - Performance Measurement - Analysis
No ratings yet
Module 8 - Performance Measurement - Analysis
38 pages
Get File
No ratings yet
Get File
2 pages
Placement Stage
No ratings yet
Placement Stage
20 pages
APR v1
No ratings yet
APR v1
119 pages
4.new Computer Organization and Assembly Language MCQs
No ratings yet
4.new Computer Organization and Assembly Language MCQs
40 pages
Semester-7 MCA Integrated IIPS DAVV Syllabus
No ratings yet
Semester-7 MCA Integrated IIPS DAVV Syllabus
9 pages
Computer Organization & Architecture Course Code: 4350701: Page 1 of 9
No ratings yet
Computer Organization & Architecture Course Code: 4350701: Page 1 of 9
9 pages
Computer Organization: Lec #2: Introduction Bnar Mustafa
No ratings yet
Computer Organization: Lec #2: Introduction Bnar Mustafa
19 pages
Clock
No ratings yet
Clock
20 pages
PD Interview Questions
No ratings yet
PD Interview Questions
16 pages
XuanTie C910 C920 UserManual
No ratings yet
XuanTie C910 C920 UserManual
415 pages
Coa Important Questions
No ratings yet
Coa Important Questions
7 pages
COA Course File For Data Science
No ratings yet
COA Course File For Data Science
50 pages
Legacy2CUI PDF
100% (1)
Legacy2CUI PDF
17 pages
Homework3 Solution v2
No ratings yet
Homework3 Solution v2
41 pages
DSP - Presentation - Sumit 5
No ratings yet
DSP - Presentation - Sumit 5
45 pages
BCA Syllabus (I, II, & III Year)
No ratings yet
BCA Syllabus (I, II, & III Year)
29 pages
Onur 447 Spring15 Lecture2 Isa Afterlecture
No ratings yet
Onur 447 Spring15 Lecture2 Isa Afterlecture
57 pages
PrimeTime 16FF Webinar Solvnet
No ratings yet
PrimeTime 16FF Webinar Solvnet
21 pages
innovusDBAref PDF
100% (2)
innovusDBAref PDF
1,373 pages
Computer Fundamental Complete I 1
No ratings yet
Computer Fundamental Complete I 1
335 pages
Real Time Issues and Process of Fixing
No ratings yet
Real Time Issues and Process of Fixing
7 pages
FRM Course Syllabus IPDownload
No ratings yet
FRM Course Syllabus IPDownload
1 page
CAO EST Solution 2022
No ratings yet
CAO EST Solution 2022
8 pages
15-740/18-740 Computer Architecture Lecture 3: Performance: Carnegie Mellon University
No ratings yet
15-740/18-740 Computer Architecture Lecture 3: Performance: Carnegie Mellon University
20 pages
Icc2 Lab Manual
No ratings yet
Icc2 Lab Manual
21 pages
1151CS110 Computer Organization and Architecture
No ratings yet
1151CS110 Computer Organization and Architecture
2 pages
MCASyllabusR 20
No ratings yet
MCASyllabusR 20
82 pages
Reserved: Basic Structure of Computers and Instruction Set
No ratings yet
Reserved: Basic Structure of Computers and Instruction Set
20 pages
Lecture-ASIP DSP Implementation
No ratings yet
Lecture-ASIP DSP Implementation
49 pages
Computer
No ratings yet
Computer
25 pages
Pentium Processor Family: Benjamin Nicomedes For-Ian Sandoval
No ratings yet
Pentium Processor Family: Benjamin Nicomedes For-Ian Sandoval
32 pages
MEL G642-MidSem-Questions 2017-18 PDF
No ratings yet
MEL G642-MidSem-Questions 2017-18 PDF
1 page
MEL G642-MidSem-Questions 2017-18 PDF
No ratings yet
MEL G642-MidSem-Questions 2017-18 PDF
1 page
PDF
No ratings yet
PDF
16 pages
PrimeTime PowerAnalysisPX
No ratings yet
PrimeTime PowerAnalysisPX
168 pages
Flow Map
No ratings yet
Flow Map
12 pages
Lec01 Verilog Combinational Circuits Design
No ratings yet
Lec01 Verilog Combinational Circuits Design
61 pages
Flipchip Appnote INV181-2
No ratings yet
Flipchip Appnote INV181-2
93 pages
EC527 Spring 2014
No ratings yet
EC527 Spring 2014
6 pages
COA511S SUPP TEST Memo
No ratings yet
COA511S SUPP TEST Memo
6 pages
Electronicdesign 10390 Jesd204bsimplified
No ratings yet
Electronicdesign 10390 Jesd204bsimplified
5 pages
Lec11 Apri
No ratings yet
Lec11 Apri
62 pages
Question Paper (Unit-Test-1) Analog IC Design (MEL G 632) Date: 21-02-2017 Time: 12:00 Hours To 13:00 Hours Closed Book Full-Marks: 15
No ratings yet
Question Paper (Unit-Test-1) Analog IC Design (MEL G 632) Date: 21-02-2017 Time: 12:00 Hours To 13:00 Hours Closed Book Full-Marks: 15
2 pages
PD RO The NG Lecture-9-Routing
No ratings yet
PD RO The NG Lecture-9-Routing
35 pages
Xbox 360 System Architecture
100% (1)
Xbox 360 System Architecture
13 pages
Service Culture Syllabus Guidelines and Topics
No ratings yet
Service Culture Syllabus Guidelines and Topics
7 pages
Client 2 - Synopsys - ATS Speaker Slide - Thomas Li (Synopsys)
No ratings yet
Client 2 - Synopsys - ATS Speaker Slide - Thomas Li (Synopsys)
29 pages
Advance Computer Architecture - CS501 Handouts PDF
No ratings yet
Advance Computer Architecture - CS501 Handouts PDF
396 pages
Syllabus of VTU 2016
No ratings yet
Syllabus of VTU 2016
19 pages
Unit 1
No ratings yet
Unit 1
105 pages
RAK Clarity3DLayout Cut-and-Stitch Flow
No ratings yet
RAK Clarity3DLayout Cut-and-Stitch Flow
36 pages
GGG
No ratings yet
GGG
82 pages
Module 8 Running The ECO Flow PDF
No ratings yet
Module 8 Running The ECO Flow PDF
2 pages
Labcxfb
No ratings yet
Labcxfb
15 pages
Lab3 New PDF
No ratings yet
Lab3 New PDF
17 pages
Power Optimization (Part 2) : Xuan Silvia' Zhang
No ratings yet
Power Optimization (Part 2) : Xuan Silvia' Zhang
26 pages
Image Processing To Manipulate RGB Values Using Verilog.
No ratings yet
Image Processing To Manipulate RGB Values Using Verilog.
5 pages
Metal Fil
No ratings yet
Metal Fil
15 pages
Understanding Timing in The Back-End Design Flow: Vlsi Ii: Design of Very Large Scale Integration Circuits
No ratings yet
Understanding Timing in The Back-End Design Flow: Vlsi Ii: Design of Very Large Scale Integration Circuits
23 pages
Clock Gating Lab Notes
No ratings yet
Clock Gating Lab Notes
7 pages
XG Mode: User Guide
No ratings yet
XG Mode: User Guide
176 pages
Sigma Delta Adc
No ratings yet
Sigma Delta Adc
3 pages
Introduction To Asic Design
No ratings yet
Introduction To Asic Design
53 pages
Setnanoroutemode PDF
No ratings yet
Setnanoroutemode PDF
9 pages
Title Description: (/S/) Cases (/S/Case-List) Stars (/S/Star-List) Articles (/S/Knowledge) Help (/S/Help-Info)
No ratings yet
Title Description: (/S/) Cases (/S/Case-List) Stars (/S/Star-List) Articles (/S/Knowledge) Help (/S/Help-Info)
2 pages
Digital Design Flow
No ratings yet
Digital Design Flow
71 pages
Formality Formality Ultra Functional Safety Manual: March 2018, Revision 1.4
No ratings yet
Formality Formality Ultra Functional Safety Manual: March 2018, Revision 1.4
34 pages
Digital Soc Synthesis, Sta, FV and Eco
No ratings yet
Digital Soc Synthesis, Sta, FV and Eco
2 pages
Cadence SOC Encounter PDF
No ratings yet
Cadence SOC Encounter PDF
222 pages
CDesigner
100% (2)
CDesigner
506 pages
Sta Lab3
No ratings yet
Sta Lab3
5 pages
Lecture 1 Introduction 2018 19 PDF
No ratings yet
Lecture 1 Introduction 2018 19 PDF
36 pages
Graduate Project-ASIC Design.v2
No ratings yet
Graduate Project-ASIC Design.v2
197 pages
Interfacing and Some Common Building Blocks: Coe 111: Advanced Digital Design
No ratings yet
Interfacing and Some Common Building Blocks: Coe 111: Advanced Digital Design
35 pages
Tutorial LSI
No ratings yet
Tutorial LSI
64 pages
Asic Design Flow Tutorial 3228gl
No ratings yet
Asic Design Flow Tutorial 3228gl
138 pages
MACDONALD TIMINGCLOSURE FINAle
No ratings yet
MACDONALD TIMINGCLOSURE FINAle
18 pages
IC-Project I-Synthesis
No ratings yet
IC-Project I-Synthesis
7 pages
Sta Lab2
No ratings yet
Sta Lab2
5 pages
Hw4 Solution
No ratings yet
Hw4 Solution
14 pages
Ex 11
No ratings yet
Ex 11
10 pages
Cadence Encounter Tutorial
No ratings yet
Cadence Encounter Tutorial
10 pages
Ccopt Lab 2 (Ccopt Rak)
100% (1)
Ccopt Lab 2 (Ccopt Rak)
3 pages
DTMF Chip Flow Picture
No ratings yet
DTMF Chip Flow Picture
1 page
VHDL Coding Tips and Tricks
No ratings yet
VHDL Coding Tips and Tricks
209 pages
Sram Low Power Decoder
No ratings yet
Sram Low Power Decoder
7 pages
Understanding The Basics of Setup and Hold Time - EDN
No ratings yet
Understanding The Basics of Setup and Hold Time - EDN
8 pages
Encounter Workshop 2: What You Will Learn - Partitioning A Design
No ratings yet
Encounter Workshop 2: What You Will Learn - Partitioning A Design
32 pages
Icc
No ratings yet
Icc
28 pages
Pic16f84a PDF
No ratings yet
Pic16f84a PDF
88 pages
Focal - Opt - Icc.: DRC Violations That Remains After The Post-Route Optimization Performed by The
No ratings yet
Focal - Opt - Icc.: DRC Violations That Remains After The Post-Route Optimization Performed by The
4 pages
PD
No ratings yet
PD
76 pages
Level 54 Bsim4
No ratings yet
Level 54 Bsim4
13 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

MEL G642-Compre Solution - 2 2016-17

Uploaded by

MEL G642-Compre Solution - 2 2016-17

Uploaded by

BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI

(PILANI, K.K.BIRLA GOA & HYDERABAD CAMPUSES)

Execution Unit Block Diagram for Q4. Part (a)

Rules of Operation for the Execution Unit:

o for k = 0, 1, … , N-1, where

o x is the input sequence in the time domain

o y is an output sequence in the frequency domain

 The Inverse Discrete Fourier Transform (IDFT) is computed as

FFT can be computed by DIT or DIF methods

DSP address path

1. Multiple address generating units (AGUs)

Figure shows the data path and address path components

1. Supply the 16-bit fractional input to the integer multiplier

[Refer Lecture-ASIP_DSP_Implementation slide from 16-24 for more details]

Operations generally supported by DSP

[Implementation examples are given in the lecture slide]

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.