EC8095-VLSI Design - 01 - by WWW - LearnEngineering.in
EC8095-VLSI Design - 01 - by WWW - LearnEngineering.in
in
.in
IMPORTANT QUESTIONS AND ANSWERS
ng
Subject Name: VLSI DESIGN
eri
Sem / Year: VI/III
GANDHI
2. Mr. BECTEL BRABI AP/ECE CET
arn
Verified by DLI, CLI and Approved by the Centralised Monitoring Team dated .
Le
w.
ww
.in
UNIT II COMBINATIONAL LOGIC CIRCUITS 9
Examples of Combinational Logic Design, Elmore’s constant, Pass transistor Logic,
Transmission gates, static and dynamic CMOS design, Power dissipation – Low power
ng
design principles
eri
Static and Dynamic Latches and Registers, Timing issues, pipelines, clock strategies,
Memory architecture and memory control circuits, Low power memory circuits,
Synchronous and Asynchronous design
e
gin
UNIT IV DESIGNING ARITHMETIC BUILDING BLOCKS 9
Data path circuits, Architectures for ripple carry adders, carry look ahead adders, High
speed adders, accumulators, Multipliers, dividers, Barrel shifters, speed and area
tradeoff
En
REFERENCES:
1. N.Weste, K.Eshraghian, “Principles of CMOS VLSI Design”, Second Edition, Addision
Wesley
ww
1993
2. R.Jacob Baker, Harry W.LI., David E.Boyee, “CMOS Circuit Design, Layout and
Simulation”,
Prentice Hall of India 2005
3. A.Pucknell, Kamran Eshraghian, “BASIC VLSI Design”, Third Edition, Prentice Hall of
India, 2007.
.in
In this course, the MOS circuit realization of the various building blocks that is
common to any microprocessor or digital VLSI circuit is studied.
Architectural choices and performance tradeoffs involved in designing and realizing
ng
the circuits in CMOS technology are discussed.
The main focus in this course is on the transistor circuit level design and realization
for digital operation and the issues involved as well as the topics covered are quite
e
2. Need and Importance for Study of the Subject
eri
distinct from those encountered in courses on CMOS Analog IC design.
gin
This subject involves packing more and more logic devices into smaller and smaller
areas. Via VLSI, circuits that would have taken boardful of space can now be fitted into a
small space of few millimeters. This has opened avenues to do things that were not
En
possible before.
In simple words, VLSI circuits are everywhere from your computer to your car, your
arn
brand new state-of-the-art digital camera, cellphones, and whatever electronics item you
have. By studying this subject student can able to design digital logic circuits with reduced
area, high speed, and low power.
Le
companies and electronic design automation (EDA) companies like Virtuso, Keltron,
Cadence, CDAC, etc.
ww
Product and application domains of VLSI include mobile and consumer electronics,
computing, telecommunications and networking, data processing, automotive, healthcare
and industrial applications. Being a fast-changing technology area, VLSI design is an
extremely challenging and creative sector that offers exciting opportunities and fast
growth for engineers. GSI, ULSI, FPAA, 10 nm process technology(by INTEL) are the
latest development in this field.
Group of Institutions
Department of Electronics and Communication
Engineering Detailed Lesson Plan
Name of the Subject& Code: EC8095 & VLSI DESIGN
TEXTBOOKS:
1. Jan Rabaey, Anantha Chandrakasan, B.Nikolic, “Digital Integrated Circuits: A
Design Perspective”, Second Edition, Prentice Hall of India, 2003. (Copies not
Available in Library)
2. M.J. Smith, “Application Specific Integrated Circuits”, Addisson Wesley,
.in
1997(Copies Available in Library: YES)
REFERENCES:
1. N.Weste, K.Eshraghian, “Principles of CMOS VLSI Design”, Second Edition,
ng
Addision Wesley 1993(Copies not Available in Library)
2. R.Jacob Baker, Harry W.LI., David E.Boyee, “CMOS Circuit Design, Layout
eri
and Simulation”, Prentice Hall of India 2005(Copies Available in Library: YES)
3. A.Pucknell, Kamran Eshraghian, “BASIC VLSI Design”, Third Edition, Prentice
Hall of India, 2007. (Copies Available in Library: YES)
e
gin
Hours Cumulativ
Sl Books
Unit Topic / Portions to be CoveredRequired e Hrs
. Referred
/
N
En
Planned
UNIT I- MOS TRANSISTOR PRINCIPLE
o
1 1 NMOS and PMOS transistors 1 1 R1
2 1 Process parameters for MOS and 1 2 Notes
arn
5 1 Fundamental limits 1 6 R3
6 1 CMOS inverter scaling 1 7 T1
7 1 Propagation delays 1 8 T1
w.
8 1 Stick diagram 1 9 R3
9 1 Layout diagrams 1 10 R1,R3
ww
.in
UNIT IV- DESIGNING ARITHMETIC BUILDING BLOCKS
26 4 Data Path Circuits 1 31 T1
ng
27 4 Architectures For Ripple Carry Adders 1 32 T1
ri
29 4 High Speed Adders 1 34 R4
30 4 Accumulators
ee 1 35 NOTES
31 4 Multipliers 2 37 T1
gin
32 4 Dividers 2 39 R1
33 4 Barrel Shifters 1 40 T1
En
38 5 1 46 T2
Cell Libraries
39 5 2 48 T2
FPGA building block architectures
40 5 2 50 T2
w
.in
UNIT 2 - COMBINATIONAL LOGIC CIRCUITS
PART A 31
PART B
ng
1. STATIC CMOS DESIGN 35
2. DYNAMIC CMOS DESIGN 40
ri
3. a. TRANSMISSION GATE 48
3. b. PASS TRANSISTOR ee 51
4. a. POWER DISSIPATION 54
4. b. LOW POWER DESIGN 57
5 STATIC LOGIC DESIGN 59
gin
UNIT 3 - SEQUENTIAL LOGIC CIRCUITS
PART A 60
PART B
En
1. STATIC LATCHES 63
2. PIPELINING 66
3. DYNAMIC LATCHES 70
4. MEMORY ARCHITECTURE 74
arn
PART B
1. RIPPLE CARRY ADDER 84
2. BARREL SHIFTER 86
3. CARRY LOOKAHEAD ADDER 88
w
4. MULTIPLIERS 90
5. DIVIDERS 96
ww
UNIT I
MOS TRANSISTOR PRINCIPLE
PART A
The resultant effect increases the channel substrate junction potential. This
.in
increases the rate-channel voltage drop. The overall effect is an increase in
threshold voltage. This effort is called body effect.
3. What is body effect coefficient? (Apr/May 2011)
ng
The potential difference between the source and body affects the threshold
voltage. The threshold voltage can be modeled as
ri
Vt=Vt0+γ((Φs+Vsb)1/2-(Φs)1/2
Where, Φs= surface potential at threshold
ee
γ= body effect coefficient
4. What is the influence of voltage scaling on power and delay? (AprMay 2011)
gin
Constant voltage scaling increased the electric field in devices. By the 1μm
generation velocity saturation was severe enough that decreasing feature size no
longer improved device current. Aggressive process achieve delays in the short end
En
6. Write down the equation for describing the channel length modulation effect in
ww
Ids=β(Vgs-Vt)2/2
7. Write the expression for the logical effort and parasitic delay of an input NOR
gate. (Nov/Dec 2011)
.in
The wire capacitance adds loading to each gate.
The long wire contributes RC delay or flight time.
ng
Circuit delay can be increased by interconnect
ri
9. Draw the IV characteristics of Mos transistors.(MayJun 2012)
ee
gin
En
arn
10. Brief the different operating regions of Mos system.(May/ Jun 2012)
11. Why the tunneling current is higher for NMos transistor than Pmos transistor
ww
.in
14. Compare NMOS and PMOS ?
NMOS PMO
The majority carriers are electron S are holes
The majority carriers
ng
Positive voltage is applied at the gate Negative voltage is applied at the gate
terminal
NMOS conducts at logic 1 terminal
PMOS conducts at logic 0
ri
Mobility of electron is high Mobility of electron is low
Switching speed is high
ee
Switching speed is low
gin
15. What is latch up? How to prevent latch up? (MAY/JUN 2016)
Latch up is a condition in which the parasitic components give rise to the
establishment of low resistance conducting paths between V DD and VSS with
En
PART-B
1. Explain about nMOS Transistor. (MAY’11)
NMos transistors are built on a p-type substrate of moderate doping. Source
and drain are formed by diffusing heavily doped n-type impurities (n+)adjacent to the
gate. A layer of silicon dioxide (SiO2) or glass is place over the substrate in between
the source and drain. Over SiO2, a layer of polycrystalline silicon or polysilicon is
formed, from which the gate terminal is taken.
The following figure shows the structure and symbol of nMOS transistor.
.in
ri ng
ee
gin
En
3. Inversion mode
a. Accumulation Mode
.in
ng
b. Depletion Mode:
ri
In this mode a low positive voltage is applied to the gate. This results in some
positive charge on the gate. The holes in the body are repelled from the region directly
ee
beneath the gate.
gin
En
c. Inversion Mode:
arn
a small number of free electrons in the body are attracted to the region beneath the
gate. This conductive layer of electrons in the p-type body is called the inversion layer.
w
ww
iii.Saturation region
a. Cut-off region:-
In this region Vgs < Vt .The source and drain have free electrons. The body has
free holes but no free electrons. The junction between the body and the source or drain
is reverse biased. So no current will flow. This mode of operation is called cut-off.
.in
ng
Linear region:-
In this region Vgs >Vt .Now an inversion region of electrons called the channel connects
ri
the source and drain. This creates a conductive path between source and drain. The
ee
number of carriers and the conductivity increases with the gate voltage. The potential
gs – Vgd. If V ds=0,there is no electric
gin
difference between drain and source is V ds =V
field tending to push current from drain to source.
En
arn
w .Le
b. Saturation region:-
ww
In this region Vds becomes sufficiently larger than Vgd < Vt, the channel is no
longer inverted near the drain and becomes pinched off .Above this drain voltage, the I
ds is controlled only by the gate voltage. This mode is called saturation mode.
.in
The Ids value of an ideal I- v model neglects many effects that are important to
modern devices.
ng
I ds
ri
1 A
100 nA
10 nA
1 nA
100 pA
Sub-
threshold
Slope
ee
10 pA Vt
gin
0 0.3 0.6 0.9 1.2 1.5 1.8
V gs
While compared to the ideal devices, the saturation current increases less
than a quarter with increasing Vgs. This is caused by two effects.
arn
1) Velocity Saturation
2) Mobility degradation.
.Le
𝛽
Ids = (Vgs - V t ) 2 for V ds > V dsat.
2
.in
In saturation, I dsat is
Vgs = V ds+ V DD
ng
I dsat = (V DD - V t ) 2
ri
Summarizing the three regions we get.
Ids = o
ee ; Vgs <- V t ; cut off
At high vertical field strengths Vgs / tor the carrier scatlers more often. This is
arn
called mobility degradation and this leads to less current than expected at high
Vgs
The threshold voltage itself is influenced by the voltage difference between the
.Le
I ds = 𝜇 С₀ x W (Vgs - V ds)2
L 2
.in
ng
If the transistor is completely Velou saturated V = Vsat and saturation current
become .
ri
Ids = С
Drain ₀ x Wis quodratically
current (Vgs - V t) Vdependent
sat
ee
on voltage without velocity saturation
and linearly dependent when fully velocity saturated.
gin
Where
𝛽
I dsat = Pc 2 (Vgs - V t) 2
V dsat = Pv (Vgs - V t) ∝/2 .
w
As channel length becomes shorter, the lateral field increases and transistors
ww
become more velocity saturated, and the supply voltage is held constant.
Imagine that the source voltage is close to the body voltage. Increasing V ds
decreases the effective channel length. Shorter length results in higher
current. Thus Ids increases with Vds in saturation as shown below.
Vgs = 1.8
300
.in
Vgs = 1.5
200
Vgs = 1.2
ng
100
Vgs = 0.9
Vgs = 0.6
0
0 0.3 0.6 0.9 1.2 1.5 1.8 Vds
ri
Body Effect:
ee
Transistor has four terminals named gate, source, drain and body. The potential difference
between the source and body Vsb affects the threshold voltage.
gin
Vt = Vto + ᴕ ( ps + Vsb - 𝜑 s )
Where
Vto = Threshold Voltage when the source is at the body potential
En
exponentially as
Ids = I dso e Vgs - V t [ 1- e V ds]
w
nvt Vt .
This is also called as leakage and often thias results in underired current when a
ww
transistor is normally OFF. Idso is the current at thresholo and is dependent on process and
device geometry
Applications:-
This is used in very low power analog circui
This is used in dynamic circuits and OR AM
Advantage:
1) Leakage increases exponentially as Vt decreases or as temperature rises.
Disadvantages:
10
Junction Leakage:
The P-n junction between diffusion and the substrate or well form diodes are
shown below.
The substrate and well are tied to GND or VDD to ensure that these diodes
remain reverse biased.
.in
The reverse biased diodes still conduct a small amount of current I o.
ID = Is [ e VD -1]
ng
VT
Where
ri
ID = diode current
Is = diode reverse- biased saturation current that depends on doping levels and on
ee
the area and perimeter of the diffusion region.
gin
p+ n+ n+ p+ p+ n+
En
n well
p substrate
arn
Tunneling :
.Le
Based on quantum mechanics, we see that the is a finite probability that carriers will
tunnel through the gate oxide. This results in gate leakage current flowing into the gate.
The probability of tunnelling drops off exponentially with oxide thickness.
w
Large tunnelling currents impact not only dynamic nodes but also quiescent
power consumption and thus may limit oxide thickness tor.
ww
Temperature Dependance:
Temperature influences the characteristics of transistors. Carrier mobility decreases
with temperature.
𝜇 (T) = 𝜇 (Tr ) ( T ) -k 𝜇
Tr
11
.in
I ds
increasing
temperature
ng
Vgs
ri
Advantages of Operating at low temperature:
ee
1) velocity saturation occurs at higher fields providing more current.
2) For high mobility , power is saved.
gin
3) Wider depletion region results in less junction capacitance.
Geometry Dependance:
En
The layout designer draws transistors with width and length W drawn and L
drawn. The actual gate dimensions may differ by factors Xw and XL.
arn
The source and drain trends to diffuse later under the gate by LDi producing a
shorter effective between source and drain.
Leff = L drawn + XL -2LP
.Le
12
2. b.Explain in detail about the ideal I-V characteristics of a nMOS and pMOS
device (NOV/DEC 2013)(MAY/JUN 2013)(NOV/DEC 2014)
.in
model assumes that the channel length is long enough that the lateral electric field (the field
between source and drain) is relatively low, which is no longer the case in nanometer
ng
devices. This model is variously known as the long-channel, ideal, first-order, or Shockley
model. Subsequent sections will refine the model to reflect high fields, leakage, and other
ri
nonidealities. The long-channel model assumes that the current through an OFF transistor
ee
is 0. When a transistor turns ON (Vgs > Vt), the gate attracts carriers (electrons) to form a
channel. The electrons drift from source to drain at a rate proportional to the electric field
gin
between these regions. Thus, we can compute currents if we know the amount of charge in
the channel and the rate at which it moves.
En
We know that the charge on each plate of a capacitor is Q = CV. Thus, the charge in the
channel Qchannel is
arn
Qchannel=Cg(Vgs-Vt)
where Cg is the capacitance of the gate to the channel and Vgc - Vt is the amount of
voltage attracting charge to the channel beyond the minimum required to invert from p to n.
.Le
The gate voltage is referenced to the channel, which is not grounded. If the source is at Vs
and the drain is at Vd , the average is Vc = (Vs + Vd)/2 = Vs + Vds /2. Therefore, the mean
w
difference between the gate and channel potentials Vgc is Vg – Vc = Vgs – Vds /2, as
shown in Figure.
ww
13
We can model the gate as a parallel plate capacitor with capacitance proportional to area
over thickness. If the gate has length L and width W and the oxide thickness is tox, as
shown in below Figure, the capacitance is
Cg=εox(WL/tox)=CoxWL
where εox is the permittivity of free space, 8.85 × 10–14 F/cm, and the permittivity of SiO2
is kox = 3.9 times as great. Often, the ox/tox term is called Cox, the capacitance per unit
area of the gate oxide.
Each carrier in the channel is accelerated to an average velocity, v, proportional to the
.in
lateral electric field, i.e., the field between source and drain. The constant of proportionality
μ is called the mobility.
ng
v = μE
The time required for carriers to cross the channel is the channel length divided by
ri
the carrier velocity: L/v. Therefore, the current between source and drain is the total amount
ee
of charge in the channel divided by the time required to cross
gin
En
arn
If Vds > Vdsat VGT, the channel is no longer inverted in the vicinity of the drain;
we say it is pinched off. Beyond this point, called the drain saturation voltage, increasing the
w
drain voltage has no further effect on current. Substituting Vds = Vdsat at this point of
ww
maximum current in above eqn, we find an expression for the saturation current that is
independent of Vds.
Ids=(β/2)V2GT
This expression is valid for Vgs > Vt and Vds > Vdsat . Thus, long-channel MOS
transistors are said to exhibit square-law behavior in saturation. Two key figures of merit for
a transistor are Ion and Ioff. Ion (also called Idsat) is the ON current, Ids, when Vgs = Vds =
VDD. Ioff is the OFF current when Vgs = 0 and Vds = VDD. According to the long-channel
model, Ioff = 0 and
14
Ion=(β/2)(Vdd-Vt)
𝛽
(V gs - V t ) 2 ; V ds > V dsat ; saturation.
.in
2
ng
Below fig shows the I-V characteristics for the transistor. According to the first-order
model, the current is zero for gate voltages below Vt. For higher gate voltages, current
ri
increases linearly with Vds for small Vds . As Vds reaches the saturation point Vdsat =
ee
VGT, current rolls off and eventually becomes independent of Vds when the transistor is
gin
saturated. We will later see that the Shockley model overestimates current at high voltage
because it does not account for mobility degradation and velocity saturation caused by the
high electric fields.
En
arn
w .Le
ww
15
.in
ri ng
ee
gin
En
• The MOS device first order Shockley equations describing the transistors in cut-off,
linear and saturation modes can be used to generate the transfer characteristics of a
arn
CMOS inverter.
• Plotting these equations for both the n- and p-type devices produces the traces
below.
w .Le
ww
16
them about the x-axis and superimposing them on the n-device IV curves.
• We basically solve for Vin(n-type) = Vin(p-type) and Ids(n-type)=Ids(p-type)
• The desired switching point must be designed to be 50 % of magnitude of the
supply voltage i.e. VDD/2.
• Analysis of the superimposed n-type and p-type IV curves results in five regions in
which the inverter operates.
.in
– The n-device is in cut-off (Idsn =0).
– p-device is in linear region,
ng
– Idsn = 0 therefore -Idsp = 0
– Vdsp = Vout – VDD, but Vdsp =0 leading to an output of Vout = VDD.
ri
• Region B occurs when the condition Vtn leq Vin le VDD/2 is met.
– Here p-device is in its non-saturated region Vds neq 0.
ee
– n-device is in saturation
gin
Saturation current Idsn is obtained by setting Vgs = Vin resulting in the equation:
n
I dsn Vun Vtn 2
En
•
arn
V V 2
I dsp p Vin VDD Vtp Vout VDD out DD
2
w
p
I dsp V
in VDD Vtp ; Vin Vtp VDD
2
2
AND
n
I dsn Vin Vtn 2 ; Vin Vtn
2
17
VDD
Vin VDD Vtp
2
p
I dsp Vin VDD Vtp ; Vin Vtp VDD
2
.in
AND
V
2
ng
• Equating the drain currents allows us to solve for Vout. (See supplemental notes for
ri
algebraic manipulations).
ee
• In Region E the input condition satisfies:
gin
are recognized by a timing analyzer or circuit simulator. Critical paths are affected by
the following four levels.
i. Architectural level
ii. Logic level
iii. Circuit level
iv. Layout level
18
Propagation delay time (tpd) or max time is the maximum time from the input
crossing 50%to the output crossing 50%.The delay can be estimated by the following
ways,
i. RC delay models
ii. Linear delay models
iii. Logic efforts
iv. Parasitic delay
.in
1. RC delay models:
The delay of logic gate is computed as the product of RC,where R is the effective
ng
driver resistance and C the load capacitance. Logic gates use minimum –length devices
for least delay, area and power consumption. The delay of a logic gate depends on the
ri
transistor width in the gate and the capacitance of the load.
ee
Effective Resistance and Capacitance:
gin
An NMOS transistor with width of one unit has effective esistance R An PMOS
transistor with width of one unit has effective resistance 2R Capacitance consists of gate
capacitance cg and source/diffusion capacitance c diff .in most processes cg is equal
En
To reduce the diffusion capacitance in the layout, diffusion nodes are shared.
Uncontacted nodes have less capacitance. Diffusion capacitance depends on the layout.
.Le
Elmore delay model estimates the delay of an RC ladder .this is equal to the sum
w
over each node in the ladder of the resistance between the node and supply
multiplied by capacitance on the node.
ww
19
d= f + p
F= effort delay or state effort, which depends on the complexity and fan-out of the
gate. P=parasitic delay
.in
ri ng
ee
Fig: Normalized delay vs. fan-out
gin
Logical effort:
Logical effort is defined as the ratio of the input capacitance of the gate to the input
En
Parasitic delay is defined as the delay of the gate when it drives zero load. This
can be estimated with RC delay models. The inverter has 3 units of diffusion
.Le
INVERTER 1
ww
NAND 2 3 4 N
NOR 2 3 4 n
TRISTATE,MULTIPLEX 2 4 6 8 2
ER n
Logical effort and transistor sizing:
Logical effort provides a simple method to choose the best topology and number
of stages of logic for a function. This quickly estimates the minimum possible delay for
the given topology and to choose gate sizes that achieve this delay.
20
3. path effort
.in
4. branching effort
ng
6. path delay
ri
7. minimum possible delay
between complexity and accuracy. Level 1 and Level 3 models were historically important,
but they are no longer adequate to accurately model very small modern transistors. BSIM
.Le
models are more accurate and are presently the most widely used. Some companies use
their own proprietary models. This section briefly describes the main features of each of
these models. It also describes how to model diffusion capacitance and how to run
w
simulations in various process corners. The model descriptions are intended only as an
ww
overview of the capabilities and limitations of the models; refer to a SPICE manual for a
much more detailed description if one is necessary.
Level 1 Models
The SPICE Level 1, or Shichman-Hodges Model [Shichman68] is closely related to the
Shockley model described in EQ (2.10), enhanced with channel length modulation and the
body effect. The basic current model is:
21
The parameters from the SPICE model are given in ALL CAPS. Notice that is
written instead as KP(Weff /Leff ), where KP is a model parameter playing the role of k.
.in
Weff and Leff are the effective width and length). The LAMBDA term (LAMBDA = 1/VA)
models channel length modulation The threshold voltage is modulated by the source-to-
ng
body voltage Vsb through the body effect.
The gate capacitance is calculated from the oxide thickness TOX. The default gate
ri
capacitance model in HSPICE is adequate for finding the transient response of digital
ee
circuits. More elaborate models exist that capture nonreciprocal effects that are important
for analog design. Level 1 models are useful for teaching because they are easy to
gin
correlate with hand analysis, but are too simplistic for modern design.
Level 2 and 3 Models
The SPICE Level 2 and 3 models add effects of velocity saturation, mobility
En
empirical equations that provide similar accuracy, faster simulation times, and better
convergence. However, these models still do not provide good fits to the measured I-V
.Le
now widely used in circuit simulation. The models are derived from the underlying device
ww
physics but use an enormous number of parameters to fit the behavior of modern
transistors. BSIM versions 1, 2, 3v3, and 4 are implemented as SPICE levels 13, 39, 49,
and 54, respectively.
Features of the model include:
Continuous and differentiable I-V characteristics across subthreshold, linear, and
saturation regions for good convergence
Sensitivity of parameters such as Vt to transistor length and width
22
Detailed threshold voltage model including body effect and drain-induced barrier
lowering
Velocity saturation, mobility degradation, and other short-channel effects
Multiple gate capacitance models
Diffusion capacitance and resistance models
Gate leakage models
As the BSIM models are so complicated, it is impractical to derive closed-form equations
for propagation delay, switching threshold, noise margins, etc., from the underlying
.in
equations. However, it is not difficult to find these properties through circuit simulation.
Device characterisation will show simple simulations to plot the device characteristics over
ng
the regions of operation that are interesting to most digital designers and to extract effective
capacitance and resistance averaged across the switching transition. The simple RC model
ri
continues to give the designer important insight about the characteristics of logic gates.
ee
Diffusion Capacitance Models
The p–n junction between the source or drain diffusion and the body forms a diode.
gin
We depends on the area and perimeter of the diffusion. HSPICE provides a number of
methods to specify this geometry, controlled by the ACM (Area Calculation Method)
En
parameter, which is part of the transistor model. have seen that the diffusion capacitance
determines the parasitic delay of a gate and The diffusion capacitance model is common
arn
across most device models including Levels 1–3 and BSIM. By default, HSPICE models
use ACM = 0. In this method, the designer must specify the area and perimeter of the
source and drain of each transistor.
.Le
The SPICE models also should contain parameters CJ, CJSW, PB, PHP, MJ, and MJSW.
Assuming the diffusion is reverse-biased and the area and perimeter are specified, the
w
The drain equations are analogous, with S replaced by D in the model parameters.
The BSIM3 models offer a similar area calculation model (ACM = 10) that takes into
account the different sidewall capacitance on the edge adjacent to the gate. Note that the
PHP parameter is renamed to PBSW to be more consistent.
23
If the area and perimeter are not specified, they default to 0 in ACM = 0 or 10,
grossly underestimating the parasitic delay of the gate. HSPICE also supports ACM = 1, 2,
3, and 12 that provide nonzero default values when the area and perimeter are not
specified. Check your models and read the HSPICE documentation carefully. The diffusion
.in
area and perimeter are also used to compute the junction leakage current. However, this
current is generally negligible compared to subthreshold leakage in modern devices.
ng
Design Corners
Engineers often simulate circuits in multiple design corners to verify operation across
ri
variations in device characteristics and environment. HSPICE includes the .lib statement
ee
that makes changing libraries easy. The deck first sets SUP to the nominal supply voltage
of 1.0 V. It then invokes .lib to read in the library specifying the TT conditions. In the
gin
stimulus, the .alter statement is used to repeat the simulation with changes. In this case, the
design corner is changed. Altogether, three simulations are performed and three sets of
En
Scaling:
As the transistors become smaller, they switch faster, dissipate less power and
w
24
Transistor Scaling:
The characteristics of an MOS device can be maintained and the basic operational
characteris. Can be preserved if the critical parameters of a device are scaled by a
dimensionless factor . These parameters include.
º All dimensions (x,y, z directions)
º Device voltages
º Doping concentration densities.
Another approach is lateral Scaling , in which only the gate length is scaled. This is
.in
commonly called a gate shrink because it can done easily to an existing mask database for
a design.
ng
For constant field scaling, all devices dimension including channel length L, width
W and oxide thickness tor are reduced by a factor of 1/s. The supply voltage VDD and the
ri
threshold voltages are also reduced by1/s.
model.
The gate delay improvement is closer to linear because velocity saturation
.Le
25
Example: A wire across 64 bits ALU is local because it becomes shorter as the
ALU is migrated to finer process. A wire across a particular micro processer is
scaled because when the microprocessor is shrunk to the new process the
wire will also shrink.
Un repeated interconnect delay is remaining about constant for local
interconnect and increasing for global interconnect . This presents a problem
because transistor are getting faster, So the ratio if interconnect to gate delay
interconnect with scaling .
.in
In moders process with aspect ratios 1-5-22 fringing capacitance accounts for
the majority of the total capacitance.
ng
Scaling spacing but not height interconnect the fringing capacitance enough
that the extra thickness scarcely improves delay.
ri
Observe that when wire thickness is called the capacitance per unit length
ee
remains constant. Hence, a reasonable initial estimate of the capacitance of a
minimum-pitch were is about 0.2fF/ 𝜇m, independent of the process.
gin
Wire capacitance is roughly 1/10-1/6 of gate capacitance per unit length.
Impacts on Design:
En
One of the limitations of first order scaling is that it gives the wrong impression
of being able to scale proportionally to zero dimensions and zero voltage.
arn
.Le
The most positive impact of scaling is that performance and cost are steadily
improving. System architects need to understand the scaling of CMOS
technologies and predict the capabilities of the process several years into the
w
Interconnect :
Scaling transistors are steadily improving in delay but scaled wires are holding
constant or getting worse.
26
First the gate delay is shown for a single unloaded transistor rather than a
realistically loaded gate. |Second, the wire delay shown for fixed lengi but as
𝜇 technology scales, most local wires connecting gates within a unit also
become shorter.
Power:
In classical constant field scaling, power density remains constant and overall chip
power increases only slowly with die size.
.in
Power density has sky rocketed because clock frequencies have increased
much faster to classical scaling would predict and V DD is some what higher
ng
than constant field scaling would demand.
Dynamic power consumption will not continue to increase at such rates
ri
because it will become uneconomical to cool the chips.
ee
The static power consumption caused by sub threshold leakage was
historically negligible but becomes important for threshold voltage below
gin
about 0.3 to 04v.
5. Explain the stick diagram and layout diagram with examples. May 11, May13,
En
Nov/Dec10
arn
Stick diagrams:
Stick diagrams are used to convey layer information through the use of a colour code
for example in NMOS design.
.Le
27
The stick diagram represents the rectangles with lines which represents wires
are component symbols.
The colour cooling has been complemented by monochrome encoding of the lies
so the black and white copies of stick diagrams do not lose the layer information.
The colour and monochrome encoding scheme used has been evolved to
cover NMOS and CMOS processes.
To illustrate the stick diagram inverter circuits are presented below in NMOS,
and in P well CMOS technology.
.in
ri ng
ee
gin
Having conveyed layer information and topology by using stick or symbolic
diagrams. These diagrams relatively easily turned into mask layouts.
En
The below diagram stressing the ready translation into mask layout form. In
order that the mask layout produced during design will be compactible with the
arn
fabrication process.
Aser of design rules are set out for layouts.
.Le
involves.
ww
A transistor is formed wherever poly silicon crosses n-diffusion and all diffusion wires
are n-type. The various steps involved in the design style are.
28
Step1: Draw the metal VDD and GND rails in parallel allowing enough space
between them for the other circuit element which will be required.
Step 2: Draw the thinox paths between the rails for inverters and inverter based
logic.
Step 3: Draw the pull up structure which comprises a depletion mode transistor
interconnected between the output point and VDD.
Step 4:
Draw the pull down structure comprising an enhancement mode structure
.in
interconnected between the output point and GNO.
Step 5: Signal paths may be switched by pass transistor, and along signal paths
ng
often require metal buses.
Design Rules and layout:
ri
The design rules primarily address two issue
ee
1) The geometrical reproduction of features that can be reproduced by the mask-
making and lithographical process.
gin
2) The interactions between different layers. There are several approaches that can
be taken in describing the design rules. These include
En
- Usually given as a list of minimum feature sizes and spacings for all masks
required in a given process.
- Normal style for industry.
.Le
- They have been widely used, particularly in the educational context and in
the design of multi project chips.
29
All paths in all layers will be dimensioned in 𝜆 units and sub-sequently 𝜆 can
be allocated an appropriate value compactible with the feature size of the
fabricalion process.
Design rules can be conveniently set out in diagrammatic form as shown
below.
Contact cuts:
The contacts between layers are set out as shown below. Here it will be obserred
that connection can be made between two or, in the case of NMOs design, three layers.
.in
1) Metal to poly silicon or to diffusion
ng
There are three possible approaches for making contacts between poly silicon and
diffusion in NMOS circuits. There are
ri
i) Poly silicon to metal then metal to diffusion
ee
ii) Buried contact poly silicon to diffusion
iii) Butting contact.
gin
The 2 𝜆 x2 𝜆 contac cut indicates and area in which the oxide is to be removed
down to the underlying polysilicon or diffusion surface.
En
When the deposition of the metal layer takes place, the metal is deposited
through the contact cut areas on to the underlying areas so that contact is
arn
30
UNIT II
COMBINATIONAL LOGIC CIRCUITS
PART A
.in
to an AND gate of inverter inputs. The same relationship applies to gates with
more inputs switching between these representation is easy to do and is often
ng
called bubble pushing.
ri
2. Draw XOR gate and XNOR gate using transmission gates.
XOR gate
ee
gin
En
arn
.Le
XNOR
w
ww
31
.in
ri ng
ee
gin
4. What are the factors that cause static power dissipation in CMoS circuits?
En
(Nov/Dec 2012)
Static power dissipation due to:
arn
32
7. Give the expression for Elmore delay and state the various parameters
associated with it.(NOV/DEC 2014) (MAY/JUN 2016)
.in
ri ng
ee
gin
8. Define power dissipation.(NOV/DEC 2013)
The instantaneous power p(t) drawn from the power supply is proportional to the
supply current iDD(t) and the supply voltage Vdd.
En
instantaneous power.
2015).
When an nMoS or pMoS is used alone as an imperfect switch, it is called as a
w
passed in an
33
10. Design a 1-bit dynamic register using pass transistor.( NOV/DEC 2013)
The fig 1 shows a very simple transparent latch built from a single transistor it is
.in
compact and fast but suffers four limitations.
Fig 2 uses a CMoS transmission gate in place of the sinlge nMoS pass transistor to
ng
offer rail-rail output swings.
ri
11. Why single phase dynamic logic structure cannot be caed. justify(MAY/JUN
ee
2016)
gin
In dynamic logic, a problem arises when caing one gate to the next. The
precharge "1" state of the first gate may cause the second gate to discharge prematurely,
En
before the first gate has reached its correct state. This uses up the "precharge" of the
second gate, which cannot be restored until the next clock cycle, so there is no recovery
arn
34
PART-B
The most widely used logic style is static complementary CMOS. The static CMOS
style is really an extension of the static CMOS inverter to multiple inputs. The primary
advantage of the CMOS structure is robustness (i.e., low sensitivity to noise), good
performance, and low power consumption with no static power dissipation. Most of those
properties are carried over to large fan-in logic gates implemented using a similar circuit
.in
topology.
The complementary CMOS circuit style falls under a broad class of logic circuits called
ng
static circuits in which at every point in time (except during the switching transients), each
gate output is connected to either VDD or Vss via a low-resistance path. Also, the
ri
outputs of the gates assume at all times the value of the Boolean function implemented
ee
by the circuit (ignoring, once again, the transient effects during switching periods). This is
in contrast to the dynamic circuit class, which relies on temporary storage of signal values
gin
on the capacitance of high-impedance circuit nodes. The latter approach has the
advantage that the resulting gate is simpler and faster. Its design and operation are
En
however more involved and prone to failure due to an increased sensitivity to noise. The
design of various static circuit flavors includes complementary CMOS, ratioed logic
(pseudo-NMOS and DCVSL), and pass transistor logic.
arn
a. Complementary CMOS
.Le
A static CMOS gate is a combination of two networks, called the pull-up network
(PUN)and the pull-down network (PDN) (Figure 1). The figure shows a generic N input
w
logic gate where all inputs are distributed to both the pull-up and pull-down networks.
ww
The function of the PUN is to provide a connection between the output and VDD anytime
the output of the logic gate is meant to be 1 (based on the inputs). Similarly, the function
of the PDN is to connect the output to VSS when the output of the logic gate is meant to
be 0. The PUN and PDN networks are constructed in a mutually exclusive fashion such
that one and only one of the networks is conducting in steady state. In this way, once the
transients have settled, a path always exists between VDD and the output F, realizing a
high output (“one”), or, alternatively, between VSS and F for a low output (“zero”). This is
equivalent to stating that the output node is always a low- impedance node in steady
35
state.
.in
ng
Figure 1: Complementary logic gate as a combination of a PUN (pull-up
network) and a PDN (pull-down network).
ri
ee
In constructing the PDN and PUN networks, the following observations should bekept in
mind:
gin
• A transistor can be thought of as a switch controlled by its gate signal. An NMOS
switch is on when the controlling signal is high and is off when the controlling
En
signal is low. A PMOS transistor acts as an inverse switch that is on when the
controlling signal is low and off when the controlling signal is high.
arn
• The PDN is constructed using NMOS devices, while PMOS transistors are used in
the PUN. The primary reason for this choice is that NMOS transistors produce
“strong zeros,” and PMOS devices generate “strong ones”. To illustrate this,
.Le
consider the examples shown in Figure 2. In Figure 2.a, the output capacitance is
initially charged to VDD. Two possible discharge scenarios are shown. An NMOS
w
device pulls the output all the way down to GND, while a PMOS lowers the output
no further than |VTp| — the PMOS turns off at that point, and stops contributing
ww
discharge current. NMOS transistors are hence the preferred devices in the PDN.
Similarly, two alternative approaches to charging up a capacitor are shown in
Figure 2.b, with the output initially at GND. A PMOS switch succeeds in charging
the output all the way to VDD, while the NMOS device fails to raise the output
above VDD-VTn. This explains why PMOS transistors are preferentially used in a
PUN.
36
.in
ng
Figure 2 Simple examples illustrate why an NMOS should be used as a
pull-down, and a PMOS should be used as a pull-up device.
ri
ee
gin
En
Figure 3 NMOS logic rules — series devices implement an AND, and parallel devices
implement an OR.
arn
A set of construction rules can be derived to construct logic functions (Figure 4).
.Le
NMOS devices connected in series corresponds to an AND function. With all the inputs
high, the series combination conducts and the value at one end of the chain is transferred
to the other end. Similarly, NMOS transistors connected in parallel represent an OR
w
function. A conducting path exists between the output and input terminal if at least one of
ww
the inputs is high. Using similar arguments, construction rules for PMOS networks can be
formulated. A series connection of PMOS conducts if both inputs are low, representing a
NOR function (A.B = A+B), while PMOS transistors in parallel implement a NAND (A+B =
A· B.
• Using De Morgan’s theorems ((A + B) = A· B and A· B = A + B), it can be shown
that the pull-up and pull-down networks of a complementary CMOS structure are
dual networks. This means that a parallel connection of transistors in the pull-up
network corresponds to a series connection of the corresponding devices in the
37
pull-down network, and vice versa. Therefore, to construct a CMOS gate, one of
the networks (e.g., PDN) is implemented using combinations of series and
parallel devices. The other network (i.e.,PUN) is obtained using duality principle
by walking the hierarchy, replacing series sub- nets with parallel sub-nets, and
parallel sub-nets with series sub-nets. The complete CMOS gate is constructed
by combining the PDN with the PUN.
• The complementary gate is naturally inverting, implementing only functions such as
NAND, NOR, and XNOR. The realization of a non-inverting Boolean function (such
.in
as AND OR, or XOR) in a single stage is not possible, and requires the addition of
an extra inverter stage.
ng
• The number of transistors required to implement an N-input logic gate is 2N.
ri
b. II Ratioed Logic
ee
Ratioed logic is an attempt to reduce the number of transistors required to
implement a given logic function, at the cost of reduced robustness and extra power
gin
dissipation. The purpose of the PUN in complementary CMOS is to provide a conditional
path between VDD and the output when the PDN is turned off. In ratioed logic, the entire
En
PUN is replaced with a single unconditional load device that pulls up the output for a high
output (Figure 5.a). Instead of a combination of active pull-down and pull-up networks,
arn
such a gate consists of an NMOS pull- down network that realizes the logic function, and
a simple load device. Figure 5.b shows an example of ratioed logic, which uses a
grounded PMOS load and is referred to as a pseudo- NMOS gate.
w .Le
ww
38
(assuming that VOL is below VTn). On the other hand, the nominal low output voltage
is not 0 V since there is a fight between the devices in the PDN and the grounded PMOS
load device. This results in reduced noise margins and more importantly static power
dissipation.
The sizing of the load device relative to the pull-down devices can be used to
trade-off parameters such a noise margin, propagation delay and power dissipation.
Since the voltage swing on the output and the overall functionality of the gate depends
upon the ratio between the NMOS and PMOS sizes, the circuit is called ratioed. This is in
.in
contrast to the ratioless logic styles, such as complementary CMOS, where the low and
high levels do not depend upon transistor sizes.
ng
Computing the dc-transfer characteristic of the pseudo-NMOS proceeds along
paths similar to those used for its complementary CMOS counterpart. The value of VOL
ri
is obtained by equating the currents through the driver and load devices for Vin = VDD. At
ee
this operation point, it is reasonable to assume that the NMOS device resides in linear
mode (since the output should ideally be close to 0V), while the PMOS load is saturated.
gin
En
Assuming that VOL is small relative to the gate drive (VDD-VT) and that VTn is equal to
VTp in magnitude, VOL can be approximated as:
arn
.Le
In order to make VOL as small as possible, the PMOS device should be sized much
smaller than the NMOS pull-down devices. Unfortunately, this has a negative impact on the
propagation delay for charging up the output node since the current provided by the
w
39
.in
ri ng
ee
gin
If the input A is 1 during Precharge, contention will take place because both the
pMOS and nMOS transistors will be ON. When the input cannot be guaranteed to be 0
En
during Precharge, an extra clocked evaluation transistor can be added to the bottom of
the nMOS stack to avoid contention as shown in Figure 9.23. The extra transistor is
sometimes called a foot. Figure 9.2 shows generic footed and unfooted gates.
arn
w .Le
ww
40
Figure 9.25 estimates the falling logical effort of both footed and unfooted dynamic
gates. Footed gates have higher logical effort than their unfooted counterparts but are still
an improvement over static logic.
.in
ri ng
ee
gin
the input can start LOW and remain LOW, start LOW and rise HIGH, start HIGH and
remain HIGH, but not start HIGH and fall LOW. Figure
arn
w .Le
ww
shows waveforms for a footed dynamic inverter in which the input violates
monotonicity.
The output of a dynamic gate begins HIGH and monotonically falls LOW during
evaluation. This monotonically falling output X is not a suitable input to a second dynamic
41
gate expecting monotonically rising signals, as shown in Figure 9.27. Dynamic gates
sharing the same clock cannot be directly connected.
.in
Advantages
ng
Lower input capacitance
Sensitive to noise
arn
Applications
Used in multiplexers
w
Domino logic
ww
Keepers
42
a. Domino Logic
.in
Precharge and evaluate all the logic gates within the chain. The dynamic output is
monotonically falling during evaluation, so the static inverter output is monotonically
ng
rising. Therefore, the static inverter is usually a HI-skew gate to favor this rising
output. Observe that Precharge occurs in parallel, but evaluation occurs
ri
sequentially. The symbols for the dynamic NAND, HI-skew inverter, and domino AND are
ee
shown in Figure 9.28(c).
gin
En
arn
w .Le
ww
43
Dual-rail domino gates encode each signal with a pair of wires. The input and
output signal pairs are denoted with _h and _l, respectively. Table 9.2 summarizes the
encoding. The _h wire is asserted to indicate that the output of the gate is “high” or 1. The
_l wire is asserted to indicate that the output of the gate is “low” or 0. When the gate is
Precharge, neither _h nor _l is asserted. The pair of lines should never be both asserted
.in
simultaneously during correct operation. Dual-rail domino gates accept both true and
complementary inputs and compute both true and complementary outputs, as shown in
ng
Figure 9.30(a). Observe that this is identical to static CVSL circuits from Figure 9.20
except that the cross-coupled pMOS transistors are instead connected to the Precharge
ri
clock. Therefore, dual-rail domino can be eeviewed as a dynamic form of CVSL,
sometimes called DCVS. Figure 9.30(b) shows a dual-rail AND/NAND gate and Figure
9.30(c) shows a dual-rail XOR/XNOR gate.
gin
En
arn
w .Le
ww
44
Dual-rail structures also neither lose the efficiency of wide dynamic NOR gates
because they require complementary tall dynamic NAND stacks. Dual-rail domino signals
not only the result of a computation but also indicates when the computation is done.
Before computation completes, both rails are Precharge. When the computation
completes, one rail will be asserted. A NAND gate can be used for completion detection,
as shown in Figure 9.31. Coupling can be reduced in dual-rail signal busses by inter
digitating the bits of the bus, as shown in Figure 9.32. Each wire will never see more than
one aggressor switching at a time because only one of the two rails switches in each
.in
cycle.
ri ng
ee
gin
En
arn
w .Le
c. Keepers
ww
Dynamic circuits also suffer from charge leakage on the dynamic node. If a
dynamic node is precharged high and then left floating, the voltage on the dynamic node
will drift over time due to sub threshold, gate, and junction leakage. The time constants
tend to be in the millisecond to nanosecond range, depending on process and
temperature. This problem is analogous to leakage in dynamic RAMs. Moreover, dynamic
circuits have poor input noise margins. If the input rises above Vt while the gate is in
evaluation, the input transistors will turn on weakly and can incorrectly discharge the
45
output. Both leakage and noise margin problems can be addressed by adding a keeper
circuit. Figure 9.33 shows a conventional keeper on a domino buffer. The keeper is a
weak transistor that holds, or staticizes, the output at the correct level when it would
otherwise float. When the dynamic node X is high, the output Y is low and the keeper is
ON to pre- vent X from floating. When X falls, the keeper initially opposes the transition so
it must be much weaker than the pull down network. Eventually Y rises, turning the
keeper OFF and avoiding static power dissipation.
.in
ri ng
ee
gin
46
9.44(a). Notice that each output is a function of the less significant outputs. The more
compact MODL design shown in Figure 9.44(b) is often called a Manchester carry
chain.
.in
ri ng
ee
gin
En
arn
inverting static gates are replaced with predischarged dynamic gates using pMOS
logic. For example, a footed dynamic p-logic NAND gate is shown in Figure
w
9.46(b). When K is 0, the first and third stages precharge high while the second
stage pre- discharges low. When K rises, all the stages evaluate. Domino
ww
connections are possible, as shown in Figure 9.46(c). The design style is called
NP Domino or NORA Domino.
Disadvantages
Susceptible to noise
47
.in
ri ng
ee
gin
-----------------------------*************----------------------------------
En
arn
3. a. Write a brief note on pass Transistor circuits also explain about CMOS with
Transmission gates. (may 2011,2013) (MAY/JUN 2016)
.Le
In pass transistor circuits, inputs are also applied to the source/drain diffusion
w
terminals.
ww
These circuits build switches using either n MOS pass transistor or parallel
pairs of nM\OS and p MOS transistors called transmission gates.
For example pass transistors are essential to the design of efficient 6 transistor
static RAM cells used in most modern systems.
Full address and other circuits rich in XOR s also can b efficiently constructed
with pass transistors.
48
.in
range of circuits. The examination shows that the topology is almost identical
to static CMOS.
ng
If multiple stages of logic are cae they can be viewed as alternating
ri
transmission gates an inverters.
ee
gin
En
arn
The above figure redraws the multiplexes to include the inverters from the
previous that drive the diffusion input but to exclude in output inverter.
.Le
The intermediate modes in the pull up and pull-down networks are shorted
together as N1 and N2.
w
ww
49
.in
It the inverter is on the output rather than the input, the delay of the gate
depends on what is driving the input as well as the capacitance drivar by the
ng
output.
The second drawback is that diffuse inputs to tristate invertors are susceptible
ri
to noise that may incorrectly turn on the inverter.
ee
Finally the contacts slightly increases are and their capacitance increases
power consumption.
gin
The logical effort of circuits involving transmission gates is computed by
drawing stage that begin at gate inputs rather than diffusion inputs.
En
arn
CVSI is slow because one side of the gate pulls down, and then the cross
coupled PMOs transistor pulls the other side up.
The size of the cross coupled device is an inherent compromise between a
.Le
large transistor that fights the pull down excessively and a small transistor that
is slow pulling up.
w
CPL resolves this problem by making on half of the gate pull up while the other
half pulls down.
ww
50
The implementation of the AND function constructed that way, using only NMOS
.in
transistors is shown in Figure 6.33. In this gate, if the B input is high, the top transistor is
turned on and copies the input A to the output F. When B is low, the bottom pass
ng
transistor is turned on and passes a 0. The switch driven by B seems to be redundant at
first glance. Its presence is essential to ensure that the gate is static; this is that a low-
ri
impedance path exists to the supply rails under all circumstances, or, in this particular
case, when B is low.
ee
gin
En
arn
.Le
For high performance design, a differential pass-transistor logic family, called CPL
or DPL, is commonly used. The basic idea (similar to DCVSL) is to accept true and
ww
complementary inputs and produce true and complementary outputs. These gates
possess a number of interesting properties:
XOR’s and adders can be realized efficiently with small number of
transistors.
Modular design
51
Advantages
Conceptually simple
.in
Disadvantages
ng
Has routing overhead
ri
Suffers static power dissipation
from static power dissipation and reduced noise margins, since the high input to the
signal-restoring inverter only charges up to VDD-VTn. There are several solutions
arn
the use of a level restorer, which is a single PMOS configured in a feedback path
(Figure 6.39). The gate of the PMOS device is connected to the output of the inverter, its
drain connected to the input of the inverter and
w
the source to VDD. Assume that node X is at 0V (out is at VDD and the Mr is turned off)
ww
52
.in
voltage-drop problem associated with pass-transistor logic is the use of multiple-threshold
devices. Using zero threshold devices for the NMOS pass-transistors eliminates most of
ng
the threshold drop, and passes a signal close to VDD. Notice that even if the devices
ri
threshold was implanted to be exactly equal to zero, the body effect of the device
prevents a swing to VDD. All devices other than the pass transistors (i.e., the inverters)
ee
are implemented using standard high-threshold devices.
gin
En
arn
.Le
Solution 3: Transmission Gate Logic: The most widely-used solution to deal with
the voltage- drop problem is the use of transmission gates. It builds on the
w
complementary properties of NMOS and PMOS transistors: NMOS devices pass a strong
ww
0 but a weak 1, while PMOS transistors pass a strong 1 but a weak 0. The ideal
approach is to use an NMOS to pull-down and a PMOS to pull-up. This gate either
selects input A or B based on the value of the control signal S, which is equivalent to
implementing the following Boolean function:
53
.in
A complementary implementation of the gate requires eight transistors instead of six.
ri ng
4.a Explain the power dissipation present in VLSI circuits(APR/MAY 2010)(MAYJUN
2014)(APR/MAY 2015) (MAY/JUN 2016)
ee
Static Power Consumption
gin
Typically, all low-voltage devices have a CMOS inverter in the input and output stage.
En
Therefore, for a clear understanding of static power consumption, refer to the CMOS
inverter modes shown in Figure 1.
arn
w .Le
ww
As shown in Figure 1, if the input is at logic 0, the n-MOS device is OFF, and the p-MOS
device is ON (Case 1). The output voltage is VCC, or logic 1. Similarly, when the input is at
54
logic 1, the associated n-MOS device is biased ON and the p-MOS device is OFF. The
output voltage is GND, or logic 0. Note that one of the transistors is always OFF when the
gate is in either of these logic states. Since no current flows into the gate terminal, and
there is no dc current path from VCC to GND, the resultant quiescent (steady-state)
current is zero, hence, static power consumption (Pq) is zero.
.in
explained with a simple model that describes the parasitic diodes of a CMOS inverter, as
shown in Figure 2.
ng
VCC
VO
ri
GND
P+ N+ N+
ee
P+ P+ N+
gin
N-Well
En
P-Substrate
arn
.Le
diodes are reverse biased, only their leakage currents contribute to static power
ww
consumption. The leakage current (Ilkg)of the diode is described by the following
equation:
Ilkg is eqV kT 1
Static power consumption is the product of the device leakage current and the supply
voltage. Total static power consumption, PS, can be obtained as shown in equation 2.
55
Most CMOS data sheets specify an ICC maximum in the 10- A to 40- A range,
encompassing total leakage current and other circuit features that may require some
static current not considered in the simple inverter model.
The leakage current ICC (current into a device), along with the supply voltage, causes
static power consumption in the CMOS devices. This static power consumption is defined
.in
as quiescent, or PS, and can be calculated by equation 3.
ng
VCC =supply voltage
ri
ICC = current into a device (sum of leakage currents as in equation 2)
Transient power consumption is due to the current that flows only when the transistors of the
devices are switching from one logic state to another. This is a result of the current
.Le
required to charge the internal nodes (switching current) plus the through current (current
that flows from VCC to GND when the p-channel transistor and n-channel transistor turn
on briefly at the same time during the logic transition). The frequency at which the device is
w
switching, plus the rise and fall times of the input signal, as well as the internal nodes of the
ww
device, have a direct effect on the duration of the current spike. For fast input transition
rates, the through current of the gate is negligible compared to the switching current. For
this reason, the dynamic supply current is governed by the internal capacitance of the IC
and the charge and discharge current of the load capacitance.
Dynamic supply current is dominant in CMOS circuits because most of the power is
consumed in moving charges in the parasitic capacitor in the CMOS gates. As a result, the
56
simplified model of a CMOS circuit consisting of several gates can be viewed as one large
capacitor that is charged and discharged between the power-supply rails. Therefore, the
power–dissipation capacitance (Cpd) is often specified as a measure of this equivalent
capacitance and is used to approximate the dynamic power consumption. Cpd is defined
as the internal equivalent capacitance of a device calculated by measuring operating
current without load capacitance. Depending on the output switching capability, Cpd can
be measured with no output switching (output disabled) or with any of the outputs
switching (output enabled). Cpd is discussed in greater detail in the next section.
.in
4.b. Explain the various ways to minimize static and dynamic power dissipation.
ng
(NOV/DEC 2014) (NOV/DEC 2013) (APR/MAY 2010) (MAY/JUN 2016)
Figure a shows a plot of the (VT, VDD) ratio required to maintain a given performance
level (assuming that other device characteristics remain identical). This trade-off is not
without penalty. Reducing the threshold voltage, increases the subthreshold leakage
arn
current exponentially .
w .Le
ww
Figure: Voltage Scaling (VDD/VT on delay and leakage) (a) VDD/VT for fixed performance (b)
Leakage as a function of VT
with S the slope factor of the device. The subthreshold leakage of an inverter is the current
of the NMOS for Vin = 0V and Vout = VDD (or the PMOS current for Vin = VDD and Vout = 0).
57
The exponential increase in inverter leakage for decreasing thresholds illustrated in Figure
b.
These leakage currents are particularly a concern for designs that feature
intermittent computational activity separated by long periods of inactivity. For example, the
processor in a cellular phone remains in idle mode for a majority of the time. While the
processor is shutdown mode, the system should ideally consume zero or near-zero power.
This is only possible if leakage is low—this is, the devices have a high threshold voltage.
This is in contradictory to the scaling scenario that we just depicted, where high
.in
performance under low supply voltage means reduced thresholds. To satisfy the
contradicting requirements of high-performance during active periods, and low leakage
ng
during standby, several process modifications or leakage-control techniques have been
introduced in CMOS processes. Most processes with feature sizes at and below 0.18 mm
ri
CMOS support devices with different thresholds—typically a device with low threshold for
ee
high performance circuits, and a transistor with high threshold for leakage control. Another
approach that is gaining ground is the dynamic control of the threshold voltage of a device
gin
by exploiting the body effect of the transistor. To use this approach for the control of
individual devices requires a dual-well process.
En
Clever circuit design can also help to reduce the leakage current, which is a function
of the circuit topology and the value of the inputs applied to the gate. Since VT depends on
arn
body bias (VBS), the sub-threshold leakage of an MOS transistor depends not only on the
gate drive (VGS), but also on the body bias. In an inverter with In = 0, the sub-threshold
leakage of the inverter is set by the NMOS transistor with its VGS = VBS = 0 V. In more
.Le
complex CMOS gates, the leakage current depends upon the input vector. For example,
the sub-threshold leakage current of a two-input NAND gate is the least when A=B=0.
w
The NAND gate sub-threshold leakage is then set by the top-most NMOS transistor
with VGS=VBS=-VX. Clearly, the sub-threshold leakage under this condition is slightly
smaller than that of the inverter. This reduction in sub-threshold leakage due to stacked
transistors is called
58
5. Draw the static CMoS logic circuit for the given expression. Y = (A.B + C.D)’
(MAY/JUN 2016)
A C A C
B D B D
(a) (b)
C D
A B C D
A B
(c)
(d)
.in
C D
A
A B
B
Y Y
ng
C
A C
D
B D
(f)
ri
(e)
ee
gin
Step 1: Fig a shows the logic design for A.B and C.D using nMoS transistors.
Step 2: Fig b shows the combination of both the AND gates using nMoS transistors. ie,
En
Y=A.B+C.D
arn
Step 3: Fig c shows the logic design for A.B and C.D using pMoS transistors.
Step 4: Fig d shows the combination of both the AND gates using pMoS transistors. ie,
.Le
Y=A.B+C.D
Step 5: Fig e denotes the inverted operation for the expression Y=A.B+C.D. This is
w
59
UNIT III
SEQUENTIAL LOGIC CIRCUITS
PART A
If all the registers are controlled by clock signal, then the circuit is called
synchronous sequential logic circuit.
2. What is bistability principle?
.in
Bistable state has two stable states. The two stable states are o and one.
ng
3. What is metastable?
If the cross coupled inverter pair is biased at point C and small deviation at this
ri
point caused by noise is amplified and regenerated around the circuit loop. This small
ee
deviation is amplified by both the inverters and the bias point C moves the operation
points A and B. so the bias point is unstable. This property is called metastable.
gin
4. List the timing parameters of registers.
1. Set up time
En
2. Propagation delay
arn
3. Hold time
During the 0-0 overlap period, NMOS of t1 and PMOS t2 are simultaneously ON.
This creates a direct path for data to flow from D input of the register to the Q output.
w
2. Complexity
60
Clock skew is defined as the spatial variation in arrival time of clock transition on
an integrated circuit. The clock skew between two points i and j on an IC.
9. What is clock jitter?
.in
Clock jitter is defined as the temporal variation of the clock period at a given point
on the chip. The clock period can reduce or expand on a cycle –by- cycle basis.
ng
10. Define pipelining.
ri
Pipelining is a designing technique used to increase the operation of datapaths in
digital processor. ee
11. Define Propagation delay (tpd)?
gin
This value indicates the amount of time needed for a change in a logic input to
result in a permanent change at an output. Combinational logic is guaranteed not to
En
show any further output changes in response to an input change after tpd time units
have passed.
12. Define Contamination delay (tcd)?
arn
This value indicates the amount of time needed for a change in a logic input to
result in an initial change at an output. Combinational logic is guaranteed not to
.Le
show any output change in response to an input change before tcd time units have
passed.
w
This value indicates the amount of time before the clock edge that data input D
must be stable. As shown in Figure 4, D is stable ts time units before the rising clock
edge.
14. What do you mean by Hold time (th)?
This value indicates the amount of time after the clock edge that data input D must
be held stable. As shown in Figure 4, the hold time is always measured from the
rising clock edge (for positive edge-triggered) to a point after the edge.
61
If one half-cycle or stage of a pipeline has too much logic, it can borrow time into
the next half-cycle or stage. Time borrowing can accumulate across multiple cycles.
16. What is clocked CMoS register? (MAY/JUN 2016)
In integrated circuit design, dynamic logic (or sometimes clocked logic) is a design
methodology in combinatory logic circuits, particularly those implemented in MOS
technology.
17. Draw the switch level schematic of multiplexer based nMoS latch using nMoS
.in
only pass
transistors for multiplexers. (MAY/JUN 2016)
ri ng
ee
gin
En
arn
w .Le
ww
62
PART B
.in
latch.
Static Latches Principle:
ng
Static memories use positive feedback to create a bistable circuit — a circuit
having two stable states that represent 0 and 1. The basic idea is shown in below figure
ri
which shows two inverters connected in
ee cae along with a voltage-transfer
characteristic typical of such a circuit. Also plotted are the VTCs of the first inverter, that
is,V o1 versus Vi1, and the second inverter (Vo2 versus Vo1).
gin
En
arn
.Le
The latter plot is rotated to accentuate thatVi2 = Vo1. Assume now that the output
w
of the second inverter Vo2 is connected to the input of the first Vi1, as shown by the
ww
dotted lines in Figure 7.4a. The resulting circuit has only three possible operation points
(A, B, and C), as demonstrated on the combined VTC.
There are many approaches for constructing latches. One very common
technique involves the use of transmission gate multiplexers. Multiplexer based latches
can provide similar functionality to the SR latch, but has the important added advantage
that the sizing of devices only affects performance and is not critical to the functionality.
63
Figure 7.11 shows an implementation of static positive and negative latches based on
multiplexers.
For a negative latch, when the clock signal is low, the input 0 of the multiplexer is
selected, and the D input is passed to the output. When the clock signal is high, the input
1 of the
multiplexer, which connects to the output of the latch, is selected. The feedback holds the
output stable while the clock signal is high. Similarly in the positive latch, the D input is
selected when clock is high, and the output is held (using feedback) when clock is low.
.in
ri ng
ee
gin
Fig: Negative and Positive latches based on multiplexer
in figure. When CLK is high, the bottom transmission gate ison and the latch is
transparent - that is, the D input is copied to the Q output. During this phase, the
feedback loop is open since the top transmission gate is off.
arn
w .Le
ww
Unlike the SR FF, the feedback does not have to be overridden to write the
memory and hence sizing of transistors is not critical for realizing correct functionality.
The number of transistors that the clock touches is important since it has an activity factor
of 1. This particular latch implementation is not particularly efficient from this metric as it
presents a load of 4 transistors to the CLK signal.
It is possible to reduce the clock load to two transistors by using implement
64
multiplexers using NMOS only pass transistor as shown in Figure 7.13. The advantage of
this approach is the reduced clock load of only two NMOS devices. When CLK is high,
the latch samples the D input, while a low clock-signal enables the feedback-loop, and
puts the latch in the hold mode.
The scaling of supply voltages is critical for low power operation. Unfortunately,
certain latch structures don’t function at reduced supply voltages. For example, without
.in
the scaling of device thresholds, NMOS only pass transistors don’t scale well with
supply voltage due to its inherent threshold drop. At very low power supply voltages, the
ng
input to the inverter cannot be raised above the switching threshold, resulting in incorrect
evaluation. Even with the use of transmission gates, performance degrades significantly
ri
at reduced supply voltages.
ee
gin
En
arn
.Le
Fig: one solution for the leakage problem in low-voltage operation using
MTCMOS
w
Scaling to low supply voltages hence requires the use of reduced threshold
ww
devices. However, this has the negative effect of exponentially increasing the sub-
threshold leakage Power. age energy is typically insignificant compared to the switching
power. However, with the use of conditional clocks, it is possible that registers are idle for
extended periods and the leakage energy expended by registers can be quite
significant.
Many solutions are being explored to address the problem of high leakage during
idle periods. One approach for this involves the use of Multiple Threshold devices as
shown in above figure only the negative latch is shown here. The shaded inverters and
65
transmission gates are implemented in low-threshold devices. The low threshold inverters
are gated using high threshold devices to eliminate leakage.
During normal mode of operation, the sleep devices are tuned on. When clock is
low, the D input is sampled and propagates to the output. When clock is high, the latch is
in the hold mode. The feedback transmission gate conducts and the cross-coupled
feedback is enabled. Note there is an extra inverter, needed for storage of state when the
latch is in the sleep state. During idle mode, the high threshold devices in series with the
low threshold inverter are turned off (the SLEEP signal is high), eliminating leakage. It is
.in
assumed that clock is in the high state when the latch is in the sleep state. The feedback
low-threshold transmission gate is turned on and the cross-coupled high-threshold device
ng
maintains the state of the latch.
ri
2. Explain about the concept of pipelining in detail (Dec-2012, May2014) (MAY/JUN
ee
2016)
gin
Pipelining is a popular design technique often used to accelerate the operation of
the datapaths in digital processors. The idea is easily explained with the example of
below figure.
En
The goal of the presented circuit is to compute log (|a - b|), where both a and b represent
streams of numbers, that is, the computation must be performed on a large set of input
arn
values.
The minimal clock period Tmin necessary to ensure correct evaluation is given as:
.Le
where tc-q and t su are the propagation delay and the set-up time of the register,
w
respectively.
ww
66
We assume that the registers are edge-triggered D registers. The term tpd, logic
stands
for the worst-case delay path through the combinatorial network, this consists of the
adder, absolute value, and logarithm functions. In conventional systems (that don’t push
the edge of technology), the latter delay is generally much larger than the delays
associated with the registers and dominates the circuit performance. Assume that each
logic module has an equal propagation delay. We note that each logic module is then
active for only 1/3 of the clock period (if the delay of the register is ignored).
.in
For example, the adder unit is active during the first third of the period and
remains idle— this is, it does no useful computation— during the other 2/3 of the period.
ng
Pipelining is a technique to improve the resource utilization, and increase the functional
throughput. Assume that we introduce registers between the logic blocks.
ri
The result for the data set (a1, b1) only appears at the output after three clock-
ee
periods. At that time, the circuit has already performed parts of the computations for the
gin
next data sets, (a2, b2) and (a3,b3). The computation is performed in an assembly-line
fashion, hence the name pipeline.
The advantage of pipelined operation becomes apparent when examining the
En
minimum clock period of the modified circuit. The combinational circuit block has been
partitioned into three sections, each of which has a smaller propagation delay than the
arn
original function. This effectively reduces the value of the minimum allowable clock
period.
.Le
triggered registers. Consider the pipelined circuit of below figure. The pipeline system is
implemented based on pass-transistor-based positive and negative latches instead of
ww
edge triggered registers. That is, logic is introduced between the master and slave
latches of a Master- slave system.
67
.in
In the following discussion, we use without loss of generality the CLK-CLK notation
to denote a two-phase clock system. Latch-based systems give significantly more
ng
flexibility in implementing a pipelined system, and often offer higher performance.
When the clocks CLK and CLK are non-overlapping, correct pipeline operation is
ri
obtained. Input data is sampled on C1 at the negative edge of CLK and the computation
ee
of logic block F starts; the result of the logic block F is stored on C2 on the falling edge of
CLK, and the computation of logic block G starts. The non-overlapping of the clocks
gin
ensures correct operation. The value stored on C2 at the end of the CLK low phase is the
result of passing the previous input (stored on the falling edge of C LK on C1) through
En
applied to F, and its effect might propagate to C2 before CLK goes low (assuming that
the contamination delay of F is small). Which value wins depends upon the logic
functionF , the overlap time, and the value of the inputs since the propagation delay is
.Le
The latch-based pipeline circuit can also be implemented using CMOS latches, as
ww
shown in Figure. The operation is similar to the one discussed above. This topology has
one additional, important property:
68
.in
ri ng
ee
Fig : Potential race condition during (0-0) overlap in CMOS-based design.
gin
Similar considerations are valid for the (1-1) overlap. Based on this concept, a
logic circuit style called NORA-CMOS; it combines CMOS pipeline registers and NORA
En
dynamic logic function blocks. Each module consists of a block of combinational logic
that can be a mixture of static and dynamic logic, followed by a CMOS latch. Logic and
arn
latch are clocked in such a way that both are simultaneously in either evaluation, or hold
(precharge) mode. A block that is in evaluation during CLK = 1 is called a CLK-module,
while the inverse is called a CLK- module.
.Le
A NORA datapath consists of a chain of alternating CLK and CLK modules. While
one class of modules is precharging with its output latch in hold mode, preserving
w
the previous output value, the other class is evaluating. Data is passed in a pipelined
fashion from module to module.
ww
NORA offers designers a wide range of design choices. Dynamic and static logic
can be mixed freely, and both CLKp and CLKn dynamic blocks can be used in caed
or in pipelined form. With this freedom of design, extra inverter stages, as required in
DOMINO-CMOS, are most often avoided.
In order to ensure correct operation, two important rules should always be
followed:
The dynamic-logic rule: Inputs to a dynamic CLKn (CLKp) block are only allowed
69
.in
is shown in figure. When CLK = 0, the input data is sampled on storage node 1, which
has an equivalent capacitance of C1 consisting of the gate capacitance of I1, the junction
ng
capacitance of T1, and the overlap gate capacitance of T1.
ri
ee
gin
During this period, the slave stage is in a hold mode, with node 2 in a high-
impedance (floating) state. On the rising edge of clock, the transmission gate T2 turns on,
arn
and the value sampled on node 1 right before the rising edge propagates to the output Q
(note that node 1 is stable during the high phase of the clock since the first transmission
gate is turned off).
.Le
Node 2 now stores the inverted version of node 1. This implementation of an edge-
triggered register is very efficient as it requires only 8 transistors.
w
70
(i.e., the state) has to be refreshed at periodic intervals to prevent a loss due to charge
leakage, due to diode leakage as well as sub-threshold currents. In datapath circuits,
the refresh rate is not an issue since the registers are periodically clocked, and the
storage nodes are constantly updated.
Clock overlap is an important concern for this register. Consider the clock
waveforms shown in below figure. During the 0-0 overlap period, the NMOS of T1 and the
PMOS of T2 are simultaneously on, creating a direct path for data to flow from the D
input of the register to the Q output. This is known as race condition. The output Q
.in
can change on the falling edge if the overlap period is large
ri ng
ee
gin
En
arn
71
.in
ng
Fig: master slave edge-triggered register
It can be stated that the C2MOS latch is insensitive to clock overlaps because
ri
those overlaps activate either the pull-up or the pull-down networks of the latches, but
ee
never both of them simultaneously. If the rise and fall times of the clock are sufficiently
slow, however, there exists a time slot where both the NMOS and PMOS transistors are
gin
conducting. This creates a path between input and output that can destroy the state of
the circuit.
En
So far, we have focused on edge-triggered registers that sample the input data on
arn
only one of the clock edges (rising or falling). It is also possible to design sequential
circuits that sample the input on both edges. The advantage of this scheme is that a
.Le
lower frequency clock (half of the original rate) is distributed for the same functional
throughput, resulting in power savings in the clock distribution network.
w
ww
72
.in
ri ng
Fig: CMOS based dual-edge triggered register.
ee
The above figure shows a modification of the C2MOS register to enable sampling
on both edges.
gin
True Single-Phase Clocked Register (TSPCR)
In the two-phase clocking schemes described above, care must be taken in routing
En
the two clock signals to ensure that overlap is minimized. While the C2MOS provides a
skew- tolerant solution, it is possible to design registers that only use a single phase
arn
clock. The basic single-phase positive and negative latches are shown in figure.
w .Le
ww
For the positive latch, when CLK is high, the latch is in the transparent mode and
corresponds to two caed inverters; the latch is non-inverting, and propagates the
input to the output. On the other hand, when C LK = 0, both inverters are disabled, and
73
the latch is in hold- mode. Only the pull-up networks are still active, while the pull-down
circuits are deactivated. As a result of the dual-stage approach, no signal can ever
propagate from the input of the latch to the output in this mode. A register can be
constructed by caing positive and negative latches.
.in
ri ng
ee
gin
STATIC (SRAM):
En
Large (6 transistors/cell)
Fast
.Le
Differential
DYNAMIC (DRAM):
w
Slower
Single Ended
74
.in
ri ng
ee
gin
En
arn
w .Le
ww
75
.in
ng
Array-Structured Memory Architecture:
ri
ee
gin
En
arn
w .Le
ww
Advantages:
76
Synchronous Timing:
CLK
In
R1 Combinational R2
Logic
.in
ri ng
The following timing parameters characterize the timing of the sequential circuit.
ee
• The contamination (minimum) delay tc-q,cd, and maximum propagation delay of
the register tc-q.
gin
• The set-up (tsu) and hold time (thold) for the registers.
• The contamination delay tlogic,cd and maximum delay tlogic of the combinational
En
logic.
• tclk1 and tclk2, corresponding to the position of the rising edge of the clock relative
arn
to a global reference.
.Le
Clock jitter
77
.in
Both skew and jitter affect the effective cycle time
ng
Only skew affects the race margin
ri
Positive and Negative Skew:
ee
gin
R1 R2 R3
In Combinational Combinational
D Q D Q
••• Logic Logic D Q
delay delay
(a) Positive
.Le
skew
R1 R2 R3
w
n
I D Q
Combinational
Logic D Q Combinational
Logic
D Q
(b) Negative
skew
78
Clock-Signal Generation- The generation of the clock signal itself causes jitter
.in
significant and primarily contribute to skew and jitter. The two major sources of
environmental variations are temperature and power supply. Power supply
ng
variations is the major source of jitter in clock distribution networks.
Capacitive Coupling-The variation in capacitive load also contributes to timing
ri
uncertainty. There are two major sources of capacitive load variations: coupling
ee
between the clock lines and adjacent signal wires and variation in gate
capacitance.
gin
Clock-Distribution Techniques:
En
It is necessary to design a clock network that minimizes skew and jitter. Another
arn
paths, that include both matched interconnect as well as buffers, are used to distribute
the reference to various leaf nodes. Ideally, if each path is balanced, the clock skew is
ww
zero.
That is, though it might take multiple clock cycles for a signal to propagate from the
central point to each leaf node, the arrival times are equal at every leaf node.
79
.in
Fig: Example of H-tree clock
distribution
ri ng
ee
gin
En
arn
.Le
w
ww
80
UNIT IV:
DESIGNING ARITHMETIC BILDING BLOCKS
PART-A
1. Define datapath circuits.
Datapath circuits use N identical circuits to process N-bit data. Related data
operators are placed physically adjacent to each other to reduce wire length and
.in
delay.
2. What is ripple carry adder?
ng
An N-bit adder can be constructed by caing N full adders. This is called
a carry-ripple adder (or ripple-carry adder). The carry-out of bit i, Ci is the carry-in
ri
to bit i ee
+ 1. This carry has twice the weight of the sum Si. The delay of the adder is set
by the time for the carries to ripple through the N stages, so the delay is
gin
minimized.
3. What is the need of carry lookahead adder?
En
The carry-look ahead adder (CLA) computes group generate signals as well
as group propagate signals to avoid waiting for a ripple to determine if the first
arn
Carry-skip adder
Carry-select adder
w
Carry-save adder
ww
81
.in
6. Define Booth encoding.
ng
encoding called booth encoding of the multiplier word that reduces the number of
required addition stages. Instead of traditional binary encoding the multiplier word
ri
ee
is recoded into radix-4 scheme.
gin
En
arn
Serial divider
Parallel divider
.Le
82
In ripple carry adder, every full-adder cell has to wait for the incoming carry
before an outgoing carry is generated. This creates a linear dependency.
11. How to overcome the disadvantage of ripple-carry adder?
.in
The disadvantage of ripple-carry adder is eliminated by the use of carry-
select adder. It anticipates both possible values of the carry input and evaluates
ng
the result for both possibilities in advance. Once the real value of the incoming
carry is known, the correct result is easily selected with a multiplexer.
ri
12. What is meant by bit sliced data path organization? (MAY/JUN 2016)
ee
A data path circuit is a circuit that combines two functions to a single logic cell. For
gin
instance, consider to design a full adder: ADD is a function that combines two inputs.
Therefore in general, the layout of buswide logic that operates on data signals is called
as a data path. The module add in a full adder is a data path.
En
13. Determine the propagation delay of n-bit carry select adder. (MAY/JUN 2016)
arn
tsetup = Initial time taken to create the propagate and generate signals.
tcarry = defines the propagation delay through the single bit
w
83
PART-B
.in
the delay should be minimized.
ri ng
ee
Fig.4.1 (a) 4-bit carry-ripple adder
gin
In carry-ripple adders, the critical path goes from C to Cout through many full
adders, so the extra delay computing S is unimportant. This delay can be reduced by
En
omitting the inverters on the outputs. Fig.4.1 (b) shows the adder with transistor sizes
optimized to favor the critical path using a number of techniques:
arn
Feed the carry-in signal (C) to the inner inputs so the internal capacitance is
already discharged.
.Le
Make all transistors in the sum logic whose gate signals are connected to
the carry-in and carry logic minimum size (1 unit, e.g., 4λ). This minimizes
the branching effort on the critical path. Keep routing on this signal as short
w
84
.in
Fig.4.1 (b) Full adder for carry-ripple
operation
ng
This delay can be reduced by omitting the inverters on the outputs, as was done in
Fig.4.1 (b). Because addition is a self-dual function (i.e., the function of complementary
ri
inputs is the complement of the function), an inverting full adder receiving complementary
ee
inputs produces true outputs. Fig.4.1 (c) shows a carry ripple adder built from inverting
full adders. Every other stage operates on complementary data. The delay inverting the
gin
adder inputs or sum outputs is off the critical ripple-carry path.
En
arn
.Le
The critical path of the carry-ripple adder passes from carry-in to carry-out along the
carry chain majority gates. As the P and G signals will have already stabilized by the time
the carry arrives, we can use them to simplify the majority function into an AND-OR gate:
Because Ci= Gi:0, carry-ripple addition can now be viewed as the extreme case ofgroup
85
PG logic in which a 1-bit group is combined with an i-bit group to form an (i+1) bit group
.in
ri ng
ee
Fig.4.1 (d) 4-bit ripple carry adder
gin
Fig.4.1 (d) shows a 4-bit carry-ripple adder. The critical carry path now proceeds
through a chain of AND-OR gates rather than a chain of majority gates.
En
-------------------------**********-------------------------
arn
A barrel shifter performs a right rotate operation. It handles left rotations using the
complementary shift amount. Barrel shifters can also perform shifts when suitable
.Le
masking hardware is included. Barrel shifters come in array and logarithmic forms. The
logarithmic barrel shifters are most useful because they are better suited for large shifts.
w
Fig.4.2 (a) shows a simple 4-bit barrel shifter that performs right rotations. Unlike funnel
shifters, barrel shifters contain long wrap-around wires.
ww
In a large shifter, it is necessary to upsize or buffer the drivers for these wires.
Fig.4.2 (b) shows an enhanced version that can rotate left by prerotating right by 1, then
rotating right by k. Performing logical or arithmetic shifts on a barrel shifter requires a way
to mask out the bits that are rotated off the end of the shifter, as shown in Fig.4.2 (c).
86
.in
ri ng
ee
gin
Fig.4.2 Barrel shifters: (a) rotate right, (b) rotate left or right, (c) rotates and shifts
En
Fig.4.2 (d) shows a 32-bit barrel shifter using a 5:1 multiplexer and an 8:1 multiplexer.
The first stage rotates right by 0, 1, 2, 3, or 4 bits to handle a pre-rotate of 1 bit and a fine
arn
rotate of up to 3 bits combined into one stage. The second stage rotates right by 0, 4, 8,
12, 16, 20, 24, or 28 bits. The critical path starts with decoding the shift amount for the
first stage. If the shift amount is available early, the delay from A to Y improves
.Le
substantially.
w
ww
87
While the rotation is taking place, the masking unit generates an N-bit mask with
ones where the kill value should be inserted for right shifts. For a right shift by m, the m
most significant bits are ones. This is called a thermometer code. When the
rotation result X is complete, the masking unit replaces the masked bits with the kill
value. For left shifts, the mask is reversed.
.in
ri ng
ee
gin
En
\
arn
Fig.4.2 (e) shows masking logic. If only certain shifts are supported, the unit can be
.Le
simplified, and if only rotates are supported, the masking unit can be eliminated, saving
substantial hardware, power, and delay.
w
ww
3. Explain in detail about the operation of carry lookahead adder with necessary
diagrams. (Nov-2010)
The carry-lookahead adder (CLA) computes group generate signals as well as
group propagate signals to avoid waiting for a ripple to determine if the first group
generates a carry.
88
In expanded form,
---- (1)
Here . For every bit, the carry and sum outputs are independent of the previous
bits. The ripple effect has thus been effectively eliminated and therefore the addition
.in
time should be independent of number of bits. Fig.4.3 (a) shows the carry-lookahead
adder.
ri ng
ee
gin
The possible circuit implementation of equation (1) is shown in Fig.4.3 (b) for N=4. The
En
large fan-in of the circuit makes it slow for larger values of N. Implementing it with simpler
gates requires multiple-logic levels. In both cases, the propagation delay increases.
arn
Furthermore, the fan-out of some signals tend to grow excessively, slowing down the
adder more since the propagation delay of a gate is proportional to its load. Finally the
.Le
89
.in
ng
Fig.4.3 (c) Improved CLA group PG network
ri
4.Explain multiplication with an example and discuss the different types of
multipliers. (Nov-2010) (MAY/JUN 2016)
ee
gin
multiplication consists of forming the product of two unsigned (positive) binary numbers.
For example, the multiplication of two positive 6-bitbinary integers, 2510 and 3910,
arn
products is added and the carry values are passed to the next column.
ww
90
----- (1)
.in
ri ng
ee
gin
Fig.4.4 (a). The generation of partial product requires a multiplication by 1 or 0 (i.e.) AND
operation. Generating the N partial products requires N M-bit AND gates. The shifting of
.Le
the partial products is performed by simple routing and does not require any active logic.
The overall structure can be compacted into a rectangle, resulting in a very efficient
layout.
w
ww
91
.in
ng
.
ri
ee
Due to array organization, determining the propagation delay is difficult. Partial sum
adders are implemented as ripple-carry adders. Performance optimization requires critical
gin
timing path to be identified. Two such paths are highlighted in Fig.4.4 (d). The
propagation delay is given as,
----- (2)
En
where is the propagation delay between input and output carry, is the delay
arn
between the input carry and sum bit of the full adder and is the delay of the AND
w .Le
ww
gate.
Fig.4.4 (d) Ripple carry based 4x4 multiplier with two critical paths
highlighted
92
Since all critical paths have the same length, speeding up one of them does not make
much difference. All the critical paths have to be speeded up at the same time. From
equation (2), it is deduced that the minimization of requires the minimization of both
and .
Due to large number of identical critical paths, increasing the performance of the
structure shown in Fig.4.4 (d) is achieved with careful transistor sizing. A more efficient
multiplier structure is obtained by noticing that the multiplication result does not change
when the output carry bits are passed diagonally downwards instead of to the right. An
.in
extra adder called as a vector-merging adder, is added to generate the final result. Such
multiplier is called as carry-save multiplier, because the carry bits are not immediately
ng
added but are rather saved for the next adder stage. This structure has a slightly
increased area cost but it has the advantage that its worst-case-critical path is uniquely
ri
defined as shown in fig.4.4 (e) and expressed in equation
ee
(3).
gin
-------- (3)
assuming that .
En
93
Booth Encoder:
.in
The radix-4 multiplier produces N/2 partial products. Each partial product is 0, Y, 2Y, or
ng
3Y, depending on a pair of bits of X. Computing 2Y is a simple shift, but 3Y is a hard
multiple requiring a slow carry propagate addition of Y + 2Y before partial product
ri
generation begins. The advantage of the recoding is that the number of partial products
ee
and hence the number of additions is halved, which results in a speed-up as well as area
gin
reduction. The only expense is somewhat more involved multiplier cell. While
multiplication with {0,1} is equivalent to an AND operation, multiplying with {-2,-1,0,1,2}
requires a combination of inversion and shift logic.
En
partial products are chosen by considering a pair of bits along with the most significant bit
from the previous pair. If the most significant bit from the previous pair is true, Y must be
added to the current partial product. If the most significant bit of the current pair is true,
.Le
the current partial product is selected to be negative and the next partial product is
incremented. Table 1 shows how the partial products are selected based on bits of the
w
multiplier. Negative partial products are generated by taking the two’s complement of the
multiplicand (possibly left-shifted by one column for –2Y). An unsigned radix-4 Booth
ww
encoded multiplier requires partial products rather than N. Each partial product is M+ 1
bits to accommodate the 2Y and –2Y multiples. Even though X and Y are unsigned, the
partial products can be negative and must be sign extended properly.
94
.in
Table 1: Radix-4 modified Booth encoding values
ng
with the most significant bit of the previous pair) is encoded into several select
lines(SINGLEi, DOUBLEi, and NEGi, given in the rightmost columns of Table 1) and
ri
driven across the partial product row as shown in Fig.4.4 (f).
ee
gin
En
arn
The multiplier Y is distributed to all the rows. The select lines control Booth
selectors that choose the appropriate multiple of Y for each partial product. The Booth
w
selectors substitute for the AND gates of a simple array multiplier to determine the i th
ww
partial product. Fig.4.4 (f) shows a conventional Booth encoder and selector design. Y is
zero-extended to M + 1 bit. Depending on SINGLEi and DOUBLEi, the gate selects either
0, Y, or 2Y. Negative partial products should be two’s-complemented (i.e., invert and add
1). If NEGi is asserted, the partial product is inverted. The extra 1 can be added in the
least significant column of the next row to avoid needing a CPA.
Wallace-Tree Multiplier
ripples from top to bottom similar to what happens in ripple-carry adder. The number of
stages equals the number of bits in the multiplier word minus 2. Now the linear chain is
translated into a tree structure as shown in Fig.4.4 (h). This topology which has an
multiplication time, is called the
Wallace multiplier. It is faster than the carry-save structure but has the disadvantage of
being irregular. This complicates the task of coming up with a dense and efficient layout.
Wallace multipliers are used only in designs where performance is critical and design
time is only a secondary consideration.
.in
ri ng
ee
gin
(g)Vertical slice of 6-bit carry-save multiplier (h) Wallace tree organization
Fig.4.4 Wallace-tree multiplier
En
5.What are the two types of dividers? Explain them with example and schematic
sketches.
arn
.Le
Serial divider
Parallel divider
w
Serial divider
ww
96
Here when the difference is positive final carry is 1 which is end around and added
to get the actual difference. When difference is negative, carry is zero and true result is
obtained by one’s complement of the sum output. So, repeated subtraction is done till
final carry is one. Since subtraction is for three times, when the carry is 1, the quotient is
3 and remainder is the final difference which is 0001. The implementation of binary
divider by means of repeated subtraction of two 4-bit unsigned binary numbers is shown
in fig.4.5 (a). Here the divisor Y3Y2Y1Y0 is subtracted from X3X2X1X0 by one’s
complement method of subtraction. The basic building blocks used are,
.in
Adder ADD4 to add to 4-bit binary number
ng
4 set of 2:1 MUX and D FLIPFLOPS
ri
ee
gin
En
arn
w .Le
ww
Each bit of divisor is complemented and fed to one set of adder inputs. Dividend is
initially loaded in a register comprising of 4 D Flipflops by putting LOAD input high, which
97
is common select input of all the MUX and also to the CLR input of the counter. So
initially the counter is also reset to zero. Output of D Flipflop is fed to another set of inputs
of the adder. The final carry output of the adder block is fed to the clock enable input CE
of the counter and also to an OR gate whose other input is LOAD and output goes to
clock enable of register.
Parallel divider
Parallel divider performs parallel division using array of full subtractor blocks.
Implementation of a parallel divider, which is also called as array divider, to divide an
.in
unsigned 4-bit number A3A2A1A0 by B3B2B1B0 is shown in fig.4.5 (b). The relation
between dividend A3A2A1A0, divisor B3B2B1B0, quotient Q3Q2Q1Q0 and remainder
ng
R3R2R1R0 is as follows:
ri
ee
gin
En
arn
w .Le
ww
98
.in
ri ng
ee
gin
To find whether Q3 is 0 or 1 divisor is multiplied by 23, i.e., left shifted by 3 bit and
arn
subtracted from dividend if difference is zero or positive. The final carryout (CO) of the
full subtractor block in the first array is 0, and Q3 is made 1 by complementing CO.
Otherwise subtraction is not done and dividend is passed to the next array and Q 3 is
.Le
is multiplied by 22, i.e., left shifted by 2 bit and subtracted from dividend or previous
w
difference,
ww
if the difference is zero or positive. If final carry out (CO) of this array of full subtractor
block is 0, and Q2 is made 1 by complementing CO. Similarly Q1 and Q0 are obtained.
The remainder R3R2R1R0is the last positive difference output. Example of binary
division of 1011 (A3A2A1A0) by 0010 (B3B2B1B0) is shown below:
99
So dividing 1011 (decimal 11) by 0010 (decimal 2) results in quotient 0101 (decimal 5)
and remainder 0001 (decimal 1). In the circuit, there are four rows of arrays for the
computation of each bit of quotient. For the above example, for the computation of Q3 we
have to subtract B0 from A3 only when A3 is 1 and B0 is 0 or 1 and rest of the bits in B is
0 and that logic is implemented in top row. For the computation of Q2 only B1 and B0 is
needed for subtraction, so there are two full subtractor blocks in second row. Foe
.in
computation of Q1 and Q0 three and four subtractor blocks are needed respectively.
Each of the full subtractor block is connected to a 2:1 MUX in order to selectively pass the
ng
dividend bit or the difference bit, depending on final carry is present in that row or not. If
the divisor is zero, quotient is zero and remainder is equal to dividend. The maximum
ri
combinational delay is equal to delay in difference generation of two full subtractor
ee
blocks plus carry propagation time in four blocks. So array divider is much faster than
serial divider with increased hardware requirement.
gin
-------------------------**********-------------------------
En
6.Design a 16 bit carry by pass and carry select addersand discuss their Features.
(MAYJUN 2016)
arn
speed up carry chain called a skip chain. This chain defines the distribution of ripple carry
blocks, which compose the skip adder. The addition of two binary digits at stage i, where i is
ww
not equal to 0, of the ripple carry adder depends on the carry in, Ci , which in reality is the
carry out, Ci-1, of the previous stage. Therefore, in order to calculate the sum and the carry
out, Ci+1 , of stage i, it is imperative that the carry in, Ci, be known in advance. It is
interesting to note that in some cases Ci+1 can be calculated without knowledge of Ci.
100
.in
provided equation 4 holds. These would enable to build an adder whose average time of
computation would be proportional to the longest chains of zeros and of different digits of A
ng
and B.
ri
ee
gin
En
arn
w .Le
101
.in
Fig: Concept of carry select adder
ng
sum is correctly output as soon as the carry-in gets there. The time taken to compute the
ri
sum is then avoided which results in a good improvement in speed.
ee
Carry-select adders can be divided into equal or unequal sections. For each section,
the calculation of two sums is accomplished using two 4-bit ripple-carry adders. One of
gin
these adders is fed with a 0 as carry-in whereas the other is fed a 1. Then using a
multiplexer, depending on the real carryout of the previous section, the correct sum is
En
chosen. Similarly, the carryout of the section is computed twice and chosen depending of
the carryout of the previous section. The concept can be expanded to any length for
example a 16-bits carry-select adder can be composed of four sections. Each of these
arn
sections is composed of two 4-bits ripple-carry adders. This is referred as linear expansion.
w .Le
ww
102
.in
ri ng
ee
Fig: Carry save adder for 4 bit number
gin
In this method, for the first 3 numbers a row of full adders are used. Then a row of full
adders is added for each additional number. The final results, in the form of two numbers
En
SUM and CARRY, are then summed up with a carry propagate adder or any other adder
arn
w .Le
ww
103
UNIT V
IMPLEMENTATION STRATEGIES
PART A
1. What is an interconnect?
The last half dozen or so layers define metal wires between the transistors are
called interconnect.
.in
2. Define Manufacturing lead time
It is defined as the time it takes to make am IC not including the design time.
ng
3. Define Flexible blocks.
ri
The predefined logic cells are known as standard cells. The standard cell areas are
ee
called flexible blocks.
gin
4. Define Mega cells
The flexible blocks used in combination with larger predesigned cells, like micro
En
Less cost
Less time
.Le
Reduced Risk
The predefined pattern of transistors on a gate array is the base array. The
base array is made up of a smallest element called primitive cell.
If an ASIC design is completed using cell library we own the mask that are used
to manufacture the ASIC. This is called Customer owned tooling.
104
2. Long lines run across the entire chip to form internal buses
3. Direction connection bypasses the switch matrices and directly connects
adjacent CLB
.in
9. Write the advantages of altera max 5000 and 7000?
ng
1. It uses a fixed no. of connections.
ri
2. Fixed routing delay
ee
3. Simple and improved speed in placement and routing software
gin
10. Write about FPGA routing techniques.
2. Semi-custom ASICs
3. Programmable ASICs
105
12. What is the full custom ASIC design? (May 2008,May 2009)
In a full custom ASIC, an engineer designs some or all of the logic cells,
circuits or layout specifically for one ASIC. It makes sense to take this approach only
if there are no suitable existing cell libraries available that can be used for the entire
design.
.in
13. What is the standard cell-based ASIC design? (May 2008)
ng
A cell-based ASIC (CBIC) uses predesigned logic cells known as standard
cells. The standard cell areas also called fle4xible blocks in a CBIC are built of rows of
ri
standard cells. The ASIC designer defines only the placement of standard cells and
the interconnect in a CBIC. All
ee
the mask layers of a CBIC are customized and are
unique to a particular customer.
gin
14. Differentiate between channeled & channel less gate array.
En
2. The
customized. interconnect uses No predefined areas are set
predefined spaces between rows aside For routing between
.Le
3. Routing
of base cells.is done using the Routing
cells. is done using the area
spaces transist unused. of
4. Logic density is less Logic density is higher.
w
106
Interconnect
.in
A method for programming the basic logic cells and the interconnect.
The core is a regular array of programmable basic logic cells that can
ng
implement combinational as well as sequential logic (flip-flops).
ri
A matrix of programmable interconnect surrounds the basic logic cells.
19. State the features of full custom ASIC Design. (MAY/JUN 2016)
w
Full custom includes all possible logic cells and mask layers that are customized.
Example is microprocessor.
In full custom ASIC an engineer design some or all logic cells ,circuits, or layout
specifically for one ASIC.
107
.in
ri ng
ee
gin
En
arn
w .Le
ww
108
PART B
1. Explain about the classification of ASIC. (Nov 2007, Nov 2008, May 2008, May
2009, May 2010) (MAY/JUN2016)
An ASIC is classified into
.in
Full custom includes all possible logic cells and mask layers that are customized.
ng
These are very expensive to manufacture and design.
Example is microprocessor.
ri
In full custom ASIC an engineer design some or all logic cells ,circuits, or layout
specifically for one ASIC.
ee
gin
Semi Custom ASIC:
In semicustom asic all the logic cells are predesigned and some of the mask layers
En
A cell based ASIC or cell based IC (CBIC) uses predesigned logic cells like AND
gates, OR gates, multiplexers, Flipflops.
The predefined logic cells are known as standard cells. The standard cell areas
w
The flexible blocks used in combination with larger predesigned cells, like micro
controllers and micro processors, these are called mega cells.
109
.in
Advantages:
Less cost
ng
Less time
ri
Reduced Risk ee
Transistor operates at maximum speed.
gin
Disadvantages:
En
Time needed to fabricate all layers for each new design is high.
arn
Gate array (GA) based ASIC has predefined transistors on the silicon wafer. The
predefined pattern of transistors on a gate array is the base array. The base array
is made up of a smallest element called primitive cell.
w
To distinguish this type of gate array from other types of gate array ,this is often
ww
110
Channeled gate array has space between the rows of transistor for wiring.
Features:
.in
ri ng
ee
Fig: Channeled Gate Array
gin
B. Channel less Gate Array
The routing on a channel less gate array uses rows of unused transistors.
arn
Features:
111
This embedded area either contains a different base cell that is more suitable for
building memory cells.
Features:
.in
ri ng
ee
gin
Fig: Structured Gate Array
Advantages:
En
2. Increased performance
arn
3. Lower cost
Disadvantage:
Programmable ASIC:
ww
In which all the all the logic cells are predesigned and none of the mask layers are
customized.
The two types are
112
.in
4. Matrix of large macro cells
ng
Field programmable gate array: (FPGA)
ri
Complex PLD’s are called FPGA.
ee
FPGA are growing rapidly and replace TTL in microelectronic system
gin
Characteristics:
3. Core with regular array of programmable basic logic cells that implement
combinational and sequential logic.
4. Matrix of programmable interconnect surrounds the basic logic cells.
.Le
113
Enter the design into an ASIC design systems ,either using a HDL
orschematic entry
.in
Step 2.Logic synthesis
ng
Step 3: System portioning
ri
Divide a large system into an ASIC sized pieces.
Step 6: Placement:
Step 7: Routing:
ww
Step 8.Extraction:
114
.in
ri ng
-------------------------**********-------------------------
ee
2. b. Explain about ASIC cell library in detail.
gin
Cell library is very important in ASIC design.
En
For MGA and CBIC there are three choices to have the cell library.
arn
If an ASIC design is completed using cell library we own the mask that is used to
manufacture the ASIC. This is called Customer owned tooling. Each cell in an
ASIC cell library contain the following,
1. Physical layout
2. Behavioral model
3. verilog/VHDL model
115
4. Timing model
5. Test strategy
6. Circuit schematic
7. Cell icon
9. Routing model
.in
3. Explain in detail about FPGA Interconnecting Procedure.(MAY/JUN 2016)
ng
complexity of interconnect is determined by programming technology and architecture of
basic logic cell. The raw material used to build interconnect is aluminum based
ri
metallization with sheet resistance ee
.programmable ASIC comes with two layers, three layers or more layers of metal
interconnect.
gin
ACTEL ACT:
En
The channel routing uses dedicated rectangular areas of fixed size within chip
called wiring channel. The horizontal channels run across the chip in the horizontal
.Le
direction.In vertical direction, vertical channels run over the top of the basic logic cells or
logic modules. Capacity of fixed wire channel is equal to the number of tacks it contains.
w
into various length or wire segments. The designer then programs the interconnections
by blowing antiques and making connections between wire segments. The unwanted
connections are left unprogrammed.
116
ACT 1 INTERCONNECT:
.in
ACT 1 routing resource interconnection architecture uses 22 horizontal tracks per
ng
channel for signal routing with three tracks dedicated to VDD ,GND, and the global
clock(GCLK).Four logic module input are available to the channel below the logic module
ri
and four input to the channel above the logic module.
Input stub:
ee
Eight vertical tracks per logic module are available for inputs. This is the input stub.
gin
Output stub:
En
Single logic module output connect to vertical track extends across the two
channel above the module and across the two channels below the module. This is the
output stub.One vertical track per column is a long vertical track (LVT) that spans the
arn
w .Le
ww
117
.in
The ACT 2/3 logic modules can accept five input ,rather than four input for the
ACT1 modules.
ng
The number of tracks per column increases from 13 to 15 in the ACT 2
architecture.
ri
ee
The greatest challenge facing the ACTEL FPGA architecture is the resistance
of polysilicon antifuses.
gin
-------------------------**********-------------------------
XILINX LCA:
XILINX LCA basic logic cells are called the configurable logic block or CLB.
arn
CLB’s are bigger and more complex than the ACTEL logic cells. Xilinx LCA uses
coarse gain architecture. Xilinx CLB contain both combinational logic and flip flops.
.Le
XC 3000 CLB:
XC 3000 CLB which has five logic inputs., a common clock input ,an asynchronous
w
address 32=25
XC 4000 LOGIC BLOCK:
This is a complicated basic logic cell containing 2 four input LUT’S that feed a
118
three input LUT. This has special fast carry logic hardwired between CLB’S
MUX control logic maps four control inputs( c1-c4) into the following four
inputs 1.H1 –LUT input 3. .EC –enable clock
2.DIN –DIRECT IN 4. S/R-set/reset control.
The control inputs( c1-c4 ) is used to control the use of F and G LUT as 32 bits of
SRAM.
.in
ri ng
ee
gin
En
XC 5200 LC contains four input LUT, a flip flop ,and MUX to handle signal
ww
119
All programmable ASIC or FPGA contain a basis logic cell.the basic logi8c cell is
REPLICATED IN A regular array across the chip.
ACTEL ACT has three logic family
1. ACT 1
2. ACT 2
3. ACT 3
.in
ACT 1 logic module:
ng
Logic cells in ACTEL ACT 1 logic family are called logic modules.
ri
ACT 1 Family uses one type of logic modules .logic function is build using an actel
ee
logic module by connecting logic signals to some or all the logic module input and
by connecting any remaining logic module input to VDD AND ground.
gin
En
arn
w .Le
ww
120
A Flipflop with two ACT 1 logic modules require added interconnect and
associated parasitic capacitance to connect the two logic modules. For better
efficiency extra antifuses in the logic module is used to cut down the parasitic
capacitance.
Another way is to use a separate Flipflop module, which reduces flexibility and
reduces layout complexity.
.in
The ACT 2 and ACT 3 architectures uses two different types of logic modules, in
ng
which one is an equivalent of D flip flop.
ri
The ACT 2 C module is similar to the ACT 1 logic module, but is capable of
ee
implementing five input logic function. ACTEL calls its C module a combinational
gin
module even though the module implements combinational logic.
En
arn
.Le
121
.in
ri ng
Fig: Sequential Element configured as positive edge triggered D flip flop
ee
gin
En
arn
w .Le
ww
122
.in
PART A-(10 x 2= 20 marks)
1. State channel length modulation. write down the equation for describing the channel
ng
length modulation effect in NMOS transistor.
2. What is latch up? How to prevent latch up.?
ri
3. Give Elmore delay expression for propagation delay of an inverter..
ee
4. Why single phase dynamic logic structure cannot be caed? Justify
5. Draw the switch level schematic of multiplexer base NMOS latch using NMOS only pass
gin
transistor for multiplexer.
6. What is clocked CMOS register?
En
11. a)i. Describe the equation for source to drain current in the three region of operation
of a MOS transistor and draw the V-I characteristics. (8)
w
ii. Explain in detail about the body effect and its effect in MOS device. (8)
ww
(OR)
ii. Discuss the principle of constant field and lateral scaling. Write the effects of the
b).Y=(D(A+BC))’
(OR)
b. What are the sources of power dissipation in CMOS and discuss various design
technique to reduce power dissipation in CMOS?
(16)
13.a. Explain the operation of master slave based edge triggered register?
.in
(16)
ng
(OR)
ri
(16)
ee
14.a. Design a 16 nbit carry bypass and carry select adder and discuss their features.
gin
(16)
(OR)
En
b. Design 4x4 array multiplier and write the equation for delay. (16)
15 a. With neat sketch explain the CLB ,IOB and programmable interconnect of an FPGA
arn
Device. (16)
(OR)
.Le
124
PART-A
1. State any two differences between CMOS and Bipolar technology.
.in
2. Draw the stick diagram for an n-type enhancement mode transistor.
ng
3. What is latch up problem in CMOS circuits?
4. Give the expressions for rise time and fall time in CMOS inverter circuit.
ri
5. Define the syntax for Architecture in Verilog HDL.
10. List the design steps required for testing in CMOS chip design.
arn
PART-B
11. (a) (i) With neat diagrams explain the steps involved in the p-well process of
.Le
(b) (i) Describe in detail with neat sketches the Twin Tub method of CMOS
ww
fabrication. (8)
(ii) With neat diagram of Latch-up effect in p-well structure, explain Latch-up
problem and the steps involved to overcome it. (8)
12. (a) (i) Derive the pull-up to pull-down ratio for an NMOS inverter driven by another
NMOS inverter. (8)
(ii) Explain in detail the MOS transistor Figure of merit. Obtain an expression for it.
(Or)
(b) (i) Explain Pass Transistor and Transmission gates with neat sketches.
125
(ii) Draw the stick and layout diagrams of an NMOS inverter. (8)
13. (a)(i) With a neat flow chart explain the VLSI design flow. (8)
(ii) Explain the syntax of conditional statements in Verilog HDL with examples. (8)
(Or)
(b)(i) Explain in detail Behavioural and RTL modeling. (8)
(ii) Write the program using Verilog HDL to implement a full adder circuit. (8)
14. (a)(i) Explain Gate Array based ASICs with diagrams. (8)
.in
(ii) With a neat flow chart explain ASIC design flow and the steps involved in the
design. (8)
ng
(Or)
(b)(i) With a block diagram describe Xilinx I/O cell. (8)
ri
(ii) Explain the Actel ACT family interconnect and its routing resources. (8)
ee
15. (a) (i) Explain in detail Boundary-Scan Test. (8)
gin
(ii) Enumerate on physical faults with examples. (8)
(Or)
(b) (i) Explain Built-in Self Test. (8)
En
(ii) Describe the testing techniques at chip level and at system level. (8)
arn
w .Le
ww
126
.in
PART-A
ng
1. Define SSI, MSI, LSI and VLSI.
2. What are the different tools available in a typical CAD tool set?
ri
3. What is meant by “body effect”? ee
4. Draw the schematic diagram of the tristate inverter.
gin
5. What are the different phases of VLSI Design flow?
10. State all the test vectors to test3 input NAND gate.
w
PART-B
(Or)
(Or)
(b) Derive the equation for threshold voltage in PMOS Enhancement transistor.
(16)
127
13. (a) Explain various features of gate level modeling and switch level modeling.
(16)
(Or)
.in
(b) Discuss the features of Channeled Gate Array, Channel less Gate Array and
Structured Gate Array. (16)
ng
15. (a) Write briefly about different test strategies of testing digital circuits. (16)
ri
(Or)
(b) Explain the importance of system level testing techniques.
ee (16)
gin
En
arn
w .Le
ww
128