VLSI
VLSI
6/3/2015 1
Acronym of VLSI
• V -> Very
• L -> Large
• S -> Scale
• I -> Integration
3
Types of Field Effect Transistors
(The Classification)
n-Channel JFET
FET p-Channel
» JFET
JFET
MOSFET (IGFET)
Enhancement Depletion
MOSFET MOSFET
5
Switch Model of NMOS Transistor
| VGS | Gate
Source Drain
(of carriers) (of carriers)
Ron
6
Switch Model of PMOS Transistor
| VGS | Gate
Source Drain
(of carriers) (of carriers)
7
MOS transistors Symbols
D D
G G
S S
NMOS Enhancement NMOS Depletion Chann
el
D D
G G B
S S
8
MOSFET Circuit Symbols
• (g) and (i) are the most
commonly used symbols
in VLSI logic design.
• MOS devices are
symmetric.
• In NMOS, n+ region at
higher voltage is the
drain.
• In PMOS p+ region at
lower voltage is the drain
9
pMOS are 2.5 time slower than
nMOS due to electron and
hole mobilities
50
UNIT 1
Basic processes involved in
fabricating Monolithic ICs
1. Silicon wafer (substrate) preparation
2. Epitaxial growth
3. Oxidation
4. Photolithography
5. Diffusion
6. Ion implantation
7. Metallization
8. Testing
9. Assembly processing & packaging
51
Oxidation
Formation of silicon dioxide layer on the surface of Si wafer
1. protects surface from contaminants
2. forms insulating layer between conductors
3. form barrier to dopants during diffusion or ion implantation
4. grows above and into silicon surface
Dry oxidation
Wet oxidation
52
Oxidation
The silicon wafers are stacked up in a quartz boat & then
inserted into quartz furnace tube. The Si wafers are raised
to a high temperature in the range of 950 to 1150oC & at
the same time, exposed to a gas containing O2 or H2O or
both. The chemical action is
53
Photolithograph
y
• Coat wafer with photoresist
(PR)
• Shine UV light through mask
to selectively expose PR
UV Light
• Use acid to dissolve Mask
exposed
PR Photoresist
– Selective doping
– Selective removal of material
under exposed PR
54
Adding Materials
• Add materials on top of
silicon Added Material
– Polysilicon (e.g. Polysilicon)
– Metal
– Oxide (SiO2) - Insulator
• Methods Silicon
– Chemical deposition
– Sputtering (Metal ions)
– Oxidation
55
Oxide (Si02) - The Key Insulator
• Thin Oxide
– Add using chemical deposition
– Used to form gate insulator & block active areas
• Field Oxide (FOX) - formed by oxidation
– Wet (H20 at 900oC - 1000oC) or Dry (O2 at 1200oC)
– Used to insulate non-active areas
SiO2 Thin Oxide FOX SiN / SiO2
FOX
56
Patterning Materials using
Photolithography
• Add material to wafer
• Coat with photoresist Added Material
(e.g. Polysilicon)
• Selectively remove
photoresist
• Remove exposed
Silicon
material
• Remove remaining
PR
57
Diffusion
• Introduce dopant via epitaxy or
ion implant e.g. Arsenic (N),
Boron (P) Blocking Material
• Allow dopants to diffuse at (Oxide)
high
temperature Diffusion
• Block diffusion in selective areas
using oxide or PR
• Diffusion spreads both vertically, Silicon
horizontally
58
N MOS Fabrication steps
1. Substrate
p
3. ………………………………………
………………………………………
Photoresist
62
UV light
Mask
……………………………………
4. …………………………………………
5. …………………
…………………… Window in
oxide
63
……………………………………………………
……
………… ……
…………………
6. …… …… … … Patterned
………… … … … Poly. (1-2 m)
………… p
On thin oxide
( 800-1000A0 )
7. ………… n+ diffusion
……………………………
……
…… …………
…………
(1 m deep)
64
………… …………
8. …… … … … ……
……
…… … … …… ……
…… …
……………….
…………
… …
………… …… ………… Contact holes
………… ………… ……
……………… …… …… (cuts)
…… …… …………
p
………… …………
9. … … …
……
…… …
…… … …… ……
…… …
…… Patterned
……………….
…
…………
…… …
………… …………
………… ………… …………
……………… …… Metallization
(aluminum
…… ……
p ………… 1 m)
65
CMOS FABRICATION
66
The p-well CMOS fabrication
In all other respects-masking, patterning, and diffusion-the process is similar to
nMOS fabrication. In summary, typical processing steps are:
• Mask 1 - defines the areas in which the deep p-well diffusions are to take place.
•Mask 2 - defines the thinox regions, namely those areas where the thick oxide is to
be stripped and thin oxide grown to accommodate p- and n-transistors and wires.
• Mask 3 - used to pattern the polysilicon layer which is deposited after the thin
oxide.
•Mask 4 - A p-plus mask is now used (to be in effect "Anded" with Mask 2) to define
all areas where p-diffusion is to take place.
•Mask 5 - This is usually performed using the negative form of the p-plus mask and
defines those areas where n-type diffusion is to take place.
• Mask 6 - Contact cuts are now defined.
• Mask 7 - The metal layer pattern is defined by this mask.
Mask 8 - An overall passivation (overglass) layer is now applied and Mask 8 is
needed to define the openings for access to bonding pads.
68
SiO2
1. …………………… … p-well
p
…………n……… (4-5 m)
Polysilicon
… …… … Thin oxide
2.
p and
n polysilicon
69
p-diffusion
P+ mask
… …… … (positive)
3.
p
P+ mask n-diffusion
(negative)
… …… …
4.
p
70
Polysilicon
Oxide
n-diffusion
P-
diffusion
Vin
Vout
VDD VSS
71
Polysilicon
Oxide
n-diffusion
P-
diffusion
Vin
Vout
VDD VSS
72
The n-well Process
• As indicated earlier, although the p-well process is widely used, n-well fabrication
has also gained wide acceptance, initially as a retrofit to nMOS lines.
6/3/201 73
5
The twin-tub-Tub Process
Vout
VDD VSS
Epitaxial
n well p well layer
n substrate
Twin-tub structure
( A logical extension of the p-well and n-
well)
6/3/2015 75
Bi-CMOS
Bipolar compatible CMOS(Bi-CMOS) technology:
Introduced in early 1980s
Combines Bipolar and CMOS logic
6/3/2015 76
Features
The objective of the Bi-CMOS is to combine bipolar and CMOS so as to
exploit the advantages of both the technologies.
Today Bi-CMOS has become one of the dominant technologies used for high
speed, low power and highly functional VLSI circuits.
The process step required for both CMOS and bipolar are almost similar The
The Bi-CMOS gates could be used as an effective way of speeding up the VLSI
circuits.
• Low output drive current (issue when driving large capacitive loads)
79
Bi-CMOS FABRICATION PROCESS
80
npn-BJT Fabrication
81
BJT Processing
1. Implantation of the buried n+
layer
6. p+ ohmic contact
p-substrate
7. Contact etching
3. p+ isolation diffusion
5. Emitter n+ diffusion
6. p+ ohmic contact
7. Contact etching
3. p+ isolation diffusion
5. Emitter n+ diffusion
6. p+ ohmic contact
n epi layer
7. Contact etching
n+ buried layer
8. Metal deposition and etching
p+ isolation layer
p+ isolation layer
1. Implantation of the buried n+ layer
3. p+ isolation diffusion
5. Emitter n+ diffusion
6. p+ ohmic contact
7. Contact etching
n+ buried layer
8. Metal deposition and etching
p+ isolation layer
p+ isolation layer
1. Implantation of the buried n+ layer
3. p+ isolation diffusion
5. Emitter n+ diffusion
6. p+ ohmic contact
p-base laye r
7. Contact etching
n+ buried layer
8. Metal deposition and etching
p+ isolation layer
p+ isolation layer
1. Implantation of the buried n+ layer
3. p+ isolation diffusion
5. Emitter n+ diffusion
n+ layer n+ layer
6. p+ ohmic contact
p-base layer
7. Contact etching
n+ buried layer
8. Metal deposition and etching
p+ isolation layer
p+ isolation layer
1. Implantation of the buried n+ layer
3. p+ isolation diffusion
5. Emitter n+ diffusion
n+ layer p+ layer n+ layer
6. p+ ohmic contact
p-base layer
7. Contact etching
n+ buried layer
8. Metal deposition and etching
5. Emitter n+ diffusion
n+ layer p+ layer n+ layer
6. p+ ohmic contact
p-base layer
7. Contact etching
n+ buried layer
8. Metal deposition and etching
5. Emitter n+ diffusion
n+ layer p+ layer n+ layer
6. p+ ohmic contact
p-base layer
7. Contact etching
n+ buried layer
8. Metal deposition and etching
5. Emitter n+ diffusion
n+ layer p+ layer n+ layer
6. p+ ohmic contact
p-base layer
7. Contact etching
n+ buried layer
8. Metal deposition and etching
6/3/2015 92
Doping Profiles in a BJT
6/3/2015 94
6/3/2015 95
BICMOS STRUCTURE
S G D S G D C B E
N-Well (Collector)
P-
EPITAXY N Plus Buried Layer
P-SUBSTRATE
6/3/2015 96
P-SUBSTRATE IS TAKEN
P-SUBSTRATE
P-SUBSTRATE
6/3/2015 97
A WINDOW IS OPENED THROUGH OXIDE LAYER
P-SUBSTRATE
P-SUBSTRATE
6/3/2015 98
P-EPITAXY LAYER IS GROWN ON THE ENTIRE SURFACE
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
THE ENTIRE SURFACE IS COVERED WITH OXIDE LAYER AND TWO WINDOWS
ARE OPENED THROUGH THE OXIDE LAYER
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
THROUGH THE TWO WINDOWS N-TYPE IMPURITIES ARE DIFFUSED TO
FORM N-WELLS
N-Well
N-Well (Collector)
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
THREE WINDOWS ARE OPENED THROUGH THE OXIDE LAYER , IN THESE
THREE WINDOWS THREE ACTIVE DEVICES NMOS,PMOS AND NPN BJT
ARE FORMED
N-Well
N-Well (Collector)
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
THE ENTIRE SURFACE IS COVERED WITH THINOX AND POLYSILICON
AND ARE PATTERNED TO FORM THE GATE TERMINALS OF THE NMOS
AND PMOS
N-Well
N-Well (Collector)
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
THROUGH THE 3RD WINDOW THE P-IMPURITIES ARE MODERATELY
DOPED TO FORM THE BASE TERMINAL OF BJT
N-WELL ACTS LIKE THE COLLECTOR TERMINAL
N-Well P-Base
N-Well (Collector)
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
THE N-TYPE IMPURITES ARE HEAVILY DOPED TO FORM
N- N- N-Plus
Diff Diff
Emitter
N-Well P-Base
N-Well (Collector)
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
THE P-TYPE IMPURITES ARE HEAVILY DOPED TO FORM
S G D S G D C B E
N-Well (Collector)
P-
EPITAXY
N Plus Buried Layer
P-SUBSTRATE
6/3/2015
Resistors & Capacitors
fabrication
110
Ohm’s Law
V IR
• Current I in terms of Jn I JA
• Voltage V in terms of electric field JtW
I JA JtW
tW E
EV/L
t
I JA JtW V
WL
– Result for R L 1 L
R R
Wt t
W
6/3/2015 111
Sheet Resistance (Rs)
• IC resistors have a specified thickness – not
under the control of the circuit designer
• Eliminate t by absorbing it into a new
parameter: the sheet resistance
(Rs)
R
L L
R
L
Wt sq
t W W
“Number of “quares”
6/3/2015 112
ELECTRON AND HOLE MOBILITY
Carrier Mobilities versus Doping Concentration
1.60E+03
1.40E+03
Carrier Mobility (cm2/V-sec)
1.20E+03
1.00E+03
mu_n
8.00E+02
6.00E+02 mu_p
4.00E+02
2.00E+02
0.00E+00
1.0
0E
+1
4
1.0
6/3/2015
0E 113
DIFFUSED RESISTOR
Aluminum contacts
Silicon dioxide
The n-type wafer is always biased positive with respect to the p-type
diffused region. This ensures that the pn junction that is formed is
in reverse bias, and there is no current leaking to the substrate.
Current will flow through the diffused resistor from one contact to
the other. The I-V characteristic follows Ohm’s Law: I = V/R
6/3/2015 114
Layout/Mask Layer 1 - Diffusion
(green)
Top View
W
Resistor
L termination
Side View
P type Diffusion
N wafer
The resistance, R = rhos (L/W)
The sheet resistance rhos, is the resistance of each square If rhos is 100 ohms per
L/W is the number of ‘squares’ long the resistor is said to square,
be. R = 500 ohms
5 squares in this case
6/3/2015 115
IC Fabrication: Ion Implantation
oxide
• Si substrate (p-type)
P-type Si Substrate
6/3/2015 117
Poly Film Resistor
Polysilicon Film (N+ or P+ type) Oxide
(heavily
doped) material on top of the oxide
• The poly will have a certain resistance (say 10 118
Diffused Resistor
-Dope a region of the silicon (n-type or p-type) to an acceptable
NA or ND.
-Then place a contact at each end of the diffusion region.
-The diffusion region will have a given resistivity specified in
"Ohms / “quare“
-Then alter the geometry (L/W ration) to get the desired
resistance - typically these have a sheet resistance between 100
to 200 ohms/sq - to save space
-These are laid out using a serpentine geometry
6/3/2015 119
-The interesting thing about the l/W ratio is that if l=W, then the
shape is a square and R=Rs, this is true no matter how big the
square is.
-In fact, the l/W ratio is actually the number of squares in a given
trace geometry - We typically just count the squares and use:
R= Rs*(no.of.squares)
6/3/2015
-Another waPyotolfyabsriiclaitce oa rnesi Rtoer iss
-Polysilicon has a high resistivity prior to Ion Implantation
-Usei undoped
s t o r
Polysilicon to create a high value resistor
t o uBefore
s e Ion
P olysilicon.
Implantation : Rs = 10M Ohms/Square
After Ion Implantation : Rs = 20 to 40 Ohms/Square
-Typically don't even need 1 square to get our resistively so we
don't need to do a serpentine layout
-One drawback is that the resistance can vary widely with
process when using less than 1 square to get a resistor in the k-
Ohms range.
- These are typically used when we just want a BIG resistor and
don't care about the exact value
6/3/2015
Metal Resistor
-Metal can also be used for very small resistors
6/3/2015
Cross sections of resistors of various types available from a typical n-well CMOS process.
6/3/2015
Capacitors
• Composed of two conductive plates separated
by an insulator (or dielectric).
– Commonly illustrated as two parallel metal plates
separated by a distance, d.
– C = e A/d
– where e = er eo
– er is the relative dielectric constant
– eo is the vacuum permittivity
6/3/2015
CMOS Capacitors
-There are 3 common ways to make a capacitor
1) MOS Capacitor:
-simply create a MOS structure where the Gate (Metal) terminal
is one terminal and the Body (Semiconductor) terminal is
Ground
- while this is easy to implement, the capacitance changes with
the bias voltage (i.e., VG) due to the depletion and inversion
which occurs
6/3/2015
MIM Capacitor
-"Metal Insulator Metal"-this is simply a parallel plate capacitor
using two metals and an insulator
-This type of capacitor is created using an extra process step that
puts in an additional metal layer that can be very close to one of
the other metal layers to get a smaller plate-to-plate separation
-Since the plates are made of metal, the capacitance doesn't
change with bias voltage-these capacitors are not as large as MOS
capacitors
6/3/2015
Interpoly and MOS capacitors in an n-well CMOS process.
6/3/2015
6/3/2015
UNIT 1B
BASIC ELECTRICAL PROPERTIES
Topics
• Basic electrical properties of MOS and BiCMOS
circuits:
• Ids-Vds relationships
• MOS transistor threshold voltage, gm, gds
• figure of merit wo
• pass transistor
• NMOS inverter
• Various pull-ups
• CMOS inverter analysis and design
• BiCMOS inverters
6/3/2015
MOSFET I-V Characteristics
I-V Plots, Channel Length Modulation
-4
x 10
– Saturation 6
ID (A)
3 Quadratic
VDS. Not sure! So V
Relationshi
VDS = VGS - VT
we consider the 2 VGS= 1.5
p
effect of channel 1 V
VGS= 1.0
length 0
V
modulation.
0 0.5 1 1.5
VDS(V) 2
2.5
130
MOSFET I-V Characteristics
Channel Length Modulation
• Channel Length
Modulation
– With pinch-off the VS=0
VDS>VDSAT
channel at the point VGS>VT0 n C ox W 2
2 L(V GS V T 0 )
y such that ID
( SAT )
1 1 1 1 1 1 1 1 1 1 VDS
L ΔL L
– λ: channel length L' L ΔL LLL
ΔL
1
L L 1 V
DS
modulation coefficient
– ID(SAT) can be rewritten as
G
S
M2 D
VSB
G
S
–
–
– V“B(M1) = VD“(M2) ≠ 0. Thus, VT0 in the M1 equation is replaced
by VT = VT(VSB) as developed in the threshold voltage section.
MOSFET I-V Characteristics
Substrate Bias Effect (Cont.)
• The general form of ID can be written as
• ID = f (VGS,VDS,VSB)
• which due to the body effect term is
non- linear and more difficult to handle in
manual calculations
MOSFET I-V Characteristics
Summary of Analytical Equations
– The voltage directions and relationships for the three
modes of pMOS are in contrast to those of nMOS.
D nMOS
G B Mode ID Voltage Range
IDVSB VDS
Cut-off 0 VGS<VT
VGS
S
Linear (µnCox/2)(W/L)[2(VGS- VGSVT , VDS< VGS
VT)VDS-VDS2] -VT
Saturatio (µnCox/2)(W/L)(VGS- VGS VT , VDS
S
VGS
n VT)2(1+hVDS) VGS -VT
VSB
G
B
VDS pMOS
ID
D
Cut-off 0 VGS>VT
Linear (µnCox/2)(W/L)[2(VGS- VGS VT , VDS>
VT)VDS-VDS2] VGS -VT
Saturatio (µ C /2)(W/L)(V -
Pass-Transistor Logic Circuits (1)
A simple approach for implementing logic functions utilizes series and
parallel combinations of switches that are controlled by input variables to
connect the input and output nodes.
136
Pass-Transistor Logic Circuits (2)
An essential requirement in the design of pass-transistor logic is
ensuring that every circuit node has at all times a low-resistance path to
VDD or to ground.
If B is high, S1 closes and Y=A.
Y will be VDD if A is high or ground if A
is low.
6/3/2015 137
(b) through switch S2.
Pass-Transistor Logic Circuits (3)
The problem can be easily solved by establishing for node Y a low-
resistance path that is activated when B goes low.
low-resistance
140
nMOS Cutoff
• No channel
• Ids = 0
Vgs = 0 Vgd
+ g +
- -
s d
n+ n+
p-type body
b
nMOS Linear
• Channel forms
• Current flows from d to Vgs
+ g +
Vgd = Vgs
>
s– e- from s to d
Vt
-
s
-
d
n+ n+ Vds = 0
• V= gate
Vg
polysilicon + +
gate Cg
source Vgs Vgd drain
W
Vs - - Vd
channel
tox n+ - + n+
Vds
L SiO2 gate oxide
n+ n+ (good insulator, ox
= 3.9) p-type body
p-type body
Channel Charge
• MOS structure looks like parallel plate
capacitor while operating in inversion
– Gate – oxide – channel
• Qchannel = CV
• C = Cg = eoxWL/tox = CoxWL Cox = ox / tox
150
Carrier velocity
• Charge is carried by e-
• Carrier velocity v proportional to lateral E-field
between source and drain
• v = mE m called mobility
• E = Vds/L
• Time for carrier to cross channel:
–t=
Carrier velocity
• Charge is carried by e-
• Carrier velocity v proportional to lateral E-field
between source and drain
• v = mE m called mobility
• E = Vds/L
• Time for carrier to cross channel:
–t=L/v
nMOS Linear I-V
• Now we know
– How much charge Qchannel is in the channel
– How much time t each carrier takes to cross
I ds
nMOS Linear I-V
• Now we know
– How much charge Qchannel is in the channel
– How much time t each carrier takes to cross
I ds Qchannel
t
nMOS Linear I-V
• Now we know
– How much charge Qchannel is in the channel
– HoQchannel
w much time t each carrier takes to
I ds
cross t
W Vgs t Vds
ox
L V ds
C 2
W
V V =
V V
gs t ds
V ds ox
2 C
L
nMOS Saturation I-V
• If Vgd < Vt, channel pinches off near drain
– When Vds > Vdsat = Vgs – Vt
• Now drain voltage no longer increases current
I ds
nMOS Saturation I-V
• If Vgd < Vt, channel pinches off near drain
– When Vds > Vdsat = Vgs – Vt
• Now drain voltage no longer increases current
I ds gs V t Vdsat dsat
2
V
V
nMOS Saturation I-V
• If Vgd < Vt, channel pinches off near drain
– When Vds > Vdsat = Vgs – Vt
• Now drain voltage no longer increases current
I ds gs V t Vdsat dsat
V 2
t
2
V
gs
2 V V
nMOS I-V Summary
• Shockley 1st order transistor models
0 cutoff
Vgs Vt
2
– m = 350 1.5 Vgs = 4
cm2/V*s
Ids (mA)
1
– Vt = 0.7 V 0.5
Vgs = 3
Vgs = 2
• Plot Ids vs. Vds 0
0 1 2
Vgs = 1
3 4 5
– Vgs = 0, 1, 2, 3, 4, 5 Vds
100 L 120
6/3/2015 C 108
8.8510 A /V
L
160
L
MOS Transistor Basics
Two Terminal Structure
• Two terminal structure (p-substrate): The MOS
capacitor VG
A D I
VB
G(ate)
S(ource) D(rain)
n+ L n+
B
G G G G G
• S S D
N-channel (fSor
arrow or add P-channel, reversSe
bubbles)
•
P-
channel
• Enhancement mode: no conducting
channel exists at VGS = 0
• Depletion mode: a conducting
channel exists at VGS = 0
MOS Transistor Basics
Four Terminal Structure (Continued)
• Source and drain identification
D
VDS
B
G
VSB
VGS S
Threshold Voltage Components
• Consider the prior 3-D drawing: Set VS=0, VDS=0,
and VSB=0.
– Increase VGS until the channel is inverted. Then a
conducting channel is formed and the depletion
region thickness (depth) is maximum as is the surface
potential.
– The value of VGS needed to cause surface inversion
(channel creation) is the threshold voltage VT0. The 0
refers to VSB=0.
– VGS< VT0: no channel implies no current flow possible.
With VGS> VT0, existence the channel implies possible
current flow.
Threshold Voltage Components (Cont.)
• GC work function difference between gate and
channel material which is the built-in voltage that
must be offset by voltage applied to flatten the
bands at the surface.
• Apply voltage to achieve surface inversion -2F
• Additional voltage must be applied to offset the
depletion region charge due to the acceptor ions. At Q
2q N A S i 2 F V SB
inversion, this charge with VSB=0 is QB0= Q0.
• For VSB non-zero,
Q
ox
C ox
Threshold Voltage Components (Cont.)
• These components together give:
V T GC 2F QB Qox
• For VSB=0, VT0 has QB replaced by QB0. This
Cox ox
gives a relationship between VT and VT0
which is:
C VV T
0
Q Q
T
C
B
ox
B0
6/3/2015 170
Threshold Voltage Components (Cont.)
• The final expression for VT0 and VT
are 2q N A S i
C ox
1 0 Vo
0 1
Vss
Vdd
Basic Inverter: Transistor with source
connected to ground and a load resistor
connected from the drain to the positive
Pull-Up Supply rail
R
V0 Vt
Vdd D
Vin
S
Non-zero output
Vss
Vi
Ids
Vgs=0.2VDD
Ids
Vgs=0
Vgs=-0.2 VDD
Vgs=-0.4 VDD
Vgs=-0.6VDD
Vds VDD –Vds
Vin
Vgs=VDD
VDD
Ids Vgs=0.8VDD
Vgs=0.6 VDD
Vgs=0.4 VDD
Vgs=0.2VDD
Vds Vo
VDD VDD
Decreasing
Vin Zpu/Zpd
VDD
Increasing
Zpu/Zpd
Vo
VDD
Vinv
V0
Vdd D
Vt (pull up) Vin
S
Non zero output
Vss
180
Assume equal margins around inverter; Vinv = 0.5 Vdd
Assume both transistors in saturation, therefore: Ids = K (W/L) (Vgs – Vt)2/2
Convention Z = L/W
A B C
Vin1 Vout2
It is often the case that two inverters are connected via a series of switches (Pass Transistors)
We are concerned that connection of transistors in series will degrade the logic levels into
Inverter 2. The driven inverter can be designed to deal with this. (Zpu/Zpd >= 8/1)
Complimentary Transistor Pull – Up (CMOS)
Vdd
P on N on
Vin Vo
N off P off
Both On
Vin
Vss Vdd
Vss
Logic 0 Logic 1
Vout Vtn Vtp
1: Logic 0 : p on ; n off
P on N on
N off P 5: Logic 1: p off ; n on
Both On off
2: Vin > Vtn.
Vdsn large – n in saturation
Vdsp small – p in resistive
Small current from Vdd to Vss
Vin
4: same as 2 except reversed p and n
Vss Vdd
3: Both transistors are in saturation
Large instantaneous current flows
1 2 3 4 5
CMOS INVERTER CHARACTERISTICS
p
If n = p and Vtp = –Vtn
I p 2 Vin VDD
At logicVthreshold,
tp In = Ip 2 Vin VDD
2
n p
2 in
V
Vtn
2
2
Vin VDD Vtp
pW p W
n n
2 Lp Ln
n
Vin Vtn
2 2
p
Vin V DD tpV
Mobilities are unequal : µn = 2.5 µp
n
Vin VDD Vtp
p Vin Vtn Z = L/W
n n
Vin 1 Vtn VDD Vtp
p p Zpu /Z = 2.5:1 for a symmetrical CMOS inverter
pd
CMOS Inverter Characteristics
• No current flow for either logical 1 or logical 0
inputs
• Full logical 1 and 0 levels are presented at the
output
• For devices of similar dimensions the p –
channel is slower than the n – channel device
CMOS Inverter VTC
NMOS off
PMOS res
N
M
O
S
2.
s NMOS sat
(V)
52
a PMOS sat
1.
Vout
t
5
1 P
0. M
5 O NMOS res
PMOS sat NMOS res
0 0 S P2M.5OS
0. 1 1.5 2
5 off
r
e
s
Vin (V)
Cutoff Linear Saturation
pMOS Vin -VDD= VGS> VT Vin -VDD=VGS< VT Vin -VDD=VGS> VT
VDD
G S
Regions of operations D
For nMOS and pMOS
In CMOS inverter Vin D Vout
G
CL
S
Impact of Process Variation
Good PMOS
(V)
2. Bad NMOS
Vout
5
2 Nominal
1.
1Bad PMOS
5
0.5Good NMOS
0
0 1 1. 2 2.
0.5 5 5
Vin (V)
190
NMOS Inverter
5V
• When VIN change When s V to logic
is logic
IN 1, 0, itsransistor gets
OUT
5V
V
logic 0.
cutoff.
R ID goesConstant
to 0. nonzero current R
• ‘esistor voltaglfeowgs ohrteousghtoartznsesirot o.r.VOUT
D V Power is used even though
“pulled
uI
D 5 V. no
OUT
new computation is being D I =0 D V OUT
VIN 0V performed.
+ VIN 5V
+
5 Vp
=”5/tR
VDS 0V VDS
_
o _
PMOS Inverter
• When V5INV changes to logic 1, transistor
cutoff.
gets5 V ID goes to 0.
V• ‘ esisto r voltage goes to zero.VOUV T
IN
- I N
-
down”Vt o 0V
“pulled D S
OUT
VDS
0V + When V is logic 0, V is + VOUT
IN OUT
5V
logic 1.
V.
ID = -5/R 5V
Constant nonzero current ID = 0 0V
flows through transistor.
R Power is used even though R
no new computation is being
performed.
Analysis of CMOS Inverter
• We can foVloDDwL(otghci e1)same procedure to solve
currents aSnd voltages in the CMOS inverter as
for
we did for the single NMOS and PMOS
circuits. D V OUT
• Remember, D
now we have two transistors so
VIN we write two I-V relationships and have twice
the number of variables.
• We can roSughly analyze the CMOS inverter
NMOS si g“rpaulp-ldhowc
i nadlelvyci .e”
PMOS is “pull-up device”
Each shuts off when not pulling
CMOS Inverter Analysis
VDD (Logic 1) VGS(n) = VIN
S
VGS(p) = VIN – VDD
VIN = VGS(n) =
0.9 V
VDS(n)
VDD
CMOS Analysis
ID(n)
As VIN goes up, VGS(n) gets bigger NMOS I-V curve
and VGS(p) gets less negative.
PMOS I-V curve
(written in terms of
NMOS variables)
VIN = VGS(n) =
1.5 V
VDS(n)
VDD
CMOS Analysis
ID(n)
As VIN goes up, VGS(n) gets bigger NMOS I-V curve
and VGS(p) gets less negative.
PMOS I-V curve
(written in terms of
NMOS variables)
VIN = VGS(n) =
2.0 V
VDS(n)
VDD
CMOS Analysis
ID(n)
As VIN goes up, VGS(n) gets bigger NMOS I-V curve
and VGS(p) gets less negative.
PMOS I-V curve
(written in terms of
NMOS variables)
VIN = VGS(n) =
2.5 V
VDS(n)
VDD
CMOS Analysis
ID(n)
As VIN goes up, VGS(n) gets bigger NMOS I-V curve
and VGS(p) gets less negative.
PMOS I-V curve
(written in terms of
NMOS variables)
VIN = VGS(n) =
3.0 V
VDS(n)
VDD
CMOS Analysis
ID(n)
As VIN goes up, VGS(n) gets bigger NMOS I-V curve
and VGS(p) gets less negative.
PMOS I-V curve
(written in terms of
NMOS variables)
VIN = VGS(n) =
3.5 V
VDS(n)
VDD
CMOS Analysis
ID(n)
As VIN goes up, VGS(n) gets bigger NMOS I-V curve
and VGS(p) gets less negative.
PMOS I-V curve
(written in terms of
NMOS variables)
VIN = VGS(n) =
4.1 V
VDS(n)
VDD
CMOS Inverter VOUT vs. VIN
VOUT
VIN
A B C D E
VDD
6/3/2015 203
Important Points
• No ID current flow in Regions A and E if nothing attached to
output; current flows only during logic transition
• If another inverter (or other CMOS logic) attached to
V
o utput, transistor gateV t erminals
DD D D
tranV s i t ion S
O U T 2
S
D D
VIN
6/3/2015
S S 204
Impact of Process Variation
Good PMOS
(V)
2. Bad NMOS
Vout
5
2 Nominal
1.
1Bad PMOS
5
0.5Good NMOS
0
0 1 1. 2 2.
0.5 5 5
Vin (V)
0
VDD
Vin
Unit II
VLSI CIRCUIT DESIGN PROCESSES
Topics
• VLSI design flow
• MOS layers
• Stick diagrams
• Design Rules and Layout
• 2 um CMOS design rules for wires
• Contacts and Transistors
• Layout diagrams for NMOS and CMOS inverters
and gates
• Scaling of MOS circuits
VLSI Design of approach of IC
26/038/2
015
MOS Layers
• p-substrate
• n-well
• n+
• p+
• Gate oxide (thin oxide)
• Gate (polycilicon)
• Field Oxide
– Insulated glass
– Provide electrical isolation
209
Stick diagrams
Encodings for a simple single metal nMOS process
Metal 1 N
BLUE
M
219
Stick Diagrams
Metal
poly
ndif
Can also draw
f in shades of
Buried Contact gray/line style.
pdif
f
Contact Cut
Stick Diagrams
• VLSI design aims to translate circuit concepts
onto silicon.
• Stick diagrams are a means of capturing
topography and layer information using simple
diagrams.
• Stick diagrams convey layer information through
colour codes (or monochrome encoding).
• Acts as an interface between symbolic circuit and
the actual layout.
Stick Diagrams
Dep
Vout
Enh Vin
0V
0V
220
NMOS-NAND
NMOS-NOR
NMOS EX-OR
NMOS EX-NOR
PMOS-INVERTER
PMOS NAND
PMOS-NOR
Sticks design CMOS NAND:
• Start with NAND gate:
NAND sticks
VDD
a
out
VSS
Stick Diagram - Example A
OUT
B
NOR Gate
230
Stick Diagram - Example
Power
A Out
Ground
2 I/P OR GATE
2 I/P AND
Y=(AB+CD)’
Y=(AB+CD)’
“TICK
Design Rules for Layout
• Design rules are a set of geometrical
specifications that dictate the design of the layout
masks
• A design rule set provides numerical values
– For minimum dimensions
– For minimum line spacings
• Design rules must be followed to insure
functional structures on the fabricated chip
• Design rules change with technological advances
(www.mosis.org)
Silicon Foundry
• A standard
• A foundry allows designers to submit designs
using a state-of-the-art process
• Each foundry state simpler set of design rules
called lambda design rules
• All widths, spacings, and distances are written in
the form
– Value = m
– TSMC (Thailand Semiconductor Manufacturing
Corporation)
Design Rules Classification
• Minimum width
• Minimum spacing
• Surround
• Extension
240
Physical Limitations
• Line width limitation of an imaging system
– The reticle shadow projected on the photoresist
does not have sharp edges due to optical
diffraction
• Etching process problem
– Undercutting of the resist due to lateral etching
decreases the resolution
Etching Process Problem
Vertical etching
Lateral
Etching
Substrate Substrate
Isotropic etch
Depletion Region
• If depletion regions of adjacent pn junctions
touch, then
– The current blocking characteristics are altered
– Current can flow between the two
Spacing
n+ n+
Substrate
Polysilic Aluminu
on m
Design rules and Layout
• Why we use design rules?
– Interface between designer and process engineer
– Guidelines for constructing process masks
Design Rules
Minimum length or width of a feature on a layer is 2
Why?
To allow for shape contraction
Minimum separation of features on a layer is 2
Why?
To ensure adequate continuity of the intervening
materials.
Design
Rules
Minimum width of PolySi and diffusion line 2
Minimum width of Metal line 3 as metal lines run over a more uneven
surface than other conducting layers to ensure their continuity
Metal
Diffusion
Polysilicon
Design
Rules
PolySi – PolySi space 2
Metal - Metal space 2
Diffusion – Diffusion 3 To avoid the possibility of their associated
regions overlapping and conducting current
Metal
Diffusion
Polysilicon
250
Design
Rules
Diffusion – PolySi To prevent the lines overlapping to form
unwanted capacitor
Metal lines can pass over both diffusion and polySi without
electrical effect. Where no separation is specified, metal lines
can overlap or cross
Metal
Diffusion
Polysilicon
Metal Vs PolySi/Diffusion
Polysilicon
Review
:
poly-poly spacing 2
diff-diff spacing 3
(depletion regions tend to spread outward)
metal-metal spacing 2
diff-poly spacing
Note
• Two Features on different mask layers can be
misaligned by a maximum of 2l on the wafer.
• If the overlap of these two different mask
layers can be catastrophic to the design, they
must be separated by at least 2l
• If the overlap is just undesirable, they must
be separated by at least l
When a transistor is formed?
Design rules
min. line width of polySi and diffusion 2
drain and source have min. length and width of 2
And
PolySi extends in the gate region…
diffusion
short
• Diffusion Problems
no overlap overlap
Depletion Transistor
We need depletion implant
2
Butting Contact
Advantage:
No buried contact mask required and
avoids associated processing.
Butting Contact
Problem: Metal descending the hole has a tendency to
fracture at the polySi corner, causing an open circuit.
Metal
Insulating
Oxide
n+ n+
260
Buried Contact
2
Contact Area
6/3/2015
2 261
Buried Contact
The buried contact window surrounds this contact by in all
directions to avoid any part of this area forming a transistor.
Buried Contact
Here gate length is depend upon the alignment of the
buried contact mask relative to the polySi and
therefore vary by .
PolySi
Channel length
2
Buried contact
2
Diffusion
Contact Cut
Metal connects to polySi/diffusion by contact cut.
Contact area: 22
Metal and polySi or diffusion must overlap this contact area
by so that the two desired conductors encompass the contact
area despite any mis-alignment between conducting layers
and the contact hole
4
Contact Cut
Contact cut – any gate: 2 apart
Why? No contact to any part of the gate.
4
2
Contact Cut
2
Rules for CMOS layout
Similar to those for NMOS except No
1. Depletion implant
2. Buried contact
Additional rules
1. Definition of n-well area
2. Threshold implant of two types of transistor
3. Definition of source and drains regions for the
NMOS and PMOS.
Rules for CMOS layout
2
2
270
Rules for CMOS layout
The p+ diffusion mask
defines the areas to
2
receive a p+ diffusion.
It is coincident with the
threshold mask 2
surrounding the PMOS
transistor but excludes
the n-well region to be
connected to the supply.
Rules for CMOS layout
A p+ diffusion is required to effect the ground connection
to the substrate. Thus mask also defines this substrate
region. It surrounds the conducting material of this
contact by
4
Rules for CMOS layout
A•B A B F
A 0 0 1
0 1 1
B
1 0 1
1 1 0
1 0 0
1 1 0
275
PMOS Gate construction
•PMOS devices in parallel implement a NAND function
A B F
0 0 1
A B
0 1 1
1 0 1
A•B 1 1 0
0 0 1
B
0 1 0
A
1 0 0
A+B 1 1 0
276
Parasitics and Performance
• Consider the
following layout:
a
• What is the impact
on performance
of
parasitics b
c
– At point 'a' (VDD rail)?
– At point 'b' (input)?
– At Point 'c' (output)?
Parasitics and Performance
• a - power supply
connections
– capacitance - no a
effect on delay
– resistance - increases b
delay c
– minimize by reducing
difffusion length
• minimize using
parallel vias
Parasitics and Performance
• b - gate input
– capacitance
increases delay on a
previous stage (often
transistor gates
b
dominate) c
– resistance increases
delay on previous
stage
Parasitics and Performance
• c - gate output
– resistance, capacitance
increase delay
– Resistance & capacitance a
"near" to output causes
additional delay
b
c
280
Driving Large Loads
• Off-chip loads, long wires, etc. have high capacitance
• Increasing transistor size increases driving ability
(and speed), but in turn increases gate capacitance
• Solution: stages of progressively larger transistors
– Use nopt = ln(Cbig/Cg).
– Scale by a factor of a=e
Summary: Static CMOS
• Advantages
– High Noise Margins (VOH=VDD, VOL=GND)
– No static power consumption (except for leakage)
– Comparable rise and fall times (with proper sizing)
– Robust and easy to use
• Disadvantages
– Large transistor counts (2N transistors for N inputs)
• Larger area
• More parasitic loading (2 transistor gates on each input)
– Pullup issues
• Lower driving capability of P transistors
• Series connections especially problematic
• Sizing helps, but increases loading on gate inputs
Alternative gate circuits
to Static CMOS
• Switch Logic
• nmos
• Pseudo-nmos
• DCVS Logic
• Dynamic Logic (Domino Logic)
Switch Logic
• Key idea: use transistors as switches
• Concern: switches are bidirectional
A B
AND
OR
Switch Logic - Pass Transistors
• Use n-transistor as “switches”
IN: OUT:
• “Threshold problem” VDD-Vtn
A
A
Switch Logic Example - 2-1 MUX
IN
Charge Sharing
• Consider transmission gates in series
– Each node has parasitic capacitances
– Problems occur when inputs change to
redistribute charge
– Solution: design network so there is always a path
from VDD or GND to output
Aside: Transmission Gates in Analog
• Transmission Gates
work with analog values, too!
• Example:
Voltage-Scaling D/A Converter
NMOS Logic
• Used before CMOS was widely
available
• Uses only n transistors
– Normal n transistors in pull- Passive Pullup Device:
down network depletion Mode
n-transistor (Vt < 0)
– depletion-mode n transistor
(Vt < 0) used for pull-up OUT
Pulldown
– "ratioed logic" required Network
• Tradeoffs:
– Simpler processing
– Smaller gates
– higher power!
– Additional design
considerations
for ratioed logic
290
Pseudo-nmos Logic
• Same idea, as nmos, but use p-
transistor for pullup
• "ratioed logic" required for
proper design (more Passive Pullup Device:
about this next) P-Transistor
• Tradeoffs:
OUT
– Fewer transistors -> smaller Pulldown
gates, esp. for large number Network
of inputs
– less capacitative load on
gates
that drive inputs
– larger power consumption
– less noise margin (VOL > 0)
– additional design
considerations due to ratioed
logic
Rationed Logic for Pseudo-nmos
• Approach:
– Assume VOUT=VOL =0.25*VDD
– Assume 1 pulldown transistor is on
OUT
– Equate currents in p, n transistors Pulldown
Idp
Network
– Solve for ratio between sizes of p, n Idn
transistors to get these conditions
– Further necessary
calculations series for
connections
W
1 I pnn
Idn
2 k' n L n V
gs,n
tn V 2 1
Wp
2 k'p L p 2V
gs,p tp
V
Vds,p Vds,p
2 (EQ 3 21)
Wp
Lp (EQ 3 22) Assumin g V DD
Wn 3.9
Ln 3.3V
DCVS Logic
• DCVS - Differential
Cascode Voltage Switch
• Differential inputs,
outputs
• Two pulldown
networks OUT OUT’
• Tradeoffs A
OUT OUT’
A’
B’
– Lower capacitative loading B
Pulldown Pulldown
C’
than static CMOS Network Network
Precharge
• Control - precharge clock f Signal Pulldown
Storage
Capacitance
Network
B
A
C
Precharge Evaluate Precharge
Domino Logic
• Key idea: dynamic gate + inverter
• Cascaded gates –
“monotonically increasing” CS
Pulldown
Network
B
C
in4
x1
x2
x3
Domino Logic Tradeoffs
• Fewer transistors -> smaller gates
• Lower power consumption than pseudo-nmos
• Clocking required
• Logic not complete (AND, OR, but no NOT)
More Techniques for Saving Power
• Reduce VDD (tradeoff: delay)
• Multiple Power Supplies
– High VDD for “fast” logic
– Low VDD for “slow” logic
• Dealing with leakage currents
– Multiple-Threshold CMOS (MTCMOS)
– Variable-Threshold CMOS (VTCMOS)
Choice of Layers
UNIT IV
DATAPATH SUBSYSTEMS
Topics
• Sub system design,
• Shifters,
• Adders,
• ALUs,
• Multipliers,
• Parity generators,
• Comparators,
• Zero/One detectors,
• Counters.
1’s & 0’s Detectors
• 1’s detector: N-input AND gate
• 0’s detector: NOTs + 1’s detector (N-input
NOR)
A7 A3
A6 allzeros
A5 A2
A4 1
allones A0
A3
A2
A1
A0 A7
A6
A5
A4 allones
6/3/2015 A3
A1
299
A
A0
Comparator
s
• 0’s detector: A = 00…000
• 1’s detector: A = 11…
• Equality comparator:111 A
=B
• Magnitude comparator: A
<B
300
Equality Comparator
• Check if each bit is equal (XNOR, aka equality
gate)
• 1’s detect on bitwise equality
B[3]
A [3]
B [2]
A [2] A =B
B [1]
A [1]
B [0]
A [0]
301
Magnitude Comparator
• Compute B – A and look at sign
• B – A = B + ~A + 1
• For unsigned numbers, carry out is sign bit
A
B C
B3
N A
A3 B
B2
Z
A=B
A2
B1
A1
B0
A0
302
Signed vs. Unsigned
• For signed numbers, comparison is harder
– C: carry out
– Z: zero (all bits of B – A are 0)
– N: negative (MSB of result)
– V: overflow (inputs had different signs, output sign
B)
– S: N xor V (sign of result)
–
303
Shifter
• Logical Shift:
s
– “hifts number left or right and fills with 0’s
• 1011 LSR 1 = 0101 1011 LSL1 = 0110
• Arithmetic Shift:
– Shifts number left or right. Rt shift sign extend
• 1011 ASR1 = 1101 1011 ASL1 = 0110 s
• Rotate:
– Shifts number left or right and fills with lost bits
• 1011 ROR1 = 1101 1011 ROL1 = 0111
304
Adders
• Single-bit Addition
• Carry-Ripple Adder
• Carry-Skip Adder
• Carry-Look ahead Adder
• Carry-Select Adder
• Carry-Increment Adder
• Tree Adder
305
Barrel Shifter
• Barrel shifters perform right rotations using
wrap-around wires.
• Left rotations are right rotations by N – k = k +
1 bits.
• Shifts are rotations with the end bits masked
off.
306
Single-Bit Addition
A B A B
• Half
SA
Adder SAB
Full Adder C
C out Cout
CoutB A C C MAJ ( A,
out S
S
B B,C)
A B Cout S A B C Cout S
0 0 0 0 0 0 0 0 0
0 1 0 1 0 0 1 0 1
1 0 0 1 0 1 0 0 1
1 1 1 0 0 1 1 1 0
1 0 0 0 1
1 0 1 1 0
1 1 0 1 0
1 1 1 1 1
307
PGK
• For a full adder, define what happens to
carries
• (in terms A
ofand B)
– Generate: Cout = 1 independent of C
• G=A•B
– Propagate: Cout = C
• P=A
B = 0 independent of C
– Kill: Cout
• K = ~A • ~B
308
Full Adder Design I
• Brute force implementation from eqns
SABC
MAJ ( A,
Cout
B,C)
A A B B C C
A A
B B
A B
C S B
C C
A B B
S
A C C C A
MAJ
Cout
Cout
B B
B B C A
C
A B B
A A
309
Full Adder Design II
• Factor S in terms of Cout
• S = ABC + (A + B + C)(~Cout)
• Critical path i s u sually C to Cout in
MIN O RIT Y
A
ripple
adde B
• r
C
Cout S
S
Cout
310
Layou
t
• Clever layout circumvents usual line of
diffusion
– Use wide transistors on critical path
– Eliminate output inverters
311
Full Adder Design III
• Complementary Pass Transistor Logic (CPL)
– Slightly faster, but more are a B
B C B C
A
B C B C
S Cout
A
B C B C
A
B C B C
S Cout
B
A
312
Full Adder Design IV
• Dual-rail domino
– Very fast, but large and power hungry
– Used i n very fast multipli ers
Cout _h Cout _l
C_h A_h C_l A_l
A_h B_h A_l B_l B_l
B_h
S_l S_h
C_l
C_h C_h
B_l
B_h B_h
A_h A_l
313
Carry Propagate Adders
• N-bit adder called CPA
– Each sum bit depends on all previous carries
– How do we compute all these carries quickly?
AN...1 BN...1
Cout Cout Cin
Cin
00000
Cout Cin 11111 carries
+ 1111
1111 A4...1
+0000
+0000 B4...1
SN...1 1111
0000 S4...1
314
Carry-Ripple Adder
• Simplest design: cascade full adders
– Critical path goes from Cin to Cout
– Design full adder to have fast carry delay
A4 B4 A3 B3 A2 B2
A1 B1
Cout C3 C2 C1 Cin
S4 S3 S2 S1
315
Inversion
s
• Critical path passes through majority gate
– Built from minority + inverter
– Eliminate inverter and use inverting full adder
A4 B4 A3 B3 A2 B2 A1 B1
Cout Cin
C3 C2 C1
S4 S3 S2 S1
316
Generate / Propagate
• Equations often factored into G and P
• Generate and propagate for groups spanning i:j
Gi: j Gi:k P i:k Gk
1: j
317
PG Logic
A4 B4 A3 B3 A2 B2 A1 B1
Cin
1: Bitwise PG logic
G4 P4 G3 P3 G2 P2 G1 P1 G0 P0
2: Group PG logic
C3 C2 C1 C0
3: Sum logic
C4
S4 S3 S2 S1
Cout
Carry-Ripple Revisited
P
Gi:0 Gi Gi1:0
A4 B4 B3 A2 B2 A1 B1
i Cin
A3
G4 P4 G3 P3 G2 P2 G1 P1 G0 P0
C3 C2 C1 C0
C4
S4 S3 S2 S1
Cout
Carry-Skip Adder
• Carry-ripple is slow through all N stages
• Carry-skip allows carry to skip over groups of n
bits
– Deci s i o n based o n n - bit propa g a t
A 16 :13 B 16:13 A 12:9 B1 2:9 A8 :5 B8 :5 B4:1
e signal A4:1
P 16:13 P12:9 P 8:5 P 4:1
1 1
Cout C12 C8 1 C4 1
Cin
0 + 0 + +
0 +
0
S16:13 S12:9 S4:1
S8:5
320
Carry-Lookahead Adder
• Carry-lookahead adder computes Gi:0 for
many bits in parallel.
• Uses higher-valency cells with more than two
inputs.
A16:13 B16:13 A12:9 B12:9 A8:5 B8:5 A4:1 B4:1
+ + + +
Cin
321
Carry-Select Adder
• Trick for critical paths dependent on late input
X
– Precompute two possible outputs for X = 0, 1
– Select proper output when X arrives
• Carry-se l e c t adder p r e c omput e s n -
s
A 1 6:13 B1 6:13 A12: 9 B12 :9 A 8:5 B8 :5
b–i For
t both po ssible car rie s into n-b it ms
B
0 0 4:
A 4 :1
+ + +
C12
g roup
C8
u
C0out 4
+ + + + C in
C
1
1 1 1
1
1
0
0
S16:13 S12:9 S8:5 S4:1
322
Tree Adder
• If lookahead is good, lookahead across
lookahead!
– Recursive lookahead gives O(log N) delay
• Many variations on tree adders
323
Tree Adder Taxonomy
• Ideal N-bit tree adder would have
– L = log N logic levels
– Fanout never exceeding 2
– No more than one wiring track between levels
• Describe adder with 3-D taxonomy (l, f, t)
– Logic levels: L+l
– Fanout: 2f + 1
– Wiring tracks: 2t
• Known tree adders sit on plane defined by
• l + f + t = L-1
324
Summary
Adder architectures offer area / power / delay tradeoffs.
Choose the best one for your application.
Architecture Classification Logic Max Tracks Cells
Levels Fanout
Carry-Ripple N-1 1 1 N
Carry-Skip n=4 N/4 + 5 2 1 1.25N
Carry-Inc. n=4 N/4 + 2 4 1 2N
Brent-Kung (L-1, 0, 0) 2log2N – 1 2 1 2N
Sklansky (0, L-1, 0) log2N N/2 + 1 1 0.5 Nlog2N
Kogge-Stone (0, 0, L-1) log2N 2 N/2 Nlog2N
6/3/2015 325
Multi-input Adders
• Suppose we want to add k N-bit words
– Ex: 0001 + 0111 + 1101 + 0010 = 10111
• Straightforward solution: k-1 N-input CPAs
– Large and slow
0001 0111 1101 0010
+
1000
+
10101
+
10111
6/3/2015 326
Carry Save Addition
• A full adder sums 3 inputs and produces 2
outputs
– Carry output has twice weight of sum output
• N full adders in parallel are called carry save
adder
X4 Y 4 Z 4 X3 Y 3 Z 3 X2 Y 2 Z 2 X1 Y 1 Z1
– Produce N sums and N carry outs
C4 S 4 C3 S 3 C2 S 2 C1 S 1
n-bit CSA
328 S
Multiplication
• Example: 1100 : 1210 multiplican
0101 : 510 d multiplier
1100
0000 partial
1100 products
0000
00111100 : 6010 product
• M x N-bit multiplication
– Produce N M-bit partial products
– Sum these to produce M+N-bit product
329
General Form
• Multiplicand: Y = (yM-1, yM-2, …, y1,
• y0)
Multiplier: X = (xN-1, xN-2, …, x1,
M 1 N 1 M 1
x0) P y j 2 j xi 2 i x y 2i j
N 1
i j
330
Dot Diagram
• Each dot represents a bit
x0
partial products
multiplier x
x15
331
Array Multiplier
y3 y2 y1 y0
x0
x1
CSA
Array
x2
x3
CPA
p7 p6 p5 p4 p3 p2 p1 p0
A B
Sin A Cin critical path A B
A B
B Sin
Cout Cout Cin
= Cin =
Cout Cin
Sout
Cout Sout Sout
Sout
6/3/2015 332
Rectangular Array
• Squash array to fit rectangular floorplan
y3 y2 y1 y0
x0
p0
x1
p1
x2
p2
x3
p3
p7 p6 p5 p4
333
Fewer Partial Products
• Array multiplier requires N partial products
• If we looked at groups of r bits, we could form
N/r partial products.
– Faster and smaller?
– Called radix-2r encoding
• Ex: r = 2: look at pairs of bits
– Form partial products of 0, Y, 2Y, 3Y
– First three are easy, but 3Y requires adder
334
Booth Encoding
• Instead of 3Y, try –Y, then increment next
partial product to add 4Y
• Similarly, for 2Y, try –2Y + 4Y in next partial
duct
pro
335
Booth Hardware
• Booth encoder generates control lines for
each PP
– Booth selectors choose PP bits
336
Advanced Multiplication
• Signed vs. unsigned inputs
• Higher radix Booth encoding
• Array vs. tree CSA networks
337
Unit VI
Array Sub Systems
Topics:
SRAM
DRAM
ROM
Serial
Acces
s
Mem
ories
Conte
nt
Addre 338
Semiconductor Memory Types
Semiconductor Memories
339
Semiconductor Memory Types (Cont.)
Design Issues
» Area Efficiency of Memory Array: of stored data bits
per
unit area
» Memory Access Time: the time required to store and/or
retrieve a particular data bit.
» Static and Dynamic Power Consumption
• Requirements
» Easy reading
» Easy Writing
» High density
» Speed, more speed and still more speed
349
Memory Architecture
m words
• k = Log2(m) address input …
signals
• or m = 2k words
• e.g., 4,096 x 8 memory: n bits per word
memory external
» 32,768 bits
r/w
view
» 12 address input signals 2k × n read and write
memory
enabl
» 8 input/output data signals
Memory access eA 0
…
• r/w: selects read or write A k-
1ouslyQ0
341
Semiconductor Memory Types (Cont.)
• DRAM
» A capacitor to store data, and a transistor to access the
capacitor
» Need refresh operation
» Low cost, and high density it is used for main
memory
• SRAM
» Consists of a latch
» Don’t need the refresh operation
» High speed and low power consumption it is mainly used for
cache memory and memory in hand-held devices
342
RAM: “Random-access” memory
External view
Typically volatile memory r/w 2k × n read and write
enable memory
• bits are not held without power
supply A0
Read and written to easily by micropro…cessor
during execution
Ak-1
…
Internal structure more complex than ROM Qn-1 Q0
• each input and output data line connects to each cell in its column
4×4 RAM
• rd/wr connected to every cell enable 2×4
• when row is enabled by decoder, each cell has logi cdecoder t stores
input data bit when rd/wr indicates write or ou At tha ed bit when
0
A
rd/wr
put indicates read s
1
Memory
cell
rd/wr stor
To every cell
Q3 Q2 Q1 Q0
343
Memory Chip Configuration
Row Address
M
N bits 2 Cells
Row Dec
Cell
WL N
2 Cells
I/O Interface
DL
Din
din
I/O Control
Dout dout
Column Dec.
Control
Column Addres s
Signals
M Bits
344
Static Random Access Memory (SRAM)
• SRAM: ThebtisltionereCd data can be retained inbdtieilfnienCitely, without
any need for a periodic refresh operation.
load
pass transistors to
activated by a row select Basic cross-coupled 2-inverter
(RS) signal to enable latch with 2 stable op points
read/write operators for storing one-bit
346
6T-SRAM
VDD
VDD
Depletion-Load
SRAM Cell
word line word line
347
SRAM Operation Principles
• RS=0: The word line is not selected. M3 and M4 are OFF
• One data-bit is held: The latch preserves one of its two stable states.
• If RS=0 for all rows: CC and CC are charged up to near VDD by pulling up
of MP1 and MP2 (both in saturation)
CC M1 M2 CC
word line
RS
348
SRAM Operation Principles (Cont.)
Pull-up transistor (one per column)
VDD VDD
VDD
MP1 MP2
bit line C R R bit line C
VC M3 V1 V2 M4 VC
CC M1 M2 CC
word line
RS
349
SRAM Operation Principles (Cont.)
Pull-up transistor (one per column)
VDD VDD
VDD
MP1 MP2
bit line C R R bit line C
VC M3 V1 V2 M4 VC
CC M1 M2 CC
word line
RS
350
SRAM Operation Principles (Cont.)
Pull-up transistor (one per column)
VDD VDD
VDD
MP1 MP2
bit line C R R bit line C
VC M3 V1 V2 M4 VC
CC M1 M2 CC
word line
RS
• Write “0” Operation (V1=VOH, V2=VOL at t=0-):
• VC VOL by the data-write circuitry.
• Since V1 VOL, M2 turns off, therefore V2 VOH.
351
SRAM Operation Principles (Cont.)
Pull-up transistor (one per column)
VDD VDD
VDD
MP1 MP2
bit line C R R bit line C
VC M3 V1 V2 M4 VC
CC M1 M2 CC
word line
RS
• Read “0” Operation (V1=VOL, V2=VOH at t=0-):
• VC retains pre-charge level, while VC VOL by M1 ON.
• Data-read circuitry detects small voltage difference VC – VC < 0,
and
amplifies it as a “0” data output.
352
SRAM Operation Principles (Cont.)
VDD VDD
VDD
MP1 MP2
bit line C R R bit line C
VC M3 V1 V2 M4 VC
CC M1 M2 CC
word line
RS
353
SRAM Operation Principles (Cont.)
VDD VDD
VDD
MP1 MP2
bit line C R R bit line C
VC M3 V1 V2 M4 VC
CC M1 M2 CC
word line
RS
CC M1 M2 CC
word line
RS
Advantages
• Very low standby power consumption
• Large noise margins than R-load SRAMS
• Operate at lower supply voltages than R-load SRAMS
Disadvantages
• Larger die area: To accommodate the n-well for pMOS transistors
and polysilicon contacts. The area has been reduced by using
multi- layer polysilicon and multi-layer metal processes
• CMOS more complex process
355
Dynamic Read-Write Memory (DRAM) Circuits
• SRAM: 4~6 transistors per bit
4~5 lines connecting as charge on capacitor
• DRAM: Data bit is stored as charge on
capacitor Reduced die area
Require periodic refresh
WL
M1 M2
M3 M4
parasitic storage
BL BL
capacitances
Four-Transistor DRAM Cell
356
DRAM Circuits (Cont.)
WL(read)
X M2
M1 M3
parasitic storage
WL(write) capacitances
BL(write) BL(read)
BLR BL GND
W
RWL
M3
M2
WWL
M1
358
One-Transistor DRAM Cell
WL
M1 explicit storage
capacitances
BL
359
Operation of Three-Transistor DRAM Cell
VDD
Precharge devices
MP1 MP2
PC
RS
M3
M1 M2
C2 C3
C1
WS
Data_in
DAT C2, C3 >> C1(>10C1)
A Data_out
• The binary information is stored as the charge in C1
• Storage transistor M2 is on or off depending on the charge in C1
• Pass transistors M1 and M3: access switches
• Two separate bit lines for “data read” and “data write”
360
Operation of Three-Transistor DRAM Cell (Cont.)
PC write 1 PC read 1 PC write 0 PC read
VDD ① 2 0
Precharge devices ③ 4 ⑤ 6 ⑦ 8
MP1 MP2 PC
PC WS
RS
M3 DAT
M1
C2
M2
C3
A
Din
C1
WS Stored data
Data_in Data_out RS
DATA
Dout
361
Operation of Three-Transistor DRAM Cell (Cont.)
VDD
Precharge devices
MP1 MP2
PC Pre-charge Cycle
C2 C3
RS
M3
M1 M2
C2 C3
C1
WS
Data_in Data_out
DATA
RS
M3
M1 M2
C2 C3
C1
WS
Data_in Data_out
DATA
RS
M3
M1 M2
C2 C3
C1
WS
Data_out
Data_in
DAT
A
364
Operation of Three-Transistor DRAM Cell (Cont.)
RS
M3
M1 M2
C2 C3
C1
WS
Data_in Data_out
DATA
M1
Column C2 C1
capacitance
BL
1-bit DRAM Cell
C2>>C1
366
RAM
DRAM SRAM
VDD
WL
WL
WL
DL
DL
DL
367
Semiconductor Memory Types (Cont.)
ROM: 1, nonvolatile memories
2, only can access data, cannot to modify data
3, lower cost: used for permanent memory in printers,
fax, and game machines, and ID cards
• Mask ROM: data are written during chip fabrication by a photo
mask
• PROM: data are written electrically after the chip is fabricated.
» Fuse ROM: data cannot be erased and modified.
» EPROM and EEPROM: data can be rewritten, but the number of
subsequent re-write operations is limited to 104-105.
• EPROM uses ultraviolet rays which can penetrate through the
crystal glass on package to erase whole data simultaneously.
• EEPROM uses high electrical voltage to erase data in 8 bit units.
• Flash Memory: similar to EEPROM
368
ROM: “Read-Only” Memory
Nonvolatile
Can be read from but not written to, by a
processor in an microcomputer system External view
…
Uses A k-1
• Store software program for general- …
purpose processor Qn-1
Q0
369
Example: 8 x 4 ROM
word 0
Decoder sets word 2’s line to 1 if enabl
3×8
decoder word 1
word 2
address input is 010 Ae
0
word
A1
Data lines Q3 and Q1 are set to 1 A2 line
because there is a “programmed” data
connection with word 2’s line programm
able line
Word 2 is not connected with connection
Q 3 Q2 Q1 Q0
data
lines Q2 and Q0
Output is 1010
370
Memory – ROM
Nonvolatile Memory
- SRAM and DRAM and attractive due to their speed
- however, they are volatile which means when the power is removed, the data
is lost
- before looking at the details of a Flash transistor, let’s first look at the
different types of ROM arrays and addressing modes
371
Memory – ROM
ROM Arrays
- There are two basic types of ROM arrays
1) NOR-based ROM
2) NAND-based ROM
NOR-based ROM
- All Column Lines are pulled-up using a PMOS transistor (or resistor)
- The Row Lines are connected to the gates of NMOS transistors at the intersection
of Row and Column Lines
- If the NMOS transistor is present, it will pull down the Column Line when its gate is
driven high by the Row Line
- If the NMOS transistor is absent, the Column Line will not be pulled down, so it
Memory – ROM
NOR-based ROM
- In order to Read from the array, the Row line is asserted and the desired Column line is
observed
373
Memory – ROM
NAND-based ROM
- NAND-based ROM is a different array architecture
NAND-based ROM
- In this configuration, if an NMOS is present, it will
represent a “stored 1” since in order to address its
location, the Row line is driven to a ‘0’ and the NMOS
not turned on. This leaves the Column line pulled HIGH
376
OTP ROM: One-time programmable ROM
378
Sample EPROM components
379
Sample EPROM programmers
380
EEPROM: Electrically erasable programmable
ROM
Extension of EEPROM
• Same floating gate principle
• Same write ability and storage permanence
Fast erase
• Large blocks of memory erased at once, rather than one word at a time
• Blocks typically several thousand bytes large
Writes to single words may be slower
• Entire block must be read, word updated, then entire block written back
Used with embedded microcomputer systems storing large data
items in nonvolatile memory
• e.g., digital cameras, MP3, cell phones
382
Serial Access Memories
Serial access memories do not use an address
– Shift Registers
– Serial In Parallel Out (SIPO)
– Parallel In Serial Out (PISO)
– Queues (FIFO, LIFO)
383
384
385
386
387
FIFOs are commonly used in electronic circuits for buffering and flow
control which is from hardware to software.
In its hardware form, a FIFO primarily consists of a set of read and
write pointers, storage and control logic.
Storage may be SRAM, flip-flops, latches or any other suitable form of
storage. For FIFOs of non-trivial size, a dual-port SRAM is usually
used, where one port is dedicated to writing and the other to reading.
A synchronous FIFO is a FIFO where the same clock is used for
both reading and writing. An asynchronous FIFO uses different
clocks for reading and writing.
FIFO full/empty
A hardware FIFO is used for synchronization purposes. It is often
implemented as a circular queue, and thus has two pointers:
Read Pointer/Read Address Register
Write Pointer/Write Address Register
388
389
390
Content Addressable Memories
What is CAM?
1 0 1 X X
Content Addressable Memory
00
01 0 1 1 0 X
11 1 0 0 1 1
01 0 1 1 0 X
0 1
In CAM it is the reverse: 10 0 1 1 X X
11 1 0 0 1 1
Input is associated with
something stored in the memory. 0 1 1 0 1
392
CAM for Routing Table Implementation
Source: http://pagiamtzis.com/cam/camintro.html
393
Simplified CAM Block Diagram
The input to the system is the search word.
The search word is broadcast on the search lines.
Match line indicates if there were a match btw. the search and stored word.
Encoder specifies the match location.
If multiple matches, a priority encoder selects the first match.
Hit signal specifies if there is no match.
The length of the search word is long ranging from 36 to 144 bits.
Table size ranges: a few hundred to 32K.
Address space : 7 to 15 bits.
394
CAM Memory Size
Largest available
around 18 Mbit (single
chip).
E xponential growth
93 5
CAM Basics
The search-data word is loaded
into the search-data register.
All match-lines are pre-
charged to high (temporary
match state).
Search line drivers
broadcast the search word
onto the differential search
lines.
Each CAM core compares its
stored bit against the bit on
the corresponding search-
lines. Source: K. Pagiamtzis, A. Sheikholeslami, “Content-Addressable
Memory (CAM) Circuits and Architectures: A Tutorial and
ground.
Match words that have at least Survey,” IEEE J. of Solid-state circuits. March 2006
396
one missing bit, discharge to
Type of CAMs
We can add new entries into their table to learn what they don’t
know
before.
399
Unit VII
SEMICONDUCTOR INTEGRATED CIRCUIT
DESIGN
Programmable
Logic Array (PLA)
Programmable
Array Logic(PAL)
Programmable logic
FPGAs devices (PLD)
CPLDs
Standard cells
Design Approach
Parameters influencing low power design
400
401
PLD
Programmable logic is defined as a device with configurable
logic and flip-flops linked together with programmable
interconnect.
Why we are going for PLDs
Problems by Using Basic Gates
Many components on PCB:
• As no. of components rise, nodes interconnection complexity
grow exponentially
• Growth in interconnection will cause increase in interference, PCB size, PCB
design cost, and manufacturing time
402
403
404
PROGRAMMABLE LOGIC DEVICES (PLD)
405
PLD Hierarchical Architecture
PLD
The purpose of a PLD device is to permit elaborate digital logic
designs to be implemented by the user in a single device.
406
General structure of PLDs.
407
PLD
408
PLD
The differences between the first three categories
are these:
• 1. In a ROM, the input connection matrix is hardwired.
The user can modify the output connection matrix.
• In a PAL/GAL the output connection matrix is
hardwired. The user can modify the input connection
matrix.
• In a PLA the user can modify both the input connection
matrix and the output connection matrix.
409
Programming by blowing fuses.
411
AND - PLD Notation
412
413
PLA
414
PLA
416
417
Design for PLA:
Example
• Implement the following functions using PLA
F0 = A + B' C'
F1 = A C' + A B Input Side:
F2 = B' C' + A B
F3 = B' C + A 1 = asserted in term
0 = negated in term
- = does not
Personality Matrix participate
P roduct Inputs
Outputs term A Output Side:
B C F0 F1 F2 1 = term connected to output
F3 Reuse 0 = no connection to output
AB 1 1 - 0 1 1 0 of
BC - 01 0 0 0 1 terms
A 1 - 0 0 1 0 0
C
BC - 00 1 0 1 0
A 1 - - 1 0 0 1
418
Example: Continued
A B C
F0 = A + B' C' AB
F1 = A C' + A B
F2 = B' C' + A B B’C
F3 = B' C + A
AC’
B’C’
Personality Matrix A
419
BCD to Gray Code Converter
A B C D W X Y Z A A
0 0 0 0 0 0 0 0 AB AB
0 0 0 1 0 0 0 1 CD 00 01 11 10 CD 00 01 11 10
0 0 1 0 0 0 1 1 00 0 0 X 1 00 0 1 X 0
0 0 1 1 0 0 1 0
0 1 0 0 0 1 1 0 0 1 X 0
01 0 1 X 1 01
0 1 0 1 1 1 1 0
0 1 1 0 1 0 1 0 D D
11 0 1 X X 11 0 0 X X
0 1 1 1 1 0 1 1
1 0 0 0 1 0 0 1 C C
1 0 0 1 1 0 0 0 10 0 1 X X 10 0 0 X X
1 0 1 0 X X X X
1 0 1 1 X X X X B B
1 1 0 0 X X X X
K-map for W K-map for X
1 1 0 1 X X X X
1 1 1 0 X X X X
1 1 1 1 X X X X A A
AB AB
CD 00 01 11 10 CD 00 01 11 10
0 0 1 X 0 0 0 0 X 1
0 0
Minimized Functions: 01 0 1 X 0 01 1 0 X 0
D D
11 1 1 X X 11 0 1 X X
W=A+BD+B C C
C X = B C' 10 1 1 X X 10 1 0 X X
Y=B+C
Z = A'B'C'D + B C D + A D' + B' C D' B B
K-map for Y K-map for Z
420
A B C D
A
BD
BC’
B
PLA achieves higher flexibility
C
at the cost of lower speed!
BCD
AD’
BCD’
421 W X Y Z
422
423
424
PALs
Programmable Array Logic
• a fixed OR array.
Inputs
Outputs
42
5
A simple four-input, three-output PAL device.
426
An example of using a PAL device to realize two
Boolean functions. (a) Karnaugh maps. (b) Realization.
427
PAL
W = ABC + CD
X = ABC + ACD + ACD +
BCD
Y = ACD + ACD + ABD
428
FPGA AND CPLD
429
What is an FPGA?
Before the advent of programmable logic, custom logic circuits were
built at the board level using standard components, or at the gate
level in expensive application-specific (custom) integrated circuits.
430
What does a logic cell do?
The logic cell architecture varies between different device families.
Each logic cell combines a few binary inputs (typically between 3 and
10) to one or two outputs according to a Boolean logic function
specified in the user program .
In most families, the user also has the option of registering the
combinatorial output of the cell, so that clocked logic can be easily
implemented.
LUT devices tend to be a bit more flexible and provide more inputs
per cell than multiplexer cells at the expense of propagation delay.
431
What does 'Field Programmable' mean?
Field Programmable means that the FPGA's function is defined by a
user's program rather than by the manufacturer of the device.
432
How are FPGA programs created?
Individually defining the many switch connections and cell
logic functions would be a daunting task.
433
FPGA
FPGA applications:-
i. DSP
ii. Software-defined radio
iii. Aerospace
iv. Defense system
v. ASIC Prototyping
vi. Medical Imaging
vii. Computer vision
viii. Speech Recognition
ix. Cryptography
x. Bioinformatic
xi. And others.
434
Xilinx Spartan-3E Starter Kit
FPGA
buttons LEDs
switches
435
FPGA Principles
436
FPGA structure
CLB SB CLB
SB SB SB
Interconnection Network
437
Simplified CLB Structure
Look-Up MUX
SET
Table D Q
(LUT)
CLR Q
CLB SB CLB
SB SB SB
AD B C D O
0 0 0 0 0
0 0 0 1 0 0
0
0 0 1 0 0 0
A 0
MUX O
0 0 1 1 0 0
0 SET
0 1 0 0 0 0 D Q
0
0 1 0 1 0 0
B
C 0
0 1 1 0 0 0
0 CLR Q
0 1 1 1 0 0
0
0
1 0 0 0 0 0
D 1
1 0 0 1 0
1 0 1 0 0 Configuration bits
1 0 1 1 0
1 1 0 0 0
1 1 0 1 0
1 1 1 0 0
1
6/3/2015
1 1 1 1
Example 2: Find the configuration
bits for the following circuit
A0
2-to-1 SET
MUX
D Q
A1
CLR Q
S A MU
Clock SE X
0 DT
A0 A1 S Q
AS
0 0 0
0 0 1 1 CLR Q
0 1 0
0 1 1
1 0 0
Configuration
1 0 1
bits
1 1 0
6 /3/
1 1 1
Interconnection Network
Configuration
bits 0 1
0
CLB SB CLB 0
0 0
SB SB SB
Interconnection Network
Input2
CLB0 SB0 CLB1
Input1 D
SET
Q
Output
Input2
Input3 CLR Q
SB1 SB2 SB3
Input3
CLB2 SB4 CLB3 Output
CLBs required
CLB 1 CLB 2
Input1 D
SET
Q
Output
Input2 CLR Q
Input3
0
0
MUX O MUX Output
SET
Input1 0 D Q D
SET
Q
O 1
0 Input3 1
CLR Q Q
Input2 1 CLR
1 0
0
Input2
CLB0 SB0 CLB1
Input3
CLB2 SB4 CLB3 Output
444
Routing: Select path
Input1
SB1
Configuration bits
Input2
CLB0 SB0 CLB1
0 0
0
1
0 0
SB1 SB2 SB3
SB4
Configuration bits
Input3
CLB2 SB4 CLB3 Output
0 0
1
0
445
Configuration Bitstream
The configuration bitstream must include ALL CLBs and
SBs, even unused ones
CLB0: 00011
CLB1: 01100
CLB2: XXXXX
CLB3: ?????
SB0: 000000
SB1: 000010
SB2: 000000
SB3: 000000
SB4: 000001
446
FPGA Advantages
447
FPGA EDA Tools
448
Design of approach of IC
449
450
Silicon Wafers: Basic unit
• Silicon Wafers Basic processing unit
• 150, 200, 300 mm disk, 0.5 mm thick
• Newest ones 300 mm (12 inches)
• Typical process 25 - 1000 wafers/run
• Each wafer: 100 - 1000's of microchips (die)
• Wafer cost $10 - $100's
• 200 mm wafer weight 0.040 Kg
• Typical processing costs $1200/wafer (200
mm)
• Typical processed wafer value $11,000
(all products, modest yield)
• Value/Mass of processed wafer $275,000/Kg
451
CPLD
1. Complexity of CPLD is between FPGA and PLD.
2. CPLD featured in common PLD:-
i. Non-volatile configuration memory – does not need an
external configuration PROM.
ii. Routing constraints. Not for large and deeply layered
logic.
3. CPLD featured in common FPGA:-
i. Large number of gates available.
ii. Can include complicated feedback
4.
path.
CPLD application:-
i. Address coding
ii. High performance control
logic
452 iii. Complex finite state machines
CPLD
5. CPLD architecture:-
454
Using a PROM for logic design
• CMOS Testing
• Need for testing
• Test principles
• Design strategies for test
• Chip level test techniques
• System-level test techniques
• Layout design for improved
testability
456
Need for Testing
457
Logic Verification
458
Silicon Debug
459
Manufacturing Test
460
Manufacturing Failures
461
Stuck-At Faults
462
Examples
463
Observability & Controllability
464
Test Pattern Generation
465
Test Example
» SA1
SA0
A3 n1
{0110} A2
A2 {1110}
Y
{1010}
{11 10}
A3 A
n2 n3
A1 {0100} {01 10
1
A0
A0 {0110} }
{0111}
n1 {1110} {0110}
n2 {0110} {0100}
n3 {0101} {0110}
Y {0110} {1110}
467
Scan
Flop
Normal mode: flip-flops behave as usuaSDlI Q
Contents of flops
Flop
Flop
Flop
can be scanned Flop
Flop
Flop
Logic Logic
inputs
Cloud Cloud
Flop
Flop
Flop
values scanned
Flop
Flop
Flop
in scanout
468
Scannable Flip-flops
SCAN
SCAN CLK Q
D
D 0 X
Flop
Q Q
SI 1 SI
(a)
(b)
d
D
Q
d
SCAN
d X
Q
s
s
SI
(c)
s
469
ATPG
470
Built-in Self-test
471
PRSG
Step Y
CLK Y 0 111
Q[0] Q[1] Q[2]
Flop
Flop
Flop
D D D 1 11
0
2 101
3 010
Flops reset to 111 4 100
5 001
6 01
7 1
111 (repeats)
472
BILBO
C[0]
C[1]
Q[2] / SO
Flop
Flop
Flop
SI 1
0 Q[0]
Q[1]
473
Boundary Scan
474
Boundary Scan Example
PackageInterconnect
CHIP B CHIP C
Serial Data
Out
CHIP A CHIP D
Serial Data In
475
Boundary Scan Interface
476
Summary
477