LPVLSI Unit 3 Notes
LPVLSI Unit 3 Notes
Power dissipation is essentially the rate at which energy is drawn from the
power supply, which is proportional to the average power dissipation.
Power dissipation is important from the viewpoint of cooling and packaging of
the integrated circuit (IC) chips.
On the other hand, energy consumed is important for battery-driven portable
systems. As power dissipation is proportional to the clock frequency, to reduce
power dissipation by half we may reduce the clock frequency by half.
Although this will reduce the power dissipation and keep the chip cooler, the time
required to perform a computation will double that of the previous case. In this case,
the energy is drawn from the battery at half the rate of the previous case.
But, the total energy drawn from the battery for performing a particular computation
remains the same.
The height of the graphs represents the power and the area under the curve
represents the energy. Here, the same energy is drawn for different average power
dissipation.
Dynamic power dissipation:
In CMOS circuits, power dissipation can be divided into two broad categories:
dynamic and static.
Dynamic power dissipation in CMOS circuits occur when the circuits are in working
condition or active mode, that is, there are changes in input and output conditions
with time. In this section, we introduce the following three basic mechanisms
involved in dynamic power dissipation:
Short-circuit power: Short-circuit power dissipation occurs when both the nMOS
and pMOS networks are ON. This can arise due to slow rise and fall times of the
inputs.
Switching power dissipation: As the input and output values keep on changing,
capacitive loads at different circuit points are charged and discharged, leading to
power dissipation. This is known as switching power dissipation
Glitching power dissipation: Due to a finite delay of the logic gates, there are
spurious transitions at different nodes in the circuit.
Apart from the abnormal behavior of the circuits, these transitions also result in
power dissipation known as glitching power dissipation.Static power dissipation
occurs due to various leakage mechanisms
Static power dissipation:
Static power dissipation occurs due to various leakage mechanisms.Reverse-bias p–
n junction diode leakage current
Reverse-biased p–n junction current due to the tunneling of elections from the
valence bond of the p region to the conduction bond of the n region, known as band-
to-band-tunneling current.Sub-threshold leakage current between source and drain
when the gate voltage is less than the threshold voltage Vt
Oxide-tunneling current due to a reduction in the oxide thicknessGate current due to
a hot-carrier injection of elections Gate-induced drain-leakage (GIDL) current due
to high field effect in the drain junction.Channel punch-through current due to close
proximity of the drain and the source in short-channel devices
Short-Circuit Power Dissipation:
• When there are finite rise and fall times at the input of CMOS logic gates, both
pMOS and nMOS transistors are simultaneously ON for a certain duration, shorting
the power supply line to ground. This leads to current flow from supply to ground.
• Short-circuit power dissipation takes place for input voltage in the range Vtn < Vin
< Vdd − | Vtp |, when both pMOS and nMOS transistors turn ON creating a
conducting path between Vdd and ground (GND).
• To estimate the average short-circuit current.It is assumed that τ is both rise and fall
times of the input (τr =τ f = τ ) and the inverter is symmetric, i.e., βn = βp = β and
Vtn = − Vtp = V t .The mean short-circuit current of the inverter having no load
attached to
This implies that half of the energy is stored in the capacitor, and the
remaining half (1/ 2)CL Vdd2 is dissipated in the pMOS transistor network.
Switching power reduces by 56 %, if the supply voltage is reduced from 5 to
3.3 V, and if the supply voltage is lowered to 1 V, the switching power is
reduced by 96 % compared to that of 5V.
This is the reason why voltage scaling is considered to be the most dominant
approach to reduce switching power.
There are situations where a rail-to-rail swing does not take place on a
capacitive node. This situation arises in pass transistor logic and when the
pull-up device is an enhancement-type nMOS transistor in nMOS logic
gates.In such cases, the output can only rise to Vdd − Vt.This situation also
happens in interval nodes of CMOS gates.
Instead of C L Vdd2 f for full-rail charging, the energy drawn from power
supply for charging the capacitance to ( Vdd − Vt) is given by
• Switching Activity:
• For a complex logic gate, the switching activity depends on two factors—
the topology of the gate and the statistical timing behavior of the circuit.
• To handle the transition rate variation statistically, let n( N) be the number
of 0-to-Vdd output transitions in the time interval [0,N].
• Total energy EN drawn from the power supply for this interval is given by
• For example, consider a two-input NAND gate with the truth table given in
Table. In this case,
• For NAND/NOR gates, the switching activity decreases with the increase in
the number of inputs.
• On the other hand, the switching activity at the output of an EXOR gate in
1/4, which is independent on the number of inputs. Output activities for
different gates are shown in Table.
The variation of the switching activity at the output of NAND, NOR, and EX-OR
gates with the increase in the number of inputs is shown in Fig.
• where P0 is the probability for the output is in the zero state. For n
independent inputs to a gate
Gives the output transition probabilities for NAND, NOR, and inverter.
Compared to the static CMOS, transition probabilities are more, because the
output is always pre-charged to 1, i.e., P1 = 1. It may be noted that the
switching activity for NAND and NOR gates is more in the dynamic gates
than their static gate counterparts.
Power Dissipation due to Charge Sharing:
• Moreover, in the case of dynamic gates, power dissipation takes place due to
the phenomenon of charge sharing even when the output is not 0 at the time
evaluation, i.e., the output load capacitance is not discharged, but part of the
charge of the load capacitance might get redistributed leading to a reduction
in the output voltage level.
In the next pre-charge period, the output is again pre-charged back to Vdd
The power transferred from the supply is
Then, the total static power dissipation due to diode leakage current for one million
transistors is given by
where m* is the effective mass of electron, E is the elective field at the junction, q
is the electronic charge, and h is the reduced Planck’s constant (1/2π times).
• where Na and Nd are the doping in the p and n side, respectively. εsi is the
permittivity of silicon and Vbi is the built-in voltage across the junction.
• High doping concentrations and abrupt doping profiles are responsible for a
significant band-to-band tunneling BTBT current in scaled devices through
the drain–well junction.
• The variation of the threshold voltage with respect to the substrate bias
dVth/dVbs is referred to as the substrate sensitivity:
• Taking into consideration all the three effects, e.g., weak inversion,
DIBL, body effect and subthreshold leakage can be modeled as
Narrow-Width Effect:
• The width of a gate, particularly when it becomes narrow, affects the
threshold voltage as shown in Fig.
• The reduction in threshold voltage also leads to an increase in the
subthreshold leakage current (Fig.
Vth Roll-Off:
• As the channel length is reduced, the threshold voltage of metal–oxide–
semiconductor field-effect transistor (MOSFET) decreases.
• This reduction of channel length is known as Vth roll-off.
• Figure shows the reduction of threshold voltage with a reduction in channel
length.This effect is caused by the proximity of the source and drain regions
leading to a 2D field pattern rather than a 1D field pattern in short-channel
devices.
• The bulk change that needs to be inverted by the application of gate voltage
is proportional to the area under the trapezoidal region, shown in Fig. 6.27,
given by QB‘ Wdm( L + L) / 2 .
• So, the gate voltage has to invert less bulk charge to turn the transistor on
leading to more band bending in the Si–SiO2 interface in the short-channel
device compared to long-channel devices. As a consequence, the threshold
voltage is lower for a short-channel device.
• Moreover, the effect of the source–drain depletion region is more severe for
high drain bias voltage. This results in a further decrease in threshold
voltage and larger subthreshold current.
Temperature Effect:
• There is a strong dependence of threshold voltage on temperature.
• Similarly, if a negative gate bias is applied, the electrons from the n + poly-
silicon can tunnel through the oxide layer as shown in Fig. c.
• This results in gate oxide-tunneling current, which violates the classical
infinite input impedance assumption of MOS transistors and thus affects the
circuit performance significantly.
Hot-Carrier Injection:
• Electrons and holes near the Si–SiO2 interface can gain sufficient
energy due to the high electric field in short-channel MOSFETs and
cross the interface potential barrier and enter into the oxide layer as
shown in below Fig.
• This phenomenon is known as the hot-carrier injection. Because of the
lower effective mass and smaller barrier height (3.1 eV) for electrons
compared to holes (4.5 eV), the possibility of an injection due to electrons is
more than the holes.
Gate-Induced Drain Leakage:
• Due to a high electric field ( Vdg) across the oxide, a deep depletion region
under the drain overlap region is created, which generates electrons and
holes by band-to band tunneling. The resulting drain to body current is
called GIDL current.
• When the negative gate bias is large ( v, gate at zero or negative voltages
with respect to drain at Vdd), the n + region under the gate can be
depleted and even inverted as shown in Fig.
• As a consequence, minority carriers are emitted in the drain region
underneath the gate. As the substrate is at a lower potential, the minority
carriers at the drain depletion region are swept away laterally to the
substrate, creating a GIDL current.
• At low drain doping, the electric field is not high enough to cause tunneling.
• At very high drain doping, the depletion width and hence the tunneling is
limited.The GIDL is worse for moderated doping, when the depletion width
and electric field are both considerable
Punch-Through:
• Due to the proximity of the drain and the source in short-channel devices,
the depletion regions at the source–substrate and drain–substrate junctions
extend into the channel.
• If the doping is kept constant while the channel length is reduced, the
separation between the depletion region boundaries decreases. Increased
reverse bias (higher Vds) across the junction further decreases the
separation.
• When the depletion region merge, a majority of the carriers in the source
enter into the substrate and get collected by the drain. This situation is
known as punch-through condition.
• The net effect of punch through is an increase in the subthreshold current.
• Moreover, punch through degrades the subthreshold slope.
• The punch-through voltage VPT estimates the value of Vds for which punch
through occurs at Vgs = 0:
•
where NB is the doping concentration at the bulk, L is the channel length, and
Wj is the junction width.
• The plot shows that the delay increases with the decrease in supply voltage
in a nonlinear manner, and it increases sharply as the supply voltage
approaches the threshold voltage.
• It is essential to devise a suitable mechanism to contain this loss in
performance due to the supply voltage scaling for the realization of low-
power high-performance circuits.
• The loss in performance can be compensated by using suitable techniques at
different levels of design hierarchy, that is, the physical level, logic level,
architectural level, algorithmic level, and system level.
The voltage scaling approaches can be divided into the following four categories:
• Static Voltage Scaling (SVS) In this case, fixed supply voltages are applied
to one or more subsystems or blocks.
• Multilevel Voltage Scaling (MVS) This is an extension of the SVS, where
two or few fixed discrete voltages are applied to different blocks or
subsystems
• Dynamic Voltage and Frequency Scaling (DVFS) :This is an extension of
the MVS, where a large number of discrete voltages are applied in response
to the changing workload conditions of the subsystems.
• Adaptive Voltage Scaling (AVS): This is an extension of the DVFS, where a
close-loop control system continuously monitors the workload and adjusts
the supply voltage.
Static Voltage Scaling:
• Device Feature Size Scaling (Physical Level Approach)
Constant Field Scaling
Constant Voltage Scaling
• Architectural-Level Approaches
Parallelism for Low Power
Multi-Core for Low Power
Pipelining for Low Power
Combining Parallelism with Pipelining
Voltage Scaling Using High-Level
Transformations
Device Feature Size Scaling:
• In the first physical-level-based approach, the device feature size is scaled to
overcome the loss in performance.
• Continuous improvements in process technology and photolithographic
techniques have made the fabrication of MOS transistors of smaller and
smaller dimensions to provide a higher packaging density.
• As a reduction in feature size reduces the gate capacitance, this leads to an
improvement in performance.This has opened up the possibility of scaling
device feature sizes to compensate for the loss in performance due to
voltage scaling.
• The reduction of the size, i.e., the dimensions of MOSFETs, is commonly
referred to as scaling. To characterize the process of scaling, a parameter S,
known as scaling factor, is commonly used.
• All horizontal and vertical dimensions are divided by this scaling factor,
S ˃ 1, to get the dimensions of the devices of the new generation technology.
• Obviously, the extent of scaling, in other words the value of S, is decided by
the minimum feature size of the prevalent technology.
• It has been observed that over a period of every 2 to 3 years, a new
generation technology is introduced by downsizing the device dimensions
by a factor of S, lying in the range 1.2–1.5.
• Trends in metal–oxide–semiconductor (MOS) device scaling
•
• It may be noted that the slope of all the curves in this figure is equal to the
scaling parameter S.
• Figure a and b shows the reduction in gate delay for the n-type MOS
(nMOS) and p-type MOS (pMOS) transistor, respectively.
• Figure c shows how gate oxide thickness varies with the scaling of channel
length, whereas Fig.d shows how the supply voltage is scaled with the
scaling of channel length
Basic geometry of an MOSFET and the various parameters scaled by a scaling
factor S.
• It may be noted that all the three dimensions are proportionally reduced
along with a corresponding increase in doping densities.
• There are two basic approaches of device size scaling–constant-field scaling
and constant voltage scaling.
Constant-Field Scaling:
• In this approach, the magnitudes of all the internal electric fields within the
device are preserved, while the dimensions are scaled down by a factor of S.
• This requires that all potentials must be scaled down by the same factor.
• Accordingly, supply and threshold voltages are scaled downproportionately.
This also dictates that the doping densities are to be increased by a factor of
S to preserve the field conditions.
• As both length and width parameters are scaled down by the same factor,
the W/L remains unchanged.
• So, the transconductance parameter Kn is also scaled by a factor S.
• Both linear-mode and saturation-mode drain currents are reduced by a
factor of S, as given below:
This leads to the reduction of delay time and the consequent improvement in
performance.
• Important benefits of constant-field scaling are:
(i) Smaller device sizes leading to a reduced chip size, higher yield, and more
number of integrated circuits (ICs) per wafer,
(ii) Higher speed of operation due to smaller delay
(iii) Reduced power consumption because of the smaller supply voltage and
device currents.
Constant-Voltage Scaling:
• In constant-voltage scaling, all the device dimensions are scaled down by a
factor of S just like constant-voltage scaling. However, in many situations,
scaling of supply voltage may not be feasible in practice.
• For example, if the supply voltage of a central processing unit (CPU) is
scaled down to minimize power dissipation, it leads to electrical
compatibility with peripheral devices, which usually operate at higher
supply voltages.
• It may be necessary to use multiple supply voltages and complicated-level
translators to resolve this problem. In such situations, constant-voltage
scaling may be preferred.
• In a constant-voltage scaling approach, power supply voltage and the
threshold voltage of the device remain unchanged.
• To preserve the charge–field relations, however, the doping densities have
to be scaled by a factor of S2.
Short-Channel Effects:
• Short-channel effects arise when channel length is of the same order of
magnitude as depletion region thickness of the source and drain junctions or
when the length is approximately equal to the source and drain junction
depths.
• As the channel length is reduced below 0.6 μm, the short-channel effect
starts manifesting.
• This leads to an increase in subthreshold leakage current, reduction in
threshold voltage with Vgs, and a linear increase in the saturation current
instead of square of the gate-to-source voltage.
• Moreover, if channel length is scaled down without scaling the supply
voltage (constant-voltage scaling), the electric field across a gate oxide
device continues to increase, creating a hot carrier.
• Hot carriers can cause an avalanche breakdown of the gate oxide.
• It is necessary to restrict the maximum electric field across the gate oxide to
7 MV/cm, which translates into 0.7 V/10 Å of gate oxide thickness.
• For gate oxide thickness of 70 Å, the applied gate voltage should be limited
to 4.9 V for long-term reliable operation.
• However, if voltage scaling is not done along with the scaling of feature size
because of some other design constraints, it will be necessary to use an
appropriate measure to reduce the number of hot carriers.
• Directed acyclic graph (DAG) after unrolling and using distributivity and
constant propagation
• Distribution of path delays under single supply voltage (SSV) and multiple
supply voltage (MSV)
• A number of studies have shown that the use of multiple supply voltages
results in the reduction of dynamic power from less than 10 % to about 50
%, with an average of about 40 %.
• It is possible to use more than two, say three or four, supply voltages.
However, the benefit of using multiple Vdd saturates quickly.
• Extending the approach to more than two supply voltages yields only a
small incremental benefit. The major gain is obtained by moving from a
single Vdd to a dual Vdd.
• It has been found that in a dual-Vdd/single-Vt system, the optimal lower Vdd
is about 60–70 % of the original Vdd.
• The optimal supply voltage depends on the threshold voltage Vt of the MOS
transistors as well.
• Challenges in MVS
• Voltage Scaling Interfaces
• Converter Placement
• Floor Planning, Routing, and Placement
• Static Timing Analysis
• Power-Up and Power-Down Sequencing
• Clock Distribution
• Low-Voltage Swing
Dynamic Voltage and Frequency Scaling(DVFS):
• DVFS has emerged as a very effective technique to reduce CPU energy.
• The technique is based on the observation that for most of the real-life
applications, the workload of a processor varies significantly with time and
the workload is bursty in nature for most of the applications.
• The energy drawn for the power supply, which is the integration of power
over time, can be significantly reduced.
• This is particularly important for battery-powered portable systems.
• Basic Approach
• The energy drawn from the power supply can be reduced by using the
following two approaches:
• Dynamic Frequency Scaling
• Dynamic Voltage and Frequency Scaling
• Dynamic Frequency Scaling:
Four different cases with two different workloads and with voltage and frequency
scaling
DVFS with Varying Work Load:
• The implementation of the DVFS system will require the following
hardware building blocks:
• Variable voltage processor μ( r)
• Variable voltage generator V( r)
• Variable frequency generator f( r)
• Variable voltage processor μ( r)
• The need of a processor which can operate over a frequency range with a
corresponding lower supply voltage range can be manufactured using the
present-day process technology and several such processors are
commercially available.
• Variable voltage processor μ( r)
• Transmeta’s TM 5400 or “Crusoe” processor and Strong ARM processor
are examples of such variable voltage processors.
• Transmeta’s Crusoe processor can operate over the voltage range of 1.65–
1.1 V, with the corresponding frequency range of 700– 200 MHz.
• Variable voltage processor μ( r)
It allows the following adjustments:
• Frequency change in steps of 33 MHz
• Voltage change in steps of 25 mV
• Up to 200 frequency/voltage change per second
• Another processor, Strong ARM, also allows voltage scaling.
• Experimental results performed on Strong ARM 1100 are shown in Fig.
7.26.
• The diagram shows the operating frequency and corresponding supply
voltages for the ARM processor.