PPT

Comparative Analysis of Power Yield
Improvement Under Process Variations
of Sub-Threshold Flip-Flops
Hassan Mostafa & M. Anis & M. Elmasry
University of Waterloo, Ontario, Canada
Outline
Introduction and Background
Motivation and Objectives
Simulation Procedure
Results and Discussions
Conclusion
Introduction and Background
ISCAS 2010
Slide 2
Outline
Introduction and Background
Variability
Classification and Design Methodologies
Sources
Variability and Yield
Variability and Sub-threshold logic
Motivation and Objectives
Simulation Procedure
Results and Discussions
Conclusion
Introduction and Background
ISCAS 2010
Slide 3
Variability Classification
Die-to-Die (D2D)
Affects all devices on the chip in the same way
e.g., all devices on a chip have the same Vt
Within-Die (WID)
Variations within a single chip
Affecting devices on the same chip differently
e.g., devices on the same chip have different Vt
D2D
WID
Die index
Introduction and Background
ISCAS 2010
Slide 4
Design Methodologies
Design Methodology
Performance Distribution
Cost (e.g. power)
(Overhead)
Nominal
FAIL
Frequency
50% Timing Yield
Corner-Based
FAIL
100% Timing Yield
Statistical
FAIL
>99.9% Timing Yield
www.nowpublishers.com
Introduction and Background
ISCAS 2010
Slide 5
Process Variations Sources
Random Dopant Fluctuations (RDF)
As CMOS devices are scaled, number of
dopant atoms decreases
The number of dopant atoms has variations
around its nominal value resulting in Vt
variations
σVt α (WL)-0.5
[1
]
As transistor area decreases with scaling,
σVt increases
Channel Length Variation
Difficulty to control the critical dimensions
at sub-wavelength lithography
Large variations in the channel length (L)
Vt α exp(-L) due to short channel effects
A small variations in L results in large
variations in Vt
Sub-wavelength lithography
1

m
Lithography
Wavelength
365nm
248nm
193nm
180nm
130nm
100
nm
10
nm
S. Borkar et al., DAC04
1980
1990
2000
Variations increase with technology scaling
Introduction and Background
ISCAS 2010
Gap
90nm
65nm`
45nm
Generation
32nm
Slide 6
13nm
EUV
2010
2020
Variability and Yield
Process variations causes the device
parameters to have fluctuations around
their nominal values
Delay distribution
The system parameters such as delay
and power have a spread around their
nominal values
This result in a percentage of the
systems does not meet the required
function or the required parameter
constraint
 Yield loss
PASS FAIL
Delay
Target delay
Variability results in Yield loss
Introduction and Background
ISCAS 2010
Slide 7
Variability and Sub-threshold logic
Sub-threshold logic is adopted for low
power consumption when performance
is of secondary importance
Variability increases in sub-threshold
circuits due to the exponential
relationship between the current and Vt
Superthreshold
Energy
The minimum energy occurs at the
sub-threshold operating region
Subthreshold
Vt
Minimum Energy
Variability increases in sub-threshold circuits
Introduction and Background
ISCAS 2010
Slide 8
VDD
Outline
Introduction and Background
Motivation and Objectives
Simulation Procedure
Results and Discussions
Conclusion
Motivation and Objectives
ISCAS 2010
Slide 9
Motivation
With technology scaling:
Variability is getting worse resulting in yield loss (higher
cost), especially, in sub-threshold circuits.
Flip-Flops
The continued demand for low power consumption leads to
sub-threshold circuit design
There are several Flip-Flops topologies which have different
power and performance trade-offs
Flip-Flops designers need some design guidelines on selecting
the best Flip-Flop topology suitable for their application
constraints considering power, performance, and yield
Motivation and Objectives
ISCAS 2010
Slide 10
Objectives
The power yield improvement of the sub-threshold flipflops is of paramount importance, especially, in
applications with strict power constraints. This power
yield improvement results in excess delay and energy
overheads.
The questions are:
Which flip-flop topology exhibits the largest overheads ?
Which flip-flop topology exhibits the lowest overheads?
Motivation and Objectives
ISCAS 2010
Slide 11
Outline
Introduction and Background
Motivation and Objective
Simulation Procedure
Flip-Flops Selection and Design
Power Yield Improvement Using Gate Sizing
Results and Discussions
Conclusion
Simulation Procedure
ISCAS 2010
Slide 12
Flip-Flops Selection (3 topologies)
1. Master-Slave Topology
Good hold time & Relatively high latency delay
Selected Flip-Flops:
Transmission-Gate Master-Slave Flip-Flop (TG-MSFF) : implemented in
IBM PowerPC 603 processor and used in standard libraries (AMS) [1]
Modified-Clocked CMOS-Master-Slave Flip-Flop (MC2MOS-MSFF) :
used in standard libraries (AMS) [2]
TG-MSFF
M-C2MOS-MSFF
[1] G. Gerosa et al., JSSC94; [2] V. Stojanovic et al., JSSC99
Simulation Procedure
ISCAS 2010
Slide 13
Flip-Flops Selection (3 topologies)
2. Pulsed Flip-Flops
Samples the input data at a short transparency period around the
clock edge
High performance
High power consumption & poor hold time
Selected Flip-Flop:
Semi-Dynamic Flip-Flop (SD-FF) [1] :
One of the fastest flip-flops structures
The most convenient structure for high performance applications
SD-FF
[1] F. Klass, VLSI Dig. 98
Simulation Procedure
ISCAS 2010
Slide 14
Flip-Flops Selection (3 topologies)
3. Sense-amplifier based flip-flop topology (SA-FF)
High performance at moderate power consumption
Represents a compromise between the MS-FF and the PulsedFF topologies
Uses differential pair architecture
Selected Flip-Flop:
Strong Arm 110 Flip-Flop (SA-FF) [1]:
implemented in the high
performance WD21264 Alpha
processor
A comparative analysis among the selected flip-flops will aid the flip-flops designers to:
Choose the best flip-flop topology based on their applications
power, performance, and yield constrains
[1]U. Ko et al.,ISLEPD96
Simulation Procedure
ISCAS 2010
Slide 15
Flip-Flops Design
Optimum Power Delay Product (PDP) Design
All flip-flops sizes are optimized to achieve minimum EDP
The optimization process is performed by using the CFSQP
(C-version Feasible Sequential Quadratic Programming)
optimization package
Functional Yield Improvement Using Setup Time Margin
Monte Carlo Analysis including D2D and WID variations is
conducted using industrial STMicroelectronics 65-nm
Technology node.
Due to variability, the setup time of some of the flip-flops
samples is violated and these samples malfunction reducing
the functional yield
A setup time margin is added to improve the functional yield
> 99.9%
Simulation Procedure
ISCAS 2010
Slide 16
Power Yield Improvement Using Gate Sizing
Main idea
Find the transistors sizes that shift the power distribution to
achieve a power yield ≥ a given yield (Yo)
The power exhibits a log-normal distribution
Statistical gate sizing to shift the power distribution mean
Simulation Procedure
ISCAS 2010
Slide 17
Power Yield Improvement Using Gate Sizing
Algorithm
Find the mean and standard deviation of the power distribution
of the initially sized sub-threshold flip-flops (PO and σ )
Using PO and σ , find the equivalent normal distribution mean
and standard deviation ( μln and σln )
Using μln and σln , find the geometric mean and standard
deviation of the log-normal distribution (μg and σg )
Using μg and σg , find the new target power distribution mean
(PO ‘)
n = 3 for YO = 99.87%
Simulation Procedure
ISCAS 2010
Slide 18
Power Yield Improvement Using Gate Sizing
Algorithm Flowchart
Calculate the gate sizing
Calculate the power
that
Repeat
Power
match
Initial
ifConstraint
the
gate
P ’ power
and
sizing
minimize
(P
yield
)
mean
Calculate
ando variability
P O’ O
(obtained
Power
Constraint
the
Yield
for
delay
Constraint
optimum
is and
not met
EDP)
(YO)
(PO and σ)
energy overheads
Simulation Procedure
ISCAS 2010
Slide 19
Outline
Introduction and Background
Motivation and Objective
Simulation Procedure
Results and Discussions
Conclusion
Results and Discussions
ISCAS 2010
Slide 20
The delay, energy, and EDP Overheads Required
for Power Yield Improvement
50
Normalized delay overhead
40
Normalized energy overhead
30
Normalized EDP overhead
20
10
0
TG-MSFF
MC2MOS-MSFF
SD-FF
SA-FF
SA-FF exhibits the lowest overheads . Thus, the SA-FF is recommended
for sub-threshold operation.
MC2MOS-FF exhibits the highest overheads. Therefore, the MC2MOS-FF
is not recommended at all for sub-threshold operation.
TG-MSFF shows lower overheads than that of the SD-FF for power yield
improvement.
ISCAS 2010
Slide 21
The delay, energy, and EDP Overheads Required
for Power Yield Improvement
Scattered plots:
TG-MSFF
ISCAS 2010
M-C2MOS-MSFF
Slide 22
The delay, energy, and EDP Overheads Required
for Power Yield Improvement
Scattered plots:
SD-FF
ISCAS 2010
SA-FF
Slide 23
Outline
Introduction and Background
Motivation and Objective
Simulation Procedure
Results and Discussions
Conclusion
Conclusions
ISCAS 2010
Slide 24
Conclusion
A comparative analysis among different subthreshold flip-flops topologies is performed
considering the required delay and energy
overheads for power yield improvement .
This work demonstrates that the SA-FF is the
recommended candidate for sub-threshold
operation in the nanometer regime whereas
the MC2MOS-MSFF is not recommended due
to its large delay and energy overheads.
Conclusions
ISCAS 2010
Slide 25
THANK YOU
ISCAS 2010
Slide 26