DT1 (intro)
DT1 (intro)
Giovanni De Micheli
Integrated Systems Laboratory
This presentation can be used for non-commercial purposes as long as this note and the copyright footers are not removed
© Giovanni De Micheli – All rights reserved
Course objectives
Objective
Electronic systems and their requirements
Integrated circuits
Design styles
Infrastructural
Core
Sensory
swarm
Mobile
access
(c) Giovanni De Micheli [Courtesy: J. Rabaey]
Requirements for electronic chip design
[Tilera]
(c) Giovanni De Micheli 7
New communication structures
Design requirements:
Predictable design
Fast design closure
[Vangal, Intel07]
Modular and flexible interconnect
Reliable on-chip communication
Structured design with synthesis and optimization support
NoC
NoC
Packets Models
Blocks
Routes
IP Cores
Topology FPGA
Comm Back
Graph Synthesis Topology
NoC End
Generation Flow
PE Network ASIC
Floorplan
Interface Constraints
Goals
(c) Giovanni De Micheli 8
Systems on Chip
Large-scale
Billion transistor chips
Multi-cores, multi-threaded sw
Power-consumption limited
Dark silicon
Very expensive to design
Non recurring engineering (NRE) costs
NVIDIA TEGRA Processor
Migration toward software
[Fraunhofer]
Electrical and manufacturing constraints limit
heterogeneous planar integration
15 mm
8
GP U mm
2
00
1,4
r ( s)
r pose
n Inte
o
Silic
GPU Die (21B Transistors @ 12 Nanometers TSMC, 300W) + 4 HBM2 Stacks (4 4GB) Onto
Several Silicon Interposer“Volta”
Die Stitched Together, price at launch $10,000
5.5D-IC (3D + 2.5D) Integration; Source: J.-H. Huang, et al., NVIDIA, GPU Technology Conference 2017
[Courtesy: STMicroelectronics]
[INTEL, 2017]
(c) Giovanni De Micheli
Carbon Nanotube Transistors
Custom:
All geometries are designed ad hoc
Mainly abandoned, but useful for high-performance components
Semicustom:
Optimality is traded-off for regularity
Performance penalty is small, cost is lower
Intel 4004
Routing channel
requirements are
reduced by presence
of more interconnect
layers
Cell-structure
hidden under
interconnect layers
polysilicon
VD D
metal
rows of Uncommited
uncommitted possible
cells GND contact Cell
In 1 In 2 In 3 In4
routing
channel Committed
Cell
(4-input NOR)
Out
Standard-cell like
floorplan
I/O Buffers
I/O Buffers
R ow s o f logi c m odule s
R outi ng c hannel s
I/O Buffers
Embedded memories
Embedded PowerPc
Hardwired multipliers
High-speed I/O
(c) Giovanni De Micheli 36
Courtesy Xilinx
Integrated Circuit Design Styles
Custom Semicustom
Cell-based Array-based
Cost [log]
chips/year [log]
Design Cost
(aka NRE)
Objective
Electronic design automation
Synthesis and optimization
Multi-criteria optimization
Architectural level
Operations implemented by resources
Logic level
Logic functions implemented by gates
Geometrical level
Transistors and wires
Architectural-level synthesis
Determine macroscopic structure
Interconnection of major building blocks
Logic-level synthesis
Determine the microscopic structure
Interconnection of logic gates
Physical design
Geometrical-level synthesis
Determine positions and connections
b-view s-view
a-synthesis
a-level
l-synthesis
l-level
p-design
g-level
p-view
Testability, dependability, …
Area
Area
Area
Max
e
tim
l e-
c
Cy
Latency
Latency
Latency
Latency
Max
Multi-criteria optimization
Multiple objectives
Pareto point:
A point of the design space is a Pareto point if there is no other
point with:
At least one inferior objectives
All other objectives inferior or equal
diffeq {
read ( x, y, u, dx, a ) ;
repeat {
xl = x + dx;
ul = u – ( 3 . x . u . dx ) – ( 3 . y . dx ) ;
yl = y + u . dx ;
c=x<a;
x = xl; u = ul; y = yl ;
until ( c );
write ( y )
}
STEERING CONTROL
ALU
* & UNIT
MEMORY
STEERING CONTROL
ALU
* ALU
* &
MEMORY
UNIT
Area
15
(2,2)
13 (2,1)
12
10
8 X(1,2)
7 (1,1)
5
Latency
1 2 3 4 5 6 7 8