0% found this document useful (0 votes)

54 views21 pages

CS 6290 Many-Core & Interconnect: Milos Prvulovic Fall 2007

The document discusses different types of interconnection networks for many-core processors, including shared medium networks, switched networks, and various routing techniques. It covers topics like distributed arbitration, circuit switching vs packet switching, store-and-forward routing, wormhole routing, cut-through routing, switch technology including crossbars and omega networks, and network topologies like meshes for on-chip networks. It also discusses issues with shared caches and solutions like non-uniform cache access (NUCA) architectures with block migration between cache banks.

Uploaded by

Majdi M. Ababneh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views21 pages

CS 6290 Many-Core & Interconnect: Milos Prvulovic Fall 2007

Uploaded by

Majdi M. Ababneh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

CS 6290

Many-core & Interconnect

Milos Prvulovic
Fall 2007

Interconnection Networks
Classification: Shared Medium or Switched

Shared Media Networks
Need arbitration to decide who gets to talk
Arbitration can be centralized or distributed
Centralized not used much for networks
Special arbiter device (or must elect arbiter)
Good performance if arbiter far away? Nah.
Distributed arbitration
Check if media already used (carrier sensing)
If media not used now, start sending
Check if another also sending (collision detection)
If collision, wait for a while and retry
For a while is random (otherwise collisions repeat forever)
Exponential back-off to avoid wasting bandwidth on collisions

Switched Networks
Need switches
Introduces switching overheads
No time wasted on arbitration and collisions
Multiple transfers can be in progress
If they use different links, of course
Circuit or Packet Switching
Circuit switching: end-to-end connections
Reserves links for a connection (e.g. phone network)
Packet switching: each packet routed separately
Links used only when data transferred (e.g. Internet Protocol)

Routing
Shared media has trivial routing (broadcast)
In switched media we can have
Source-based (source specifies route)
Virtual circuits (end-to-end route created)
When connection made, set up route
Switches forward packets along the route
Destination-based (source specifies destination)
Switches must route packet toward destination
Also can be classified into
Deterministic (one route from a source to a destination)
Adaptive (different routes can be used)

Routing Methods for Switches
Store-and-Forward
Switch receives entire packet, then forwards it
If error occurs when forwarding, switch can re-send
Wormhole routing
Packet consists of flits (a few bytes each)
First flit contains header w/ destination address
Switch gets header, decides where to forward
Other flits forwarded as they arrive
Looks like packet worming through network
If an error occurs along the way, sender must re-send
No switch has the entire packet to re-send it

Cut-Through Routing
What happens when link busy?
Header arrives to switch, but outgoing link busy
What do we do with the other flits of the packet?
Wormhole routing: stop the tail when head stops
Now each flit along the way blocks the a link
One busy link creates other busy links => traffic jam
Cut-Through Routing
If outgoing link busy, receive and buffer incoming flits
The buffered flits stay there until link becomes free
When link free, the flits start worming out of the switch
Need packet-sized buffer space in each switch
Wormhole Routing switch needs to buffer only one flit

Routing: Network Latency
Switch Delay
Time from incoming to outgoing link in a switch
Switches
Number of switches along the way
Transfer time
Time to send the packet through a link
Store-and-Forward end-to-end transfer time
(Switches*SwitchDelay)+(TransferTime*(Switches+1))
Wormhole or Cut-Through end-to-end transfer time
(Switches*SwitchDelay) + TransferTime
Much better if there are many switches along the way
See the example on page 811

Switch Technology
What do we want in a switch
Many input and output links
Usually number of input and output links the same
Low contention inside the switch
Best if there is none (only external links cause contention)
Short switching delay
Crossbar
Very low switching delay, no internal contention
Complexity grows as square of number of links
Can not have too many links (e.g. up to 64 in and 64 out)

Switch Technology
What do we want in a switch
Many input and output links
Usually number of input and output links the same
Low contention inside the switch
Best if there is none (only external links cause contention)
Short switching delay
Crossbar
Very low switching delay, no internal contention
Complexity grows as square of number of links
Can not have too many links (e.g. up to 64 in and 64 out)
Omega Network
Build switches with more ports using small crossbars
Lower complexity per link, but longer delay and more contention

Switch Technology

Network Topology

Network Topology
What do we want in a network topology
Many nodes, high bandwidth, low contention, low
latency
Low latency: few switches along any route
For each (src, dest) pair, we choose shortest route
Longest such route over all (src,dst) pairs: network diameter
We want networks with small diameter!
Low contention: high aggregate bandwidth
Divide network into two groups, each with half the nodes
Total bandwidth between groups is bisection bandwidth
Actually, we use the minimum over all such bisections

On-Chip Networks
Well have many cores on-chip
Need switched network to provide bandwidth
Need to map well onto chip surface
E.g. hypercube is not great
Mesh or grid should work well, torus OK too
Limited ports per switch (CPU & 4 neighbors)
All links short (going to neighbors)
Many parallel algorithms map well onto grids
Matrices, grids, etc.

Trouble with shared caches
Private caches OK
Each placed with its own processor
We want a shared cache, too
Fits more data than if broken into private
caches
Private caches replicate data
Dynamically shared
Threads that need more space get more space
But how do we make a shared cache fast

Trouble with shared caches
Private caches OK
Each placed with its own processor
We want a shared cache, too
Fits more data than if broken into private
caches
Private caches replicate data
Dynamically shared
Threads that need more space get more space
But how do we make a shared cache fast

Non-uniform Cache Arch. (NUCA)
Bank
Switch
CPU
Request 0x.3 Request 0x.C

S-NUCA Perormance
Fast access to nearby banks
Slow access to far-away banks
Average better than worst-case

CPU
D-NUCA Solution
A B

D-NUCA Perormance
Fast access to nearby banks
Slow access to far-away banks
Average much better than worst-case
But we keep moving blocks
Lots of power-hungry activity
Need smart policies for block migration
Move blocks less frequently
But get most of the benefit of being able to
move

D-NUCA Issues
Blocks keep moving, how do we find them?
One solution: Use an on-chip directory!
Use direct mapping to assign a home bank
If we dont know where the block is,
ask the home bank
If we move the block, tell the home bank
If we think we know where the block is,
look there. If its been moved, ask home bank

Photoshop MCQ Questions and Answers
73% (15)
Photoshop MCQ Questions and Answers
9 pages
200-301 CCNA (Cisco Certified Network Associate) Study Guide
From Everand
200-301 CCNA (Cisco Certified Network Associate) Study Guide
Anand Vemula
No ratings yet
CN Unit 1
No ratings yet
CN Unit 1
81 pages
HPC1
No ratings yet
HPC1
87 pages
Quantum Mechanics - Special Chapters PDF
No ratings yet
Quantum Mechanics - Special Chapters PDF
398 pages
Chapter1 - Networking Concepts
100% (1)
Chapter1 - Networking Concepts
106 pages
4 - Interconnection Networks
No ratings yet
4 - Interconnection Networks
57 pages
3-Topology (Line Configuration, Data Flow) ,-25-07-2024
No ratings yet
3-Topology (Line Configuration, Data Flow) ,-25-07-2024
87 pages
Computer Network Lab Experiments-1
No ratings yet
Computer Network Lab Experiments-1
59 pages
Module 2 (Part) - 3
No ratings yet
Module 2 (Part) - 3
84 pages
UNIT 1 Networking Fundamentals
No ratings yet
UNIT 1 Networking Fundamentals
39 pages
Session 6
No ratings yet
Session 6
70 pages
Static and Dynamic
No ratings yet
Static and Dynamic
43 pages
Algorithms: Notes For Professionals
100% (1)
Algorithms: Notes For Professionals
252 pages
TCS Full
No ratings yet
TCS Full
179 pages
CN Module 1 Final
No ratings yet
CN Module 1 Final
59 pages
Unit 1 CN
No ratings yet
Unit 1 CN
20 pages
Switches, Routers and Networks
No ratings yet
Switches, Routers and Networks
104 pages
PowerPoint Merge
No ratings yet
PowerPoint Merge
124 pages
Notes Multiprocessor
No ratings yet
Notes Multiprocessor
19 pages
Switching Techniques - Tpoint Tech
No ratings yet
Switching Techniques - Tpoint Tech
36 pages
10 Switches
No ratings yet
10 Switches
36 pages
DCN-unit-1 Modified
No ratings yet
DCN-unit-1 Modified
112 pages
Module 4 Distributed System
No ratings yet
Module 4 Distributed System
40 pages
Class Xi Unit-2 Networking and Internet Quick Notes and MCQ
No ratings yet
Class Xi Unit-2 Networking and Internet Quick Notes and MCQ
14 pages
18 Interconnects
No ratings yet
18 Interconnects
38 pages
Computer Networks
No ratings yet
Computer Networks
25 pages
Unit IV Course Material Comp - Networks
No ratings yet
Unit IV Course Material Comp - Networks
37 pages
ECE458 Communication Networks Notes
No ratings yet
ECE458 Communication Networks Notes
55 pages
Aca Unit-3
No ratings yet
Aca Unit-3
10 pages
Computer Networks Bca
No ratings yet
Computer Networks Bca
50 pages
Computer Networks Notes
No ratings yet
Computer Networks Notes
7 pages
Networking Ceh
No ratings yet
Networking Ceh
98 pages
Ids Unit 1
No ratings yet
Ids Unit 1
24 pages
Lecture2 NP Sep 2018
No ratings yet
Lecture2 NP Sep 2018
43 pages
Value of Expression 1 - 2 3 4 Sis: 2. 3, 3-Digit
No ratings yet
Value of Expression 1 - 2 3 4 Sis: 2. 3, 3-Digit
4 pages
Atlas Copco Pf4000 Manual
67% (6)
Atlas Copco Pf4000 Manual
476 pages
CCNA - Preperation-Day1
No ratings yet
CCNA - Preperation-Day1
122 pages
CSC 222 Digital Communication Lect 6
No ratings yet
CSC 222 Digital Communication Lect 6
25 pages
CCNA3 Study Guide
100% (1)
CCNA3 Study Guide
42 pages
2.RGP Corneal Lens
No ratings yet
2.RGP Corneal Lens
13 pages
CN Unit 1
No ratings yet
CN Unit 1
63 pages
Telecom Tutorial
No ratings yet
Telecom Tutorial
23 pages
PC System Power Supply Diagrams, Schematics and Service Manuals PDF - Google Search PDF
50% (2)
PC System Power Supply Diagrams, Schematics and Service Manuals PDF - Google Search PDF
1 page
Slingshot Elastics Test
100% (1)
Slingshot Elastics Test
12 pages
BS EN 12524-2000 Hygrothermal Properties
No ratings yet
BS EN 12524-2000 Hygrothermal Properties
14 pages
Connet Devices
No ratings yet
Connet Devices
13 pages
01-Bowles-Foundation Analysis and Design PDF
No ratings yet
01-Bowles-Foundation Analysis and Design PDF
6 pages
CH-10 Boiler Performance
No ratings yet
CH-10 Boiler Performance
19 pages
Introduction To Computer Networks
No ratings yet
Introduction To Computer Networks
79 pages
Introduction To MIMD Architectures
No ratings yet
Introduction To MIMD Architectures
17 pages
Raids and Availability
No ratings yet
Raids and Availability
3 pages
Interconnection of Networks
No ratings yet
Interconnection of Networks
3 pages
Updated Networking Notes
No ratings yet
Updated Networking Notes
28 pages
Chemistry Acid and Basic Radicals
87% (15)
Chemistry Acid and Basic Radicals
1 page
Att 8 - ASTM B8-4
No ratings yet
Att 8 - ASTM B8-4
7 pages
Network 2: Protocols, Routing, Wireless: Prof - Lawrence Rauchwerger
No ratings yet
Network 2: Protocols, Routing, Wireless: Prof - Lawrence Rauchwerger
36 pages
Lec CH 17 Symmetric Faults
No ratings yet
Lec CH 17 Symmetric Faults
16 pages
Lecture 3 - 3 Evaluating Static Interconnection Networks
No ratings yet
Lecture 3 - 3 Evaluating Static Interconnection Networks
41 pages
Lecture Note On Switch Architectures
No ratings yet
Lecture Note On Switch Architectures
63 pages
Lan and Inter-Working Devices: M.S.Chawla Sde (Computer) RTTC Rajpura
No ratings yet
Lan and Inter-Working Devices: M.S.Chawla Sde (Computer) RTTC Rajpura
37 pages
Chapter 2 - Parallel Programming Platforms
No ratings yet
Chapter 2 - Parallel Programming Platforms
33 pages
Shop 04 PEB Data
No ratings yet
Shop 04 PEB Data
9 pages
Lecture 18 - 19 - Switching Cont. - Message Switching
No ratings yet
Lecture 18 - 19 - Switching Cont. - Message Switching
28 pages
Ch2. Basics of Python Programming: Dr. Tulika Assistant Professor Department of Computer Science Miranda House
No ratings yet
Ch2. Basics of Python Programming: Dr. Tulika Assistant Professor Department of Computer Science Miranda House
47 pages
Review Questions - ch09
No ratings yet
Review Questions - ch09
10 pages
Appendix F: Authors: John Hennessy & David Patterson
No ratings yet
Appendix F: Authors: John Hennessy & David Patterson
33 pages
Lecture 3.2.4 (Various Interconnection Networks)
No ratings yet
Lecture 3.2.4 (Various Interconnection Networks)
5 pages
Frese OPTIMA Compact Actuators
No ratings yet
Frese OPTIMA Compact Actuators
6 pages
Computer Networks vs. Distributed Systems
No ratings yet
Computer Networks vs. Distributed Systems
68 pages
Lect Networking Primer
No ratings yet
Lect Networking Primer
51 pages
9100 Manual
No ratings yet
9100 Manual
11 pages
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
No ratings yet
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
16 pages
Lec09 Switches
No ratings yet
Lec09 Switches
35 pages
Husqvarna 2003 SM WRE 125 Manual
No ratings yet
Husqvarna 2003 SM WRE 125 Manual
2 pages
Intrinsic Viscosities and Unperturbed Dimensions of Long Chain Molecules
No ratings yet
Intrinsic Viscosities and Unperturbed Dimensions of Long Chain Molecules
117 pages
Unit-I INTRODUCTION: Goal and Application of Network, Network Structure and Architecture, Network Topology, Terminal Handling
No ratings yet
Unit-I INTRODUCTION: Goal and Application of Network, Network Structure and Architecture, Network Topology, Terminal Handling
6 pages
Distributed Memory Machines
No ratings yet
Distributed Memory Machines
10 pages
Comparison of Shielding Methods
No ratings yet
Comparison of Shielding Methods
2 pages
Mip Report
No ratings yet
Mip Report
22 pages
Computer Networks Networking
No ratings yet
Computer Networks Networking
7 pages
Bhumika Di Ip
No ratings yet
Bhumika Di Ip
20 pages
Circle The Correct Answer:: Name (Last, First) : W#
No ratings yet
Circle The Correct Answer:: Name (Last, First) : W#
1 page
Piezoelectric Energy Harvesting As Opportunity of Powering Intelligent Implants and Prostheses
No ratings yet
Piezoelectric Energy Harvesting As Opportunity of Powering Intelligent Implants and Prostheses
4 pages
Internal Test I STA
No ratings yet
Internal Test I STA
2 pages
قوانين الفصول بملف واحد فيزياء السادس علمي للاستاذ سعيد محي تومان PDF PDF Mathematical Analysis Teaching Mathematics
No ratings yet
قوانين الفصول بملف واحد فيزياء السادس علمي للاستاذ سعيد محي تومان PDF PDF Mathematical Analysis Teaching Mathematics
1 page
Fractional Fourier Transform
No ratings yet
Fractional Fourier Transform
28 pages
Polygenic Risk in Families With Spon
No ratings yet
Polygenic Risk in Families With Spon
8 pages
Electric Power Generation2
No ratings yet
Electric Power Generation2
28 pages
How To Reduce EMI in Switching Power Supplies
No ratings yet
How To Reduce EMI in Switching Power Supplies
3 pages
Akka HTTP
No ratings yet
Akka HTTP
23 pages
AD8302
No ratings yet
AD8302
24 pages
Chapter 10
No ratings yet
Chapter 10
15 pages
10 1016@j Mineng 2019 02 012 PDF
No ratings yet
10 1016@j Mineng 2019 02 012 PDF
7 pages
SPANNING TREE PROTOCOL: Most important topic in switching
From Everand
SPANNING TREE PROTOCOL: Most important topic in switching
Mulayam Singh
No ratings yet
HW 00 Quick Online Tutorial
No ratings yet
HW 00 Quick Online Tutorial
2 pages
Java Sript
No ratings yet
Java Sript
2 pages
Table 5.3: Specifications of Optimized Equivalent Anisotropic Microstrip Line After
No ratings yet
Table 5.3: Specifications of Optimized Equivalent Anisotropic Microstrip Line After
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CS 6290 Many-Core & Interconnect: Milos Prvulovic Fall 2007

Uploaded by

CS 6290 Many-Core & Interconnect: Milos Prvulovic Fall 2007

Uploaded by

CS 6290

Many-core & Interconnect

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.