0% found this document useful (0 votes)

78 views2 pages

Flow Slice PDF

A paper about load balancing

Uploaded by

dodownload

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views2 pages

Flow Slice PDF

A paper about load balancing

Uploaded by

dodownload

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Flow-Slice: A Novel Load-Balancing Scheme for Multi-Path Switching Systems

1 1 1 1 2 3 Lei Shi , Bin Liu , Changhua Sun , Zhengyu Yin , Laxmi Bhuyan , H. Jonathan Chao
1

Department of Computer Science and Technology Tsinghua University Beijing, China

Department of Computer Science and Engineering University of California, Riverside CA 92521, U.S.A

Department of Electrical and Computer Engineering Polytechnic University Brooklyn, NY 11201, U.S.A

{shijim,sch04,yzy04}@mails.thu .edu.cn, liub@tsinghua.edu.cn ABSTRACT

bhuyan@cs.ucr.edu

chao@poly.edu

Multi-Path Switching systems (MPS) are intensively used in the state-of-the-art core routers. One of the most intractable issues is how to load-balance traffic across its multiple paths while not disturbing the intra-flow packet orders. In this paper, based on the studies of tens of real Internet traces, we develop a novel scheme, namely Flow-Slice (FS), which cuts off each flow into flow-slices at every intra-flow interval larger than a slicing threshold set to 1ms~4ms and balances the load on the finer granularity. Through theoretical analyses and comprehensive trace-driven simulations, we show that FS achieves impressive load-balancing performance with little hardware cost while limiting the packet out-of-order chances to a negligible level (below 10-6).

The rule-of-thumb on this problem advocates packet-based solutions where the traffic is optimally balanced. However, in this way, packets in the same flow may be forwarded in the separate paths and experience various delays, thus violating the intra-flow packet ordering requirement. Although timestamp or sequence based re-sequencers can be added to restore packet orders, they are often shown to be costly and not scalable. By timestamp based re-sequencer [1], each packet is stalled statically (or adaptively) by the system delay upper bound, which will impose a huge delay penalty. On the other hand, the sequence based re-sequencer [2] will need to maintain at least N re-sequencers at each output, leading to O(N2) complexity. (N is the number of ports in a square MPS.) In a 1024-port/16-plane/8-priority-class 3-stage-Clos based MPS, it should maintain 4M re-sequencing FIFOs at each output. To avoid the packet out-of-order, another choice is to use flowbased load-balancing algorithms [3]. They dispatch packets in the same flow to a fixed switching path by hashing its 5-tuple to path ID. However, hashing solution will lead to severe load-imbalance. It is further shown by our evaluation results. In this paper, we present a new scheme, namely Flow-Slice (FS), that perfectly achieves the three objectives defined above. Our idea is inspired from the observations on tens of broadly located Internet traces that the intra-flow packet intervals are often, say in 40%~50% percentages, larger than the delay upper bound at MPS which is calculated statistically. As such, if we cut off each flow at every packet interval larger than a slicing threshold equaling to this bound and balance the load on the generated flow-slices, the three objectives are met triply. The load-balancing uniformity of FS is only moderately degraded from the optimal load-balancing; The intra-flow packet order is kept intact as their arrivals. Exceptions only happen in a negligible level. (below 10-6); The flow-slice table size to implement FS is limited below 1.8MB under 40Gbps line rate, which can be placed onchip to provide an ultra-fast access speed.

Categories and Subject Descriptors

C.2.1 [Computer-Communication Networks]: Network Architecture and Design Packet-switching networks

General Terms
Algorithms, Measurement, Performance

Keywords
Flow-Slice, Load-Balancing, Multi-Path Switching

1. INTRODUCTION
One of the major issues in designing MPS is the load-balancing problem defined as how to distribute incoming traffic A(t) across its k switching paths to meet the three objectives simultaneously: Uniform load-sharing: The traffic destined for each output should be dispatched to all the switching paths uniformly; Intra-flow packet ordering: The packets in the same flow should depart MPS as their arrival orders; Low complexity: The load-balancing and the additional resequencing mechanisms should work fast enough to catch up with the switch fabrics line rate.
Copyright is held by the author/owner(s). ANCS07, December 34, 2007, Orlando, Florida, USA. ACM 978-1-59593-945-6/07/0012.

2. FLOW-SLICE
Definition: A flow-slice is a sequence of packets in a flow, where every intra-flow interval between two consecutive packets is smaller than or equal to a slicing threshold ST.

Figure 1. Flow-slice size.

Figure 2. Speedup requirements in PPS.

Figure 3. Average packet delay in PPS.

Consider a PPS with port number (N) below 32 and R/k=5Gbps, where R denotes the external line rate and k denotes the switch plane number, the STmin is shown in Figure 2, as a inverse function of the provided minimal speedup S of the PPS. We observe that a speedup of 1.409 is sufficient to ensure STmin2ms for all traces. Given a slightly larger speedup of 1.627, STmin1ms can be expected. For the typical LBvN and M2Clos design, the speedup of 2 is required to ensure STmin4ms. Figure 4. Packet out-of-order probability in PPS. Flow-slices can be seen as mini-flows created by cutting off every intra-flow interval larger than ST. Compared with the original 5tuple flows, three specific properties are observed for flow-slice in all the traces we study. Property 1 (Small Size): Both the average per-flow-slice packet count and the average per-flow-slice size are much smaller than those of the 5-tuple flows. Figure 1 shows the average per-flow-slice size, while the per-flow sizes shown by the intersections of the curves and the Y axis are much larger. Using the per-flow-slice (per-flow) size to indicate the load-balancing granularity, the flow-based algorithm is 3.5~12 times coarser than the packet-based one, while the flow-slice based algorithm is only 41%~97% coarser at ST=1ms. Property 2 (Light-Tailed Size Distribution): The per-flow-slice packet count and size distributions are light-tailed while the perflow distributions are heavy-tailed. Property 3 (Fewer Active Flow-Slices): The active flow-slice number is 1~2 magnitudes fewer than the active 5-tuple flow.

4. EVALUATIONS
We establish prototypes for all the three MPS by software modeling. Specifically, the PPS prototype has 32 external ports working at 40Gbps and 8 parallel switch planes working at 5Gbps. No speedup is provided. We use homogeneous real trace data sets collected at CERNET backbone to generate the traffic at each input. Each segment has an average traffic speed around 3.5Gbps and is condensed to simulate the expected traffic rate. Each incoming packets information, including the packet arrival time, packet length and 5-tuple, are extracted from the trace files. In each test slot, 1.2 billion packets are injected to the prototype. Figure 3 depicts the average packet delay experienced in PPS when the traffic arrival is uniform. At the load rate above 0.85, FS with slicing threshold of 1ms receives the average delay only one times larger than the optimal Round-Robin (RR) algorithm; while the hashing algorithms and the re-sequencing methods are generally more than six times larger. Figure 4 depicts the packet out-of-order probability. RR without re-sequencer consistently disorders more than 2% packets, while FS limits the packet outof-order at a negligible level (below 10-6) if only slicing threshold is no less than 1ms and load rate is no larger than 0.8. This corresponds to a speedup requirement of 1.25.

3. ADMISSIBLE SLICING THRESHOLD

Theorem (Packet Out-of-order Probability): Setting a slicing threshold ST for MPS adopting FS, which leads to a statistical delay upper bound of D1 in 1 confidence interval, the packet out-of-order probability in MPS will be guaranteed of no more than , if only it suffices ST D1 . The slicing threshold ST is defined to be admissible if it guarantees a packet out-of-order probability of no more than 10-6. We are most interested in the smallest admissible slicing threshold (STmin), as it provides the best load-balancing performance while satisfying the packet out-of-order requirement. We calculate the STmin for three popular MPS designs, including Parallel Packet Switch (PPS), Load-Balanced Birkhoff-von Neumann switch (LBvN) and Multi-plane Multi-stage Clos network based switch (M2Clos).

ACKNOWLEDGMENTS
This work is partially supported by National Science Foundation of China (No. 60573121, 60625201), and National Basic Research Program of China (973 program, No. 2007CB310702).

5. REFERENCES
[1] J. S. Turner, "Resequencing Cells in an ATM Switch," Tech. Rep., WUCS-91-21, Feb. 1991. [2] D. A. Khotimsky and S. Krishnan, "Evaluation of Open-loop Sequence Control Schemes for Multi-path Switches," in Proc. IEEE ICC, pp. 2116-2120, 2002. [3] L. Shi, W. Li, B. Liu, and X. Wang, "Flow Mapping in the Load Balancing Parallel Packet Switches," in Proc. IEEE HPSR, pp. 254-258, 2005.

On Wireless Link Scheduling and Flow Control - Gore
No ratings yet
On Wireless Link Scheduling and Flow Control - Gore
213 pages
Fast ReRouting AnnaFormat Finished2
No ratings yet
Fast ReRouting AnnaFormat Finished2
184 pages
Sizing Router Buffer
No ratings yet
Sizing Router Buffer
112 pages
Project Manager RAN Integration - Ericsson
No ratings yet
Project Manager RAN Integration - Ericsson
2 pages
Modeling and Optimization of Latency in Erasure-Coded Storage Systems
No ratings yet
Modeling and Optimization of Latency in Erasure-Coded Storage Systems
141 pages
Book Scheduling2008
No ratings yet
Book Scheduling2008
124 pages
98-366 2E Lesson 1 Slides
No ratings yet
98-366 2E Lesson 1 Slides
54 pages
Informatics 11 00058
No ratings yet
Informatics 11 00058
29 pages
1999 Icon
No ratings yet
1999 Icon
7 pages
1998 Globecom
No ratings yet
1998 Globecom
6 pages
Shell Air Tool S2 A 150: Performance, Features & Benefits
No ratings yet
Shell Air Tool S2 A 150: Performance, Features & Benefits
2 pages
M2-L2 Lab User Story and Product Backlog Worksheet
No ratings yet
M2-L2 Lab User Story and Product Backlog Worksheet
1 page
Stud CSA Mod4 p1MsgPassng
No ratings yet
Stud CSA Mod4 p1MsgPassng
40 pages
(THESIS) Enhanced Fast Rerouting Mechanisms For Protected Traffic in MPLS Networks
No ratings yet
(THESIS) Enhanced Fast Rerouting Mechanisms For Protected Traffic in MPLS Networks
189 pages
Prem Kumar 2017
No ratings yet
Prem Kumar 2017
5 pages
Proposed Scheme
No ratings yet
Proposed Scheme
11 pages
Gps PDF
No ratings yet
Gps PDF
14 pages
Multiple Fault Tolerance in Mpls Network Using Open Source Network Simulator
No ratings yet
Multiple Fault Tolerance in Mpls Network Using Open Source Network Simulator
8 pages
Saras Thesis
No ratings yet
Saras Thesis
231 pages
Refueller Fabrication Manual
100% (1)
Refueller Fabrication Manual
256 pages
Improving SCFQ To Support Bursty Traffic
No ratings yet
Improving SCFQ To Support Bursty Traffic
11 pages
Distributed Systems Concepts: Ch. 10 and 14-17
No ratings yet
Distributed Systems Concepts: Ch. 10 and 14-17
53 pages
RAM & SSD Upgrades - HP - Compaq Pavilion x360 14-Ba003tx
No ratings yet
RAM & SSD Upgrades - HP - Compaq Pavilion x360 14-Ba003tx
8 pages
Design & Verification Multimedia Using Routing IP
No ratings yet
Design & Verification Multimedia Using Routing IP
4 pages
Research Inventy: International Journal of Engineering and Science
No ratings yet
Research Inventy: International Journal of Engineering and Science
5 pages
Guidelines For Students
No ratings yet
Guidelines For Students
2 pages
Nan Su2018 - A Highly Efficient Dynamic Router For Application Oriented Network On Chip
No ratings yet
Nan Su2018 - A Highly Efficient Dynamic Router For Application Oriented Network On Chip
11 pages
Datasheet - Acuvim-L Multifunction Power and Energy Meter
No ratings yet
Datasheet - Acuvim-L Multifunction Power and Energy Meter
10 pages
Its Implementation in NS2 A New Path Computation Algorithm and
No ratings yet
Its Implementation in NS2 A New Path Computation Algorithm and
6 pages
Building Material Specification
No ratings yet
Building Material Specification
88 pages
Fastpass SIGCOMM14 Perry
No ratings yet
Fastpass SIGCOMM14 Perry
12 pages
Java Abstract 2010 & 2009
No ratings yet
Java Abstract 2010 & 2009
60 pages
Buffer Space Allocation For Real-Time Channels in A Packet-Switching Network
No ratings yet
Buffer Space Allocation For Real-Time Channels in A Packet-Switching Network
21 pages
Info Compr I Yank
No ratings yet
Info Compr I Yank
10 pages
Making Parallel Packet Switches Practical: Sundar Iyer, Nick Mckeown
No ratings yet
Making Parallel Packet Switches Practical: Sundar Iyer, Nick Mckeown
8 pages
Fairqueue
No ratings yet
Fairqueue
22 pages
A Scalable Memory-Ef Cient Architecture For Parallel Shared Memory Switches
No ratings yet
A Scalable Memory-Ef Cient Architecture For Parallel Shared Memory Switches
5 pages
Mini Project
No ratings yet
Mini Project
12 pages
Joint Flow Routing and Relay Node Assignment
No ratings yet
Joint Flow Routing and Relay Node Assignment
22 pages
Pembagian Trafik Jaringan Komputer
No ratings yet
Pembagian Trafik Jaringan Komputer
16 pages
Path Allocation in Backbone Networks Project Repor
No ratings yet
Path Allocation in Backbone Networks Project Repor
125 pages
Load Balancing Multipath Switching System With Flow Slice
No ratings yet
Load Balancing Multipath Switching System With Flow Slice
7 pages
The Stratified Round Robin Scheduler: Design, Analysis and Implementation
No ratings yet
The Stratified Round Robin Scheduler: Design, Analysis and Implementation
12 pages
A Data Throughput Prediction Using Scheduling and Assignment Technique
No ratings yet
A Data Throughput Prediction Using Scheduling and Assignment Technique
5 pages
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
No ratings yet
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
4 pages
Load Sharing With OCGRR For Network Processors Which Supports Different Services
No ratings yet
Load Sharing With OCGRR For Network Processors Which Supports Different Services
6 pages
Round Robin Algorithm
No ratings yet
Round Robin Algorithm
27 pages
Analysis, Approximations and Admission Control of A Multi-Service Multiplexing System With Priorities
No ratings yet
Analysis, Approximations and Admission Control of A Multi-Service Multiplexing System With Priorities
10 pages
Improving Fairness in Packetized Computer Data Networks
No ratings yet
Improving Fairness in Packetized Computer Data Networks
29 pages
Multicost (Or Qos) Routing: Minimize F (V) F (V
No ratings yet
Multicost (Or Qos) Routing: Minimize F (V) F (V
23 pages
Diffserv Traffic Management With MPLS: Paulo Rogério Pereira, Pasquale Lepera, Augusto Casaca
No ratings yet
Diffserv Traffic Management With MPLS: Paulo Rogério Pereira, Pasquale Lepera, Augusto Casaca
4 pages
Networks : Partial Disjoint For Multi-Layer Protection
No ratings yet
Networks : Partial Disjoint For Multi-Layer Protection
6 pages
Qyality of Service
No ratings yet
Qyality of Service
19 pages
William Stallings Data and Computer Communications: Packet Switching
No ratings yet
William Stallings Data and Computer Communications: Packet Switching
59 pages
Design Schemes For Mpls Fast Reroute: Olexandr Lemeshko, Alla Romanyuk, Helen Kozlova
No ratings yet
Design Schemes For Mpls Fast Reroute: Olexandr Lemeshko, Alla Romanyuk, Helen Kozlova
2 pages
36-QoS TBF WFQ
No ratings yet
36-QoS TBF WFQ
9 pages
Fast Data Collection
No ratings yet
Fast Data Collection
10 pages
CH 5 Network Layer Congestion
No ratings yet
CH 5 Network Layer Congestion
42 pages
Congestion Control Algorithms
No ratings yet
Congestion Control Algorithms
42 pages
Homework - ECE 346 Fall 2010 - Classes 7-12
No ratings yet
Homework - ECE 346 Fall 2010 - Classes 7-12
12 pages
Ed 14 Module 4
No ratings yet
Ed 14 Module 4
16 pages
R07-200 RNKG PDF
No ratings yet
R07-200 RNKG PDF
4 pages
A Hierarchical Multilayer Qos Routing System With Dynamic Sla Management
No ratings yet
A Hierarchical Multilayer Qos Routing System With Dynamic Sla Management
14 pages
Paper 3 - Slides Qsic-Cidb Act2
No ratings yet
Paper 3 - Slides Qsic-Cidb Act2
22 pages
Sistem de Balustrada Vormatic - KGS - GB
No ratings yet
Sistem de Balustrada Vormatic - KGS - GB
16 pages
Software Design Patterns
100% (1)
Software Design Patterns
17 pages
API 1104 Code Clinic (Nineteenth Edition)
No ratings yet
API 1104 Code Clinic (Nineteenth Edition)
24 pages
Astm D638-22
No ratings yet
Astm D638-22
7 pages
Lecture12 Routers
No ratings yet
Lecture12 Routers
64 pages
CH 24 QoS Part2-1
No ratings yet
CH 24 QoS Part2-1
19 pages
An Overview of Motherboard Types - CompTIA A+ 220-801 - 1.2 - Professor Messer IT Certification Training Courses
100% (1)
An Overview of Motherboard Types - CompTIA A+ 220-801 - 1.2 - Professor Messer IT Certification Training Courses
4 pages
Site Inspection HSE Observations Report No.01 - 28.06.2022 N10
100% (1)
Site Inspection HSE Observations Report No.01 - 28.06.2022 N10
2 pages
Mini Cooper
No ratings yet
Mini Cooper
9 pages
Network
No ratings yet
Network
3 pages
"Leaky Bucket Algorithm": Computer Networks Minor Project Report On
100% (1)
"Leaky Bucket Algorithm": Computer Networks Minor Project Report On
13 pages
Exam 1
No ratings yet
Exam 1
8 pages
E Kubilinskas THESIS
No ratings yet
E Kubilinskas THESIS
254 pages
Compal La-A971p r0.3 Schematics PDF
No ratings yet
Compal La-A971p r0.3 Schematics PDF
53 pages
Type RI Contract NB 165895 INVOICE NB 132829 Client NB 388603 PDF
No ratings yet
Type RI Contract NB 165895 INVOICE NB 132829 Client NB 388603 PDF
3 pages
As 1019-2000 Internal Combustion Engines - Spark Emission Control Devices
No ratings yet
As 1019-2000 Internal Combustion Engines - Spark Emission Control Devices
7 pages
High Performance Network-on-Chip Through MPLS
No ratings yet
High Performance Network-on-Chip Through MPLS
4 pages
Quality Principles and Concepts
No ratings yet
Quality Principles and Concepts
32 pages
Info - Iec60794 5 20 (Ed1.0) en
No ratings yet
Info - Iec60794 5 20 (Ed1.0) en
7 pages
TG550 Service Manual
No ratings yet
TG550 Service Manual
27 pages
Guide Specifications Symmetra 96 160kVA
No ratings yet
Guide Specifications Symmetra 96 160kVA
8 pages
ZXHN F620 Datasheet
50% (2)
ZXHN F620 Datasheet
2 pages
l2vpn Tutorial
No ratings yet
l2vpn Tutorial
88 pages
Ansi Fci70-2
No ratings yet
Ansi Fci70-2
3 pages
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
Study Guide Designing Cisco Data Centre Infrastructure (300-610) Exam
From Everand
Study Guide Designing Cisco Data Centre Infrastructure (300-610) Exam
Anand Vemula
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Flow Slice PDF

Uploaded by

Flow Slice PDF

Uploaded by

Flow-Slice: A Novel Load-Balancing Scheme for Multi-Path Switching Systems

Department of Computer Science and Technology Tsinghua University Beijing, China

{shijim,sch04,yzy04}@mails.thu .edu.cn, liub@tsinghua.edu.cn ABSTRACT

Categories and Subject Descriptors

Figure 1. Flow-slice size.

Figure 2. Speedup requirements in PPS.

Figure 3. Average packet delay in PPS.

3. ADMISSIBLE SLICING THRESHOLD

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.