0% found this document useful (0 votes)

16 views18 pages

Lecture 4 - Failure Detection and Membership

Uploaded by

Asad Javed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views18 pages

Lecture 4 - Failure Detection and Membership

Uploaded by

Asad Javed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Parallel and Distributed

Computing
Fall 2022
Dr. Zeshan Iqbal
Lecture 4: Failure Detection and
Membership
1

A Challenge
• You’ve been put in charge of a datacenter, and your
manager has told you, “Oh no! We don’t have any failures
in our datacenter!”

• Do you believe him/her?

• What would be your first responsibility?

• Build a failure detector
• What are some things that could go wrong if you didn’t do
this? 2

1
Failures are the Norm
… not the exception, in datacenters.

Say, the rate of failure of one machine (OS/disk/motherboard/network, etc.) is

once every 10 years (120 months) on average.

When you have 120 servers in the DC, the mean time to failure (MTTF) of the
next machine is 1 month.

When you have 12,000 servers in the DC, the MTTF is about once every 7.2
hours!

Soft crashes and failures are even more frequent!

To build a failure detector

• You have a few options

1. Hire 1000 people, each to monitor one machine in the datacenter and
report to you when it fails.
2. Write a failure detector program (distributed) that automatically detects
failures and reports to your workstation.

Which is more preferable, and why?

2
Target Settings
• Process ‘group’-based systems
– Clouds/Datacenters
– Replicated servers
– Distributed databases

• Fail-stop (crash) process failures 5

Group Membership Service

Application Queries Application Process pi
e.g., gossip, overlays,
DHT’s, etc.
joins, leaves, failures
of members
Membership
Protocol
Membership
Group List
Membership List
Unreliable
Communication 6

3
Two sub-protocols
Application Process pi
Group
Membership List
pj

•Complete list all the time (Strongly consistent) Dissemination

•Virtual synchrony Failure Detector
•Almost-Complete list (Weakly consistent)
•Gossip-style, SWIM, …
•Or Partial-random list (other systems)
•SCAMP, T-MAN, Cyclon,… Unreliable
Focus of this series of lecture Communication 7

Large Group: Scalability A Goal

this is us (pi) Process Group
“Members”

1000’s of processes

Unreliable Communication
Network
8

4
Group Membership Protocol
II Failure Detector
Some process
pi finds out quickly
I pj crashed

III Dissemination
Unreliable Communication
Network
Fail-stop Failures only 9

Next
• How do you design a group membership
protocol?

5
I. pj crashes
• Nothing we can do about it!
• A frequent occurrence
• Common case rather than exception
• Frequency goes up linearly with size of
datacenter

II. Distributed Failure Detectors:

Desirable Properties
• Completeness = each failure is detected
• Accuracy = there is no mistaken detection
• Speed
– Time to first detection of a failure
• Scale
– Equal Load on each member
– Network Message Load 12

6
Distributed Failure Detectors: Properties

Impossible together in
• Completeness
lossy networks [Chandra
• Accuracy and Toueg]
• Speed
If possible, then can
– Time to first detection of a failure
solve consensus! (but
• Scale consensus is known to be
– Equal Load on each member unsolvable in
– Network Message Load asynchronous systems)
13

What Real Failure Detectors Prefer

• Completeness Guaranteed
Partial/Probabilistic
• Accuracy guarantee
• Speed
– Time to first detection of a failure
• Scale
– Equal Load on each member
– Network Message Load 14

7
What Real Failure Detectors Prefer
• Completeness Guaranteed
Partial/Probabilistic
• Accuracy guarantee
• Speed
– Time to first detection of a failure
Time until some
• Scale process detects the failure
– Equal Load on each member
– Network Message Load 15

What Real Failure Detectors Prefer

• Completeness Guaranteed
Partial/Probabilistic
• Accuracy guarantee
• Speed
– Time to first detection of a failure
Time until some
• Scale process detects the failure
– Equal Load on each member No bottlenecks/single
– Network Message Load failure point 16

8
Failure Detector Properties
• Completeness In spite of
arbitrary simultaneous
• Accuracy process failures
• Speed
– Time to first detection of a failure
• Scale
– Equal Load on each member
– Network Message Load 17

Centralized Heartbeating
L Hotspot
pi

…
pi, Heartbeat Seq. l++
pj •Heartbeats sent periodically
•If heartbeat not received from pi within
18
timeout, mark pi as failed
18

9
Ring Heartbeating
L Unpredictable on
pi simultaneous multiple
pi, Heartbeat Seq. l++
failures
pj

…
…

All-to-All Heartbeating
J Equal load per member
pi, Heartbeat Seq. l++ pi L Single hb loss à false
detection
…
pj

10
Next
• How do we increase the robustness of all-to-all
heartbeating?

Gossip-style Heartbeating
J Good accuracy
Array of pi properties
Heartbeat Seq. l
for member subset

11
Gossip-Style Failure Detection
1 10118 64
1 10120 66 2 10110 64
2 10103 62 3 10090 58
3 10098 63 4 10111 65
4 10111 65 2
1
Address Time (local) 1 10120 70
Heartbeat Counter 2 10110 64
Protocol:
3 10098 70
•Nodes periodically gossip their membership
list: pick random nodes, send it list
4 4 10111 65

•On receipt, it is merged with local

3
Current time : 70 at node 2
membership list
•When an entry times out, member is marked
(asynchronous clocks)
as failed
23

Gossip-Style Failure Detection

• If the heartbeat has not increased for more than
Tfail seconds,
the member is considered failed
• And after a further Tcleanup seconds, it will
delete the member from the list
• Why an additional timeout? Why not delete
right away? 24

12
Gossip-Style Failure Detection
• What if an entry pointing to a failed node is
deleted right after Tfail (=24) seconds?
1 10120 66
2 10110 64
1 10120 66 34 10098
10111 75
50
65
2 10103 62 4 10111 65

3 10098 55 2
4 10111 65 1
Current time : 75 at node 2

4
3 25

• Fix: remember for another Tfail

Next
• Is there a better failure detector?

13
SWIM Failure Detector Protocol
pi pj
•random pj
ping K random
ack processes

•random K X
ping-req
X
Protocol period ping
= T’ time units
ack
ack

Time-bounded Completeness
• Key: select each membership element once as a ping
target in a traversal
– Round-robin pinging
– Random permutation of list after each traversal
• Each failure is detected in worst case 2N-1 (local)
protocol periods
• Preserves FD properties
28

14
Next
• How do failure detectors fit into the big picture
of a group membership protocol?
• What are the missing blocks?

Group Membership Protocol

II Failure Detector
Some process
pi finds out quickly
I pj crashed

III Dissemination
Unreliable Communication
Network
Fail-stop Failures only 30

HOW ?
HOW ? 15
HOW ?
HOW ?
Dissemination Options
• Multicast (Hardware / IP)
– unreliable
– multiple simultaneous multicasts
• Point-to-point (TCP / UDP)
– expensive
• Zero extra messages: Piggyback on Failure Detector messages
– Infection-style Dissemination

Infection-style Dissemination
pi pj
•random pj
ping K random
ack processes
•random K X
ping-req
X
Protocol period ping
= T time units
ack
ack Piggybacked
membership
information
32

16
Suspicion Mechanism
• False detections, due to
– Perturbed processes
– Packet losses, e.g., from congestion
• Indirect pinging may not solve the problem
• Key: suspect a process before declaring it as
failed in the group
33

Suspicion Mechanism pi

Dissmn (Suspect pj) Dissmn

FD34
ed
f ail t pj) Suspected
p in g s p e c Tim
i
: : p ::( S u e ss eo
ut
D n c
F sm
i s g s u c p j)
D e
i p in A liv
Alive D ::p n : : ( Failed
F ssm
Di
Dissmn (Alive pj) Dissmn (Failed pj)

17
Suspicion Mechanism
• Distinguish multiple suspicions of a process
– Per-process incarnation number
– Inc # for pi can be incremented only by pi
• e.g., when it receives a (Suspect, pi) message
– Somewhat similar to DSDV (routing protocol in ad-hoc nets)
• Higher inc# notifications over-ride lower inc#’s
• Within an inc#: (Suspect inc #) > (Alive, inc #)
• (Failed, inc #) overrides everything else 35

Wrap Up
• Failures the norm, not the exception in datacenters
• Every distributed system uses a failure detector
• Many distributed systems use a membership service

• Ring failure detection underlies

– IBM SP2 and many other similar clusters/machines

• Gossip-style failure detection underlies

– Amazon EC2/S3 (rumored!)
36

Data Gathering, Data Dissemination and Data Fusion
100% (1)
Data Gathering, Data Dissemination and Data Fusion
17 pages
gfgdgdg
No ratings yet
gfgdgdg
610 pages
Notes On Theory of Distributed Systems
No ratings yet
Notes On Theory of Distributed Systems
556 pages
Lecture 4 - Failure Detection and Membership
No ratings yet
Lecture 4 - Failure Detection and Membership
36 pages
CS 425 / ECE 428 Distributed Systems Fall 2016: Lecture 16-A: Impossibility of Consensus
No ratings yet
CS 425 / ECE 428 Distributed Systems Fall 2016: Lecture 16-A: Impossibility of Consensus
40 pages
notes (2)
No ratings yet
notes (2)
584 pages
Invitation to Computer Science 7th Edition Schneider Solutions Manual - Complete Set Of Chapters Available For One-Click Download
100% (2)
Invitation to Computer Science 7th Edition Schneider Solutions Manual - Complete Set Of Chapters Available For One-Click Download
46 pages
TCT6A
No ratings yet
TCT6A
46 pages
T5 Failure Detectors
No ratings yet
T5 Failure Detectors
67 pages
Asynchronous Multi Agent ASMs With Real
No ratings yet
Asynchronous Multi Agent ASMs With Real
28 pages
SEC-2425-L06 (3)
No ratings yet
SEC-2425-L06 (3)
42 pages
Lecture-04
No ratings yet
Lecture-04
49 pages
CS601 QUiz 1 Updated-1
No ratings yet
CS601 QUiz 1 Updated-1
53 pages
5.1 - What is Group Membership List
No ratings yet
5.1 - What is Group Membership List
57 pages
Network Performance
No ratings yet
Network Performance
17 pages
lecture 7
No ratings yet
lecture 7
57 pages
Fault Tolerance Fdcc
No ratings yet
Fault Tolerance Fdcc
76 pages
Chap 15
No ratings yet
Chap 15
72 pages
DC - Unit IV
No ratings yet
DC - Unit IV
36 pages
Huawei H12-811 - V1 0 v2021-11-19 q116
No ratings yet
Huawei H12-811 - V1 0 v2021-11-19 q116
28 pages
Lab 2
No ratings yet
Lab 2
24 pages
Lecture 04
No ratings yet
Lecture 04
49 pages
DS UNIT-3 NOTES
No ratings yet
DS UNIT-3 NOTES
35 pages
Chapter 8
No ratings yet
Chapter 8
107 pages
Descritivotecnico s5600
No ratings yet
Descritivotecnico s5600
39 pages
ch08 Ts TK Fault Tolerance I
No ratings yet
ch08 Ts TK Fault Tolerance I
29 pages
CS 425 / ECE 428 Distributed Systems Fall 2016: Indranil Gupta (Indy) Sep 8, 2016
No ratings yet
CS 425 / ECE 428 Distributed Systems Fall 2016: Indranil Gupta (Indy) Sep 8, 2016
66 pages
Field Test Plan For Frequency Synchronization Using PTP
No ratings yet
Field Test Plan For Frequency Synchronization Using PTP
30 pages
Please Provide Me Zte 5g Pa Product Names for Ue!
No ratings yet
Please Provide Me Zte 5g Pa Product Names for Ue!
13 pages
Computer Science 425 Distributed Systems: CS 425 / ECE 428
No ratings yet
Computer Science 425 Distributed Systems: CS 425 / ECE 428
34 pages
Distributed Computing Module 3 Important Topics PYQs
No ratings yet
Distributed Computing Module 3 Important Topics PYQs
18 pages
Week-04
No ratings yet
Week-04
49 pages
Chen 07
No ratings yet
Chen 07
39 pages
1-Lecture (2. Intro-Core Challenges)_Slides
No ratings yet
1-Lecture (2. Intro-Core Challenges)_Slides
22 pages
Practical No 8
No ratings yet
Practical No 8
13 pages
Computer Networks and Internet
No ratings yet
Computer Networks and Internet
81 pages
Fault Tolerant Message Passing Systems
No ratings yet
Fault Tolerant Message Passing Systems
26 pages
CSE446 Lecture 4
No ratings yet
CSE446 Lecture 4
32 pages
Cudy LT500D(EU)+V2.0_Datasheet
No ratings yet
Cudy LT500D(EU)+V2.0_Datasheet
6 pages
modernprotocols-lewispye 1 (1)
No ratings yet
modernprotocols-lewispye 1 (1)
6 pages
Gossip
No ratings yet
Gossip
16 pages
Consensus Failure
No ratings yet
Consensus Failure
79 pages
Tutorial On Communication Between Access Networks and The 5G Core
No ratings yet
Tutorial On Communication Between Access Networks and The 5G Core
14 pages
Chapter_8-Fault_Tolerance (1)
No ratings yet
Chapter_8-Fault_Tolerance (1)
37 pages
w9s1 FaultTolerance1
No ratings yet
w9s1 FaultTolerance1
34 pages
DC Unit IV
No ratings yet
DC Unit IV
37 pages
sc250 04 11.exos
No ratings yet
sc250 04 11.exos
8 pages
Report
No ratings yet
Report
12 pages
Lec 3
No ratings yet
Lec 3
30 pages
Cisco Catalyst 9000 Switches Comparison: 9200 Vs 9300 Vs 9400 Vs 9500
100% (1)
Cisco Catalyst 9000 Switches Comparison: 9200 Vs 9300 Vs 9400 Vs 9500
2 pages
Unit 3-1
No ratings yet
Unit 3-1
26 pages
Advanced Network Solution Documentation
No ratings yet
Advanced Network Solution Documentation
24 pages
Datasheet of DS 7732NI K4 NVR E - V4.71.400 - 20221017
No ratings yet
Datasheet of DS 7732NI K4 NVR E - V4.71.400 - 20221017
5 pages
Tplink Eap110 Qig Eng
No ratings yet
Tplink Eap110 Qig Eng
20 pages
Chapter 8 Fault Tolerance
No ratings yet
Chapter 8 Fault Tolerance
20 pages
Chapte Four DS
No ratings yet
Chapte Four DS
37 pages
Ch8 Distributed
No ratings yet
Ch8 Distributed
12 pages
group communication
No ratings yet
group communication
4 pages
Network Lab PDF
100% (1)
Network Lab PDF
83 pages
Chapter 8-Fault Tolerance
100% (1)
Chapter 8-Fault Tolerance
71 pages
Intro To DS Chapter 6
No ratings yet
Intro To DS Chapter 6
51 pages
Ds chapter 7 (2)
No ratings yet
Ds chapter 7 (2)
21 pages
Chapter 8-Fault Tolerance
No ratings yet
Chapter 8-Fault Tolerance
30 pages
Unit5 compressed Fault tolerance- PACE
No ratings yet
Unit5 compressed Fault tolerance- PACE
11 pages
Document 32Distributed computing concept
No ratings yet
Document 32Distributed computing concept
16 pages
Kaku - Time and Global State Seminar
No ratings yet
Kaku - Time and Global State Seminar
8 pages
Prueba 3
No ratings yet
Prueba 3
2 pages
Fault Tolerance: Click To Add Text Dealing Successfully With Partial System. Key Technique: Redundancy
No ratings yet
Fault Tolerance: Click To Add Text Dealing Successfully With Partial System. Key Technique: Redundancy
48 pages
Synchronous Systems With Failures
No ratings yet
Synchronous Systems With Failures
9 pages
Beckoff BK9105 Completo - 3
No ratings yet
Beckoff BK9105 Completo - 3
40 pages
FailureDetector ds14
No ratings yet
FailureDetector ds14
33 pages
A Gossip-Style Failure Detection Service
No ratings yet
A Gossip-Style Failure Detection Service
16 pages
DS Chapter V8.0fault Tolerance
No ratings yet
DS Chapter V8.0fault Tolerance
23 pages
Distributed Computing: Farhad Muhammad Riaz
No ratings yet
Distributed Computing: Farhad Muhammad Riaz
18 pages
Application of Software Load Balancing or SLB For SDN
No ratings yet
Application of Software Load Balancing or SLB For SDN
9 pages
Nikil_DS_Report[1]
No ratings yet
Nikil_DS_Report[1]
4 pages
Unit 4
No ratings yet
Unit 4
11 pages
Worksheet #2: Mobile Security
No ratings yet
Worksheet #2: Mobile Security
3 pages
Distributed Systems - Fault Tolerance
No ratings yet
Distributed Systems - Fault Tolerance
21 pages
Fault
No ratings yet
Fault
101 pages
Using Cows To Explain The Differences of Cisco Operating Systems
No ratings yet
Using Cows To Explain The Differences of Cisco Operating Systems
6 pages
Fault System One
No ratings yet
Fault System One
19 pages
Routing and Adminstrative
No ratings yet
Routing and Adminstrative
13 pages
Consensus & Agreement: Arvind Krishnamurthy Fall 2003
No ratings yet
Consensus & Agreement: Arvind Krishnamurthy Fall 2003
41 pages
NetEngine 8000 M1A Service Router Data Sheet
No ratings yet
NetEngine 8000 M1A Service Router Data Sheet
10 pages
CCNA Routing and Switching: Introduction To Networks: Course Outline
No ratings yet
CCNA Routing and Switching: Introduction To Networks: Course Outline
2 pages
25.c) ACI Multi-Pod-Part 3 - LEARN WORK IT
No ratings yet
25.c) ACI Multi-Pod-Part 3 - LEARN WORK IT
13 pages
Failure Detectors For Large-Scale Distributed Systems: Naohiro Hayashibara Adel Cherif
No ratings yet
Failure Detectors For Large-Scale Distributed Systems: Naohiro Hayashibara Adel Cherif
6 pages
NetOps 2.0 Transformation: The DIRE Methodology
From Everand
NetOps 2.0 Transformation: The DIRE Methodology
Ray Belleville
5/5 (1)
Hack into your Friends Computer
From Everand
Hack into your Friends Computer
Magelan Cyber Security
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture 4 - Failure Detection and Membership

Uploaded by

Lecture 4 - Failure Detection and Membership

Uploaded by

Parallel and Distributed

• Do you believe him/her?

• What would be your first responsibility?

Say, the rate of failure of one machine (OS/disk/motherboard/network, etc.) is

Soft crashes and failures are even more frequent!

To build a failure detector

Which is more preferable, and why?

• Fail-stop (crash) process failures 5

Group Membership Service

•Complete list all the time (Strongly consistent) Dissemination

Large Group: Scalability A Goal

II. Distributed Failure Detectors:

What Real Failure Detectors Prefer

What Real Failure Detectors Prefer

•On receipt, it is merged with local

Gossip-Style Failure Detection

• Fix: remember for another Tfail

Group Membership Protocol

Dissmn (Suspect pj) Dissmn

• Ring failure detection underlies

• Gossip-style failure detection underlies

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.