0% found this document useful (0 votes)

37 views

11 Locks

This document summarizes key aspects of implementing locks, including goals, building blocks, and examples. It discusses: 1) Lock implementation goals of correctness, fairness, and performance. 2) Basic building blocks like shared memory variables and atomic read-modify-write instructions. 3) Examples of lock implementations using test-and-set and ticket locks, and how they aim to satisfy the goals, with limitations like spinning performance.

Uploaded by

chandreshpatel16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

11 Locks

Uploaded by

chandreshpatel16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Fall 2017 :: CSE 306

Implementing
Locks
Nima Honarmand
(Based on slides by Prof. Andrea Arpaci-Dusseau)
Fall 2017 :: CSE 306

Lock Implementation Goals

• We evaluate lock implementations along following lines

• Correctness
• Mutual exclusion: only one thread in critical section at a time
• Progress (deadlock-free): if several simultaneous requests, must
allow one to proceed
• Bounded wait (starvation-free): must eventually allow each waiting
thread to enter

• Fairness: each thread waits for same amount of time

• Also, threads acquire locks in the same order as requested

• Performance: CPU time is used efficiently

Fall 2017 :: CSE 306

Building Locks
• Locks are variables in shared memory
• Two main operations: acquire() and release()
• Also called lock() and unlock()

• To check if locked, read variable and check value

• To acquire, write “locked” value to variable
• Should only do this if already unlocked
• If already locked, keep reading value until unlock
observed

• To release, write “unlocked” value to variable

Fall 2017 :: CSE 306

First Implementation Attempt

• Using normal load/store instructions
Boolean lock = false; // shared variable

Void acquire(Boolean *lock) {

while (*lock) /* wait */ ; Final check of while condition & write
*lock = true; to lock should happen atomically
}

Void release(Boolean *lock) {

*lock = false;
}

• This does not work. Why?

• Checking and writing of the lock value in acquire() need
to happen atomically.
Fall 2017 :: CSE 306

Solution: Use Atomic RMW Instructions

• Atomic Instructions guarantee atomicity
• Perform Read, Modify, and Write atomically (RMW)
• Many flavors in the real world
• Test and Set
• Fetch and Add
• Compare and Swap (CAS)
• Load Linked / Store Conditional
Fall 2017 :: CSE 306

Example: Test-and-Set
Semantic:
// return what was pointed to by addr
// at the same time, store newval into addr atomically
int TAS(int *addr, int newval) {
int old = *addr;
*addr = newval;
return old;
}

Implementation in x86:
int TAS(volatile int *addr, int newval) {
int result = newval;
asm volatile("lock; xchg %0, %1"
: "+m" (*addr), "=r" (result)
: "1" (newval)
: "cc");
return result;
}
Fall 2017 :: CSE 306

Lock Implementation with TAS

typedef struct __lock_t {
int flag;
} lock_t;

void init(lock_t *lock) {

lock->flag = ??;
}

void acquire(lock_t *lock) {

while (????)
; // spin-wait (do nothing)
}

void release(lock_t *lock) {

lock->flag = ??;
}
Fall 2017 :: CSE 306

Lock Implementation with TAS

typedef struct __lock_t {
int flag;
} lock_t;

void init(lock_t *lock) {

lock->flag = 0;
}

void acquire(lock_t *lock) {

while (TAS(&lock->flag, 1) == 1)
; // spin-wait (do nothing)
}

void release(lock_t *lock) {

lock->flag = 0;
}
Fall 2017 :: CSE 306

Evaluating Our Spinlock

• Lock implementation goals
1) Mutual exclusion: only one thread in critical section at a
time
2) Progress (deadlock-free): if several simultaneous requests,
must allow one to proceed
3) Bounded wait: must eventually allow each waiting thread
to enter
4) Fairness: threads acquire lock in the order of requesting
5) Performance: CPU time is used efficiently

• Which ones are NOT satisfied by our lock impl?

• 3, 4, 5
Fall 2017 :: CSE 306

Our Spinlock is Unfair

unlock lock unlock lock unlock lock unlock lock

lock
spin spin spin spin

A B A B A B A B

0 20 40 60 80 100 120 140 160

Scheduler is independent of locks/unlocks
Fall 2017 :: CSE 306

Fairness and Bounded Wait

• Use Ticket Locks Semantics:
int FAA(int *ptr) {
• Idea: reserve each thread’s turn int old = *ptr;
to use a lock. *ptr = old + 1;
• Each thread spins until their turn. return old;
}
• Use new atomic primitive:
fetch-and-add Implementation:
// Let’s use GCC’s built-in
• Acquire: Grab ticket using // atomic functions this time around
fetch-and-add __sync_fetch_and_add(ptr, 1)

• Spin while not thread’s ticket !=

turn

• Release: Advance to next turn

Fall 2017 :: CSE 306

Ticket Lock Example

Initially, turn = ticket = 0

A lock(): gets ticket 0, spins until turn == 0

 A runs
B lock(): gets ticket 1, spins until turn == 1
C lock(): gets ticket 2, spins until turn == 2
A unlock(): turn++ (turn = 1)
 B runs
A lock(): gets ticket 3, spins until turn == 3
B unlock(): turn++ (turn = 2)
 C runs
C unlock(): turn++ (turn = 3)
 A runs
A unlock(): turn++ (turn = 4)
C lock(): gets ticket 4
 C runs
Fall 2017 :: CSE 306

Ticket Lock Implementation

typedef struct {
int ticket;
int turn;
} lock_t;

void lock_init(lock_t *lock) {

lock->ticket = 0;
lock->turn = 0;
}

void acquire(lock_t *lock) {

int myturn = FAA(&lock->ticket);
while (lock->turn != myturn); // spin
}

void release(lock_t *lock) {

lock->turn += 1;
}
Fall 2017 :: CSE 306

Busy-Waiting (Spinning) Performance

• Good when…
• many CPUs
• locks held a short time
• advantage: avoid context switch

• Awful when…
• one CPU
• locks held a long time
• disadvantage: spinning is wasteful
Fall 2017 :: CSE 306

CPU Scheduler Is Ignorant

• …of busy-waiting locks

lock unlock lock

spin spin spin spin spin

A B C D A B C D

0 20 40 60 80 100 120 140 160

CPU scheduler may run B instead of A

even though B is waiting for A
Fall 2017 :: CSE 306

Ticket Lock with yield()

typedef struct {
int ticket;
int turn;
} lock_t;

void acquire(lock_t *lock) {

int myturn = FAA(&lock->ticket);
while (lock->turn != myturn)
yield();
}

void release(lock_t *lock) {

lock->turn += 1;
}
Fall 2017 :: CSE 306

Yielding instead of Spinning

lock unlock lock

spin spin spin spin spin

no yield: A B C D A B C D

0 20 40 60 80 100 120 140 160

lock unlock lock

yield: A A B

0 20 40 60 80 100 120 140 160

Fall 2017 :: CSE 306

Evaluating Ticket Lock

• Which ones are NOT satisfied by our lock impl?

• 5 (even with yielding, too much overhead)
Fall 2017 :: CSE 306

Spinning Performance
• Wasted time
• Without yield: O(threads × time_slice)
• With yield: O(threads × context_switch_time)

• So even with yield, spinning is slow with high

thread contention

• Next improvement: instead of spinning, block and

put thread on a wait queue
Fall 2017 :: CSE 306

Blocking Locks
• acquire() removes waiting threads from run queue using
special system call
• Let’s call it park() — removes current thread from run queue
• release() returns waiting threads to run queue using special
system call
• Let’s call it unpark(tid) — returns thread tid to run queue

• Scheduler runs any thread that is ready

• No time wasted on waiting threads when lock is not available
• Good separation of concerns
• Keep waiting threads on a wait queue instead of scheduler’s run queue

• Note: park() and unpark() are made-up syscalls — inspired

by Solaris’ lwp_park() and lwp_unpark() system calls
Fall 2017 :: CSE 306

Building a Blocking Lock

typedef struct { void acquire(lock_t *l) {
int lock; while (TAS(&l->guard, 1) == 1);
int guard;
queue_t q; if (l->lock) {
queue_add(l->q, gettid());
} lock_t; l->guard = 0;
park(); // blocked
1) What is guard for? } else {
l->lock = 1;
l->guard = 0;
2) Why okay to spin on }
guard? }

void release(lock_t *l) {

3) In release(), why not while (TAS(&l->guard, 1) == 1);
set lock=false when
unparking? if (queue_empty(l->q))
l->lock=false;
else
4) Is the code correct? unpark(queue_remove(l->q));
• Hint: there is a race condition l->guard = false;
}
Fall 2017 :: CSE 306

Race Condition
Thread 1 in acquire() Thread 2 in release()
if (l->lock) {
queue_add(l->q, gettid());
l->guard = 0;
while (TAS(&l->guard, 1) == 1);
if (queue_empty(l->q))
l->lock=false;
else
unpark(queue_remove(l->q));

park();

• Problem: guard not held when calling park()

• Thread 2 can call unpark() before Thread 1 calls park()
Fall 2017 :: CSE 306

Solving Race Problem: Final Correct Lock

typedef struct { void acquire(lock_t *l) {
int lock; while (TAS(&l->guard, 1) == 1);
int guard; if (l->lock) {
queue_t q; queue_add(l->q, gettid());
setpark();
} lock_t; l->guard = 0;
park(); // blocked
} else {
• setpark() informs the l->lock = 1;
OS of my plan to park() }
l->guard = 0;

myself }

void release(lock_t *l) {

• If there is an unpark() while (TAS(&l->guard, 1) == 1);
between my setpark() if (queue_empty(l->q))
and park(), park() will l->lock=false;
return immediately (no else
unpark(queue_remove(l->q));
blocking) l->guard = false;
}
Fall 2017 :: CSE 306

Different OS, Different Support

• park, unpark, and setpark inspired by Solaris
• Other OSes provide different mechanisms to
support blocking synchronization
• E.g., Linux has a mechanism called futex
• With two basic operations: wait and wakeup
• It keeps the queue in kernel
• It renders guard and setpark unnecessary

• Read more about futex in OSTEP (brief) and in an

optional reading (detailed)
Fall 2017 :: CSE 306

Spinning vs. Blocking

• Each approach is better under different circumstances

• Uniprocessor
• Waiting process is scheduled → Process holding lock can’t be
• Therefore, waiting process should always relinquish processor
• Associate queue of waiters with each lock (as in previous
implementation)

• Multiprocessor
• Waiting process is scheduled → Process holding lock might be
• Spin or block depends on how long before lock is released
• Lock is going to be released quickly → Spin-wait
• Lock released slowly → Block
Fall 2017 :: CSE 306

Two-Phase Locking
• A hybrid approach that combines best of spinning
and blocking

• Phase 1: spin for a short time, hoping the lock

becomes available soon

• Phase 2: if lock not released after a short while,

then block

• Question: how long to spin for?

• There’s a nice theory (next slide) which is in practice
hard to implement, so just spin for a few iterations
Fall 2017 :: CSE 306

Two-Phase Locking Spin Time

• Say cost of context switch is C cycles and lock will become
available after T cycles

• Algorithm: spin for C cycles before blocking

• We can show this is a 2-approximation of the optimal solution

• Two cases:
• T < C: optimal would spin for T (cost = T), so do we (cost = T)
• T ≥ C: optimal would immediately block (cost = C), we spin for C and
then block (cost = C + C = 2C)
• So, our cost is at most twice that of optimal algorithm

• Problems to implement this theory?

1) Difficult to know C (it is non-deterministic)
2) Needs a low-overhead high-resolution timing mechanism to know
when C cycles have passed

5.2.concurrency-locks
No ratings yet
5.2.concurrency-locks
26 pages
13-Conc Bugs
No ratings yet
13-Conc Bugs
27 pages
Merged 2
No ratings yet
Merged 2
21 pages
Lock
No ratings yet
Lock
53 pages
15 Synchronization
No ratings yet
15 Synchronization
120 pages
Lec07 Exclusion
No ratings yet
Lec07 Exclusion
33 pages
Synchronization
No ratings yet
Synchronization
9 pages
Implementing Locks: How To Write Correct Concurrent Programs? No Race
No ratings yet
Implementing Locks: How To Write Correct Concurrent Programs? No Race
4 pages
OS_Concurrecny_Summary
No ratings yet
OS_Concurrecny_Summary
10 pages
Lec09 Schedule
No ratings yet
Lec09 Schedule
38 pages
005 Readerwriter
No ratings yet
005 Readerwriter
33 pages
16 Synchronization
No ratings yet
16 Synchronization
29 pages
Operating Systems: Synchronization
No ratings yet
Operating Systems: Synchronization
26 pages
CS0051 - M3-Locks and Liveness
No ratings yet
CS0051 - M3-Locks and Liveness
30 pages
Locks: OS Support: 21.1 A Simple Approach: Just Yield, Baby
No ratings yet
Locks: OS Support: 21.1 A Simple Approach: Just Yield, Baby
8 pages
13_Locks
No ratings yet
13_Locks
40 pages
Lect14 4up
No ratings yet
Lect14 4up
8 pages
5-Implementing Synchronization Operations
No ratings yet
5-Implementing Synchronization Operations
4 pages
Ksync Notes
No ratings yet
Ksync Notes
6 pages
Locks and Semaphores Ch4
No ratings yet
Locks and Semaphores Ch4
6 pages
Hardware and Software Synchronization Advanced Computer Architecture COMP 140 Thursday June 26, 2014
No ratings yet
Hardware and Software Synchronization Advanced Computer Architecture COMP 140 Thursday June 26, 2014
33 pages
CS347 04 Process Sync
No ratings yet
CS347 04 Process Sync
14 pages
DPDK Locks Optimizations and New Locks APIs
No ratings yet
DPDK Locks Optimizations and New Locks APIs
16 pages
9-Operating Systems -Synchronization, interprocess communication, deadlock(1)
No ratings yet
9-Operating Systems -Synchronization, interprocess communication, deadlock(1)
162 pages
Critical Sections With Lots of Threads
No ratings yet
Critical Sections With Lots of Threads
34 pages
Lab 3
No ratings yet
Lab 3
18 pages
Spin Locks and Contention
No ratings yet
Spin Locks and Contention
53 pages
Cs252s05 Lec19 Synch
No ratings yet
Cs252s05 Lec19 Synch
56 pages
004 Readerwriter
No ratings yet
004 Readerwriter
17 pages
producer-consumer (1)
No ratings yet
producer-consumer (1)
16 pages
Lab 09 - Concurrency (Answers) PDF
No ratings yet
Lab 09 - Concurrency (Answers) PDF
5 pages
Lec08 Readerwriter
100% (1)
Lec08 Readerwriter
33 pages
Fuss, Futexes and Furwocks: Fast Userlevel Locking in Linux
No ratings yet
Fuss, Futexes and Furwocks: Fast Userlevel Locking in Linux
19 pages
CS3451 - Introduction To Operating Systems: Ii Year / Iv Semester
No ratings yet
CS3451 - Introduction To Operating Systems: Ii Year / Iv Semester
22 pages
Lec10 Scheduling
No ratings yet
Lec10 Scheduling
39 pages
An Introduction To Programming With Threads
No ratings yet
An Introduction To Programming With Threads
29 pages
Synchronization
No ratings yet
Synchronization
81 pages
SPThreads 5 Sync
No ratings yet
SPThreads 5 Sync
26 pages
Processes and Threads: Operating Systems CSE 4300
No ratings yet
Processes and Threads: Operating Systems CSE 4300
121 pages
Iqra University Islamabad Campus: Project Report
No ratings yet
Iqra University Islamabad Campus: Project Report
12 pages
Locking
No ratings yet
Locking
26 pages
csci4061_Final_Exam_Practice_sol
No ratings yet
csci4061_Final_Exam_Practice_sol
13 pages
66d7f11af5bb28839233756b Operating System-Synchronization
No ratings yet
66d7f11af5bb28839233756b Operating System-Synchronization
38 pages
OS Process Synchronization Unit 3
No ratings yet
OS Process Synchronization Unit 3
58 pages
Deadlock: Thread 1: Thread 2: Lock (L1) Lock (L2) Lock (L2) Lock (L1)
No ratings yet
Deadlock: Thread 1: Thread 2: Lock (L1) Lock (L2) Lock (L2) Lock (L1)
11 pages
CSC139 Operating Systems Readers-Writers Language Support For Synchronization
No ratings yet
CSC139 Operating Systems Readers-Writers Language Support For Synchronization
31 pages
CS6210 4b - Synchronization
No ratings yet
CS6210 4b - Synchronization
27 pages
Lec 5
No ratings yet
Lec 5
6 pages
Lec08 Notes
No ratings yet
Lec08 Notes
3 pages
120-l6 Simplicity
No ratings yet
120-l6 Simplicity
30 pages
Air University Department of Computer Sciences Operating Systems
No ratings yet
Air University Department of Computer Sciences Operating Systems
9 pages
Summary Midterm Concurrency
No ratings yet
Summary Midterm Concurrency
22 pages
12 Multithreading Patterns Data Structures
No ratings yet
12 Multithreading Patterns Data Structures
40 pages
ex 8
No ratings yet
ex 8
19 pages
Multithreads Lecture 3
No ratings yet
Multithreads Lecture 3
29 pages
Locks
No ratings yet
Locks
59 pages
Semaphore Basics
No ratings yet
Semaphore Basics
6 pages
Kotlin Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
From Everand
Kotlin Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
Shobo
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Profound Python Libraries
From Everand
Profound Python Libraries
Onder Teker
No ratings yet
Ps Question Bank Nljiet
No ratings yet
Ps Question Bank Nljiet
38 pages
Nse6 FWB-6.0
No ratings yet
Nse6 FWB-6.0
4 pages
Black Belt - SDWAN Deployment Stage 3 - Submission Guidelinesv3
No ratings yet
Black Belt - SDWAN Deployment Stage 3 - Submission Guidelinesv3
4 pages
IT324 Database Administration
No ratings yet
IT324 Database Administration
2 pages
Polytechnic University of The Philippines College of Engineering Sta. Mesa Manila Computer Applications For ME - Activity 04
No ratings yet
Polytechnic University of The Philippines College of Engineering Sta. Mesa Manila Computer Applications For ME - Activity 04
3 pages
Me6501-Computer Aided Design - Iiiyr 5th Sem
No ratings yet
Me6501-Computer Aided Design - Iiiyr 5th Sem
111 pages
1.3.2.4 Lab - Tracing Internet Connectivity
No ratings yet
1.3.2.4 Lab - Tracing Internet Connectivity
3 pages
Forwarding PSTN Calls To A VOIP Number On SPA-3102: Easy Answers Help
No ratings yet
Forwarding PSTN Calls To A VOIP Number On SPA-3102: Easy Answers Help
3 pages
iPhone Parts and Service History - Apple Support
No ratings yet
iPhone Parts and Service History - Apple Support
1 page
IGNOU Mail - December, 2024 Term-End-Examination for E-Vidya Bharti Learners – Login Credentials for Accessing the Examination Portal and Instructions Thereof
No ratings yet
IGNOU Mail - December, 2024 Term-End-Examination for E-Vidya Bharti Learners – Login Credentials for Accessing the Examination Portal and Instructions Thereof
3 pages
BIOS and Bootloader: Guoyong Li Jinwen Xu
No ratings yet
BIOS and Bootloader: Guoyong Li Jinwen Xu
22 pages
MUFundamentals3.9 Studentmanual Mod05
No ratings yet
MUFundamentals3.9 Studentmanual Mod05
50 pages
Legacy Software
No ratings yet
Legacy Software
8 pages
Permutations Combinations
No ratings yet
Permutations Combinations
80 pages
Important Questions New
No ratings yet
Important Questions New
3 pages
Linear Algebra 2005
No ratings yet
Linear Algebra 2005
3 pages
HPE 3PAR StoreServ 8000 Storage Quick Setup Poster (3PAR OS 3.3.1)
No ratings yet
HPE 3PAR StoreServ 8000 Storage Quick Setup Poster (3PAR OS 3.3.1)
2 pages
CMMT-AS-SW Manual 2020-11f 8146068g1
No ratings yet
CMMT-AS-SW Manual 2020-11f 8146068g1
1,204 pages
Screw Threads Chart
No ratings yet
Screw Threads Chart
2 pages
A Systematic Approach To Prevent Threats Using Ids in Iot Based Devices
No ratings yet
A Systematic Approach To Prevent Threats Using Ids in Iot Based Devices
7 pages
Tso C161
No ratings yet
Tso C161
5 pages
Antivirus Transparent Proxy Indonesian
No ratings yet
Antivirus Transparent Proxy Indonesian
7 pages
2016 2017 PDF
No ratings yet
2016 2017 PDF
67 pages
C64 - Fujitsu Analog To Digital Converter Evaluation Board LUKE-DK
No ratings yet
C64 - Fujitsu Analog To Digital Converter Evaluation Board LUKE-DK
2 pages
Cloud Computing: Overview & Current Research Challenges: Mohsin Nazir
No ratings yet
Cloud Computing: Overview & Current Research Challenges: Mohsin Nazir
9 pages
Poriyaan_Theory_of_Computation_CS3452_Question_Bank_and_Important
No ratings yet
Poriyaan_Theory_of_Computation_CS3452_Question_Bank_and_Important
15 pages
HELDROID
No ratings yet
HELDROID
23 pages
Invitation to Computer Science 7th Edition Schneider Solutions Manual download
100% (1)
Invitation to Computer Science 7th Edition Schneider Solutions Manual download
42 pages
Case Study-Google
100% (1)
Case Study-Google
16 pages
FMD 3005
No ratings yet
FMD 3005
143 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

11 Locks

Uploaded by

11 Locks

Uploaded by

Fall 2017 :: CSE 306

Lock Implementation Goals

• Fairness: each thread waits for same amount of time

• Performance: CPU time is used efficiently

• To check if locked, read variable and check value

• To release, write “unlocked” value to variable

First Implementation Attempt

Void acquire(Boolean *lock) {

Void release(Boolean *lock) {

• This does not work. Why?

Solution: Use Atomic RMW Instructions

Lock Implementation with TAS

void init(lock_t *lock) {

void acquire(lock_t *lock) {

void release(lock_t *lock) {

Lock Implementation with TAS

void init(lock_t *lock) {

void acquire(lock_t *lock) {

void release(lock_t *lock) {

Evaluating Our Spinlock

• Which ones are NOT satisfied by our lock impl?

Our Spinlock is Unfair

unlock lock unlock lock unlock lock unlock lock

0 20 40 60 80 100 120 140 160

Fairness and Bounded Wait

• Spin while not thread’s ticket !=

• Release: Advance to next turn

Ticket Lock Example

A lock(): gets ticket 0, spins until turn == 0

Ticket Lock Implementation

void lock_init(lock_t *lock) {

void acquire(lock_t *lock) {

void release(lock_t *lock) {

Busy-Waiting (Spinning) Performance

CPU Scheduler Is Ignorant

lock unlock lock

0 20 40 60 80 100 120 140 160

CPU scheduler may run B instead of A

Ticket Lock with yield()

void acquire(lock_t *lock) {

void release(lock_t *lock) {

Yielding instead of Spinning

lock unlock lock

0 20 40 60 80 100 120 140 160

0 20 40 60 80 100 120 140 160

Evaluating Ticket Lock

• Which ones are NOT satisfied by our lock impl?

• So even with yield, spinning is slow with high

• Next improvement: instead of spinning, block and

• Scheduler runs any thread that is ready

• Note: park() and unpark() are made-up syscalls — inspired

Building a Blocking Lock

void release(lock_t *l) {

• Problem: guard not held when calling park()

Solving Race Problem: Final Correct Lock

void release(lock_t *l) {

Different OS, Different Support

• Read more about futex in OSTEP (brief) and in an

Spinning vs. Blocking

• Phase 1: spin for a short time, hoping the lock

• Phase 2: if lock not released after a short while,

• Question: how long to spin for?

Two-Phase Locking Spin Time

• Algorithm: spin for C cycles before blocking

• We can show this is a 2-approximation of the optimal solution

• Problems to implement this theory?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.