Chapter 6: Synchronization
Tools
Operating System Concepts – 10th Edition Silberschatz, Galvin and Gagne
Outline
Background
The Critical-Section Problem
Peterson’s Solution
Hardware Support for Synchronization
Mutex Locks
Semaphores
Monitors
Liveness
Evaluation
Operating System Concepts – 10th Edition 6.2 Silberschatz, Galvin and Gagne
Objectives
Describe the critical-section problem and
illustrate a race condition
Illustrate hardware solutions to the
critical-section problem using memory
barriers, compare-and-swap operations,
and atomic variables
Demonstrate how mutex locks,
semaphores, monitors, and condition
variables can be used to solve the critical
section problem
Evaluate tools that solve the critical-
section problem in low-, Moderate-, and
high-contention scenarios
Operating System Concepts – 10th Edition 6.3 Silberschatz, Galvin and Gagne
Background
Processes can execute concurrently
• May be interrupted at any time, partially
completing execution
Concurrent access to shared data may result in data
inconsistency
Maintaining data consistency requires mechanisms
to ensure the orderly execution of cooperating
processes
We illustrated in chapter 4 the problem when we
considered the Bounded Buffer problem with use of a
counter that is updated concurrently by the producer
and consumer,. Which lead to race condition.
Operating System Concepts – 10th Edition 6.4 Silberschatz, Galvin and Gagne
Race Condition
Processes P0 and P1 are creating child processes using
the fork() system call
Race condition on kernel variable next_available_pid
which represents the next available process identifier
(pid)
Unless there is a mechanism to prevent P0 and P1 from
accessing the variable next_available_pid the same
pid could be assigned to two different processes!
Operating System Concepts – 10th Edition 6.5 Silberschatz, Galvin and Gagne
Critical Section Problem
Consider system of n processes {p0, p1, … pn-1}
Each process has critical section segment of code
• Process may be changing common variables,
updating table, writing file, etc.
• When one process in critical section, no other may
be in its critical section
Critical section problem is to design protocol to solve
this
Each process must ask permission to enter critical
section in entry section, may follow critical section
with exit section, then remainder section
Operating System Concepts – 10th Edition 6.6 Silberschatz, Galvin and Gagne
Critical Section
General structure of process Pi
Operating System Concepts – 10th Edition 6.7 Silberschatz, Galvin and Gagne
Critical-Section Problem (Cont.)
Requirements for solution to critical-section problem
1. Mutual Exclusion - If process Pi is executing in its critical
section, then no other processes can be executing in
their critical sections
2. Progress - If no process is executing in its critical section
and there exist some processes that wish to enter their
critical section, then the selection of the process that
will enter the critical section next cannot be postponed
indefinitely
3. Bounded Waiting - A bound must exist on the number of
times that other processes are allowed to enter their
critical sections after a process has made a request to
enter its critical section and before that request is
granted
• Assume that each process executes at a nonzero
speed
• No assumption concerning relative speed of the n
processes 6.8 Silberschatz, Galvin and Gagne
Operating System Concepts – 10th Edition
Interrupt-based Solution
Entry section: disable interrupts
Exit section: enable interrupts
Will this solve the problem?
• What if the critical section is code that runs for an hour?
• Can some processes starve – never enter their critical section.
• What if there are two
CPUs?
Operating System Concepts – 10th Edition 6.9 Silberschatz, Galvin and Gagne
Software Solution 1
Two process solution
Assume that the load and store machine-
language instructions are atomic; that is, cannot
be interrupted
The two processes share one variable:
• int turn;
The variable turn indicates whose turn it is to
enter the critical section
Operating System Concepts – 10th Edition 6.10 Silberschatz, Galvin and Gagne
Algorithm for Process Pi
while (true){
turn = i;
while (turn = = j)
;
/* critical section */
turn = j;
/* remainder section */
Operating System Concepts – 10th Edition 6.11 Silberschatz, Galvin and Gagne
Correctness of the Software Solution
Mutual exclusion is preserved
Pi enters critical section only if:
turn = i
and turn cannot be both 0 and 1 at the same time
What about the Progress requirement?
What about the Bounded-waiting requirement?
Operating System Concepts – 10th Edition 6.12 Silberschatz, Galvin and Gagne
Peterson’s Solution
Two process solution
Assume that the load and store machine-
language instructions are atomic; that is, cannot
be interrupted
The two processes share two variables:
• int turn;
• boolean flag[2]
The variable turn indicates whose turn it is to
enter the critical section
The flag array is used to indicate if a process is
ready to enter the critical section.
• flag[i] = true implies that process Pi is
ready!
Operating System Concepts – 10th Edition 6.13 Silberschatz, Galvin and Gagne
Algorithm for Process Pi
while (true){
flag[i] = true;
turn = j;
while (flag[j] && turn = = j)
;
/* critical section */
flag[i] = false;
/* remainder section */
Operating System Concepts – 10th Edition 6.14 Silberschatz, Galvin and Gagne
Correctness of Peterson’s Solution
Provable that the three CS requirement are met:
1. Mutual exclusion is preserved
Pi enters CS only if:
either flag[j] = false or turn = i
2. Progress requirement is satisfied
3. Bounded-waiting requirement is met
Operating System Concepts – 10th Edition 6.15 Silberschatz, Galvin and Gagne
Peterson’s Solution and Modern Architecture
Although useful for demonstrating an algorithm,
Peterson’s Solution is not guaranteed to work on
modern architectures.
• To improve performance, processors and/or
compilers may reorder operations that have no
dependencies
Understanding why it will not work is useful for
better understanding race conditions.
For single-threaded this is ok as the result will
always be the same.
For multithreaded the reordering may produce
inconsistent or unexpected results!
Operating System Concepts – 10th Edition 6.16 Silberschatz, Galvin and Gagne
Modern Architecture Example
Two threads share the data:
boolean flag = false;
int x = 0;
Thread 1 performs
while (!flag)
;
print x
Thread 2 performs
x = 100;
flag = true
What is the expected output?
100
Operating System Concepts – 10th Edition 6.17 Silberschatz, Galvin and Gagne
Modern Architecture Example (Cont.)
However, since the variables flag and x are
independent of each other, the instructions:
flag = true;
x = 100;
for Thread 2 may be reordered
If this occurs, the output may be 0!
Operating System Concepts – 10th Edition 6.18 Silberschatz, Galvin and Gagne
Peterson’s Solution Revisited
The effects of instruction reordering in Peterson’s
Solution
This allows both processes to be in their critical section
at the same time!
To ensure that Peterson’s solution will work correctly on
modern computer architecture we must use Memory
Barrier.
Operating System Concepts – 10th Edition 6.19 Silberschatz, Galvin and Gagne
Memory Barrier
Memory model are the memory guarantees a
computer architecture makes to application
programs.
Memory models may be either:
• Strongly ordered – where a memory modification
of one processor is immediately visible to all
other processors.
• Weakly ordered – where a memory modification
of one processor may not be immediately visible
to all other processors.
A memory barrier is an instruction that forces any
change in memory to be propagated (made visible)
to all other processors.
Operating System Concepts – 10th Edition 6.20 Silberschatz, Galvin and Gagne
Memory Barrier Instructions
When a memory barrier instruction is performed, the
system ensures that all loads and stores are
completed before any subsequent load or store
operations are performed.
Therefore, even if instructions were reordered, the
memory barrier ensures that the store operations
are completed in memory and visible to other
processors before future load or store operations
are performed.
Operating System Concepts – 10th Edition 6.21 Silberschatz, Galvin and Gagne
Memory Barrier Example
Returning to the example of slides 6.17 - 6.18
We could add a memory barrier to the following
instructions to ensure Thread 1 outputs 100:
Thread 1 now performs
while (!flag)
memory_barrier();
print x
Thread 2 now performs
x = 100;
memory_barrier();
flag = true
For Thread 1 we are guaranteed that that the
value of flag is loaded before the value of x.
For Thread 2 we ensure that the assignment to x
occurs before the assignment flag.
Operating System Concepts – 10th Edition 6.22 Silberschatz, Galvin and Gagne
Synchronization Hardware
Many systems provide hardware support for
implementing the critical section code.
Uniprocessors – could disable interrupts
• Currently running code would execute without
preemption
• Generally too inefficient on multiprocessor systems
Operating systems using this not broadly
scalable
We will look at three forms of hardware support:
1. Hardware instructions
2. Atomic variables
Operating System Concepts – 10th Edition 6.23 Silberschatz, Galvin and Gagne
Hardware Instructions
Special hardware instructions that allow us
to either test-and-modify the content of a
word, or two swap the contents of two
words atomically (uninterruptedly.)
• Test-and-Set instruction
• Compare-and-Swap instruction
Operating System Concepts – 10th Edition 6.24 Silberschatz, Galvin and Gagne
The test_and_set Instruction
Definition
boolean test_and_set (boolean
*target)
{
boolean rv = *target;
*target = true;
return rv:
}
Properties
• Executed atomically
• Returns the original value of passed
parameter
• Set the new value of passed parameter to
true
Operating System Concepts – 10th Edition 6.25 Silberschatz, Galvin and Gagne
Solution Using test_and_set()
Shared boolean variable lock, initialized to false
Solution:
do {
while (test_and_set(&lock))
; /* do nothing */
/* critical section */
lock = false;
/* remainder section */
} while (true);
Does it solve the critical-section problem?
Operating System Concepts – 10th Edition 6.26 Silberschatz, Galvin and Gagne
The compare_and_swap Instruction
Definition
int compare_and_swap(int *value, int expected, int new_value)
{
int temp = *value;
if (*value == expected)
*value = new_value;
return temp;
}
Properties
• Executed atomically
• Returns the original value of passed parameter value
• Set the variable value the value of the passed
parameter new_value but only if *value == expected is
true. That is, the swap takes place only under this
condition.
Operating System Concepts – 10th Edition 6.27 Silberschatz, Galvin and Gagne
Solution using compare_and_swap
Shared integer lock initialized to 0;
Solution:
while (true){
while (compare_and_swap(&lock, 0, 1) != 0)
; /* do nothing */
/* critical section */
lock = 0;
/* remainder section */
}
Does it solve the critical-section problem?
Operating System Concepts – 10th Edition 6.28 Silberschatz, Galvin and Gagne
Bounded-waiting with compare-and-swap
while (true) {
waiting[i] = true;
key = 1;
while (waiting[i] && key == 1)
key = compare_and_swap(&lock,0,1);
waiting[i] = false;
/* critical section */
j = (i + 1) % n;
while ((j != i) && !waiting[j])
j = (j + 1) % n;
if (j == i)
lock = 0;
else
waiting[j] = false;
/* remainder section */
}
Operating System Concepts – 10th Edition 6.29 Silberschatz, Galvin and Gagne
Atomic Variables
Typically, instructions such as compare-and-
swap are used as building blocks for other
synchronization tools.
One tool is an atomic variable that provides
atomic (uninterruptible) updates on basic data
types such as integers and booleans.
For example:
• Let sequence be an atomic variable
• Let increment() be operation on the atomic
variable sequence
• The Command:
increment(&sequence);
ensures sequence is incremented without
interruption:
Operating System Concepts – 10th Edition 6.30 Silberschatz, Galvin and Gagne
Atomic Variables
The increment() function can be implemented as follows:
void increment(atomic_int *v)
{
int temp;
do {
temp = *v;
}
while (temp != (compare_and_swap(v,temp,temp+1));
}
Operating System Concepts – 10th Edition 6.31 Silberschatz, Galvin and Gagne
Mutex Locks
Previous solutions are complicated and generally
inaccessible to application programmers
OS designers build software tools to solve critical
section problem
Simplest is mutex lock
• Boolean variable indicating if lock is available or not
Protect a critical section by
• First acquire() a lock
• Then release() the lock
Calls to acquire() and release() must be atomic
• Usually implemented via hardware atomic
instructions such as compare-and-swap.
But this solution requires busy waiting
• This lock therefore called a spinlock
Operating System Concepts – 10th Edition 6.32 Silberschatz, Galvin and Gagne
Solution to CS Problem Using Mutex Locks
while (true) {
acquire lock
critical section
release lock
remainder section
}
Operating System Concepts – 10th Edition 6.33 Silberschatz, Galvin and Gagne
Semaphore
Synchronization tool that provides more
sophisticated ways (than Mutex locks) for
processes to synchronize their activities.
Semaphore S – integer variable
Can only be accessed via two indivisible (atomic)
operations
• wait() and signal()
Originally called P() and V()
Definition of the wait() operation
wait(S) {
while (S <= 0)
; // busy wait
S--;
}
Definition of the signal() operation
signal(S) {
S++;
}
Operating System Concepts – 10th Edition 6.34 Silberschatz, Galvin and Gagne
Semaphore (Cont.)
Counting semaphore – integer value can range
over an unrestricted domain
Binary semaphore – integer value can range
only between 0 and 1
• Same as a mutex lock
Can implement a counting semaphore S as a
binary semaphore
With semaphores we can solve various
synchronization problems
Operating System Concepts – 10th Edition 6.35 Silberschatz, Galvin and Gagne
Semaphore Usage Example
Solution to the CS Problem
• Create a semaphore “mutex” initialized to 1
wait(mutex);
CS
signal(mutex);
Consider P1 and P2 that with two statements S1
and S2 and the requirement that S1 to happen
before S2
• Create a semaphore “synch” initialized to 0
P1:
S1;
signal(synch);
P2:
wait(synch);
S2;
Operating System Concepts – 10th Edition 6.36 Silberschatz, Galvin and Gagne
Semaphore Implementation
Must guarantee that no two processes can execute
the wait() and signal() on the same semaphore at
the same time
Thus, the implementation becomes the critical
section problem where the wait and signal code are
placed in the critical section
Could now have busy waiting in critical section
implementation
• But implementation code is short
• Little busy waiting if critical section rarely
occupied
Note that applications may spend lots of time in
critical sections and therefore this is not a good
solution
Operating System Concepts – 10th Edition 6.37 Silberschatz, Galvin and Gagne
Semaphore Implementation with no Busy waiting
With each semaphore there is an associated
waiting queue
Each entry in a waiting queue has two data items:
• Value (of type integer)
• Pointer to next record in the list
Two operations:
• block – place the process invoking the operation
on the appropriate waiting queue
• wakeup – remove one of processes in the waiting
queue and place it in the ready queue
Operating System Concepts – 10th Edition 6.38 Silberschatz, Galvin and Gagne
Implementation with no Busy waiting (Cont.)
Waiting queue
typedef struct {
int value;
struct process *list;
} semaphore;
Operating System Concepts – 10th Edition 6.39 Silberschatz, Galvin and Gagne
Implementation with no Busy waiting (Cont.)
wait(semaphore *S) {
S->value--;
if (S->value < 0) {
add this process to S->list;
block();
}
}
signal(semaphore *S) {
S->value++;
if (S->value <= 0) {
remove a process P from S->list;
wakeup(P);
}
}
Operating System Concepts – 10th Edition 6.40 Silberschatz, Galvin and Gagne
Problems with Semaphores
Incorrect use of semaphore operations:
• signal(mutex) …. wait(mutex)
• wait(mutex) … wait(mutex)
• Omitting of wait (mutex) and/or signal
(mutex)
These – and others – are examples of what can
occur when semaphores and other synchronization
tools are used incorrectly.
Operating System Concepts – 10th Edition 6.41 Silberschatz, Galvin and Gagne
Monitors
A high-level abstraction that provides a convenient and
effective mechanism for process synchronization
Abstract data type, internal variables only accessible by
code within the procedure
Only one process may be active within the monitor at a
time
Pseudocode syntax of a monitor:
monitor monitor-name
{
// shared variable declarations
procedure P1 (…) { …. }
procedure P2 (…) { …. }
procedure Pn (…) {……}
initialization code (…) { … }
}
Operating System Concepts – 10th Edition 6.42 Silberschatz, Galvin and Gagne
Schematic view of a Monitor
Operating System Concepts – 10th Edition 6.43 Silberschatz, Galvin and Gagne
Monitor Implementation Using Semaphores
Variables
semaphore mutex
mutex = 1
Each procedure P is replaced by
wait(mutex);
…
body of P;
…
signal(mutex);
Mutual exclusion within a monitor is ensured
Operating System Concepts – 10th Edition 6.44 Silberschatz, Galvin and Gagne
Condition Variables
condition x, y;
Two operations are allowed on a condition variable:
• x.wait() – a process that invokes the operation is
suspended until x.signal()
• x.signal() – resumes one of processes (if any) that
invoked x.wait()
If no x.wait() on the variable, then it has no
effect on the variable
Operating System Concepts – 10th Edition 6.45 Silberschatz, Galvin and Gagne
Monitor with Condition Variables
Operating System Concepts – 10th Edition 6.46 Silberschatz, Galvin and Gagne
Usage of Condition Variable Example
Consider P1 and P2 that that need to execute two
statements S1 and S2 and the requirement that S1 to
happen before S2
• Create a monitor with two procedures F1 and F2
that are invoked by P1 and P2 respectively
• One condition variable “x” initialized to 0
• One Boolean variable “done”
• F1:
S1;
done = true;
x.signal();
• F2:
if done = false
x.wait()
S2;
Operating System Concepts – 10th Edition 6.47 Silberschatz, Galvin and Gagne
Monitor Implementation Using Semaphores
Variables
semaphore mutex; // (initially = 1)
semaphore next; // (initially = 0)
int next_count = 0; // number of processes waiting
inside the monitor
Each function P will be replaced by
wait(mutex);
…
body of P;
…
if (next_count > 0)
signal(next)
else
signal(mutex);
Mutual exclusion within a monitor is ensured
Operating System Concepts – 10th Edition 6.48 Silberschatz, Galvin and Gagne
Implementation – Condition Variables
For each condition variable x, we have:
semaphore x_sem; // (initially = 0)
int x_count = 0;
The operation x.wait() can be implemented as:
x_count++;
if (next_count > 0)
signal(next);
else
signal(mutex);
wait(x_sem);
x_count--;
Operating System Concepts – 10th Edition 6.49 Silberschatz, Galvin and Gagne
Implementation (Cont.)
The operation x.signal() can be implemented as:
if (x_count > 0) {
next_count++;
signal(x_sem);
wait(next);
next_count--;
}
Operating System Concepts – 10th Edition 6.50 Silberschatz, Galvin and Gagne
Resuming Processes within a Monitor
If several processes queued on condition
variable x, and x.signal() is executed,
which process should be resumed?
FCFS frequently not adequate
Use the conditional-wait construct of the form
x.wait(c)
where:
• c is an integer (called the priority
number)
• The process with lowest number (highest
priority) is scheduled next
Operating System Concepts – 10th Edition 6.51 Silberschatz, Galvin and Gagne
Single Resource allocation
Allocate a single resource among competing processes
using priority numbers that specifies the maximum time
a process plans to use the resource
R.acquire(t);
...
access the resurce;
...
R.release;
Where R is an instance of type ResourceAllocator
Operating System Concepts – 10th Edition 6.52 Silberschatz, Galvin and Gagne
Single Resource allocation
Allocate a single resource among competing processes
using priority numbers that specifies the maximum time
a process plans to use the resource
The process with the shortest time is allocated the
resource first
Let R is an instance of type ResourceAllocator (next
slide)
Access to ResourceAllocator is done via:
R.acquire(t);
...
access the resurce;
...
R.release;
Where t is the maximum time a process plans to use the
resource
Operating System Concepts – 10th Edition 6.53 Silberschatz, Galvin and Gagne
A Monitor to Allocate Single Resource
monitor ResourceAllocator
{
boolean busy;
condition x;
void acquire(int time) {
if (busy)
x.wait(time);
busy = true;
}
void release() {
busy = false;
x.signal();
}
initialization code() {
busy = false;
}
}
Operating System Concepts – 10th Edition 6.54 Silberschatz, Galvin and Gagne
Single Resource Monitor (Cont.)
Usage:
acquire
...
release
Incorrect use of monitor operations
• release() … acquire()
• acquire() … acquire())
• Omitting of acquire() and/or release()
Operating System Concepts – 10th Edition 6.55 Silberschatz, Galvin and Gagne
Liveness
Processes may have to wait indefinitely while trying to
acquire a synchronization tool such as a mutex lock or
semaphore.
Waiting indefinitely violates the progress and bounded-
waiting criteria discussed at the beginning of this
chapter.
Liveness refers to a set of properties that a system must
satisfy to ensure processes make progress.
Indefinite waiting is an example of a liveness failure.
Operating System Concepts – 10th Edition 6.56 Silberschatz, Galvin and Gagne
Liveness
Deadlock – two or more processes are waiting
indefinitely for an event that can be caused by only one
of the waiting processes
Let S and Q be two semaphores initialized to 1
P0 P1
wait(S); wait(Q);
wait(Q); wait(S);
... ...
signal(S); signal(Q);
signal(Q); signal(S);
Consider if P0 executes wait(S) and P1 wait(Q). When P0
executes wait(Q), it must wait until P1 executes
signal(Q)
However, P1 is waiting until P0 execute signal(S).
Since these signal() operations will never be executed,
P0 and P1 are deadlocked.
Operating System Concepts – 10th Edition 6.57 Silberschatz, Galvin and Gagne
Liveness
Other forms of deadlock:
Starvation – indefinite blocking
• A process may never be removed from the
semaphore queue in which it is suspended
Priority Inversion – Scheduling problem when lower-
priority process holds a lock needed by higher-priority
process
• Solved via priority-inheritance protocol
Operating System Concepts – 10th Edition 6.58 Silberschatz, Galvin and Gagne
End of Chapter 6
Operating System Concepts – 10th Edition Silberschatz, Galvin and Gagne
Priority Inheritance Protocol
Consider the scenario with three processes P1, P2, and
P3.
• P1 has the highest priority, P2 the next highest, and
P3 the lowest.
Assume that P3 is holding semaphore S and that P1 is
waiting to S to be released
Assume that P2 is assigned the CPU and preempts P3
• P3 is still holding semaphore S
• P1 is waiting to S to be released
What has happened is that P2 - a process with a lower
priority than P1 - has indirectly prevented P3 from
gaining access to the resource.
To prevent this from occurring, a priority inheritance
protocol is used. This simply allows the priority of the
highest thread waiting to access a shared resource to
be assigned to the thread currently using the resource.
Thus, the current owner of the resource is assigned the
priority of the highest priority thread wishing to acquire
the resource.
Operating System Concepts – 10th Edition 6.60 Silberschatz, Galvin and Gagne
Usage of Condition Variable Example
Consider P1 and P2 that that need to execute two
statements S1 and S2 and the requirement that S1 to
happen before S2
• Create a monitor with two procedures F1 and F2
that are invoked by P1 and P2 respectively
• One condition variable “x” initialized to 0
• One Boolean variable “done”
• F1:
S1;
done = true;
x.signal();
• F2:
if done = false
x.wait()
S2;
Operating System Concepts – 10th Edition 6.61 Silberschatz, Galvin and Gagne