Module 2 os (2)
Module 2 os (2)
The Process
A process is more than the program code which is also called text section.
It contains program counter which represents the current activity and also the
contents of the processor's registers.
A process also consists of a process stack section which contains temporary data &
data section which contains global variables.
A process may also include a heap, which is memory that is dynamically allocated
during process run time.
The structure of a process in memory is shown in below figure.
Process State
As process executes it changes its state and each process may be in one of
the following states:
o New: The process is being created
o Running: Instructions are being executed
o Waiting: The process is waiting for some event to occur
o Ready: The process is waiting to be assigned to a process
o Terminated: The process has finished execution
Only one process can be running on any processor at any instant. Many processes
may be ready and waiting.
The state diagram corresponding to these states is shown below figure.
The PCB contains important information about the specific process including,
o Process state: The current state of the process i.e., whether it is ready, running, waiting, halted
and so on.
o Program counter: Indicates the address of the next instruction to be executed for a process.
o CPU registers: The registers vary in number and type. Along with program counter this state
information should be saved to allow process to be continued correctly after an interrupt occurs as
shown in below figure.
o CPU scheduling information: This information includes a process priority, pointers to
scheduling queues, and any other scheduling parameters.
o Memory-management information: This information may include the value of base and limit
registers, the page tables, or the segment tables, depending on the memory system used by the OS.
o Accounting information: This information includes the amount of CPU and real
time used, time limits, account numbers, job or process numbers, and so on.
o I/O status information: This information includes the list of I/O devices
allocated to the process, a list of open files, and so on.
Threads
Scheduling queues
o Ready queue: The processes that are placed in main memory and are ready and
waiting to execute are placed in a list called the ready queue. This is in the form
of linked list, the header contains pointer to the first and final PCB in the list.
Each PCB contains a pointer field that points to next PCB in ready queue.
o Device queue: The list of processes waiting for a particular I/O device is called
device queue. When the CPU is allocated to a process it may execute for
sometime and may quit or interrupted or wait for the occurrence of a particular
event like completion of an I/O request, but the I/O may be busy with some other
processes. In this case the process must wait for I/O and it will be placed in device
queue. Each device will have its own queue.
The below figure Shows Ready queue and various I/O Device queues
A new process is initially put in the ready queue and it waits there until it is selected
for execution or dispatched. Once the process is assigned CPU and is executing, the
following events can occur,
o It can execute an I/O request and is placed in I/O queue.
o The process can create a sub process & wait for its termination.
o The process may be removed from the CPU as a result of interrupt and can be
put back into ready queue.
Schedulers
Context Switch
When an interrupt occurs, the system needs to save the current context of the
process running on the CPU. The context is represented in the PCB of the process.
When CPU switches to another process, the system must save the state of the old
process and load the saved state for the new process. A state save of the current state
of the CPU, and then state restore to resume operations is performed.
Context-switch time is overhead and the system does no useful work while
switching. Context-switch times are highly dependent on hardware support. Context-
switch speed varies from machine to machine, depending on the memory speed, the
number of registers that must be copied, and the existence of special instructions.
Process Creation
A process may create several new processes by some create-process system call,
during the course of execution.
The creating process is called parent process and the created one is called the child
process. Each of the new process may in turn create other processes, forming a tree
of processes. Processes are identified by unique process identifier ( or pid).
Below Figure. Shows the process tree for the solaris OS. The process at the top of
the tree is sched process, with pid of 0,and this creates several children processes.
The sched process creates several children processes including pageout and fsflush.
These processes are responsible for managing memory and file systems. The sched
process also creates the init process, which serves as the root parent process for all
user processes. These processes are responsible for managing memory and file
systems.
inetd and dtlogin are two children of init where inetd is responsible for networking
services such as telnet and ftp; dtlogin is the process representing a user login
screen.
When a user logs in, dtlogin creates an X-windows session (X session), which in
turns creates the sdt_shel process. Below sdt_shel, a user's command-line shell, the
C-shell or csh is created. In this command line interface, the user can then invoke
various child processes, such as the ls and cat commands.
There is also csh process with pid of 7778 representing a user who has logged onto
the system using telnet. This user has started the Netscape browser (pid of 7785) and
the emacs editor (pid of 8105).
A process needs certain resources to accomplish its task. Along with the various
logical and physical resources that a process obtains when it is created, initialization
data may be passed along by the parent process to the child process.
When a process creates a new process, two possibilities exist in terms of execution.
1. The parent continues to execute concurrently with its children.
2. The parent waits until some or all of the children have terminated.
There are also two possibilities in terms of the address space of the new process.
1. The child process is a duplicate of the parent process.
2. The child process has a new program loaded into it.
In UNIX OS, fork () system call creates new process. In windows Create Process ()
does the job.
Exec () system call Is called after a fork () to replace the process memory space with
a new program.
int main()
{
Pid_t pid;
pid=fork();/*fork another process*/
if (pid<0)/*error occurred */
{
fprintf(stderr,"Fork Failed");
exit(-1);
}
Else if (pid ==0) /*child process*/
{
execlp("/bin/ls","ls",NULL);
}
else /*parentprocess*/
{
wait(NULL);/*parent will wait for the child to complete*/
printf ("Child Complete");
exit(0);
}
}
If there are two different processes running a copy of the same program, the pid for
child is zero and for the parent it is greater than zero. The parent process waits for
the child process to complete with the wait () system call.
When the child process completes, the parent process resumes from the call to wait
(), where it completes using exit () system call. This is shown in below figure.
Process Termination
A process terminates when it finishes executing its last statement and asks the
operating system to delete it by using exit() system call.
Process resources are de allocated by the operating system. A process can terminate
another process via Terminate Process() system call. A Parent may terminate
execution of children processes (abort) for the following reasons.
o Child has exceeded usage of allocated resources.
o Task assigned to child is no longer required.
o If parent is exiting some operating system do not allow child to continue if its
parent terminates.
Some systems does not allow child to exist if its parent has terminated. If process
terminates then all its children must also be terminated, this phenomenon is referred
as cascading termination.
3.4 Inter-process Communication (IPC)
Processes executing concurrently in the operating system may be either independent
Processes or cooperating processes.
A process is independent if it cannot affect or be affected by the other processes
executing in the system. Any process that does not share data with any other process is
independent.
A process is cooperating if it can affect or be affected by the other processes executing
in the system. Any process that shares data with other processes is a cooperating
process.
Advantages of process cooperation are,
o Information sharing: several users may be interested in same piece of
information, so an environment must be provided to allow concurrent access to
such information.
o Computation speed-up: If we want particular task to run faster, it can be broken
into subtasks, each of which will be executing in parallel with each other.
o Modularity: If the system is to be constructed in modular fashion, then system
functions can be divided into separate processes or threads.
o Convenience: An individual user may work on many tasks at the same time.
Sujatha GS, Asst prof Dept of Computer Science Page 9
Operating systems
#define BUFFER_SIZE 10
typedef struct {
……..
……..
}item;
Itembuffer[BUFFER_SIZE];
int in = 0;
int out =0;
The shared buffer is implemented as a circular array with two logical pointers in and
out. The variable in points to the next free position in the buffer; out points to the
first full position in the buffer. The buffer is empty when in==out, the buffer is full
when ((in+ 1)% BUFFER_SIZE) = = out.
The code for the producer process is shown below.
Item next Produced;
while(true)
{
/*produce an item in next Produced*/
While (((in+1)%BUFFER_SIZE)==out);//do nothing
Buffer [in] = next Produced;
in =(in +1)%BUFFER_SIZE;
}
Naming
Processes that want to communicate must have a way to refer to each other.
They can use either direct or indirect communication.
Under Direct Communication processes must name each other explicitly
The send() and receive() primitives are defined as,
o Send( P, message)–send a message to process P.
o Receive( Q, message)–receive a message from process Q.
Properties of communication link in this scheme are,
o Links are established automatically between every pair of processes that
want to communicate.
o A link is associated with exactly two communicating processes.
o Between each pair there exists exactly one link.
This scheme exhibits two types of addressing,
o Symmetry: Both sender and receiver must name the other to communicate.
o Asymmetry: Only the sender names the recipient, the recipient is not
required to name the sender. The send() and receive() primitives are defined
as,
Send (P, message)–send a message to process P
Receive (id, message) – receive a message from any process; the
variable id is set to the name of the process with which
communication has taken place.
In Indirect Communication messages are sent and received from mail boxes
(also referred to as ports)
A mailbox can be viewed abstractly as an object into which messages can be
placed by processes and from which messages can be removed.
Each mailbox has a unique id and processes can communicate only if they share
a mailbox.
Synchronization
Buffering
Messages exchanged by communicating processes reside in a temporary queue,
and such queues can be implemented in one of three ways,
1. Zero capacity– Maximum length is zero and sender must wait for
the receiver.
2. Bounded capacity – finite length of n messages, sender must wait if
link is full.
3. Unbounded capacity–infinite length and sender never waits.
Threads
Motivation
Many software packages that run on modern desktop PCs are multithreaded.
An application is implemented as a separate process with several threads of
control. Eg: A Web browser might have one thread to display images or text while
another thread retrieves data from the network.
As process creation takes more time than thread creation it is more efficient to
use process that contains multiple threads. So, that the amount of time that a client
have to wait for its request to be serviced from the web server will be less.
Threads also play an important role in remote procedure call.
Benefits
Support for threads may be provided either at user level for user threads or by the
kernel for kernel threads.
There must be a relationship between user threads and kernel threads. Three
Common ways of establishing this relationship are,
Many-to-One
Many user-level threads are mapped to single kernel thread as shown in below figure
This model is efficient as the thread management is done by the thread library in
user space, but the entire process will block if a thread makes a blocking system
call.
As only one thread can access the kernel thread at a time, multiple threads are
unable to run in parallel on multiprocessors.
Examples: Solaris Green Threads, GNU Portable Threads
One-to-One
Many-to-Many Model
One-to-One model restricts creating more user threads and Many-to-One model
allows creating more user threads but kernel can schedule only one thread at a
time. These drawbacks can be overcome by Many-to-Many model as shown in
below figure. (Left side)
Many-to-Many model allows many user level threads to be mapped to many kernel
threads.
It allows the operating system to create a sufficient number of kernel threads.
When thread performs a blocking system call, the kernel can schedule another
thread for execution.
It allows user-level thread to be bound to a kernel thread and this is referred as
two-level model as shown in below figure. (Right side)
Examples: IRIX, HP-UX, Solaris OS.
Thread Libraries
A thread library provides the programmer an API for creating and managing threads.
There are two primary ways of implementing a thread library.
o The first approach is to provide a library entirely in user space with no kernel
support. All code and data structures for the library exist in user space.
o The second approach is to implement a kernel-level library supported directly
by the operating system.
Three primary thread libraries are
o POSIX P threads : extension of posix standard, they may be provided as either
a user bor kernel library.
o Win 32 threads: is a kernel level library available on windows systems.
o Java threads: API allows creation and management directly in Java programs.
However, on windows java threads are implemented using win32 and on
UNIX and Linux using P threads
P threads
P threads, the threads extension of the POSIX standard, may be provided as either
a user or kernel-level library.
P threads refers to the POSIX standard (IEEE1003.1c) defining an API for
thread creation and synchronization.
This is a specification for thread behavior, not an implementation.
Operating system designers may implement the specification in any way they wish.
Numerous systems implement the P threads specification, including Solaris, Linux,
Mac OS X, and Tru64 UNIX.
Shareware implementations are available in the public domain for the various
Windows operating systems as well.
#include <pthread.h>
#include<stdio.h>
int sum; /*this data is shared by the thread(s)*/
void*runner(void *param); /* the thread */
int main (int argc,char*argv[])
{
pthread_t tid; /*the thread identifier*/
pthread_attr_tattr; /*set of thread attributes*/
if(argc !=2)
{
fprintf(stderr,"usage: a.out <integer value>\n");
return-1;
}
If (atoi(argv[1]) <0)
{
fprintf(stderr,"%d must be>=0\n",atoi(argv[1]));
return-1;
}
pthread_attr_init(&attr); /* get the default attributes */
pthread_create(&tid,&attr,runner,argv[1]); /* create the thread */
pthread_join(tid,NULL); /*wait for the thread to exit*/
printf("sum= %d\n",sum);
}
Win32 Threads
if(Thread Handle!=NULL)
{
Wait For Single Object (Thread Handle,INFINITE);
//now wait for the thread to finish Close Handle (Thread Handle); // close the
thread handle printf ("surn=%d\n", Sum);
}
}
Java Threads
The Java thread API allows thread creation and management directly in Java programs.
Threads are the fundamental model of program execution in a Java program, and the
Java language and its API provide a rich set of features for the creation and
management of threads.
All Java programs comprise at least a single thread of control and even a simple Java
program consisting of only a main ()method runs as a single thread in the JVM.
There are two techniques for creating threads in a Java program. One approach is to
create a new class that is derived from the Thread class and to override its run()
method. An alternative and more commonly used technique is to define a class that
implements the Runnable interface. The Runnable interface is defined as follows:
When a class implements Runnable, it must define a run() method. The code
implementing the run() method runs as a separate thread.
Creating a Thread object does not specifically create the new thread but it is the
start()method that actually creates the new thread. Calling the start() method for the
new object does two things:
o It allocates memory and initializes a new thread in the JVM.
o It calls the run () method, making the thread eligible to be run by the JVM.
As Java is a pure object-oriented language, it has no notion of global data. If two or
more threads have to share data means then the sharing occurs by passing reference to
the shared object to the appropriate threads.
This shared object is referenced through the appropriate getSum() and setSum()
methods.
As the Integer class is immutable, that is, once its value is set it cannot change, a new
sum class is designed.
The parent threads in Java uses join() method to wait for the child threads to finish
before proceeding.
Class Sum
{
Private int sum;
Public int getSum()
{
Return sum;
}
Threading Issues
The semantics of the fork() and exec() system calls change in a multithreaded
program. Some UNIX systems have chosen to have two versions of fork(),
one that duplicates all threads and another that duplicates only the thread that
invoked the fork() system call.
If a thread invokes the exec() system call, the program specified in the
parameter to exec() will replace the entire process including all threads.
Which of the two versions of fork() to use depends on the application. If
exec()is called immediately after forking, then duplicating all threads is
unnecessary, as the program specified in the parameters to exec() will replace
the process. In this instance, duplicating only the calling thread is appropriate.
Thread Cancellation
Thread cancellation is the task of terminating a thread before it has completed. For
example, if multiple threads are concurrently searching through a database and one
thread returns the result, the remaining threads might be cancelled.
A thread that is to be cancelled is often referred to as the target thread.
Cancellation of a target thread may occur in two different scenarios:
o Asynchronous cancellation: One thread immediately terminates the target
thread.
o Deferred cancellation: The target thread periodically checks whether it
should terminate, allowing it an opportunity to terminate itself in an orderly
fashion.
Signal Handling
A signal is used in UNIX systems to notify a process that a particular event has
occurred.
Thread Pools
The idea behind a thread pool is to create a number of threads at process startup and
place them into a pool, where they sit and wait for work.
When a server receives a request, it awakens a thread from this pool and passes the
request to it to service.
Once the thread completes its service, it returns to the pool and waits for more work.
If the pool contains no available thread, the server waits until one becomes free.
The benefits of Thread pools are,
o Servicing a request with an existing thread is usually faster than waiting
to create a thread.
o A thread pool limits the number of threads that exist at any one point. This
is particularly important on systems that cannot support a large number of
concurrent threads.
The number of threads in the pool can be set based on factors such as the number of
CPUs in the system, the amount of physical memory, and the expected number of
concurrent client requests.
Thread-Specific Data
Threads belonging to a process share the data of the process. This sharing of data
provides one of the benefits of multi threaded programming. But, in some
circumstances, each thread might need its own copy of certain data. Such data is called
as thread-specific data. For example, in a transaction-processing system, we might
service each transaction in a separate thread. Furthermore, each transaction may be
assigned a unique identifier.
Most thread libraries including Win32 and P threads provide support for thread-
specific data.
Scheduler Activations
is about to block and identifying the specific thread. The kernel then allocates a new
virtual processor to the application.
The application runs an upcall handler on this new virtual processor, which saves the
state of the blocking thread and gives up the virtual processor on which the blocking
thread is running. The upcall handler then schedules another thread that is eligible to
run on the new virtual processor.
When the event that the blocking thread was waiting for occurs, the kernel makes
another upcall to the thread library informing it that the previously blocked thread is
now eligible to run.
The upcall handler for this event also requires a virtual processor, and the kernel may
allocate a new virtual processor or preempt one of the user threads and run the upcall
handler on its virtual processor.
After marking the unblocked thread as eligible to run, the application schedules an
eligible thread to run on an available virtual processor.
Differences
Process Thread
1. It is called heavy weight process. It is called light weight process.
2. Process switching needs interface with Thread switching does not need
OS. interface with OS.
Multiple processes use more Multiple threaded processes use fewer
3.
resources than multiple threads. resources than multiple processes.
In multiple process implementations
each process executes same code but All threads can share same set of open
4.
has its own memory and file files.
resources.
If one server process is blocked no While one server thread is blocked and
5. Other server process can execute waiting, second thread in the same task
until the first process unblocked. could run.
One thread can read, write or even
In multiple processes each
6. process operates independently of Completely wipeout another
threads stack.
others.
PROCESS SHEDULING
Basic Concepts
In a single-processor system, only one process can run at a time and others must wait
until the CPU is free and can be rescheduled.
The objective of multi programming is to have some process running at all times, to
maximize CPU utilization
.
With multiprogramming, several processes are kept in memory at one time. When one
process has to wait, the operating system takes the CPU away from that process and
gives the CPU to another process.
The CPU is one of the primary computer resources. Thus, its scheduling is central to
Operating-system design.
The duration of CPU bursts vary from process to process and from computer to
computer.
CPU Scheduler
Whenever the CPU becomes idle, the operating system must select one of the
processes in the ready queue to be executed. This selection is carried out by the
short-term scheduler (or CPU scheduler).
The scheduler selects a process from the processes in memory that are ready to
execute and allocates the CPU to that process.
A ready queue can be implemented as a FIFO queue, a priority queue, a tree, or
simply an unordered linked list. But all the processes in the ready queue are lined
up waiting for a chance to run on the CPU. The records in the queues are Process
Control Blocks (PCBs) of the processes.
Preemptive scheduling
When scheduling takes place only under circumstances 1 and 4, we say that the
scheduling scheme is non preemptive or cooperative; otherwise, it is preemptive.
Under non preemptive scheduling, once the CPU has been allocated to a process,
the process keeps the CPU until it releases the CPU either by terminating or by
Dispatcher
Scheduling Criteria
Many criteria have been suggested for comparing CPU scheduling algorithms. The
Criteria include the following :
o CPU utilization: CPU must be kept as busy as possible.CPU utilization can range
from 0 to 100 percent. In a real system, it should range from 40 percent (for a
lightly loaded system) to 90 percent (for a heavily used system).
o Throughput: If the CPU is busy executing processes, then work is being done.
One measure of work is the number of processes that are completed per time unit,
called through put. For long processes, this rate may be one process per hour; for
short transactions, it may be10 processes per second.
o Turnaround time: The interval from the time of submission of a process to the
time of completion is the turnaround time. Turnaround time is the sum of the
periods spent waiting to get into memory, waiting in the ready queue, executing on
the CPU, and doing I/O.
o Waiting time: The CPU scheduling algorithm does not affect the amount of time
during which a process executes or does I/O; it affects only the amount of time that
a process spends waiting in the ready queue. Waiting time is the sum of the periods
spent waiting in the ready queue.
o Response time: The measure of the time from the submission of a request until the
first response is produced. This measure, called response time, is the time it takes
to start responding, not the time it takes to output the response.
Scheduling Algorithms
CPU Scheduling deals with the problem of deciding which of the processes in the ready
queue is to be allocated the CPU. Following are some scheduling algorithms,
o FCFS Scheduling.
o Round Robin Scheduling.
o SJF Scheduling.
o Priority Scheduling.
o Multilevel Queue Scheduling.
o Multilevel Feedback Queue Scheduling.
If the processes arrive in the order P1, P2, P3, and are served in FCFS order, we get
the result shown in the following Gantt chart:
The waiting time is 0 milliseconds for process P1, 24 milliseconds for process P 2, and
27 milliseconds for process P3 . Thus, the average waiting time is (0 + 24 + 27)/3 =
17milliseconds.
The FCFS scheduling algorithm is non preemptive. Once the CPU has been allocated
to a process, that process keeps the CPU until it releases the CPU, either by
terminating or by requesting I/O. The FCFS algorithm is thus particularly troublesome
for time-sharing systems, where it is important that each user get a share of the CPU at
regular intervals.
Shortest-Job-First Scheduling
This algorithm associates with each process the length of the process's next CPU burst.
When the CPU is available, it is assigned to the process that has the smallest next CPU
burst.
If the next CPU bursts of two processes are the same, FCFS scheduling is used to
break the tie.
As an example of SJF scheduling, consider the following set of processes, with the
length of the CPU burst given in milliseconds:
Using SJF scheduling, we would schedule these processes according to the following
Gantt chart:
The waiting time is 3 milliseconds for process P1, 16 milliseconds for process P2 , 9
milliseconds for process P3 , and 0 milliseconds for process P4. Thus, the average
waiting time is (3 +16 +9 +0)/4=7 milliseconds.
The SJF scheduling algorithm is optimal; it gives the minimum average waiting time
for a given set of processes. Moving a short process before a long one, decreases the
waiting time of the short process more than it increases the waiting time of the long
process. Consequently, the average waiting time decreases.
The SJF algorithm can be either preemptive or non preemptive. The choice arises
when a new process arrives at the ready queue while a previous process is still
executing. The next CPU burst of the newly arrived process may be shorter than what
is left of the currently executing process. A preemptive SJF algorithm will preempt the
currently executing process, whereas a non preemptive SJF algorithm will allow the
currently running process to finish its CPU burst. Preemptive SJF scheduling is
sometimes called shortest-remaining-time-first scheduling.
As an example, consider the following four processes, with the length of the CPU
burst given in milliseconds:
If the processes arrive at the ready queue at the times shown and need the indicated
burst times, then the resulting preemptive SJF schedule is as depicted in the following
Gantt chart:
Process P1 is started at time 0, since it is the only process in the queue. Process P2
arrives at time1.
The remaining time for process P1 (7 milliseconds) is larger than the time required by
process P2 (4milliseconds), so process P1 is preempted, and process P2 is scheduled.
The average waiting time for this example is((10-1)+(1-1)+(17-2)+(5-3))/4=26/4=
6.5 milliseconds. Non preemptive SJF scheduling would result in an average waiting
time of 7.75 milliseconds.
Priority Scheduling
The SJF algorithm is a special case of the general priority scheduling algorithm.
A priority is associated with each process, and the CPU is allocated to the process
with the highest priority.
Equal-priority processes are scheduled in FCFS order.
An SJF algorithm is simply a priority algorithm where the priority (p) is the inverse of
the (predicted) next CPU burst. The larger the CPU burst, the lower the priority, and
vice versa.
As an example, consider the following set of processes, assumed to have arrived at
time 0, in the order P1, P2, … ,P5, with the length of the CPU burst given in
milliseconds:
Round-Robin Scheduling
Time quantum, and the CPU is given to the next process in the queue ie. Process P2. Since
process P2 does not need 4 milliseconds, it quits before its time quantum expires.
The CPU is then given to the next process P3. Once each process has received 1 time
quantum, the CPU is returned to process P1 for an additional time quantum. The
resulting RR schedule is
Another class of scheduling algorithms has been created for situations in which
processes are easily classified into different groups. For example, a common division
is made between fore ground (inter active) processes and back ground (batch)
processes.
These two types of processes have different response-time requirements and may have
Different scheduling needs.
Foreground processes have priority over background processes.
A multilevel queue scheduling algorithm partitions the ready queue into
several separate queues as shown in below figure.
The processes are permanently assigned to one queue based on some property of the
process, such as memory size, process priority, or process type.
Each queue has its own scheduling algorithm. For example, separate queues might be
used for foreground and back ground processes.
The foreground queue might be scheduled by an RR algorithm, while the background
queue is scheduled by an FCFS algorithm.
When the multilevel queue scheduling algorithm is used, processes are permanently
assigned to a queue when they enter the system.
The multi level feedback-queue scheduling algorithm allows a process to move
between queues. The idea is to separate processes according to the characteristics of
their CPU bursts.
If a process uses too much CPU time, it will be moved to a lower-priority queue. This
scheme leaves I/O-bound and inter active processes in the higher-priority queues.
A process that waits too long in a lower-priority queue may be moved to a higher-
priority queue. This form of aging prevents starvation.
For example, consider a multilevel feedback-queue scheduler with three queues,
numbered from 0 to 2 as shown in below figure.
The scheduler first executes all processes in queue0. Only when queue0 is empty it
will execute processes in queue1.
Similarly, processes in queue 2 will only be executed if queues 0 and 1 are empty.A
process that arrives for queue 1 will preempt a process in queue2.
A process in queue 1 will in turn be preempted by a process arriving for queue 0.
A process entering the ready queue is put in queue 0. A process in queue 0 is given a
time quantum of 8 milliseconds. If it does not finish within this time, it is moved to the
tail of queue 1. If queue 0 is empty, the process at the head of queue 1 is given a
quantum of 16 milliseconds. If it does not complete, it is preempted and is put into
queue 2. Processes in queue 2 are scheduled on an FCFS basis but they run only when
0 and 1are empty queue.
This scheduling algorithm gives highest priority to any process with a CPU burst of 8
milliseconds or less. Such a process will quickly get the CPU, finish its CPU burst, and
go off to its next I/O burst. Processes that need more than 8 but less than 24
milliseconds are also served quickly, although with lower priority than shorter
processes. Long processes automatically sink to queue 2 and are served in FCFS order
with any CPU cycles left over from queues 0 and1.
A multilevel feedback-queue scheduler is defined by the following parameters:
o The number of queues.
o The scheduling algorithm for each queue.
o The method used to determine when to upgrade a process to a higher-priority
queue.
o The method used to determine when to demote a process to a lower-priority
queue.
o The method used to determine which queue a process will enter when
that process needs service.
Multiple-Processor Scheduling
a. Processor Affinity
The data most recently accessed by the process is populated in the cache for the
processor and successive memory accesses by the process are often satisfied in
cache memory.
If the process migrates to another processor, the contents of cache memory must
be invalidated for the processor being migrated from, and the cache for the
processor being migrated to must be re-populated. Because of the high cost of
invalidating and re-populating caches, most SMP systems try to avoid migration
of processes from one processor to another and instead tries to keep a process
running on the same processor. This is known as processor affinity, i.e., a process
has an affinity for the processor on which it is currently running.
Processor affinity takes several forms. When an operating system has a policy of
attempting to keep a process running on the same processor but not guaranteeing
that it will do so, a situation is known as soft affinity. Here, it is possible for a
process to migrate between processors.
Some systems such as Linux provide system calls that support hard affinity,
thereby allowing a process to specify that it must not migrate to other processors.
b. Load Balancing
With push migration, a specific task periodically checks the load on each processor
and if it finds an imbalance it evenly distributes the load by moving (or pushing)
processes from overloaded to idle or less-busy processors.
Pull migration occurs when an idle processor pulls a waiting task from a busy
processor.
c. Symmetric Multithreading
Thread Scheduling
On operating systems that support user-level and kernel-level threads, the kernel-level
threads are being scheduled by the operating system.
User-level threads are managed by a thread library, and the kernel is unaware of them.
To run on a CPU, user-level threads must be mapped to an associated kernel-level
thread, although this mapping may be indirect and may use a lightweight
process(LWP).
One distinction between user-level and kernel-level threads lies in how they are
scheduled. On systems implementing the many-to-one and many-to-many models, the
thread library schedules user-level threads to run on an available LWP, a scheme
P thread Scheduling
o pthread_attr_setscope(pthread_attr_t*attr,int scope)
o pthread_attr_getscope(pthread_attr_t*attr,int* scope)
The first parameter for both functions contains a pointer to the attribute set
for the thread.
The second parameter for the pthread_attr_setscope () function is passed
either the THREAD_SCOPE_SYSTEM or
PTHREAD_SCOPE_PROCESS value, indicating how the contention scope
is to be set. In the case of pthread_attr_getscope(), this second parameter
contains a pointer to an int value that is set to the current value of the
contention scope. If an error occurs, each of these functions returns non-
zero values.