2.
2.
presentation 2
Parallelism in uniprocessor systems
The early computer has only one ALU in its CPU and
hence performing a long sequence of ALU instructions takes
more amount of time.
Eg.The CDC-6600 (was the flagship of the 6000 series of
mainframe computer systems manufactured by Control
Data Corporation) has 10 functional units built into its
CPU. These 10 units are independent of each other and
may operate simultaneously. A score board is used to
keep track of the availability of the functional units and
registers being demanded. With 10 functional units and 24
registers available, the instruction issue rate can be
significantly increased.
2.Parallelism and pipelining within
the CPU :
Parallel adders, using such techniques as carry-
look ahead and carry –save, are now built into
almost all ALUs. High speed multiplier recording
and convergence division are techniques for
exploring parallelism and the sharing of hardware
resources for the functions of multiply and Divide.
The use of multiple functional units is a form of
parallelism with the CPU.
Various phases of instructions executions are now
pipelined, including instruction fetch, decode,
operand fetch, arithmetic logic execution, and
store result.
3. Overlapped CPU and I/O
operations :
I/O operations can be performed simultaneously
with the CPU competitions by using separate I/O
controllers, channels, or I/O processors.
The direct memory access (DMA) channel can be
used to provide direct information transfer
between the I/O devices and the main
memory. The DMA is conducted on a cycle stealing
basis, which is apparent to the CPU
4. Use of a hierarchical memory system :