Week - 01 - Lec1 03 03 2021
Week - 01 - Lec1 03 03 2021
Distributed Computing
Lecture # 01
Spring 2021
Agenda
2
Motivating Parallelism
Computing vs Systems
CS443
3
Motivating Parallelism
CS443
Motivating Parallelism
4
Moore’s Law
Proposed by Gorden E. Moore in 1965 and revised in 1975.
CS443
Motivating Parallelism
6
Moore’s Law
More computational power implicitly means more transistors.
Then why need second interpretation?
Let’s have a look on empirical data from 1970 to 2009
In 1970’s (i.e., from 1970 to 1979), processor speeds ranged from 740 KHz to
8 Mhz. Difference shows that both the interpretations are correct.
From 2000 to 2009, Speeds ranged from 1.3 GHz to 2.8 GHz.
Relative speed difference is too low but, number of integrated transistors
ranged from 37.5 million to 904 million.
So, second interpretation is more accurate.
CS443
Motivating Parallelism
7
Moore’s Law
Why doubling the transistors does not doubles the speed?
Increase in number of transistor per processor is due to multi-core CPU’s.
It means, to follow Moore’s law, companies had to:
Introduce ULSI( ultra large-scale integrations)
And multi-core processing era.
Will Moore’s law hold forever?
Adding multiple cores on single chip causes heat issues.
Furthermore, increasing the number of cores, may not be able to increase
speeds [Due to inter-process interactions].
Moreover, transistors would eventually reach the limits of miniaturization
at atomic levels
CS443
Motivating Parallelism
8
Moore’s Law
So, we must look for efficient parallel [multi-core] software solutions
to fulfill our future computational needs.
Solution[s]?
Need to find more scalable distributed and hybrid solutions
CS443
Motivating Parallelism
9
CS443
Motivating Parallelism
10
Any analyses on this data must be performed over the network using
parallel computing techniques
CS443
Computing vs Systems
11
Distributed Systems
A collection of autonomous computers, connected through a network
and distribution middleware.
This enables computers to coordinate their activities, and to share the resources
of the system.
The system is usually perceived as a single integrated computing facility.
Mostly concerned with the hardware-based accelerations
Distributed Computing
A specific use of distributed systems to split a large and complex
processing into subparts, and then execute the parts in parallel to
increase the productivity.
Computing mainly concerned with software-based accelerations (i.e.,
designing and implementing parallel algorithms)
CS443
Parallel vs Distributed Computing
12
Distributed Computing
This type of computing is mainly concerned with developing
algorithms for the distributed cluster systems.
Here distributed means a geographical distance between the
computers without any shared-Memory.
CS443
Practical Applications of P&D Computing
13
Scientific Applications
Bioinformatics
DNA sequence alignment (fine-grained, Coarse-grained, and hybrid solutions)
Applications in astrophysics have explored the evolution of galaxies,
thermonuclear processes, and applications in the analysis of
extremely large datasets from telescope.
Advances in computational physics and chemistry have explored
new materials, understanding of chemical pathways, and more
efficient processes
e.g., Large Hydron Collider (LHC) at European Organization for Nuclear
Research (CERN) generates petabytes of data for a single collision.
CS443
Practical Applications of P&D Computing
14
Scientific Applications
Weather modeling for simulating the track of natural hazards like the
extreme cyclones (storms).
Flood prediction
CS443
Practical Applications of P&D Computing
15
Commercial Applications
Some of the largest parallel computers power the wall street.
Large scale servers (mail and web servers) are often implemented
using parallel platforms.
CS443
Practical Applications of P&D Computing
16
Other Applications
Network intrusion detection: A large amount of data needs to be
analyzed and processed
Graphics processing
CS443
Limitations of Parallel Computing:
17
CS443
References
18
1. Kumar, V., Grama, A., Gupta, A., & Karypis, G. (1994). Introduction to
parallel computing (Vol. 110). Redwood City, CA: Benjamin/Cummings.
CS443
Questions
19
CS443