0% found this document useful (0 votes)

50 views3 pages

Parallel Models of Computation

The document discusses three important principles that have emerged for developing parallel models of computation: 1. Work-efficiency - An algorithm is efficient if it performs the same amount of work as the fastest sequential algorithm using the same number of processors. The work captures the actual cost of the computation. 2. Emulation - A useful model does not need to mimic a real machine, but algorithms efficient in the model must map to efficient algorithms on real machines. For example, a PRAM algorithm can be translated to an algorithm for a multi-processor memory system. 3. Modeling communication - To optimize performance, a model may explicitly include capabilities like bandwidth and topology that impact communication between processors. [/SUMMARY]

Uploaded by

debasish behera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views3 pages

Parallel Models of Computation

Uploaded by

debasish behera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Parallel Models of Computation

Developing a standard parallel model of computation for analyzing algorithms has proven
difficult because different parallel computers tend to vary significantly in their organizations.
In spite of this difficulty, useful parallel models have emerged, along with a deeper
understanding of the modeling process. In this section we describe three important principles
that have emerged.

1. Work-efficiency. In designing a parallel algorithm, it is more important to make it

efficient than to make it asymptotically fast. The efficiency of an algorithm is determined
by the total number of operations, or work that it performs. On a sequential machine, an
algorithm's work is the same as its time. On a parallel machine, the work is simply the
processor-time product. Hence, an algorithm that takes time t on a P-processor machine
performs work W = Pt. In either case, the work roughly captures the actual cost to
perform the computation, assuming that the cost of a parallel machine is proportional to
the number of processors in the machine. We call an algorithm work-efficient (or just
efficient) if it performs the same amount of work, to within a constant factor, as the
fastest known sequential algorithm. For example, a parallel algorithm that sorts n keys

in time using processors is efficient since the work, , is as

good as any (comparison-based) sequential algorithm. However, a sorting algorithm that

runs in time using processors is not efficient. The first algorithm is better
than the second - even though it is slower - because it's work, or cost, is smaller. Of
course, given two parallel algorithms that perform the same amount of work, the faster
one is generally better.
2. Emulation. The notion of work-efficiency leads to another important observation: a
model can be useful without mimicking any real or even realizable machine. Instead, it
suffices that any algorithm that runs efficiently in the model can be translated into an
algorithm that runs efficiently on real machines. As an example, consider the widely-used
parallel random-access machine (PRAM) model. In the PRAM model, a set of processors
share a single memory system. In a single unit of time, each processor can perform an
arithmetic, logical, or memory access operation. This model has often been criticized as
unrealistically powerful, primarily because no shared memory system can perform
memory accesses as fast as processors can execute local arithmetic and logical operations.
The important observation, however, is that for a model to be useful we only require that
algorithms that are efficient in the model can be mapped to algorithms that are efficient
on realistic machines, not that the model is realistic. In particular, any algorithm that runs
efficiently in a P-processor PRAM model can be translated into an algorithm that runs

efficiently on a -processor machine with a latency L memory system , a much

more realistic machine. In the translated algorithm, each of the processors

emulates L PRAM processors. The latency is ``hidden'' because a processor has useful
work to perform while waiting for a memory access to complete. Although the translated
algorithm is a factor of L slower than the PRAM algorithm, it uses a factor of L fewer
processors, and hence is equally efficient.
3. Modeling Communication. To get the best performance out of a parallel machine, it is
often helpful to model the communication capabilities of the machine, such as its latency,
explicitly. The most important measure is the communication bandwidth. The bandwidth
available to a processor is the maximum rate at which it can communicate with other
processors or the memory system. Because it is more difficult to hide insufficient
bandwidth than large latency, some measure of bandwidth is often included in parallel
models. Sometimes the specific topology of the communication network is modeled as
well. Although including this level of detail in the model often complicates the design of
parallel algorithms, it's essential for designing the low-level communication primitives for
the machine. In addition to modeling basic communication primitives, other operations
supported by hardware, including synchronization and concurrent memory accesses, are
often modeled, as well as operations that mix computation and communication, such as
fetch-and-add and scans. A final consideration is whether the machine supports shared
memory, or whether all communication relies on passing messages between the
processors.

Algorithmic Techniques

A major advance in parallel algorithms has been the identification of fundamental algorithmic
techniques. Some of these techniques are also used by sequential algorithms, but play a more
prominent role in parallel algorithms, while others are unique to parallelism. Here we list
some of these techniques with a brief description of each.

1. Divide-and-Conquer. Divide-and-conquer is a natural paradigm for parallel algorithms.

After dividing a problem into two or more subproblems, the subproblems can be solved in
parallel. Typically the subproblems are solved recursively and thus the next divide step
yields even more subproblems to be solved in parallel. For example, suppose we want to
compute the convex-hull of a set of n points in the plane (i.e., compute the smallest
convex polygon that encloses all of the points). This can be implemented by splitting the

points into the leftmost and rightmost , recursively finding the convex hull of
each set in parallel, and then merging the two resulting hulls. Divide-and-conquer has
proven to be one of the most powerful techniques for solving problems in parallel with
applications ranging from linear systems to computer graphics and from factoring large
numbers to n-body simulations.
2. Randomization. The use of random numbers is ubiquitous in parallel algorithms.
Intuitively, randomness is helpful because it allows processors to make local decisions
which, with high probability, add up to good global decisions. For example, suppose we
want to sort a collection of integer keys. This can be accomplished by partitioning the
keys into buckets then sorting within each bucket. For this to work well, the buckets must
represent non-overlapping intervals of integer values, and contain approximately the same
number of keys. Randomization is used to determine the boundaries of the intervals. First
each processor selects a random sample of its keys. Next all of the selected keys are
sorted together. Finally these keys are used as the boundaries. Such random sampling is
also used in many parallel computational geometry, graph, and string matching
algorithms. Other uses of randomization include symmetry breaking, load balancing, and
routing algorithms.
3. Parallel Pointer Manipulations. Many of the traditional sequential techniques for
manipulating lists, trees, and graphs do not translate easily into parallel techniques. For
example techniques such as traversing the elements of a linked list, visiting the nodes of a
tree in postorder, or performing a depth-first traversal of a graph appear to be inherently
sequential. Fortunately, each of these techniques can be replaced by efficient parallel
techniques. These parallel techniques include pointer jumping, the Euler-tour technique,
ear decomposition, and graph contraction. For example, one way to label each node of
an n-node list (or tree) with the label of the last node (or root) is to use pointer jumping.
In each pointer-jumping step each node in parallel replaces its pointer with that of its

successor (or parent). After at most steps, every node points to the same node, the
end of the list (or root of the tree).
4. Others. Other useful techniques include finding small graph separators for partitioning
data among processors to reduce communication, hashing for balancing load across
processors and mapping addresses to memory, and iterative techniques as a replacement
for direct methods for solving linear systems.

These techniques have led to efficient parallel algorithms in most problem areas for which
efficient sequential algorithms are known. In fact, some of the techniques originally
developed for parallel algorithms have led to improvements in sequential algorithms.

(CS, Algorithm) - An Introduction To Distributed Algorithms
No ratings yet
(CS, Algorithm) - An Introduction To Distributed Algorithms
254 pages
5 - Designing Parallel Programs
No ratings yet
5 - Designing Parallel Programs
52 pages
Parallel Algorithms Complete Notes
No ratings yet
Parallel Algorithms Complete Notes
13 pages
What Is Network Topology (AutoRecovered)
No ratings yet
What Is Network Topology (AutoRecovered)
23 pages
Lec Notes
No ratings yet
Lec Notes
50 pages
LNLCH 3 4
No ratings yet
LNLCH 3 4
38 pages
4 DesigningParallelPrograms
No ratings yet
4 DesigningParallelPrograms
69 pages
Overheads
No ratings yet
Overheads
139 pages
Hpclab
No ratings yet
Hpclab
58 pages
Unit1 2 and 3
No ratings yet
Unit1 2 and 3
76 pages
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
No ratings yet
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
104 pages
Unit-Iv Concurrent and Parallel Programming: Parallel Programming Paradigms - Data Parallel
No ratings yet
Unit-Iv Concurrent and Parallel Programming: Parallel Programming Paradigms - Data Parallel
61 pages
CPP Unit-4
No ratings yet
CPP Unit-4
61 pages
Parallel
No ratings yet
Parallel
59 pages
Parallel Algorithms: Theory and Practice
No ratings yet
Parallel Algorithms: Theory and Practice
44 pages
HPC2
No ratings yet
HPC2
22 pages
Lecture 9 - Parallel Algorithms
No ratings yet
Lecture 9 - Parallel Algorithms
28 pages
E - Notes - HPC-Unit 3-1
No ratings yet
E - Notes - HPC-Unit 3-1
26 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
Parallel Algorithem
No ratings yet
Parallel Algorithem
15 pages
Chapter 7 - Parallel Programming Issues
No ratings yet
Chapter 7 - Parallel Programming Issues
68 pages
Parallel Algorithms: Theory and Practice: Deterministi C Parallelism
No ratings yet
Parallel Algorithms: Theory and Practice: Deterministi C Parallelism
51 pages
Introduction To Parallel Computing Design and Anal
No ratings yet
Introduction To Parallel Computing Design and Anal
53 pages
Lecture 4: Principles of Parallel Algorithm Design (Part 4)
No ratings yet
Lecture 4: Principles of Parallel Algorithm Design (Part 4)
27 pages
Parallel Algorithms Presentation
No ratings yet
Parallel Algorithms Presentation
32 pages
Lec # 04 - Parallel Algorithm
No ratings yet
Lec # 04 - Parallel Algorithm
13 pages
HPC Note
No ratings yet
HPC Note
39 pages
07 Parallel Algorithms in Parallel and Distributed Computing
No ratings yet
07 Parallel Algorithms in Parallel and Distributed Computing
13 pages
Parallel Algorithms and Architectures 1
No ratings yet
Parallel Algorithms and Architectures 1
22 pages
Unit 4 Parallel Computing
No ratings yet
Unit 4 Parallel Computing
8 pages
1 Parallel and Distributed Computation
No ratings yet
1 Parallel and Distributed Computation
10 pages
Chapter Six
No ratings yet
Chapter Six
19 pages
Integration: Prepared By: Khalil
100% (1)
Integration: Prepared By: Khalil
102 pages
Par Seq Algorithms
No ratings yet
Par Seq Algorithms
44 pages
Week 3 Parallel Algorithms
No ratings yet
Week 3 Parallel Algorithms
10 pages
L19-20 PA Design Intro
No ratings yet
L19-20 PA Design Intro
31 pages
Final Report Checking Modern Headlight Systems
No ratings yet
Final Report Checking Modern Headlight Systems
60 pages
Chapter Six
No ratings yet
Chapter Six
18 pages
Ram, Pram, and Logp Models
No ratings yet
Ram, Pram, and Logp Models
72 pages
JaJa Parallel - Algorithms Intro
50% (2)
JaJa Parallel - Algorithms Intro
45 pages
Parallel Computing Challanges
No ratings yet
Parallel Computing Challanges
7 pages
Dalgorithm
No ratings yet
Dalgorithm
5 pages
Lectures5 14
No ratings yet
Lectures5 14
85 pages
Chap 4-7 - Parallel - Abstractions - and - MPI
No ratings yet
Chap 4-7 - Parallel - Abstractions - and - MPI
34 pages
HPC Chapter 2
No ratings yet
HPC Chapter 2
16 pages
Parallel Computing
No ratings yet
Parallel Computing
2 pages
PRAM COMP 633: Parallel Computing Algorithms: The PRAM Model of Computation
No ratings yet
PRAM COMP 633: Parallel Computing Algorithms: The PRAM Model of Computation
49 pages
Introduction To Parallel Computation: Akl@cs - Queensu.ca
No ratings yet
Introduction To Parallel Computation: Akl@cs - Queensu.ca
2 pages
An Introduction To Parallel Algorithms
No ratings yet
An Introduction To Parallel Algorithms
66 pages
1.1 Parallelism
No ratings yet
1.1 Parallelism
29 pages
Assignment of Algorithm
No ratings yet
Assignment of Algorithm
9 pages
Parallel Computation Models: Slide 1
No ratings yet
Parallel Computation Models: Slide 1
28 pages
Chapter 14: Parallel Algorithms
No ratings yet
Chapter 14: Parallel Algorithms
23 pages
1 Overview, Models of Computation, Brent's Theorem
No ratings yet
1 Overview, Models of Computation, Brent's Theorem
8 pages
Lect 1 Overview
No ratings yet
Lect 1 Overview
17 pages
Parallel Algorithm and Programming
No ratings yet
Parallel Algorithm and Programming
4 pages
Dis Top Tim Notes 1
No ratings yet
Dis Top Tim Notes 1
3 pages
Class X TERM 1 PAPER
No ratings yet
Class X TERM 1 PAPER
18 pages
Incompatible Element-Rich Uids Released by Antigorite Breakdown in Deeply Subducted Mantle
No ratings yet
Incompatible Element-Rich Uids Released by Antigorite Breakdown in Deeply Subducted Mantle
14 pages
Computational Fluid Dynamics Sheets
No ratings yet
Computational Fluid Dynamics Sheets
12 pages
Lecture Parallelism DC PDF
No ratings yet
Lecture Parallelism DC PDF
7 pages
A Review On Use of MPI in Parallel Algorithms: IPASJ International Journal of Computer Science (IIJCS)
No ratings yet
A Review On Use of MPI in Parallel Algorithms: IPASJ International Journal of Computer Science (IIJCS)
8 pages
Reproductive System
No ratings yet
Reproductive System
8 pages
Designing Effective Powerpoint Presentations: Adopted From: Victor Chen Erau - V.Chen@Erau
No ratings yet
Designing Effective Powerpoint Presentations: Adopted From: Victor Chen Erau - V.Chen@Erau
49 pages
Troubleshooting: Turn Power On
No ratings yet
Troubleshooting: Turn Power On
6 pages
Parallel Random Access Machine (PRAM) : Control
No ratings yet
Parallel Random Access Machine (PRAM) : Control
9 pages
1.1 Parallelism Is Ubiquitous
No ratings yet
1.1 Parallelism Is Ubiquitous
3 pages
Paving Flooring and Dado
No ratings yet
Paving Flooring and Dado
17 pages
(Storyline Guide) Hoenn Kanto Unova Complete Walkthrough (All Items + Hidden Items) - Guide Tavern - PokeMMO PDF
No ratings yet
(Storyline Guide) Hoenn Kanto Unova Complete Walkthrough (All Items + Hidden Items) - Guide Tavern - PokeMMO PDF
107 pages
Aps RTC Bus Routes
No ratings yet
Aps RTC Bus Routes
8 pages
Ana Nadhya Abrar (2020) - Environemntal Journaism in Indonesia - in Search of Principles and Technical Guidelines
No ratings yet
Ana Nadhya Abrar (2020) - Environemntal Journaism in Indonesia - in Search of Principles and Technical Guidelines
15 pages
The Design and Simulation of An S-Band Circularly Polarized Microstrip Antenna Array
No ratings yet
The Design and Simulation of An S-Band Circularly Polarized Microstrip Antenna Array
5 pages
Maths p1 2021 g12 Solutions
No ratings yet
Maths p1 2021 g12 Solutions
5 pages
Precision Oxygen Analyzer: Key Features
No ratings yet
Precision Oxygen Analyzer: Key Features
2 pages
Raft Activity and Lesson Plan
No ratings yet
Raft Activity and Lesson Plan
18 pages
Stronghold 3 - Keyboard Shortcuts
No ratings yet
Stronghold 3 - Keyboard Shortcuts
2 pages
EN - BioMajesty 6010 - C
100% (1)
EN - BioMajesty 6010 - C
2 pages
Training Need Analysis by Atul Mathur
No ratings yet
Training Need Analysis by Atul Mathur
11 pages
The Immersive Environment
No ratings yet
The Immersive Environment
21 pages
Sweetcase EP - User Manual
No ratings yet
Sweetcase EP - User Manual
20 pages
Machine Dependent Assembler Features
No ratings yet
Machine Dependent Assembler Features
26 pages
Course 1 - Exam - Attempt Review
No ratings yet
Course 1 - Exam - Attempt Review
4 pages
The First Quarterly Assessment Results of Grade 2
No ratings yet
The First Quarterly Assessment Results of Grade 2
13 pages
MMW Requirement Basic Statistics-.
No ratings yet
MMW Requirement Basic Statistics-.
16 pages
International Journal of Food Science - 2023 - Amaiach - Microbiological Profile and Hygienic Quality of Foodstuffs
No ratings yet
International Journal of Food Science - 2023 - Amaiach - Microbiological Profile and Hygienic Quality of Foodstuffs
13 pages
74ac14 Hex Schmitt Inverter
No ratings yet
74ac14 Hex Schmitt Inverter
9 pages
Machine Design 1
No ratings yet
Machine Design 1
13 pages
Insect Science Age Cheilomenes
No ratings yet
Insect Science Age Cheilomenes
9 pages
Judgement 12
No ratings yet
Judgement 12
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Parallel Models of Computation

Uploaded by

Parallel Models of Computation

Uploaded by

Parallel Models of Computation

1. Work-efficiency. In designing a parallel algorithm, it is more important to make it

in time using processors is efficient since the work, , is as

efficiently on a -processor machine with a latency L memory system , a much

more realistic machine. In the translated algorithm, each of the processors

1. Divide-and-Conquer. Divide-and-conquer is a natural paradigm for parallel algorithms.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.