0% found this document useful (0 votes)

35 views5 pages

HPC Detailed Notes

The document provides detailed notes on High Performance Computing, covering topics such as parallel computing definitions, terminologies, types of parallelism, and scalability. It also discusses various parallel architectures, interconnection networks, mapping and scheduling techniques, and parallel programming models and algorithms. Key concepts include data and control parallelism, MIMD architectures, load balancing, and performance analysis of parallel algorithms.

Uploaded by

juhi2781

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views5 pages

HPC Detailed Notes

Uploaded by

juhi2781

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

High Performance Computing - Detailed Notes

Module 1: Introduction to Parallel Computing (10 hours)

1.1 What is Parallel Computing?

Parallel computing is a type of computation in which many calculations or processes are carried out

simultaneously. It leverages multiple processors to solve a problem faster.

1.2 Terminologies in Parallel Processing

- Task: A logical unit of work.

- Process: An instance of a running program.

- Thread: A component of a process that can run independently.

- Speedup: Ratio of time taken by single processor to time taken by multiple processors.

- Efficiency: Ratio of speedup to number of processors.

1.3 Types of Parallelism

- Data Parallelism: Performing the same task on different pieces of distributed data.

- Control Parallelism: Executing different tasks simultaneously.

- Pipelining: Breaking a task into stages with each stage handled by a different processor.

1.4 Scalability

Refers to how well a parallel system can increase performance when additional resources are

added.

1.5 Control Parallel & Data Parallel Approaches

- Control Parallel: Each processor executes different instruction sets.

- Data Parallel: Each processor performs the same operation on different pieces of data.
1.6 Parallel Reduction

Combines values from multiple processors to a single result (e.g., sum, max).

1.7 Prefix Sums

An operation where each element in an array is replaced by the sum of all previous elements

including itself.

1.8 List Ranking

Determines the position of each node in a linked list from the head node.

1.9 Preorder Tree Traversal

Parallel traversal of trees in preorder (root, left, right).

1.10 Merging Two Sorted Lists

Divide the task into chunks and merge them in parallel.

1.11 Graph Coloring

Assigning colors to vertices such that no two adjacent vertices have the same color.

1.12 Reducing Number of Processors

Optimizing task-to-processor mapping to minimize idle time and resource wastage.

1.13 Problems Defying Fast Solutions on PRAMs

Some problems inherently resist efficient parallelization due to dependencies.

Module 2: Parallel Architectures (10 hours)

2.1 MIMD Architectures (Multiple Instruction, Multiple Data)

- Each processor executes different instructions on different data.

- Includes both shared and distributed memory systems.

2.2 Multi-threaded Architectures

- Processors capable of managing multiple threads concurrently.

- Improves latency hiding and resource usage.

2.3 Distributed Memory Systems

- Each processor has its own private memory.

- Communication via message passing (e.g., MPI).

2.4 Shared Memory Systems

- All processors access a common memory space.

- Easier programming but more contention.

Module 3: Interconnection Networks (8 hours)

3.1 Dynamic Interconnection Networks

- Networks where the path between two nodes can change during execution.

- Topologies include:

- Mesh

- Torus

- Hypercube

- Tree

- Ring

- Key Metrics: Bandwidth, Latency, Bisection width, Diameter.

Module 4: Mapping and Scheduling (8 hours)

4.1 Mapping

- Assigning processes or data to processors.

- Affects load balancing and communication cost.

4.2 Scheduling

- Static Scheduling: Tasks assigned before execution.

- Dynamic Scheduling: Tasks assigned during execution.

4.3 Load Balancing

- Equal distribution of workload among processors.

- Static Load Balancing: Predefined distribution.

- Dynamic Load Balancing: Adjusted at runtime.

4.4 Deadlock

- Occurs when processes wait indefinitely for resources.

- Solutions:

- Avoidance

- Detection and Recovery

- Prevention

Module 5: Parallel Programming & Algorithms (12 hours)

5.1 Programming Models

- OpenMP: For shared memory systems.

- MPI: For distributed memory systems.

- CUDA: For GPU programming.

5.2 Structure of Parallel Algorithms

- Divide the problem.

- Distribute tasks among processors.

- Compute in parallel.

- Combine the results.

5.3 Analysis of Parallel Algorithms

- Measure performance with speedup, efficiency, and scalability.

5.4 Elementary Parallel Algorithms

- Sum, Max, Min in parallel.

5.5 Matrix Algorithms

- Matrix addition, multiplication, inversion using parallel blocks.

5.6 Sorting Algorithms

- Parallel Merge Sort, Bitonic Sort.

5.7 Graph Algorithms

- Parallel BFS, DFS, Graph coloring, Shortest Path (e.g., Dijkstra's in parallel).

Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Comm Prog - Fexam
33% (3)
Comm Prog - Fexam
99 pages
Programming Manual
No ratings yet
Programming Manual
147 pages
.Trashed-1650000204-Hpc Prac Exam
No ratings yet
.Trashed-1650000204-Hpc Prac Exam
5 pages
PDC 3
No ratings yet
PDC 3
26 pages
CC ZG501 Course Handout
No ratings yet
CC ZG501 Course Handout
8 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
HPC Note
No ratings yet
HPC Note
39 pages
Handbook HPC 23-24
No ratings yet
Handbook HPC 23-24
18 pages
Unit1 2 and 3
No ratings yet
Unit1 2 and 3
76 pages
Syllabus
No ratings yet
Syllabus
2 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
ISE-20% Unit Test I-15% Unit Test II-15% ESE-50% (Minimum Passing Marks: 40%)
No ratings yet
ISE-20% Unit Test I-15% Unit Test II-15% ESE-50% (Minimum Passing Marks: 40%)
2 pages
CS621 Cheatsheet
No ratings yet
CS621 Cheatsheet
11 pages
High Performance Computing Unit 1-2
No ratings yet
High Performance Computing Unit 1-2
60 pages
Parallel and Distributed Algorithms
No ratings yet
Parallel and Distributed Algorithms
65 pages
Clase01 - Introducción Al Paralelismo
No ratings yet
Clase01 - Introducción Al Paralelismo
30 pages
Clase01 - Introducción Al Paralelismo
No ratings yet
Clase01 - Introducción Al Paralelismo
30 pages
Parallel Computing - Major Elective - III
No ratings yet
Parallel Computing - Major Elective - III
3 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
Group3 - Parallel - Computing - Techniques - Presentation Power Point 2025
No ratings yet
Group3 - Parallel - Computing - Techniques - Presentation Power Point 2025
27 pages
Parallel Computation Models: Slide 1
No ratings yet
Parallel Computation Models: Slide 1
28 pages
Guidelines IntroductionToParallelProgramming
No ratings yet
Guidelines IntroductionToParallelProgramming
2 pages
Mansi Kadam PC Lab Assignment 1
No ratings yet
Mansi Kadam PC Lab Assignment 1
4 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
LP V Theory and Practical Explanation: o o o o
No ratings yet
LP V Theory and Practical Explanation: o o o o
96 pages
001 - DDS IIIT Jan 10th
No ratings yet
001 - DDS IIIT Jan 10th
34 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Intro Parallel Programming 2015
No ratings yet
Intro Parallel Programming 2015
38 pages
CS-3006 - Parallel and Distributed Computing - (BS All Programs) - Spring-2023
No ratings yet
CS-3006 - Parallel and Distributed Computing - (BS All Programs) - Spring-2023
6 pages
Parallel Algorithms Presentation
No ratings yet
Parallel Algorithms Presentation
32 pages
Pda 1
No ratings yet
Pda 1
72 pages
U1&u2 Padcom-25
No ratings yet
U1&u2 Padcom-25
95 pages
HPC Module 4
No ratings yet
HPC Module 4
18 pages
Untitled Document
No ratings yet
Untitled Document
23 pages
HPC Lecture 2 Points
No ratings yet
HPC Lecture 2 Points
7 pages
Parallel Computing LessonPlan
No ratings yet
Parallel Computing LessonPlan
10 pages
Parallel Programming
No ratings yet
Parallel Programming
18 pages
What Is HPC Question Bank
No ratings yet
What Is HPC Question Bank
22 pages
CS621 SQ
No ratings yet
CS621 SQ
15 pages
Cse4001 Parallel-And-Distributed-Computing Eth 1.1 47 Cse4001
50% (2)
Cse4001 Parallel-And-Distributed-Computing Eth 1.1 47 Cse4001
2 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
Parallel
No ratings yet
Parallel
4 pages
CMP 252 - Parallelism Fundamentals
No ratings yet
CMP 252 - Parallelism Fundamentals
64 pages
Project - ParallelComputing BSR v2
No ratings yet
Project - ParallelComputing BSR v2
40 pages
Parallel Programming - Unit 1
No ratings yet
Parallel Programming - Unit 1
81 pages
Parallel Processing Course Plan
No ratings yet
Parallel Processing Course Plan
2 pages
HPC Unit 1
No ratings yet
HPC Unit 1
65 pages
Simulating Ocean Currents
No ratings yet
Simulating Ocean Currents
35 pages
HPC Ut 2
No ratings yet
HPC Ut 2
4 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
BCSE412L - Parallel Computing 01
No ratings yet
BCSE412L - Parallel Computing 01
27 pages
Course Outline
No ratings yet
Course Outline
4 pages
Parallel Programming
No ratings yet
Parallel Programming
42 pages
HPC Viva Question
No ratings yet
HPC Viva Question
5 pages
09 ParallelizationRecap PDF
No ratings yet
09 ParallelizationRecap PDF
62 pages
Parellel Computing 2024 C - Handout-2
No ratings yet
Parellel Computing 2024 C - Handout-2
3 pages
Subject Name Parallel and Distributed Computing
100% (1)
Subject Name Parallel and Distributed Computing
3 pages
L1.3a HPC Concepts
No ratings yet
L1.3a HPC Concepts
43 pages
Module 1
No ratings yet
Module 1
14 pages
CSE5006 Multicore-Architectures ETH 1 AC41
No ratings yet
CSE5006 Multicore-Architectures ETH 1 AC41
9 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
All Important Questions
No ratings yet
All Important Questions
24 pages
2022 Pyq CC
No ratings yet
2022 Pyq CC
17 pages
Big Data Analytics 2023 Solution
No ratings yet
Big Data Analytics 2023 Solution
17 pages
HPC Unit 1 Handwritten Notes
No ratings yet
HPC Unit 1 Handwritten Notes
9 pages
Bid Data Analytics
No ratings yet
Bid Data Analytics
5 pages
Object Oriented Programming Lab Manual R
No ratings yet
Object Oriented Programming Lab Manual R
77 pages
Cryptography and Network Security Notes
No ratings yet
Cryptography and Network Security Notes
41 pages
Structured Query Language: Introduction To
No ratings yet
Structured Query Language: Introduction To
18 pages
Flat 2021 Pyq Solution
No ratings yet
Flat 2021 Pyq Solution
10 pages
DBMS Full Notes
No ratings yet
DBMS Full Notes
5 pages
PSD Imp Ques
No ratings yet
PSD Imp Ques
63 pages
Artificial Intelligence PYQ Theory
No ratings yet
Artificial Intelligence PYQ Theory
30 pages
CS 2022-24
No ratings yet
CS 2022-24
228 pages
Expresiones Phyton
No ratings yet
Expresiones Phyton
15 pages
Amer Filesys
No ratings yet
Amer Filesys
57 pages
DS Lab Programs 1-12
No ratings yet
DS Lab Programs 1-12
51 pages
OS (Mmy Allocation Methods)
No ratings yet
OS (Mmy Allocation Methods)
12 pages
HowTo5 - 001 HMI VIJEO DEIGNER MAGELIS
No ratings yet
HowTo5 - 001 HMI VIJEO DEIGNER MAGELIS
14 pages
Print Brush Java Project
50% (2)
Print Brush Java Project
40 pages
C++ Programming TI00AA50: Jarkko - Vuori@metropolia - Fi
No ratings yet
C++ Programming TI00AA50: Jarkko - Vuori@metropolia - Fi
13 pages
Engineering Programming A - EC144
No ratings yet
Engineering Programming A - EC144
9 pages
KONTAKT 602 KSP Reference Manual
100% (1)
KONTAKT 602 KSP Reference Manual
223 pages
Bachelor of Science in Computer Science Prospectus
No ratings yet
Bachelor of Science in Computer Science Prospectus
2 pages
Eclipse Error Log
No ratings yet
Eclipse Error Log
1 page
ComputerSciencePaperI 12th
No ratings yet
ComputerSciencePaperI 12th
27 pages
L1 - Instructions - Intro - Operations - Operands of The Computer
No ratings yet
L1 - Instructions - Intro - Operations - Operands of The Computer
19 pages
Core Java - Unit 1 2 3
No ratings yet
Core Java - Unit 1 2 3
21 pages
DevOps Shack - Mastering Multi-Stage Docker Builds
No ratings yet
DevOps Shack - Mastering Multi-Stage Docker Builds
36 pages
Victaulic Grooved Aluminum Installation
No ratings yet
Victaulic Grooved Aluminum Installation
3 pages
04-C - Strings
No ratings yet
04-C - Strings
9 pages
Data Structures Foundation-2021 Batch - Class Notes
No ratings yet
Data Structures Foundation-2021 Batch - Class Notes
208 pages
SQL Set2 qns1
No ratings yet
SQL Set2 qns1
18 pages
Kill Script Roblox
No ratings yet
Kill Script Roblox
3 pages
Ims HSSR
No ratings yet
Ims HSSR
642 pages
PostgreSQL CHEAT SHEET
No ratings yet
PostgreSQL CHEAT SHEET
8 pages
Introduction To The CImg Library
No ratings yet
Introduction To The CImg Library
219 pages
Relative Xpath
No ratings yet
Relative Xpath
11 pages
Data Structures Notes
No ratings yet
Data Structures Notes
131 pages
SQL Joins
No ratings yet
SQL Joins
59 pages
Badi Enhancement Process
No ratings yet
Badi Enhancement Process
48 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

HPC Detailed Notes

Uploaded by

HPC Detailed Notes

Uploaded by

High Performance Computing - Detailed Notes

Module 1: Introduction to Parallel Computing (10 hours)

1.1 What is Parallel Computing?

simultaneously. It leverages multiple processors to solve a problem faster.

1.2 Terminologies in Parallel Processing

- Task: A logical unit of work.

- Process: An instance of a running program.

- Thread: A component of a process that can run independently.

- Efficiency: Ratio of speedup to number of processors.

1.3 Types of Parallelism

- Control Parallelism: Executing different tasks simultaneously.

1.5 Control Parallel & Data Parallel Approaches

- Control Parallel: Each processor executes different instruction sets.

1.7 Prefix Sums

1.8 List Ranking

1.9 Preorder Tree Traversal

Parallel traversal of trees in preorder (root, left, right).

1.10 Merging Two Sorted Lists

Divide the task into chunks and merge them in parallel.

1.11 Graph Coloring

1.12 Reducing Number of Processors

Optimizing task-to-processor mapping to minimize idle time and resource wastage.

1.13 Problems Defying Fast Solutions on PRAMs

Some problems inherently resist efficient parallelization due to dependencies.

Module 2: Parallel Architectures (10 hours)

- Each processor executes different instructions on different data.

- Includes both shared and distributed memory systems.

2.2 Multi-threaded Architectures

- Processors capable of managing multiple threads concurrently.

- Improves latency hiding and resource usage.

2.3 Distributed Memory Systems

- Each processor has its own private memory.

- Communication via message passing (e.g., MPI).

2.4 Shared Memory Systems

- All processors access a common memory space.

- Easier programming but more contention.

Module 3: Interconnection Networks (8 hours)

3.1 Dynamic Interconnection Networks

- Key Metrics: Bandwidth, Latency, Bisection width, Diameter.

- Assigning processes or data to processors.

- Affects load balancing and communication cost.

- Static Scheduling: Tasks assigned before execution.

- Dynamic Scheduling: Tasks assigned during execution.

4.3 Load Balancing

- Equal distribution of workload among processors.

- Static Load Balancing: Predefined distribution.

- Dynamic Load Balancing: Adjusted at runtime.

- Occurs when processes wait indefinitely for resources.

- Detection and Recovery

Module 5: Parallel Programming & Algorithms (12 hours)

5.1 Programming Models

- OpenMP: For shared memory systems.

- CUDA: For GPU programming.

5.2 Structure of Parallel Algorithms

- Divide the problem.

- Distribute tasks among processors.

- Combine the results.

5.3 Analysis of Parallel Algorithms

- Measure performance with speedup, efficiency, and scalability.

5.4 Elementary Parallel Algorithms

- Sum, Max, Min in parallel.

5.5 Matrix Algorithms

- Matrix addition, multiplication, inversion using parallel blocks.

5.6 Sorting Algorithms

- Parallel Merge Sort, Bitonic Sort.

5.7 Graph Algorithms

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.