0% found this document useful (0 votes)

111 views3 pages

Java Practise Exercise

This document provides instructions for four exercises in advanced parallel computing. Exercise 1 involves optimizing matrix multiplication and calculating matrix norms in C/C++ to improve cache usage. Exercise 2 introduces OpenMP and has students write a simple parallel "Hello World" program. Exercise 3 involves parallelizing a SAXPY operation in C++ using OpenMP work sharing directives. Exercise 4 instructs students to calculate the dot product of vectors in parallel.

Uploaded by

sivaterror

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views3 pages

Java Practise Exercise

Uploaded by

sivaterror

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Advanced Parallel Computing for Scientific

Applications
Autumn Term 2010
Prof. I. F. Sbalzarini Prof. P. Arbenz
ETH Zentrum, CAB G34 ETH Zentrum, CAB H89
CH-8092 Zürich CH-8092 Zürich

Exercise 3
Release: 12. Oct 2010
Due: 26 Oct. 2010

1 Practice in C/C++
The following two assignments illustrate the effects of caching in matrix operations. C
uses row major memory layout for storing matrices and high dimensional arrays. Hence
row-wise access of elements is more cache efficient than column-wise access.

Question 1: Matrix multiplication

The file matrixMult.cpp contains a program to find the execution time for matrix multi-
plication.

C =A·B

Each matrix is stored as a 1-D array. The multiplication is performed in the method void
Multiply(...). To calculate each element in C, the elements of A are accessed row-wise
and that of B are accessed column-wise resulting in several cache misses especially for
large matrix sizes. A better cache usage can be achieved if the matrix B is transposed
and the matrix multiplication operation is modified accordingly to give the same result as
before. Your task is to implement the methods void InPlaceTranspose(...) and void
MultiplyEfficient(...).
Compile your code using the default GNU compiler: g++ -o mult matrixMult.cpp
Do you observe better performance in the case of large matrices?

Question 2: Matrix norm

You have to calculate the 1-norm and infinity norm of a matrix A of size m × n given by
m
X
||A||1 = max |aij |
1≤j≤n
i=1

n
X
||A||∞ = max |aij |
1≤i≤m
j=1

1
Implement the above equations in the appropriate methods double Norm 1() and double
Norm Inf() in the file matrixNorm.cpp. Count the floating point operations in the calcu-
lation and compute Mflop/s for different matrix sizes n.
Compile your code using : g++ -o norm matrixNorm.cpp
Which of the above norms is calculated faster and why?

In each of the above examples, time measurement is done using the method double walltime(...)
which is implemented in the file walltime.h
Please submit the jobs to the batch queue as follows:

bsub -o <op_file> ./<executable>

2 Introduction to OpenMP
1. OpenMP is an application programming interface that provides a parallel program-
ming model for shared memory and distributed shared memory multiprocessors.

2. OpenMP is based on the Fork/Join Execution Model : An OpenMP program starts

as a single thread (master) and additional threads are created when the master hits
a parallel region.

3. There is a standard include file omp.h for C/C++ OpenMP programs.

4. The number of threads is fixed a priori by the programmer using environment

variable OMP NUM THREADS

5. omp get num threads() and omp get thread num() can be used to get the number
of threads created and the local number assigned to each thread.

6. The directive #pragma omp parallel in the program marks the beginning of parallel
section.

7. The keywords used for distributing work among threads are for, sections, critical
etc

Question 3: First OpenMP program

Using the above information, write a simple program that will create n = 2, 4, 6 threads and
each thread will display one of the following messages along with its own thread number.

This is Advanced Parallel Computing tutorial.

This is the first OpenMP program.
This program uses n threads
Hello World

2
Compile the program using the GNU compiler as follows:

gcc -lgomp -fopenmp -o omp1 omp1.c

Question 4: Work sharing among OpenMP threads

The file vectorAdd.cpp contains the code for serial and parallel execution of SAXPY oper-
ation along with time measurement.
a) Compile the program and execute it using say 4 threads. Why is there no speedup?
Modify the code in order to achieve appreciable speedup.
b) Write a code to calculate dot product ā.b̄ in parallel.

You may submit the jobs to the batch queue as follows:

bsub -n N -o <op_file> ./<executable>

{N is the number of processors}

Do not forget to set the environment variable OMP NUM THREADS before execution.

Parallel Computing Lab Manual PDF
100% (1)
Parallel Computing Lab Manual PDF
51 pages
Threads: Thread
No ratings yet
Threads: Thread
11 pages
L04 Parallel Systems Synchronization Communication Scheduling
No ratings yet
L04 Parallel Systems Synchronization Communication Scheduling
117 pages
Lecture06 Sharedmem jwd15
No ratings yet
Lecture06 Sharedmem jwd15
60 pages
Anr 7.2.4 (70246989) 20231011 010630
No ratings yet
Anr 7.2.4 (70246989) 20231011 010630
37 pages
Rifat
No ratings yet
Rifat
26 pages
viva questions
No ratings yet
viva questions
15 pages
COL380 Assignment 1
No ratings yet
COL380 Assignment 1
10 pages
Mobile App1 td3
No ratings yet
Mobile App1 td3
84 pages
1.practice Questions and Solutions Set-1
No ratings yet
1.practice Questions and Solutions Set-1
23 pages
TP2
No ratings yet
TP2
4 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Lecture3 (Form Parallelism&flynn)
No ratings yet
Lecture3 (Form Parallelism&flynn)
12 pages
Bushramemon - 2075 - 4326 - 1 - Lecture - Process System Calls
No ratings yet
Bushramemon - 2075 - 4326 - 1 - Lecture - Process System Calls
15 pages
20bce2126 PDC Lab Da 3
No ratings yet
20bce2126 PDC Lab Da 3
11 pages
Multicore Architecture Lab Manual
No ratings yet
Multicore Architecture Lab Manual
34 pages
HPC Revised Syllabus
No ratings yet
HPC Revised Syllabus
4 pages
Exericses 2 Solution Matrix Multlpication
No ratings yet
Exericses 2 Solution Matrix Multlpication
4 pages
Ee8218 Lab2
No ratings yet
Ee8218 Lab2
7 pages
Unit 6 - Interprocess Communication
No ratings yet
Unit 6 - Interprocess Communication
32 pages
Mid Sem QP&Solution
No ratings yet
Mid Sem QP&Solution
7 pages
Worksharing and Parallel Loops
No ratings yet
Worksharing and Parallel Loops
23 pages
OS Mid Solution
No ratings yet
OS Mid Solution
7 pages
Practice_OpenMP
No ratings yet
Practice_OpenMP
2 pages
OSY (Ashish) Micro Project Report
No ratings yet
OSY (Ashish) Micro Project Report
8 pages
Lab 2
No ratings yet
Lab 2
2 pages
Azizul Azri Bin Mustaffa - PEC12-60
No ratings yet
Azizul Azri Bin Mustaffa - PEC12-60
36 pages
Cabrera_Robert_CSC342_Final_Presentation
No ratings yet
Cabrera_Robert_CSC342_Final_Presentation
23 pages
OPENMP
No ratings yet
OPENMP
37 pages
HPC - Assignment 1
No ratings yet
HPC - Assignment 1
2 pages
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
100% (1)
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
15 pages
Day 2 1 Advanced-Openmp
No ratings yet
Day 2 1 Advanced-Openmp
52 pages
Synchronization Between Threads
No ratings yet
Synchronization Between Threads
7 pages
Multicore Architecture and Programming Lab Manual
No ratings yet
Multicore Architecture and Programming Lab Manual
29 pages
CP4292-MCAP
No ratings yet
CP4292-MCAP
24 pages
Assignment_4
No ratings yet
Assignment_4
2 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
Exercise 1 (Openmp-I)
No ratings yet
Exercise 1 (Openmp-I)
10 pages
OpenMP Matrix
No ratings yet
OpenMP Matrix
6 pages
Omp Exercises
No ratings yet
Omp Exercises
81 pages
MP L11 Multiprocessor A
No ratings yet
MP L11 Multiprocessor A
32 pages
Unit 4: Aneka Cloud Application Platform
No ratings yet
Unit 4: Aneka Cloud Application Platform
9 pages
Lab Manual
No ratings yet
Lab Manual
31 pages
ipc_assig 1
No ratings yet
ipc_assig 1
9 pages
Chapter 5a: CPU Scheduling: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
No ratings yet
Chapter 5a: CPU Scheduling: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
33 pages
BITS Pilani: Distributed Computing
No ratings yet
BITS Pilani: Distributed Computing
73 pages
Task Level Parallelization of All Pair Shortest Path Algorithm in Openmp 3.0
No ratings yet
Task Level Parallelization of All Pair Shortest Path Algorithm in Openmp 3.0
4 pages
CP4292 Multicore Architecture lab manual
No ratings yet
CP4292 Multicore Architecture lab manual
36 pages
Lab 3
No ratings yet
Lab 3
23 pages
Time Complexity Cheat Sheet
No ratings yet
Time Complexity Cheat Sheet
6 pages
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
No ratings yet
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
58 pages
Excelente
No ratings yet
Excelente
64 pages
Suyash Os Notes
No ratings yet
Suyash Os Notes
13 pages
Lab Pratice First Lab Manual
No ratings yet
Lab Pratice First Lab Manual
81 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
gauravkumar_221it027@it301_Lab2
No ratings yet
gauravkumar_221it027@it301_Lab2
28 pages
22l-6819
No ratings yet
22l-6819
8 pages
Inf3380 Oblig2 2011
No ratings yet
Inf3380 Oblig2 2011
3 pages
MPC LAB Manual new
No ratings yet
MPC LAB Manual new
24 pages
Processes and Threads (2013 1) 1x2
No ratings yet
Processes and Threads (2013 1) 1x2
16 pages
Cp4292 Multicore Lab Multicore Lab Removed
No ratings yet
Cp4292 Multicore Lab Multicore Lab Removed
37 pages
Pseudo Code of Mpi Programs
No ratings yet
Pseudo Code of Mpi Programs
22 pages
PROCESS MANAGEMENT - Lecture
No ratings yet
PROCESS MANAGEMENT - Lecture
6 pages
Unit III
No ratings yet
Unit III
15 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
Dining Philosophers: A Classic Parallel Processing Problem by E Dijkstra
No ratings yet
Dining Philosophers: A Classic Parallel Processing Problem by E Dijkstra
29 pages
MAP laB mannual
No ratings yet
MAP laB mannual
24 pages
OpenMP Examples
No ratings yet
OpenMP Examples
12 pages
Scala and Spark Overview PDF
No ratings yet
Scala and Spark Overview PDF
37 pages
MAP lab completed doc
No ratings yet
MAP lab completed doc
29 pages
Yozolog
No ratings yet
Yozolog
3 pages
DDB University Paper PDF
No ratings yet
DDB University Paper PDF
3 pages
OpenMP Programs
No ratings yet
OpenMP Programs
4 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
Question Bank
100% (1)
Question Bank
28 pages
Chapter 5
No ratings yet
Chapter 5
92 pages
OpenMP Presentation
No ratings yet
OpenMP Presentation
51 pages
Parallel Processing Quiz No 1 Fill in The Blanks
No ratings yet
Parallel Processing Quiz No 1 Fill in The Blanks
2 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
4 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
CP4252 Multicore Architecture and Programming Lab Manual
No ratings yet
CP4252 Multicore Architecture and Programming Lab Manual
26 pages
OS Assignment
No ratings yet
OS Assignment
3 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
PC File
No ratings yet
PC File
57 pages
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
4.5/5 (3)
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Java Practise Exercise

Uploaded by

Java Practise Exercise

Uploaded by

Advanced Parallel Computing for Scientific

Question 1: Matrix multiplication

Question 2: Matrix norm

bsub -o <op_file> ./<executable>

2. OpenMP is based on the Fork/Join Execution Model : An OpenMP program starts

3. There is a standard include file omp.h for C/C++ OpenMP programs.

4. The number of threads is fixed a priori by the programmer using environment

Question 3: First OpenMP program

This is Advanced Parallel Computing tutorial.

gcc -lgomp -fopenmp -o omp1 omp1.c

Question 4: Work sharing among OpenMP threads

You may submit the jobs to the batch queue as follows:

bsub -n N -o <op_file> ./<executable>

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.