0% found this document useful (0 votes)

29 views7 pages

Ee8218 Lab2

The document discusses running a matrix multiplication program in parallel using MPI. It initializes matrices of different sizes and runs the program sequentially and in networks of 4 and 6 computers. The results show that parallel computation significantly reduces runtime, especially for larger matrices, though speedup is not directly proportional to the number of computers. Network speed and data size affect parallel performance.

Uploaded by

John Clifford Konwat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views7 pages

Ee8218 Lab2

Uploaded by

John Clifford Konwat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Faculty of Electrical and Computer Engineering

Department of Electrical and Computer Engineering

Program: Computer Engineering

Course Number EE 8218 – 011

Section Number 01
Course Title Parallel Computing
Semester/Year Fall 2015

Instructor Nagi Mekhiel

ASSIGNMENT No. 02

Assignment Title Introduction to MPI

Submission Date November 03,2015

Due Date November 03,2015

Student Name Ismail Sheikh

Student ID Xxxx89867
Signature* M.Ismail
(Note: Remove the first 4 digits from your student ID)

*By signing above you attest that you have contributed to this submission and
confirm that all work you have contributed to this submission is your own work.
Any suspicion of copying or plagiarism in this work will result in an investigation
of Academic Misconduct and may result in a “0” on the work, an “F” in the
course, or possibly more severe penalties, as well as a Disciplinary Notice on
your academic record under the Student Code of Academic Conduct, which can
be found online at: http://www.ryerson.ca/senate/policies/pol60.pdf.
Objective:
This lab is about installing and getting familiarize with the MPI software. Similarly, visualize the
performance improvement through parallel computing a program in a network of computers

Introduction:
Message Passing Interface (MPI) is a portable message-passing system used in many computer
languages such as FORTRAN, C, C++ and Java. Further, MPI software, allows to run a computer
program parallel in the network of computers to increase the computational power of a system.
Parallel computing is really important for handling big data since the sequential algorithm (single
processor) has many limitation when it comes to big data. Depending on data, they might take
years to solve a problem. However, by diving the problem and solving it parallel in the network of
computers helps solve the problem much faster.

This being said, there are certain thing one has to keep in mind for parallel computing. Such as for
parallel computing the data has to be big and independent since the communication time is really
high for the small or dependent data resulting in poor performance in the network of computer.
Similarly, the data has to be divided in the network equally. If the data is not distributed equally,
the process with minimum task will finish it quickly and stay idle until the rest of the processors
finish their task which results in long overhead. In this report, these problem will be explicitly
discuss and proven with the results.

Experiment:
In order to visualize and compare the performance of a parallel computing, two square matrices
with the size of 1000, 3000, and 5000, 6000 respectively initialized. Further, both square matrices
where multiplied together using sequential algorithm as well as parallel computing algorithm in
the network of 4 and 6 computers.

The result of sequential algorithm is as follow:

Size of Matrices Computational Time (seconds) for sequential algorithm

1000 x 1000 10.36 Seconds
3000 x 3000 339.92 Seconds (~5 Minutes)
5000 x 5000 1577.78 Seconds ( ~26.5 Minutes)
6000 x 6000 2523.55 Seconds ( ~42 Minutes)

Table 1 above represents the size of two square matrices used for the application and the time the
system took to multiply those two matrices using sequential algorithm
Size of Matrices Computational Time (seconds) in a network of 4 computers
1000 x 1000 3.44 Seconds
3000 x 3000 147.04 Seconds (~2.45 Minutes)
5000 x 5000 678.63 Seconds (~11.5 Minutes)
6000 x 6000 979.52 Seconds (~16.5 Minutes)
Table 2 above represents the size of two square matrices used for the application and the time the
system took to multiply those two matrices in a network of 4 computers.

Size of Matrices Computational Time (seconds) in a network of 6 computers

1000 x 1000 2.64 Seconds
3000 x 3000 125.08 Seconds (~2.1 Minutes)
5000 x 5000 525.63 Seconds (~9Minutes)
6000 x 6000 625.6 Seconds (~10.5 Minutes)
Table 3 above represents the size of two square matrices used for the application and the time the
system took to multiply those two matrices in a network of 6 computers.

Result Comparison:
Further, as the size of the matrices becomes bigger and bigger there are more values to execute
and sequential algorithm waits for the first commands to complete before executing the next
command which results in a really slow response. Similarly, the parallel algorithm divide the
workload into network of computers and execute at the same time. After completing the process,
they send their result to main host computer where the main computer combines the results in
much faster way.

Time Comparison of 1000x1000 Time Comparison of 3000x3000

matrices matrices

10.36 339.92

147.04
3.44 125.08
2.64

1000X1000 3000X3000
Seqential 4 Computer network 6 computer network Seqential 4 Computer network 6 computer network

Time Comparison of 5000x5000 Time Comparison of 6000x6000

matrices matrices

2523.55
1577.78

678.63 979.52
525.6 625.6

5000X5000 6000X6000

Seqential 4 Computer network 6 computer network Seqential 4 Computer network 6 computer network
Hence, the graph below represents the computation time for all matrices, 1000x1000, 3000x3000,
5000x5000 and 6000x6000.

3000
2523.55
2500
Time in Seconds

2000
1577.78
1500
979.52
678.63
1000
525.6 625.6
339.92
500 147.04
10.36 3.44 2.64 125.08

0
1000X1000 3000X3000 5000X5000 6000X6000

Seqential 4 Computer network 6 computer network

Conclusion:
Based on the above results we see that in parallel computing, even by using 4 computers network
or 6 computers network, we still don’t receive 4 times or 6 times faster response. That could be
the reason for many different options. First, in this lab, we only used maximum size of matrix to
be 6000x6000 which is still not big enough. Also, it could also be the result of network speed
since we are communicating with a network of computers. In order to achieve a better result, we
have to use faster network speed with Fiber optics.

However, we do see from the above graph that as the matrix size is increases, the time difference
in sequential, 4 computer network and 6 computer network increases.
Appendix A:
Program output used to verify that the program works

Program Output for 1 computer

Program Output for network of 6 computers

Appendix 2:
Program code:

#include "/usr/local/mpich-3.1.4/include/mpi.h"
#include <stdio.h>
#include <math.h>
#define sizeOfMatrix 1000
int matrix1[sizeOfMatrix][sizeOfMatrix];
int matrix2[sizeOfMatrix][sizeOfMatrix];
int result[sizeOfMatrix][sizeOfMatrix];
int row, colum;
int n, myid, numprocs;
int tempArray[sizeOfMatrix * sizeOfMatrix];
int myid, numprocs, temp;
double startwtime = 0.0, endwtime;
int namelen, ierr, icount;
int i, j, k;
int columNumber = 0;
char processor_name[MPI_MAX_PROCESSOR_NAME];
MPI_Status status;

void initialize();
void display();

int main(int argc, char *argv[]) {

MPI_Init(&argc, &argv);
MPI_Comm_size(MPI_COMM_WORLD, &numprocs);
MPI_Comm_rank(MPI_COMM_WORLD, &myid);
MPI_Get_processor_name(processor_name, &namelen);

for (i = 0; i < numprocs; i++);

printf("Running program in %d computers with the size of Square Matrices %d x %d \n\n",
i, sizeOfMatrix, sizeOfMatrix);

printf("Process %d of %d is on %s\n", myid, numprocs, processor_name);

printf("\n");
initialize();
if (myid == 0)
startwtime = MPI_Wtime();

for (row = 0; row < sizeOfMatrix; row++) {

for (colum = 0; colum < sizeOfMatrix; colum += numprocs) {
for (k = 0; k < sizeOfMatrix; k++) {
if (myid == 0)
result[row][colum] += matrix1[row][k] * matrix2[k][colum];
else
tempArray[columNumber] += matrix1[row][k] * Matrix2[k][colum];
}
columNumber++;
}
}
if (myid == 0) {
int i, columns, j, counter;
for (i = 1; i < numprocs; i++)
{
counter = 0;
MPI_Recv(&tempArray, columNumber, MPI_INT, i, 0, MPI_COMM_WORLD,
MPI_STATUS_IGNORE);
for (row = 0; row < sizeOfMatrix; row++) {
for (j = i; j < sizeOfMatrix; j += numprocs) {
result[row][j] = tempArray[counter];
counter++;
}
}
}
}
else
MPI_Send(&tempArray, columNumber, MPI_INT, 0, 0, MPI_COMM_WORLD);

if (myid == 0) {
// display();
endwtime = MPI_Wtime();
printf("wall clock time = %f\n", endwtime - startwtime);
}

MPI_Finalize();
return 0;
}
void initialize() {
for (row = 0; row < sizeOfMatrix; row++) {
for (colum = 0; colum < sizeOfMatrix; colum++) {
// matrix1[row][colum] = 1;
//matrix2[row][colum] = 2;
matrix1[row][colum] = rand() % 100;
matrix2[row][colum] = rand() % 100;
}
}
}

void display() {
int counter = 0;
for (counter = 0; counter < 3; counter++) {
for (row = 0; row < sizeOfMatrix; row++) {
for (colum = 0; colum < sizeOfMatrix; colum++) {
if (counter == 0)
printf(" %d ", matrix1[row][colum]);
else if (counter == 1)
printf(" %d ", matrix2[row][colum]);
else if (counter == 2)
printf(" %d ", result[row][colum]);
}
printf("\n");
}
printf("\n");
}

Quiz For Chapter 7 With Solutions
No ratings yet
Quiz For Chapter 7 With Solutions
8 pages
HPC MPI LAB 1 Vector Addition
No ratings yet
HPC MPI LAB 1 Vector Addition
9 pages
Ass Parallel
No ratings yet
Ass Parallel
11 pages
MPI Lab 3
No ratings yet
MPI Lab 3
18 pages
Parallel Computing
No ratings yet
Parallel Computing
30 pages
410A-week-5
No ratings yet
410A-week-5
23 pages
Assign-01
No ratings yet
Assign-01
14 pages
COA_Imple
No ratings yet
COA_Imple
22 pages
Matrix Multiplication-Javan.
No ratings yet
Matrix Multiplication-Javan.
6 pages
Assignment (T)
No ratings yet
Assignment (T)
13 pages
60004210188_RajSingh_HPC_Exp1-7
No ratings yet
60004210188_RajSingh_HPC_Exp1-7
23 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
Exericses 2 Solution Matrix Multlpication
No ratings yet
Exericses 2 Solution Matrix Multlpication
4 pages
2 Cache Complexity
No ratings yet
2 Cache Complexity
100 pages
CP4292-MCAP(1)
No ratings yet
CP4292-MCAP(1)
15 pages
MPI Plamen Krastev
No ratings yet
MPI Plamen Krastev
49 pages
Fa19 BCS 134
No ratings yet
Fa19 BCS 134
4 pages
77a3d882-bc70-4699-a880-d8bd3ce01411
No ratings yet
77a3d882-bc70-4699-a880-d8bd3ce01411
24 pages
8 Week Report
No ratings yet
8 Week Report
23 pages
Wa0001.
No ratings yet
Wa0001.
17 pages
gauravkumar_221it027@it301_Lab2
No ratings yet
gauravkumar_221it027@it301_Lab2
28 pages
Lect11 12 Parallel
No ratings yet
Lect11 12 Parallel
57 pages
PL01 Guiao
No ratings yet
PL01 Guiao
3 pages
Report - Viber String
No ratings yet
Report - Viber String
26 pages
Project Report CS 341: Computer Architecture Lab
No ratings yet
Project Report CS 341: Computer Architecture Lab
12 pages
Chap2 Slides Week3
No ratings yet
Chap2 Slides Week3
28 pages
Lab3
No ratings yet
Lab3
4 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
EXERCISE- 4
No ratings yet
EXERCISE- 4
8 pages
Untitled document
No ratings yet
Untitled document
23 pages
Floyd's Algorithm: Input N: Number of Vertices A (0..n-1) (0..n-1) - Adjacency Matrix
No ratings yet
Floyd's Algorithm: Input N: Number of Vertices A (0..n-1) (0..n-1) - Adjacency Matrix
7 pages
EXERCISE- 4[1] (1)
No ratings yet
EXERCISE- 4[1] (1)
8 pages
Deep Sky Companions The Messier Objects 2ed. Draft Edition O'Meara S.J. download
100% (1)
Deep Sky Companions The Messier Objects 2ed. Draft Edition O'Meara S.J. download
49 pages
Final PDC Exam
No ratings yet
Final PDC Exam
10 pages
Assignment_4
No ratings yet
Assignment_4
2 pages
matrix_mul
No ratings yet
matrix_mul
33 pages
Parallel Processing
No ratings yet
Parallel Processing
35 pages
Lecture07 MPI by Example
No ratings yet
Lecture07 MPI by Example
27 pages
DAA Mini Project (1)
No ratings yet
DAA Mini Project (1)
6 pages
Assignment PDF
No ratings yet
Assignment PDF
2 pages
OpenMP Matrix
No ratings yet
OpenMP Matrix
6 pages
As 3
No ratings yet
As 3
2 pages
Java Practise Exercise
No ratings yet
Java Practise Exercise
3 pages
Intro To MPI
No ratings yet
Intro To MPI
44 pages
Tiny Project 1
No ratings yet
Tiny Project 1
2 pages
Assignment 04 (2)
No ratings yet
Assignment 04 (2)
16 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
#Include #Include #Define
No ratings yet
#Include #Include #Define
8 pages
Pseudo Code of Mpi Programs
No ratings yet
Pseudo Code of Mpi Programs
22 pages
MPP Exercises
No ratings yet
MPP Exercises
8 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
Sunil Kumar L 24
No ratings yet
Sunil Kumar L 24
21 pages
2022 Mid 1
No ratings yet
2022 Mid 1
4 pages
A Practical Performance Comparison of Parallel Matrix Multiplication Algorithms On Networks of Workstations
No ratings yet
A Practical Performance Comparison of Parallel Matrix Multiplication Algorithms On Networks of Workstations
2 pages
CMAT 2018 Slot 1 Question Paper with solution
No ratings yet
CMAT 2018 Slot 1 Question Paper with solution
55 pages
Advanced Computer Architecture 1
No ratings yet
Advanced Computer Architecture 1
14 pages
CSCI 8150 Advanced Computer Architecture
100% (2)
CSCI 8150 Advanced Computer Architecture
18 pages
Applied Thermodynamics Software Solutions Vapor Po
No ratings yet
Applied Thermodynamics Software Solutions Vapor Po
242 pages
3-Project Title AI-Enhanced Telemedicine For Remote Diagnosis and Management of Chronic Diseases
No ratings yet
3-Project Title AI-Enhanced Telemedicine For Remote Diagnosis and Management of Chronic Diseases
57 pages
Gauss
No ratings yet
Gauss
7 pages
Chained Matrix Multiplication
No ratings yet
Chained Matrix Multiplication
32 pages
Business Analytics in Supply Chain Management
No ratings yet
Business Analytics in Supply Chain Management
9 pages
Introduction
No ratings yet
Introduction
46 pages
Chapter 4-1
No ratings yet
Chapter 4-1
62 pages
Completion Tools HALCO
No ratings yet
Completion Tools HALCO
12 pages
Digital HR Certificate Program
No ratings yet
Digital HR Certificate Program
17 pages
A Mother
No ratings yet
A Mother
1 page
Potential Application of Nanotechnology in Transportation Seminar
No ratings yet
Potential Application of Nanotechnology in Transportation Seminar
20 pages
CV HASAN MD Amit - 1
No ratings yet
CV HASAN MD Amit - 1
2 pages
Englis Indo - Id.en
No ratings yet
Englis Indo - Id.en
5 pages
Sensor Current CCT en
No ratings yet
Sensor Current CCT en
13 pages
Culvert Equipment
No ratings yet
Culvert Equipment
2 pages
Electrochemistry - Extra Question
No ratings yet
Electrochemistry - Extra Question
7 pages
Lesson Plan
No ratings yet
Lesson Plan
4 pages
Xiamen University Malaysia
No ratings yet
Xiamen University Malaysia
2 pages
Jacques Cronje Portfolio: June 2010
No ratings yet
Jacques Cronje Portfolio: June 2010
25 pages
Leland Safety Data Sheet Tfss Co2
No ratings yet
Leland Safety Data Sheet Tfss Co2
7 pages
Chapter07 QBE Examples
No ratings yet
Chapter07 QBE Examples
27 pages
Statie Stonex r2w Manual
No ratings yet
Statie Stonex r2w Manual
583 pages
Microsoft - LeetCode
No ratings yet
Microsoft - LeetCode
16 pages
Optimization of Regenerative Feed Water Heaters
No ratings yet
Optimization of Regenerative Feed Water Heaters
5 pages
Filipino Scientists: GE 07 Science, Technology and Society
No ratings yet
Filipino Scientists: GE 07 Science, Technology and Society
4 pages
Design of Parallel Algorithm'S: Faculty Guide: Group Members
No ratings yet
Design of Parallel Algorithm'S: Faculty Guide: Group Members
49 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Causes of Decay of Buildings
No ratings yet
Causes of Decay of Buildings
20 pages
Presentation - Jeddah Airport 2 ISTP (KOM)
No ratings yet
Presentation - Jeddah Airport 2 ISTP (KOM)
44 pages
Meldasmagic Monitor Operation Manual: BNP-B2192 (ENG)
No ratings yet
Meldasmagic Monitor Operation Manual: BNP-B2192 (ENG)
14 pages
The Occupational Personality Questionnaire
0% (1)
The Occupational Personality Questionnaire
11 pages
IEEE Guide For Abnormal Frequency Protection For Power Generating Plants
100% (2)
IEEE Guide For Abnormal Frequency Protection For Power Generating Plants
41 pages
4 Vo5G Solution Introduction ISSUE 1.00
No ratings yet
4 Vo5G Solution Introduction ISSUE 1.00
60 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Ee8218 Lab2

Uploaded by

Ee8218 Lab2

Uploaded by

Faculty of Electrical and Computer Engineering

Department of Electrical and Computer Engineering

Course Number EE 8218 – 011

Instructor Nagi Mekhiel

Assignment Title Introduction to MPI

Submission Date November 03,2015

Student Name Ismail Sheikh

The result of sequential algorithm is as follow:

Size of Matrices Computational Time (seconds) for sequential algorithm

Size of Matrices Computational Time (seconds) in a network of 6 computers

Time Comparison of 1000x1000 Time Comparison of 3000x3000

Time Comparison of 5000x5000 Time Comparison of 6000x6000

Seqential 4 Computer network 6 computer network

Program Output for 1 computer

Program Output for network of 6 computers

int main(int argc, char *argv[]) {

for (i = 0; i < numprocs; i++);

printf("Process %d of %d is on %s\n", myid, numprocs, processor_name);

for (row = 0; row < sizeOfMatrix; row++) {

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.