0% found this document useful (0 votes)

54 views18 pages

PCHPC Python Mpi Parallelization Slides

MPI is used for distributed memory parallel programming in Python. It allows launching multiple Python processes that run the same code independently. Each process has a unique rank and communicator. Processes can send and receive arbitrary Python objects and NumPy arrays between each other. Collective communication functions like broadcast, scatter, gather allow sharing data between all processes. Dynamically managing processes is also possible in MPI.

Uploaded by

Fares Ait-Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views18 pages

PCHPC Python Mpi Parallelization Slides

Uploaded by

Fares Ait-Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Python + MPI

Hendrik Nolte
Repetition - MPI
MPI is used for a distributed memory system
Execution of an MPI
Program
●
Launching an MPI-parallelized Pythonscript (e.g. with mpirun,
mpiexec …) will start n Python interpreters
●
All processes contain the same code, thus they are independent and
identical processes

https://lsi.vc.ehu.eus/pablogn/docencia/manuales/linuxcourse.rutgers.edu/lessons/HPC_1/sec_4.php.html
Ranks and
Communicators
●
Rank: Unique id given to one process to distinguish between them
●
Communicator: Group of processes
– Communication takes always place in a certain communicator
– Rank of a process can be different in different communicators

from mpi4py import MPI

comm = MPI.COMM_WORLD
size = comm.get_size()
rank = comm.get_rank()

If rank == 0:
# do stuff that only process 0 should do
Sending and Receiving
Data - Example
from mpi4py import MPI

comm = MPI.COMM_WORLD
rank = comm.Get_rank()

if rank == 0:
data = {'a': 1, 'b': 2, 'c':'test string'}
comm.send(data,dest=1,tag=11)
elif rank == 1:
data = comm.recv(source=0,tag=11)
print(data)

$ mpirun -n 2 python3 mpi_example_1.py

{'a': 1, 'b': 2, 'c': 'test string'}
Sending and Receiving
Data - Summary
●
Arbitrary Python objects can be send and received without the manual need
for serialization from the user
– MPI functions pickle under the hood
●
send(data,dest,tag)
– data: Data, i.e. a Python object to send
– dest: Rank of the destination process
– tag: Arbitrary id for this message
●
recv(source,tag)
– source: Rank of the sending process
– tag: ID of the message, must match the tag in the send function
– The return value is the send data
●
There are also the non-blocking functions isend and irecv
Sending and Receiving
Data - Summary
●
Objects need to be serialized to a byte stream when sending
●
Byte stream needs to be deserialized on the receiving process
– Additional overhead for communication
●
Specifically in scientific computing it is necessary to be able to
efficiently exchange large amounts of data
●
For this contiguous NumPy arrays can be communicated with a
largely reduced overhead
●
Use Send(data,dest,tag) and Recv(data,source,tag)
– Notice the capitalized Send and Recv
●
Data array has to exist beforehand on the receiving process
Sending and Receiving
NumPy Arrays
from mpi4py import MPI
import numpy

comm = MPI.COMM_WORLD
rank = comm.Get_rank()

if rank == 0:
data = numpy.arange(100, dtype=numpy.float)
comm.Send(data,dest=1,tag=11)
elif rank == 1:
data = numpy.empty(100,dtype=numpy.float)
comm.Recv(data, source=0,tag=11)

Summary:
➔
send/recv for all general Python objects, slow
➔
Send/Recv for continuous arrays, fast
Collective
Communication
Broadcast

from mpi4py import MPI

comm = MPI.COMM_WORLD
rank = comm.Get_rank()

if rank == 0:
data = {'key1' : [7, 2.72, 2+3j],
'key2' : ( 'abc', 'xyz')}
else:
data = None
data = comm.bcast(data, root=0)
Collective
Communication
Scattering

from mpi4py import MPI

comm = MPI.COMM_WORLD
size = comm.Get_size()
rank = comm.Get_rank()

if rank == 0:
data = [(i+1)**2 for i in range(size)]
else:
data = None
data = comm.scatter(data, root=0)
assert data == (rank+1)**2
Collective
Communication
Gather

from mpi4py import MPI

comm = MPI.COMM_WORLD
size = comm.Get_size()
rank = comm.Get_rank()

data = (rank+1)**2
data = comm.gather(data, root=0)
if rank == 0:
for i in range(size):
assert data[i] == (i+1)**2
else:
assert data is None
Collective
Communication
Broadcasting a
NumPy array

from mpi4py import MPI

import numpy as np

comm = MPI.COMM_WORLD
rank = comm.Get_rank()

if rank == 0:
data = np.arange(100, dtype='i')
else:
data = np.empty(100, dtype='i')
comm.Bcast(data, root=0)
for i in range(100):
assert data[i] == i
Collective
Communication
Scattering a
NumPy array

from mpi4py import MPI

import numpy as np

comm = MPI.COMM_WORLD
size = comm.Get_size()
rank = comm.Get_rank()

sendbuf = None
if rank == 0:
sendbuf = np.empty([size, 100], dtype='i')
sendbuf.T[:,:] = range(size)
recvbuf = np.empty(100, dtype='i')
comm.Scatter(sendbuf, recvbuf, root=0)
assert np.allclose(recvbuf, rank)
Collective
Communication
Gathering a
NumPy array

from mpi4py import MPI

import numpy as np

comm = MPI.COMM_WORLD
size = comm.Get_size()
rank = comm.Get_rank()

sendbuf = np.zeros(100, dtype='i') + rank

recvbuf = None
if rank == 0:
recvbuf = np.empty([size, 100], dtype='i')
comm.Gather(sendbuf, recvbuf, root=0)
if rank == 0:
for i in range(size):
assert np.allclose(recvbuf[i,:], i)
Exceptions and
Deadlocks
●
Upon import, mpi4py is being automatically initialized
from mpi4py import MPI
assert MPI.COMM_WORLD.Get_size() > 1
rank = MPI.COMM_WORLD.Get_rank()
if rank == 0:
1/0
MPI.COMM_WORLD.send(None, dest=1, tag=42)
elif rank == 1:
MPI.COMM_WORLD.recv(source=0, tag=42)

●
Using
$ mpirun -n 10 python3 deadlock_example.py
Traceback (most recent call last):
File "deadlock_example.py", line 5, in <module>
1/0
ZeroDivisionError: division by zero
Gives a Deadlock, rather use
$ mpirun -n 10 python3 -m mpi4py deadlock_example.py
Traceback (most recent call last):
Dynamic Process
Management I
●
Since MPI-2 provides a process models which allows to create new
processes and to establish communication between them and the
existing MPI application
●
Useful for sequential applications built on top of parallel modules or
in a client/server model
from mpi4py import MPI
import numpy
import sys

comm = MPI.COMM_SELF.Spawn(sys.executable,
args=['cpi.py'],
maxprocs=5)

N = numpy.array(100, 'i')
comm.Bcast([N, MPI.INT], root=MPI.ROOT)
PI = numpy.array(0.0, 'd')
comm.Reduce(None, [PI, MPI.DOUBLE],
op=MPI.SUM, root=MPI.ROOT)
print(PI)

comm.Disconnect()
Dynamic Process
Management II

from mpi4py import MPI

import numpy

comm = MPI.Comm.Get_parent()
size = comm.Get_size()
rank = comm.Get_rank()

N = numpy.array(0, dtype='i')
comm.Bcast([N, MPI.INT], root=0)
h = 1.0 / N; s = 0.0
for i in range(rank, N, size):
x = h * (i + 0.5)
s += 4.0 / (1.0 + x**2)
PI = numpy.array(s * h, dtype='d')
comm.Reduce([PI, MPI.DOUBLE], None,
op=MPI.SUM, root=0)

comm.Disconnect()
Q&A

Single Most important Source:

https://mpi4py.readthedocs.io/en/stable/index.html

MLT - Solutions (12 Weeks Merged) PDF
No ratings yet
MLT - Solutions (12 Weeks Merged) PDF
143 pages
Chapter-7 Multiprocessors and Multicomputers: Module-Iv
No ratings yet
Chapter-7 Multiprocessors and Multicomputers: Module-Iv
53 pages
MPI - Python Lab
No ratings yet
MPI - Python Lab
33 pages
Class 1 Computer Studies Syllabus
No ratings yet
Class 1 Computer Studies Syllabus
7 pages
Unit 1
No ratings yet
Unit 1
135 pages
High Performance Computing: Course Introduction
No ratings yet
High Performance Computing: Course Introduction
32 pages
Lisandro Dalcin Mpi4py
No ratings yet
Lisandro Dalcin Mpi4py
60 pages
FIOT Unit-4
No ratings yet
FIOT Unit-4
36 pages
CS8087 - Software Defined Networks (Ripped From Amazon Kindle Ebooks by Sai Seena)
No ratings yet
CS8087 - Software Defined Networks (Ripped From Amazon Kindle Ebooks by Sai Seena)
68 pages
Chapter - 2 - Parallel Hardware and Parallel Software
No ratings yet
Chapter - 2 - Parallel Hardware and Parallel Software
143 pages
MPI4 Py
No ratings yet
MPI4 Py
28 pages
Lecture#14
No ratings yet
Lecture#14
38 pages
Distributed Computing Consensus and Agreement Algorithms: BITS Pilani
No ratings yet
Distributed Computing Consensus and Agreement Algorithms: BITS Pilani
46 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
HCIP-AI-EI Developer V2.0 Training Material
No ratings yet
HCIP-AI-EI Developer V2.0 Training Material
508 pages
Chapter 3 - Solving Problems by Searching
No ratings yet
Chapter 3 - Solving Problems by Searching
71 pages
OS Lecture3 - Inter Process Communication
No ratings yet
OS Lecture3 - Inter Process Communication
43 pages
Course Management System
50% (2)
Course Management System
65 pages
Unit 4 PPT
No ratings yet
Unit 4 PPT
34 pages
Ch02 OS9e
No ratings yet
Ch02 OS9e
97 pages
Advanced Computer Networks - CS716 Power Point Slides Lecture 25
No ratings yet
Advanced Computer Networks - CS716 Power Point Slides Lecture 25
264 pages
PDF
100% (2)
PDF
39 pages
Knowledge Representation & Reasoning: By: Irum Naz Sodhar Lecturer IT, SBBU-SBA Main Campus
100% (1)
Knowledge Representation & Reasoning: By: Irum Naz Sodhar Lecturer IT, SBBU-SBA Main Campus
22 pages
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 1.1 Title: Microsoft Excel Programming
No ratings yet
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 1.1 Title: Microsoft Excel Programming
6 pages
BDT - Unit - II - Hdfs and Hadoop Io
No ratings yet
BDT - Unit - II - Hdfs and Hadoop Io
42 pages
Load Balancing
No ratings yet
Load Balancing
12 pages
Study of McEliece Cryptosystem
No ratings yet
Study of McEliece Cryptosystem
19 pages
Unit-4 of Ai
No ratings yet
Unit-4 of Ai
9 pages
Distributed Operating Systems: Unit - 2
No ratings yet
Distributed Operating Systems: Unit - 2
48 pages
COMP290-084: Graphical Representation of Asynchronous Systems Jan 15, 2004
No ratings yet
COMP290-084: Graphical Representation of Asynchronous Systems Jan 15, 2004
38 pages
Unit 5.2 Issues With and Limitations of Hadoop v1 and MapReduce v1
No ratings yet
Unit 5.2 Issues With and Limitations of Hadoop v1 and MapReduce v1
15 pages
Cluster Computing
No ratings yet
Cluster Computing
32 pages
IOT Module 1 Ch1 PG Maya Final
No ratings yet
IOT Module 1 Ch1 PG Maya Final
74 pages
Block Chain-Module-I
No ratings yet
Block Chain-Module-I
19 pages
Advanced Computer Networking
No ratings yet
Advanced Computer Networking
1 page
Chapter 9 - Security, Privacy, Etchic & Ergonomic
No ratings yet
Chapter 9 - Security, Privacy, Etchic & Ergonomic
38 pages
Unit 1 Notes
100% (1)
Unit 1 Notes
18 pages
CP4253 Map Unit I
No ratings yet
CP4253 Map Unit I
31 pages
Malware Forensics Introduction
No ratings yet
Malware Forensics Introduction
16 pages
Exercises: Parallel Programming Lab
No ratings yet
Exercises: Parallel Programming Lab
2 pages
Course Plan (Operating System)
No ratings yet
Course Plan (Operating System)
14 pages
General Education March 2023 Blessings
No ratings yet
General Education March 2023 Blessings
40 pages
Slides Chapter 5 Basic Processing Unit
No ratings yet
Slides Chapter 5 Basic Processing Unit
44 pages
CP4253 Map Unit Ii
No ratings yet
CP4253 Map Unit Ii
23 pages
GPGPU Sim Tutorial
No ratings yet
GPGPU Sim Tutorial
28 pages
ADF Syllabus
No ratings yet
ADF Syllabus
8 pages
HPC MCQ List: A. Bus
No ratings yet
HPC MCQ List: A. Bus
6 pages
Python Function Questions
No ratings yet
Python Function Questions
13 pages
Operating Systems Interview Questions
No ratings yet
Operating Systems Interview Questions
15 pages
CCS369
No ratings yet
CCS369
2 pages
PES University, Bengaluru
No ratings yet
PES University, Bengaluru
8 pages
Subject Name Parallel and Distributed Computing
100% (1)
Subject Name Parallel and Distributed Computing
3 pages
Unit 2 - Week 1: Introduction To Clouds, Virtualization and Virtual Machine
No ratings yet
Unit 2 - Week 1: Introduction To Clouds, Virtualization and Virtual Machine
48 pages
WSN Unit 1
No ratings yet
WSN Unit 1
49 pages
Dokumen Tindak Pidana
No ratings yet
Dokumen Tindak Pidana
18 pages
Cs6551 Computer Networks: Unit - I
No ratings yet
Cs6551 Computer Networks: Unit - I
86 pages
CST 402 DC QB
No ratings yet
CST 402 DC QB
6 pages
Chapter2 PDF
No ratings yet
Chapter2 PDF
31 pages
30 Reels Ideas
No ratings yet
30 Reels Ideas
69 pages
Ethernet For Real Time Embedded Systems White Paper PDF
No ratings yet
Ethernet For Real Time Embedded Systems White Paper PDF
5 pages
Ff84602 - CHP - ISM Book Exc Solutions
No ratings yet
Ff84602 - CHP - ISM Book Exc Solutions
4 pages
Ultrasound Principles
No ratings yet
Ultrasound Principles
61 pages
Distributed System Course File
No ratings yet
Distributed System Course File
26 pages
Tables in SAP
No ratings yet
Tables in SAP
20 pages
Ddos
No ratings yet
Ddos
30 pages
Abbas Moallem (Editor) - Smart and Intelligent Systems - The Human Elements in Artificial Intelligence, Robotics, and Cybersecurity (The Human Element in (2021, CRC Press) - Libgen - Li
No ratings yet
Abbas Moallem (Editor) - Smart and Intelligent Systems - The Human Elements in Artificial Intelligence, Robotics, and Cybersecurity (The Human Element in (2021, CRC Press) - Libgen - Li
188 pages
Snort 2
No ratings yet
Snort 2
50 pages
Cse330 Agent-Based-Intelligent-Systems TH 1.00 Ac26
0% (1)
Cse330 Agent-Based-Intelligent-Systems TH 1.00 Ac26
2 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
Narrative Intelligence - 1
No ratings yet
Narrative Intelligence - 1
10 pages
Brksec 2364
No ratings yet
Brksec 2364
83 pages
Load Scheduling
100% (1)
Load Scheduling
10 pages
EE 675 Lecture 27th March
No ratings yet
EE 675 Lecture 27th March
4 pages
PDF Handout - Opening Keynote
No ratings yet
PDF Handout - Opening Keynote
48 pages
CBSE Class 10 Number Systems Worksheet (1) - 0
No ratings yet
CBSE Class 10 Number Systems Worksheet (1) - 0
11 pages
HighPerformanceComputing DS
No ratings yet
HighPerformanceComputing DS
2 pages
Machine Learning - PPT
No ratings yet
Machine Learning - PPT
4 pages
82C55 Ppi
No ratings yet
82C55 Ppi
22 pages
Blockly
No ratings yet
Blockly
1 page
The Evaluation of Operating System
No ratings yet
The Evaluation of Operating System
6 pages
Unit-4 Os
No ratings yet
Unit-4 Os
49 pages
Comp1220 2011 12 Sem I Exam PDF
No ratings yet
Comp1220 2011 12 Sem I Exam PDF
3 pages
Quad Operational Amplifier
No ratings yet
Quad Operational Amplifier
5 pages
Autonomous Robot Navigation With Raspberry Pi
No ratings yet
Autonomous Robot Navigation With Raspberry Pi
8 pages
Prototyping Methods
No ratings yet
Prototyping Methods
9 pages
Inst Siemens Components
No ratings yet
Inst Siemens Components
2 pages
2016 Simulationofe Puckpathplanninginwebots
No ratings yet
2016 Simulationofe Puckpathplanninginwebots
5 pages
Cs Learning Journal Unit 6
No ratings yet
Cs Learning Journal Unit 6
4 pages
Web SML500e Eng
No ratings yet
Web SML500e Eng
3 pages
EE Handbook & COC Acceptance Form
No ratings yet
EE Handbook & COC Acceptance Form
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

PCHPC Python Mpi Parallelization Slides

Uploaded by

PCHPC Python Mpi Parallelization Slides

Uploaded by

Python + MPI

from mpi4py import MPI

$ mpirun -n 2 python3 mpi_example_1.py

from mpi4py import MPI

from mpi4py import MPI

from mpi4py import MPI

from mpi4py import MPI

from mpi4py import MPI

from mpi4py import MPI

sendbuf = np.zeros(100, dtype='i') + rank

from mpi4py import MPI

Single Most important Source:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.