0% found this document useful (0 votes)

9 views5 pages

Exercise 9

homework 9 networking

Uploaded by

nani chkhenkeli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views5 pages

Exercise 9

homework 9 networking

Uploaded by

nani chkhenkeli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Exercise 9

1. Computing forces

Celestial mechanics:
2
𝐺∗|𝑟𝑖 −𝑟𝑗 |
𝑓(𝑖, 𝑗) = 𝑚𝑖 ∗ 𝑚𝑗

Cost: distance calculation - eucledian distance between bodies dominates cost, one square root has
time complexity of O(sqrt), subtractions - O(n)

constant factors G and masses of bodies are assumed constant and minimal cost. overall cost
O(1)+O(√𝑛).

Time: Brute force methods have time complexity of O(𝑛2 ), this is because each body needs to interact
with every other body, n*(n-1) calculation. But because of symmetrical interactions time complexity
becomes O(n^2)/2. (but for large n constant factor ½ becomes less significant).

Molecular dynamics:

Cost: almost similar to celestial mechanics, but higher because of cost associated with energy
function.

Time: Brute force methods have time complexity of O(𝑛2 ), systems with localized interactions can
reduce to O(n).

2. Heat dissipation

1) Serial time t1(n) = O(n)

𝑛 𝑛
Parallel time tp(n) = + O(√ )
√𝑝 𝑝

Number of processors – p
𝑡1(𝑛) 𝑝
Speedup S=𝑡𝑝(𝑛) = 2
+ 𝑂(1)

𝑆 1
Efficiency = 𝑝 = ½ + O(𝑝)

2)𝑡𝑠𝑡𝑟𝑖𝑝𝑒 = n x 𝑡𝑠𝑒𝑟𝑖𝑎𝑙 ( 𝑡𝑠𝑒𝑟𝑖𝑎𝑙 is time for simulating single grid point). Communication – O(n).

Total parallel runtime = 𝑡𝑠𝑡𝑟𝑖𝑝𝑒 + O(n)

𝑡𝑠𝑒𝑞𝑢𝑒𝑛𝑡𝑖𝑎𝑙 𝑛2 ∗𝑡𝑠𝑒𝑟𝑖𝑎𝑙 𝑝𝑛
Speedup 𝑡𝑠𝑡𝑟𝑖𝑝𝑒 = 𝑡 = 𝑛∗𝑡𝑠𝑒𝑟𝑖𝑎𝑙+𝑂(𝑛0
= 𝑛+𝑂(1)
=𝑝
𝑝𝑎𝑟𝑎𝑙𝑙𝑒𝑙𝑆𝑡𝑟𝑖𝑝𝑒

𝑆
Efficiency(squares) = 𝑝 = 1, efficiency for stripes is the same.
3. Matrix multiplication

Runtime: O(n^3) .

a(constant communication overhead factor),

n(problem size)

parallelizable part = O(n^3) – a

1
ideal speedup = 𝑝𝑎𝑟𝑎𝑙𝑙𝑒𝑙𝑖𝑧𝑎𝑏𝑙𝑒 𝑝𝑎𝑟𝑡 𝑝𝑎𝑟𝑎𝑙𝑙𝑒𝑙𝑖𝑧𝑎𝑏𝑙𝑒 𝑝𝑎𝑟𝑡 1
(1− )+( )𝑥 ( )
𝑡𝑜𝑡𝑎𝑙 𝑡𝑖𝑚𝑒 𝑡𝑜𝑡𝑎𝑙 𝑡𝑖𝑚𝑒 𝑝

𝑂(𝑛3 )−𝑎 𝑎
= 1- 𝑂(𝑛3)
𝑂(𝑛3 )

1 𝑂(𝑛3 )∗𝑝 𝑂(𝑛3 )

S= 𝑎 1 𝑎 = =
+ − 𝑂(𝑛3 )−𝑎(𝑝−1) 𝑝−𝑎
𝑂(𝑛3 ) 𝑝 𝑂(𝑛3 )∗𝑝

each processor stores portion of input matrices A,B. if each processor stores n/p rows of A, n
2𝑛2
columns of B, total number will be +n.
𝑝

4. Matrix multiplication

Each processor (ip, jp) has initial inputs, submatrices A,B and empty C.

Exchange blocks -

Each processor in √𝑝 communication rounds 0<= k <= √𝑝 − 1

In round k processor sends its block B(i,(𝑗𝑝 + 𝑘) % √𝑝) 𝑓𝑟𝑜𝑚 𝑝𝑟𝑜𝑐𝑒𝑠𝑠𝑜𝑟 (𝑖𝑝 , (𝑗𝑝 − k + √𝑝)%√𝑝 .

Processor receives block B(i’,(𝑗𝑝 + 𝑘) % √𝑝) from processor ((𝑖𝑝 + 𝑘) % √𝑝, 𝑗𝑝 )

Local computation –

Processor performs block multiplication between submatrix A and all received blocks B. results are in
blocks of C.

Communication of gathering –
Each processor in √𝑝 communication rounds 0<= k <= √𝑝 − 1

In round k processor sends its block C(i,(𝑗𝑝 + 𝑘) % √𝑝) to processor ((𝑖𝑝 − 𝑘 + √𝑝) % √𝑝, 𝑗𝑝 ),

Processor receives a block C((𝑖𝑝 + 𝑘) % √𝑝, ( 𝑗𝑝 + 𝑘)%√𝑝) from processor (𝑖𝑝 ,(𝑗𝑝 − 𝑘 + √𝑝) % √𝑝)

Local accumulation:

After √𝑝 𝑐𝑜𝑚𝑚𝑢𝑛𝑖𝑐𝑎𝑡𝑖𝑜𝑛 𝑟𝑜𝑢𝑛𝑑𝑠, 𝑒𝑎𝑐ℎ 𝑝𝑟𝑜𝑐𝑒𝑠𝑠𝑜𝑟 𝑟𝑒𝑐𝑒𝑖𝑣𝑒𝑑 𝑎𝑙𝑙 𝑏𝑙𝑜𝑐𝑘𝑠 𝑓𝑜𝑟 𝐶(𝑖, 𝑗). Processor
accumulates received blocks into the final submatrix C(i,j).

Pseudo code would like the following –

FUNCTION parallelMult(A, B, n, p):

# Check if n and p are perfect squares

# Decompose matrices A and B into submatrices

submatricesA = decomposeMatrix(A, n, p)

submatricesB = decomposeMatrix(B, n, p)

# Initialize result matrix

submatricesC = initializeMatrix(n, p)

# Communication and computation rounds

FOR k = 0 TO sqrt(p) - 1:

FOR i = 0 TO sqrt(p) - 1:

# Calculate index for submatrix B to be received

j_recv = (i + k) % sqrt(p)

submatrixB = submatricesB[j_recv]

# Calculate index for submatrix C to be accumulated

j_out = i

computeProduct(submatricesA[k], submatrixB, submatricesC[i * sqrt(p) + j_out])

# Gather results

RETURN gatherResults(submatricesC)
Exercise 5: routing for a grid

(0,0) -- (1,0) -- (2,0) -- (3,0)

| | | |

(0,1) -- (1,1) -- (2,1) -- (3,1)

| | | |

(0,2) -- (1,2) -- (2,2) -- (3,2)

| | | |

(0,3) -- (1,3) -- (2,3) -- (3,3)

(0,0) -> 0 (0,1) -> 1 (0,2) -> 2 (0,3) -> 3

(1,0) -> 4 (1,1) -> 5 (1,2) -> 6 (1,3) -> 7

(2,0) -> 8 (2,1) -> 9 (2,2) -> 10 (2,3) -> 11

(3,0) -> 12 (3,1) -> 13 (3,2) -> 14 (3,3) -> 15

3. w=(0,1,4,5,8,9,12,13)

4. w=(0,1,4,5,8,9,12,13,3,7,11,15,2,6,10,14)

Westward edges (0,1,4,5,8,9,12,13) form a cycle. These nodes not conflict, eastward edges(3,7,11,15,
2,6,10,14) connect nodes in a way that does not create conflicts.

6.
Route left (0,1,4,5,8,9,12,13)

Right(3,7,11,15,2,6,10,14)

Exercise6:

Case 1: cy(0) + th < e(0) - ts

Sender transmits bit x with time greater than clock skew ts.

During interval [e(0)-ts, e(0)], x is guaranteed to be stable on the bus before receivers first clock edge
in cycle i. cy(i) - R_cy(i) < e(0) - ts for all k ∈ [0, 6].

Case2: cy(0) + th ≥ e(0) – ts

Receiver might miss x in cycle I due to clock alignment. cy(i) - R_cy(i) ≥ 0 (only applies for cycle i).

Stable sender transmission and subsequent cycles guarantee correct sampling: cy(i + k) - R_cy(i + k) <
e(0) - ts for k ∈ [1, 6].

In both cases, the receiver samples the correct x for at least 7 consecutive cycles, starting from cycle β
= 0 (Case 1) or β = 1 or β=0 (Case 2).

JIO FI Manual
100% (1)
JIO FI Manual
17 pages
100 Shell Script Examples
100% (2)
100 Shell Script Examples
62 pages
Partial Solutions Manual Parallel and Distributed Computation: Numerical Methods
100% (1)
Partial Solutions Manual Parallel and Distributed Computation: Numerical Methods
95 pages
Task 6: Installation of Software (VLC Media Player) .: Step 0: Download and Launch Installer
No ratings yet
Task 6: Installation of Software (VLC Media Player) .: Step 0: Download and Launch Installer
6 pages
PowerShell Training Primer
100% (1)
PowerShell Training Primer
29 pages
CN Lab Manual - r22
0% (1)
CN Lab Manual - r22
85 pages
Smart Board Manual
No ratings yet
Smart Board Manual
2 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
ADA (Lab)
No ratings yet
ADA (Lab)
37 pages
Phishing, Pharming, Vishing and Smishing
100% (1)
Phishing, Pharming, Vishing and Smishing
2 pages
BSC CSIT Final Year Project Report On Sword of Warrior Game Project Report
No ratings yet
BSC CSIT Final Year Project Report On Sword of Warrior Game Project Report
52 pages
Omartarekamer Corrected
No ratings yet
Omartarekamer Corrected
22 pages
Computer Network LAB 5
No ratings yet
Computer Network LAB 5
19 pages
Computer Networks Lab
No ratings yet
Computer Networks Lab
14 pages
ADA Lab Manual 2024
No ratings yet
ADA Lab Manual 2024
15 pages
HPC Scaling
No ratings yet
HPC Scaling
56 pages
Week 9N
No ratings yet
Week 9N
9 pages
Lecture 4: Principles of Parallel Algorithm Design (Part 4)
No ratings yet
Lecture 4: Principles of Parallel Algorithm Design (Part 4)
27 pages
Filtro Español Portugues
No ratings yet
Filtro Español Portugues
89 pages
Parallel Processing
No ratings yet
Parallel Processing
35 pages
Applied Graph Theory File 7 Semester MCE: Submitted By:-Ankit Jain 2K14/MC/011 Batch R1
No ratings yet
Applied Graph Theory File 7 Semester MCE: Submitted By:-Ankit Jain 2K14/MC/011 Batch R1
35 pages
2,3 &4 CN Prgs
No ratings yet
2,3 &4 CN Prgs
7 pages
A PC5250 Printer Session On AS400
No ratings yet
A PC5250 Printer Session On AS400
15 pages
Sheet 2: Problem 1: Matrix Multiplication Using CREW PRAM
No ratings yet
Sheet 2: Problem 1: Matrix Multiplication Using CREW PRAM
3 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
Comp 372 Assignment 2
No ratings yet
Comp 372 Assignment 2
9 pages
Quectel GSM MQTT Application Note V1.2
No ratings yet
Quectel GSM MQTT Application Note V1.2
29 pages
Privacy Tools v19.84 Secure Open List: Ubuntu Touch: Android Alternative For Phones and Tablets
No ratings yet
Privacy Tools v19.84 Secure Open List: Ubuntu Touch: Android Alternative For Phones and Tablets
84 pages
Unit II Matrix Multiplication
No ratings yet
Unit II Matrix Multiplication
23 pages
To Read Dynprog2
No ratings yet
To Read Dynprog2
50 pages
Network Communication - Embedded Lab Inlab Exam Lab Code: L41+L42 Name: Sannithi Sai Lokesh Reg No: 19bce2379
No ratings yet
Network Communication - Embedded Lab Inlab Exam Lab Code: L41+L42 Name: Sannithi Sai Lokesh Reg No: 19bce2379
8 pages
B2B E-Commerce - Nobo - IT - Proposal
100% (1)
B2B E-Commerce - Nobo - IT - Proposal
36 pages
Nirafon PLC R
No ratings yet
Nirafon PLC R
54 pages
RG2 ParallelizationPrinciples HPCAI Jan2020
No ratings yet
RG2 ParallelizationPrinciples HPCAI Jan2020
40 pages
Integer Set Library (ISL)
No ratings yet
Integer Set Library (ISL)
61 pages
WireGuard - RouterOS - MikroTik Documentation
No ratings yet
WireGuard - RouterOS - MikroTik Documentation
1 page
Data Communications Standards by Tomasi
No ratings yet
Data Communications Standards by Tomasi
26 pages
Sheet No. Sheet Name: Hierarchical Block
No ratings yet
Sheet No. Sheet Name: Hierarchical Block
8 pages
Ada Lab Manual 2022 Scheme
No ratings yet
Ada Lab Manual 2022 Scheme
28 pages
Dynamic Programming
No ratings yet
Dynamic Programming
14 pages
A5-Set-A-aTopological Sort
No ratings yet
A5-Set-A-aTopological Sort
11 pages
DSTL Lab File
No ratings yet
DSTL Lab File
21 pages
CN Lab Manual
No ratings yet
CN Lab Manual
32 pages
CN Lab Manual
No ratings yet
CN Lab Manual
109 pages
An Efficient Reducing Mechanismfor Energy Consumption in Cloud
No ratings yet
An Efficient Reducing Mechanismfor Energy Consumption in Cloud
15 pages
DAA - Complete GD
No ratings yet
DAA - Complete GD
27 pages
Content PDF
No ratings yet
Content PDF
14 pages
Part-A: 1. Write A Program To Implement RSA Algorithm
No ratings yet
Part-A: 1. Write A Program To Implement RSA Algorithm
49 pages
CAPS and IBASS Manuals
No ratings yet
CAPS and IBASS Manuals
50 pages
Partial Solutions Manual Parallel and Distributed Computation: Numerical Methods
No ratings yet
Partial Solutions Manual Parallel and Distributed Computation: Numerical Methods
95 pages
Compre 1
No ratings yet
Compre 1
2 pages
ADA PGM 3 and 4
No ratings yet
ADA PGM 3 and 4
6 pages
An Algorithmic Solution To The Couple Casino
No ratings yet
An Algorithmic Solution To The Couple Casino
5 pages
To Print - Dynprog2
No ratings yet
To Print - Dynprog2
46 pages
Cn&os Lab Executed Codes
No ratings yet
Cn&os Lab Executed Codes
24 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
ThinkPad X1 Carbon Gen 11
No ratings yet
ThinkPad X1 Carbon Gen 11
3 pages
AP1
No ratings yet
AP1
6 pages
Pseudo Code of Mpi Programs
No ratings yet
Pseudo Code of Mpi Programs
22 pages
Review 4: CSCI 2720: Data Structures
No ratings yet
Review 4: CSCI 2720: Data Structures
33 pages
Chained Matrix Multiplication
No ratings yet
Chained Matrix Multiplication
32 pages
Homework 11 Bonus
No ratings yet
Homework 11 Bonus
4 pages
NP CP 1
No ratings yet
NP CP 1
4 pages
Ass Parallel
No ratings yet
Ass Parallel
11 pages
CP1 NP2023
No ratings yet
CP1 NP2023
3 pages
CP 2
No ratings yet
CP 2
3 pages
Numerical Programming - Ap3
No ratings yet
Numerical Programming - Ap3
3 pages
XDR For Networks - Datasheet
No ratings yet
XDR For Networks - Datasheet
3 pages
E School Abstract
No ratings yet
E School Abstract
3 pages
ADBMS Course Information
No ratings yet
ADBMS Course Information
6 pages
Asymptotic Notation, Review of Functions & Summations
100% (1)
Asymptotic Notation, Review of Functions & Summations
45 pages
CN LAB Manual
No ratings yet
CN LAB Manual
14 pages
Eeng420 DSP Lab Experiment 2
No ratings yet
Eeng420 DSP Lab Experiment 2
7 pages
Numerical Programming - Ap8
No ratings yet
Numerical Programming - Ap8
2 pages
Numerical Programming - Ap7
No ratings yet
Numerical Programming - Ap7
2 pages
Numerical Programming - Ap6
No ratings yet
Numerical Programming - Ap6
2 pages
Introduction
No ratings yet
Introduction
46 pages
As 3
No ratings yet
As 3
2 pages
Are You Ready To Go Paperless?: Crawfordtech Bank Statement Sample
No ratings yet
Are You Ready To Go Paperless?: Crawfordtech Bank Statement Sample
2 pages
Assignment 1: Name Class Date Period Sbuid Netid Email
No ratings yet
Assignment 1: Name Class Date Period Sbuid Netid Email
4 pages
Homework #3 Solution: Department of Electrical and Computer Engineering University of Wisconsin - Madison
No ratings yet
Homework #3 Solution: Department of Electrical and Computer Engineering University of Wisconsin - Madison
7 pages
Daniel Pyld Resume
No ratings yet
Daniel Pyld Resume
1 page
CN Lab
No ratings yet
CN Lab
20 pages
Project Ideas
No ratings yet
Project Ideas
1 page
Jaison CV
No ratings yet
Jaison CV
1 page
DSP Laboratory (EELE 4110) : Lab#3 Discrete Time Signals
No ratings yet
DSP Laboratory (EELE 4110) : Lab#3 Discrete Time Signals
14 pages
Computer Networks Lab
No ratings yet
Computer Networks Lab
15 pages
Parallel Random Access Machine (PRAM) : Control
No ratings yet
Parallel Random Access Machine (PRAM) : Control
9 pages
AI&SCnoww Content
No ratings yet
AI&SCnoww Content
11 pages
Sanggjin: Design Study (2 2) Muling Peng, Hong, Doboli, Wendy Computer Engineering Stony
No ratings yet
Sanggjin: Design Study (2 2) Muling Peng, Hong, Doboli, Wendy Computer Engineering Stony
2 pages
Device Info
No ratings yet
Device Info
17 pages
A Practical Performance Comparison of Parallel Matrix Multiplication Algorithms On Networks of Workstations
No ratings yet
A Practical Performance Comparison of Parallel Matrix Multiplication Algorithms On Networks of Workstations
2 pages
50 Excel Shortcuts To Save Time and Effort in Articleship
No ratings yet
50 Excel Shortcuts To Save Time and Effort in Articleship
9 pages
BCSL404 Programs
No ratings yet
BCSL404 Programs
28 pages
Implement On A Data Set of Characters The Three CRC Polynomials
No ratings yet
Implement On A Data Set of Characters The Three CRC Polynomials
26 pages
CN and WP Lab Manual
No ratings yet
CN and WP Lab Manual
101 pages
Subjects
No ratings yet
Subjects
1 page
Vxrail Simulator
No ratings yet
Vxrail Simulator
4 pages
Oracle: Question & Answers
No ratings yet
Oracle: Question & Answers
7 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Exercise 9

Uploaded by

Exercise 9

Uploaded by

Exercise 9

1) Serial time t1(n) = O(n)

Total parallel runtime = 𝑡𝑠𝑡𝑟𝑖𝑝𝑒 + O(n)

a(constant communication overhead factor),

parallelizable part = O(n^3) – a

1 𝑂(𝑛3 )∗𝑝 𝑂(𝑛3 )

Each processor in √𝑝 communication rounds 0<= k <= √𝑝 − 1

Processor receives block B(i’,(𝑗𝑝 + 𝑘) % √𝑝) from processor ((𝑖𝑝 + 𝑘) % √𝑝, 𝑗𝑝 )

Pseudo code would like the following –

FUNCTION parallelMult(A, B, n, p):

# Check if n and p are perfect squares

# Decompose matrices A and B into submatrices

# Initialize result matrix

# Communication and computation rounds

# Calculate index for submatrix B to be received

# Calculate index for submatrix C to be accumulated

computeProduct(submatricesA[k], submatrixB, submatricesC[i * sqrt(p) + j_out])

(0,0) -- (1,0) -- (2,0) -- (3,0)

(0,1) -- (1,1) -- (2,1) -- (3,1)

(0,2) -- (1,2) -- (2,2) -- (3,2)

(0,3) -- (1,3) -- (2,3) -- (3,3)

(0,0) -> 0 (0,1) -> 1 (0,2) -> 2 (0,3) -> 3

(1,0) -> 4 (1,1) -> 5 (1,2) -> 6 (1,3) -> 7

(2,0) -> 8 (2,1) -> 9 (2,2) -> 10 (2,3) -> 11

(3,0) -> 12 (3,1) -> 13 (3,2) -> 14 (3,3) -> 15

Case 1: cy(0) + th < e(0) - ts

Case2: cy(0) + th ≥ e(0) – ts

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.