0% found this document useful (0 votes)

17 views10 pages

Ca Lecture 11

Uploaded by

kawsarnewazchowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views10 pages

Ca Lecture 11

Uploaded by

kawsarnewazchowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

CSE - 313

Computer Architecture

Faculty: Shoib Ahmed Shourav

United International University
Summer 2021
Parallel Processors from
Client to Cloud
Introduction
• Goal: connecting multiple computers to get higher performance
• Multiprocessors
• Scalability, availability, power efficiency
• Task-level (process-level) parallelism
• High throughput for independent jobs
• Parallel processing program
• Single program run on multiple processors
• Multicore microprocessors
• Chips with multiple processors (cores)
Hardware and Software
• Hardware
• Serial: e.g., Pentium 4
• Parallel: e.g., quad-core Xeon e5345
• Software
• Sequential: e.g., matrix multiplication
• Concurrent: e.g., operating system
• Sequential/concurrent software can run on serial/parallel hardware
• Challenge: making effective use of parallel hardware
Parallel Programming

• Parallel software is the problem

• Need to get significant performance improvement
• Otherwise, just use a faster uniprocessor, since it’s easier!
• Difficulties
• Partitioning
• Coordination
• Communications overhead
Amdahl’s Law

• Sequential part can limit speedup

• Example: 100 processors, 90× speedup?
• Tnew = Tparallelizable/100 + Tsequential

•S

• Solving: Fparallelizable = 0.999

• Need sequential part to be 0.1% of original time
• Here T is Time and F represents Fraction of the program.
Scaling Example 1
• Workload: sum of 10 scalars, and 10 × 10 matrix sum
• Speed up from 10 to 100 processors
• Single processor: Time = (10 + 100) × tadd
• 10 processors
• Time = 10 × tadd + 100/10 × tadd = 20 × tadd
• Speedup = 110/20 = 5.5 (55% of potential)
• 100 processors
• Time = 10 × tadd + 100/100 × tadd = 11 × tadd
• Speedup = 110/11 = 10 (10% of potential)
• Assumes load can be balanced across processors
Scaling Example 2
• What if matrix size is 100 × 100?
• Single processor: Time = (10 + 10000) × tadd
• 10 processors
• Time = 10 × tadd + 10000/10 × tadd = 1010 × tadd
• Speedup = 10010/1010 = 9.9 (99% of potential)
• 100 processors
• Time = 10 × tadd + 10000/100 × tadd = 110 × tadd
• Speedup = 10010/110 = 91 (91% of potential)
• Assuming load balanced
Strong vs Weak Scaling

• Strong scaling: problem size fixed as in example

• Weak scaling: problem size proportional to number of processors
• 10 processors, 10 × 10 matrix
• Time = 20 × tadd
• 100 processors, 32 × 32 matrix
• Time = 10 × tadd + 1000/100 × tadd = 20 × tadd
• Constant performance in this example
Any Question?

Az MCQ 1 PDF
No ratings yet
Az MCQ 1 PDF
78 pages
Unit 1
No ratings yet
Unit 1
11 pages
CompArch 23a MP-1
No ratings yet
CompArch 23a MP-1
17 pages
‎⁨کامپیوتر صنف دوازدهم⁩
No ratings yet
‎⁨کامپیوتر صنف دوازدهم⁩
114 pages
Chapter 6 Parallel Processors From Client To Cloud 5th
No ratings yet
Chapter 6 Parallel Processors From Client To Cloud 5th
36 pages
Patterson6e MIPS Ch06 PPT
No ratings yet
Patterson6e MIPS Ch06 PPT
63 pages
Unit-2 Aca
No ratings yet
Unit-2 Aca
24 pages
Performance Metrics
No ratings yet
Performance Metrics
16 pages
Employability and Entrepreneurial Success Section A
No ratings yet
Employability and Entrepreneurial Success Section A
6 pages
20ec755 Unit 3 Notes
No ratings yet
20ec755 Unit 3 Notes
21 pages
IS6335 Week2
No ratings yet
IS6335 Week2
51 pages
DS2 Lab Team-9
No ratings yet
DS2 Lab Team-9
20 pages
PDC Lecture 03
No ratings yet
PDC Lecture 03
36 pages
Unit 1 - Part 3
No ratings yet
Unit 1 - Part 3
17 pages
Sleep Tracker Project App
No ratings yet
Sleep Tracker Project App
14 pages
Embedded Systems - NEW
No ratings yet
Embedded Systems - NEW
13 pages
Fliphtml5 Com
No ratings yet
Fliphtml5 Com
2 pages
Lecture 3
No ratings yet
Lecture 3
24 pages
Volvo Cem m32c L
No ratings yet
Volvo Cem m32c L
8 pages
Cao - Unit 4 - Notes - Final
No ratings yet
Cao - Unit 4 - Notes - Final
30 pages
CS-3006 10 PerformanceAnalysis
No ratings yet
CS-3006 10 PerformanceAnalysis
52 pages
The Complete Future Trait Guide
From Everand
The Complete Future Trait Guide
Hamze Ghalebi
No ratings yet
Generic Questions
No ratings yet
Generic Questions
70 pages
Lecture # 21
No ratings yet
Lecture # 21
16 pages
Patterson6e MIPS Ch06 PPT
No ratings yet
Patterson6e MIPS Ch06 PPT
74 pages
Creating Extra Information Types As A Self-Serivce Function - Oracle Apps
No ratings yet
Creating Extra Information Types As A Self-Serivce Function - Oracle Apps
8 pages
2 ND
No ratings yet
2 ND
19 pages
1 PB
No ratings yet
1 PB
8 pages
Studybuddyfinal
No ratings yet
Studybuddyfinal
14 pages
Multicore02 2
No ratings yet
Multicore02 2
18 pages
ch6 LMS
No ratings yet
ch6 LMS
44 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
BGP Secure Routing 1708284503
No ratings yet
BGP Secure Routing 1708284503
82 pages
CS-3006 2 PDC Overview Compressed
No ratings yet
CS-3006 2 PDC Overview Compressed
107 pages
GO - NAST3007 - E01 - 1 GSM Network SDCCH Congestion and Solutions-22p
No ratings yet
GO - NAST3007 - E01 - 1 GSM Network SDCCH Congestion and Solutions-22p
22 pages
Introduction To Paralel Procesing
No ratings yet
Introduction To Paralel Procesing
40 pages
CS-3006 4 PerformanceAnalysis
No ratings yet
CS-3006 4 PerformanceAnalysis
62 pages
Java 1
No ratings yet
Java 1
51 pages
Cloud Intro
No ratings yet
Cloud Intro
50 pages
IT3030E CA Chap8 Multiprocessing
No ratings yet
IT3030E CA Chap8 Multiprocessing
26 pages
Student Result: Aktu-One-View (Oneview - Aspx)
No ratings yet
Student Result: Aktu-One-View (Oneview - Aspx)
8 pages
HPC Lecture (1) Summary
No ratings yet
HPC Lecture (1) Summary
8 pages
Screenshot 2024-12-05 at 2.01.32 PM
No ratings yet
Screenshot 2024-12-05 at 2.01.32 PM
49 pages
Chapter 06 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
100% (1)
Chapter 06 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
57 pages
Using Marketing Information Systems (MIS) : Value To Decision Makers
No ratings yet
Using Marketing Information Systems (MIS) : Value To Decision Makers
10 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
The Good Points of Microsoft Excel
No ratings yet
The Good Points of Microsoft Excel
11 pages
Chapter 06
No ratings yet
Chapter 06
59 pages
Computer Architecture
No ratings yet
Computer Architecture
56 pages
KD6 Zapisovac
No ratings yet
KD6 Zapisovac
5 pages
Unit4 Session4 PC Examples Machine Learning
No ratings yet
Unit4 Session4 PC Examples Machine Learning
24 pages
2 New Module 2 Performance Analysis of Multiprocessor Architectures Students Version
No ratings yet
2 New Module 2 Performance Analysis of Multiprocessor Architectures Students Version
13 pages
Arch13 Multiprocessors Afterlecture
No ratings yet
Arch13 Multiprocessors Afterlecture
70 pages
Chapter 06
No ratings yet
Chapter 06
57 pages
Software Quality Metrics
No ratings yet
Software Quality Metrics
16 pages
Brkarc 3000
No ratings yet
Brkarc 3000
242 pages
A Survey Paper On: Gmail API Services and Importing PDF'S.: Authors
No ratings yet
A Survey Paper On: Gmail API Services and Importing PDF'S.: Authors
15 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Variant Configuration of Sap SD
No ratings yet
Variant Configuration of Sap SD
5 pages
Invoice: WPS Canada Inc
No ratings yet
Invoice: WPS Canada Inc
2 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
Lecture Week - 3 Amdahl Law 1
No ratings yet
Lecture Week - 3 Amdahl Law 1
19 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Social Media Audit Template - PDF (MAKE A COPY) PDF
No ratings yet
Social Media Audit Template - PDF (MAKE A COPY) PDF
3 pages
Samsung 1TB 970 PRO v-NAND SSD - Jumia - Com.ng
No ratings yet
Samsung 1TB 970 PRO v-NAND SSD - Jumia - Com.ng
3 pages
CS0051 - Module 01
No ratings yet
CS0051 - Module 01
52 pages
HPC Note
No ratings yet
HPC Note
39 pages
BDS Session 2
No ratings yet
BDS Session 2
58 pages
Chapter 6 Parallel Processor
No ratings yet
Chapter 6 Parallel Processor
21 pages
PCS - Process Control System ILTIS-PCS - Sistema Control de Procesos
No ratings yet
PCS - Process Control System ILTIS-PCS - Sistema Control de Procesos
9 pages
Internet & Email
No ratings yet
Internet & Email
130 pages
Chapter 7
No ratings yet
Chapter 7
25 pages
HW2 Solutions
No ratings yet
HW2 Solutions
4 pages
L 1 ParallelProcess Challenges
No ratings yet
L 1 ParallelProcess Challenges
82 pages
Best Practices For HP EVA
No ratings yet
Best Practices For HP EVA
4 pages
Parallel Programming: Sathish S. Vadhiyar Course Web Page
No ratings yet
Parallel Programming: Sathish S. Vadhiyar Course Web Page
36 pages
How To Increase Auto Extend Size of Auto Extend Datafiles
No ratings yet
How To Increase Auto Extend Size of Auto Extend Datafiles
8 pages
Lecture04 PDF
No ratings yet
Lecture04 PDF
27 pages
BASYX TriComm System Operation Manual v21
No ratings yet
BASYX TriComm System Operation Manual v21
58 pages
Introduction To Parallel Programming
No ratings yet
Introduction To Parallel Programming
129 pages
ch4 PC
No ratings yet
ch4 PC
76 pages
VSPlayer User Manual V6.0.0.4
No ratings yet
VSPlayer User Manual V6.0.0.4
17 pages
Lec7 PDF
No ratings yet
Lec7 PDF
16 pages
Parallel2 PDF
No ratings yet
Parallel2 PDF
16 pages
BlueCat Whitepaper ProteusMS
No ratings yet
BlueCat Whitepaper ProteusMS
12 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Pemrosesan Parale2l
No ratings yet
Pemrosesan Parale2l
27 pages
HPC Overview
No ratings yet
HPC Overview
45 pages
Massively Parallel Processors
No ratings yet
Massively Parallel Processors
102 pages
Lecture 4 Analytical Modeling of Parallel Programs
No ratings yet
Lecture 4 Analytical Modeling of Parallel Programs
11 pages
24-25 - Parallel Processing PDF
No ratings yet
24-25 - Parallel Processing PDF
36 pages
Chapter 06
No ratings yet
Chapter 06
57 pages
Pc98 Lect5 Part1 Speedup
No ratings yet
Pc98 Lect5 Part1 Speedup
36 pages
SMM Cap1
No ratings yet
SMM Cap1
101 pages
Chapter 06
No ratings yet
Chapter 06
76 pages
Multicores, Multiprocessors, and P, Clusters
No ratings yet
Multicores, Multiprocessors, and P, Clusters
51 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Ca Lecture 11

Uploaded by

Ca Lecture 11

Uploaded by

CSE - 313

Faculty: Shoib Ahmed Shourav

• Parallel software is the problem

• Sequential part can limit speedup

• Solving: Fparallelizable = 0.999

• Strong scaling: problem size fixed as in example

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.