0% found this document useful (0 votes)

46 views

OPENMP Notes

1. GPUs are better suited than CPUs for tasks that can be processed in parallel and require high throughput rather than low latency. GPUs have simpler control hardware that allows for more computational units, making them more power efficient for parallel workloads. 2. Programming for GPUs requires an explicitly parallel programming model like CUDA and optimizing for throughput. Data must be copied between CPU and GPU memory, and kernels launched on the GPU to perform computation on the device. 3. The CPU acts as the host, launching kernels on the GPU device and managing data transfer between CPU and GPU memory via APIs like cudaMemcpy. Kernels define code to run identically on many parallel threads to leverage the GPU's parallel

Uploaded by

avinash kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views

OPENMP Notes

Uploaded by

avinash kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

OPENMP

1. Turning up clock speed will result in increase in power consumption

2. Many smaller simpler processors
3. (Feature size) As the transistor size decreases, they will run faster, consume less power and
can accommodate more on chips
4. (Clock frequency) clock speed also increases over time but in last few years they are
stagnant
5. Processors are getting faster because we have more transistors available for computation
not because the clocking their transistor’s faster as clocking speed is being constant past few
years
6. Why we don’t keep increasing clock speed?
It is not the case that we can’t make processors any smaller or can increase the clock speed
any better. It is due to the fact that it produces lot of heat and its hard for us to cool down
the processors. So, the main factor today is power.
Limitation is the number of transistors…smaller transistors indeed consume less space and
consumes less power but combining millions of transistors together produces a lot of heat.
So instead of flooding single processors with many transistors we are moving towards having
more processors for running program faster
7. What kind of processors will we build?(Major design constraint: power)
CPU:
Complex control hardware
Flexibility_performance
Expensive in terms of power
GPU:
Simpler control hardware
(+) More HW for computation
(+) Potentially more power efficient(operations/watt)
(-) More restrictive programming model
8. Latency: time required to complete a task
9. Throughput: work done per unit of time
10. CPU chooses to optimise latency while GPU chooses to optimise latency
11. CORE GPU DESIGN TENETS:
Lots of simple compute units
Trade simple control for more compute
Explicitly parallel programming model
Optimize for throughput not latency
(Therefore, most important for those where throughput metric is important)
12. GPUs from the point of view of software developer
-Importance of programming in parallel
8 core Bridge (INTEL)
8-Wide AVX vector operations/core
2 threads/core (HyperThreading)
=128-way parallelism
13. The computers are heterogeneous for this task of parallelism. They have two different
processors in then (i) CPU (“HOST”) (ii) GPU (“DEVICE”)
14. If we write a plain sequential program it will only run on CPU, for utilizing GPU we use cuda
programming model written in c with extensions
15. CUDA assumes GPU as the coprocessor to CPU and also assumes they have their separate
memory (Physically allocated to both of them in form of DRAM)
16. CPU is incharge, it tells GPU what to do

Tasks involve:

Moving data from cpu to GPU (fulfils by cudamemcpy)

Calling back data from GPU to CPU (fulfils by cudamemcpy)

Allocate GPU memory (cudaMalloc)

Launch kernel on GPU (Host launches kernel on device)

17. The GPU can do the following:

Respond to CPU request to SEND data GPU -> CPU
Respond to CPU request to RECV data CPU -> GPU
Compute A Kernel Launched by CPU
((Advanced) GPU can launch their own kernals as well and copy data from cpu)
18. A typical GPU program:
CPU allocates storage on GPU (cudaMalloc)
CPU copies input data from CPU->GPU (cudaMemcpy)
CPU launches KERNEL(s) on GPU to process the data (kernel launch)
CPU copies results back to CPU from GPU (cudaMencpy)
[Must have a high ratio of computation to communication, If communication is high but
computation on that communicated data is low then parallelism fails. So we must focus on
high computation on communicated data]
19. DEFINING THE GPU COMPUTATION:
Kernels look like serial programs
Write your program as if it will run on one thread
The GPU will run that program on MANY THTREADS
20. What is the GPU good at?
Efficiently launching lots of threads
Running lots of threads in parallel
21. cudaMemcpy(d_in,h_in,Array_Bytes,CudaMemcpyHostToDevice)
22. cudaMemcpy(h_out,d_out,Array_Bytes,CudaMemcpyDeviceToHost)
23. square<<<1, ARRAY_SIZE>>>(d_out,d_in); { launch the kernel named square on 1 block of 64
elelment}
24. threadIdx.x for thread id
25. configuring the kernel launch:
kernel <<< GRID OF BLOCKS, BLOCK OF THTREADS>>> (…)
dim3(x,y,z) == dim3(w) ==w
square<<<1,64>>> == square<<<dim3(1,1,1), dim3(64,1,1)>>>
each block can have a maximum of 512 or 1024 threads
square<<< dim3(bx,by,bz), dim3(tx,ty,tz),shmem>>> (…)
dim3(bx,by,bz) = grid of blocks bx.by.bz
dim3(tx,ty,tz) = block of threads tx.ty.tz
shmem = shared memory per block in bytes
threadIdx: thread within block, threadIdx.x threadIdx.y
blockDim: size of a block
blockIdx: block within grid
gridDim: size of grid
26. adc
27.

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6418)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (640)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1173)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (992)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1853)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1016)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (297)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5143)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (460)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2126)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (279)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4355)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2001)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2787)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2876)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Victoria Walters
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4087)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (835)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (918)
Mazak Fusion 640 Series Reloading Mazatrol From Hard Drive
100% (3)
Mazak Fusion 640 Series Reloading Mazatrol From Hard Drive
7 pages
Memory Management: Early Systems
100% (1)
Memory Management: Early Systems
49 pages
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
8051 Man
No ratings yet
8051 Man
334 pages
03.CA (CL) - IT - (Module-2) - (3) Information Technology-Hardware
No ratings yet
03.CA (CL) - IT - (Module-2) - (3) Information Technology-Hardware
18 pages
Introduction of Arduino-1
No ratings yet
Introduction of Arduino-1
6 pages
Model Question Paper Diploma in Computer Hardware Engineering Computer System Architecture
No ratings yet
Model Question Paper Diploma in Computer Hardware Engineering Computer System Architecture
4 pages
Dell Latitude E6430 Compal LA-7782P
No ratings yet
Dell Latitude E6430 Compal LA-7782P
66 pages
2023 Computer Studies F2
No ratings yet
2023 Computer Studies F2
8 pages
Computer Organization and Architecture 18EC35: Course Handling Faculty: Gahan A V
No ratings yet
Computer Organization and Architecture 18EC35: Course Handling Faculty: Gahan A V
72 pages
DP CardReader 14035 Drivers
No ratings yet
DP CardReader 14035 Drivers
568 pages
SHARC Processor Architecture ADSP-21060
No ratings yet
SHARC Processor Architecture ADSP-21060
6 pages
Uncover, Understand, Own: Regaining Control Over Your Amd Cpu
No ratings yet
Uncover, Understand, Own: Regaining Control Over Your Amd Cpu
60 pages
Boot Process PDF
No ratings yet
Boot Process PDF
5 pages
Kitwe College of Education: Computer Science Department Lecture 1 - Basic Computer Programming
No ratings yet
Kitwe College of Education: Computer Science Department Lecture 1 - Basic Computer Programming
28 pages
Lab Manual 3
No ratings yet
Lab Manual 3
4 pages
Controller at A Glance
No ratings yet
Controller at A Glance
2 pages
Praktek 10 A - Prastika Agustina D - 4a
0% (1)
Praktek 10 A - Prastika Agustina D - 4a
3 pages
SVR Order 123497 - VesNatSer - MV ELBRUNNER (IMO 9395563)
No ratings yet
SVR Order 123497 - VesNatSer - MV ELBRUNNER (IMO 9395563)
8 pages
Unit 4 MCQ
0% (1)
Unit 4 MCQ
17 pages
ZKB202S_User Manual_20230228
No ratings yet
ZKB202S_User Manual_20230228
2 pages
P25 Firmware Update Guide v1.1
No ratings yet
P25 Firmware Update Guide v1.1
13 pages
Cabt 12
No ratings yet
Cabt 12
2 pages
ATA Hard Drive
No ratings yet
ATA Hard Drive
42 pages
Prepaid Card For Petrol Bunk System
100% (1)
Prepaid Card For Petrol Bunk System
6 pages
OS_Module1_Unit1 (1)
No ratings yet
OS_Module1_Unit1 (1)
75 pages
ICT THEORY PPR 2
No ratings yet
ICT THEORY PPR 2
18 pages
Parallel & Distributed Computing Report
No ratings yet
Parallel & Distributed Computing Report
4 pages
Data Sheet
No ratings yet
Data Sheet
824 pages
Case Study 2.2: Classic Case: Paradise Lost-The Xerox Alto
No ratings yet
Case Study 2.2: Classic Case: Paradise Lost-The Xerox Alto
1 page
1063 Leveraging UVM Based Low Power Package Library To SOC Designs 1
No ratings yet
1063 Leveraging UVM Based Low Power Package Library To SOC Designs 1
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

OPENMP Notes

Uploaded by

OPENMP Notes

Uploaded by

OPENMP

1. Turning up clock speed will result in increase in power consumption

Moving data from cpu to GPU (fulfils by cudamemcpy)

Calling back data from GPU to CPU (fulfils by cudamemcpy)

Allocate GPU memory (cudaMalloc)

Launch kernel on GPU (Host launches kernel on device)

17. The GPU can do the following:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.