1 Intro To HPC Compressed 1 Part 1
1 Intro To HPC Compressed 1 Part 1
Introduction to
High Performance Computing
University of Wyoming
May 22 – 26, 2017
1
What is Supercomputing or High
Performance Computing?
• The definition of supercomputing is constantly changing.
Supercomputers can perform up to quadrillions of FLOPS or PFLOPS.
• High-performance computing (HPC) utilizes parallel processing for
running large and advanced application programs efficiently. The term
applies especially to systems that function above a hundred teraflops.
The Top500 list has several multi-petaflop systems in the top 50.
• HPC aggregates computing power in a way that delivers much higher
performance than one could get out of a typical desktop computer or
workstation in order to solve large problems in science, engineering, or
business.
• Supercomputers were introduced in the 1960s, and initially created by
Seymour Cray at Control Data Corporation who led the HPC industry for
decades.
• To me, personally, it is an eco-system that provides users with a high
performance computational, networking, storage, and analysis
platform, and the necessary software stack to stitch these resources.
2
Fastest Supercomputer vs. Moore
100000000
Year
1993
Fastest
59.7
Moore
60
1994 143.4
1995 170.4
10000000
1996
1997
220.4
1068
240
1998 1338
1999 2121.3 960
1000000
2000 2379
2001 7226
2002 35860 3840
2003 35860 18000000
100000
2004 35860
2005 136800 15360 16000000
2006 280600 14000000
2007 280600
2008 1375780 61440 12000000
2009
2010
10000
1456700
1759000
10000000
Fastest Fastest
8000000
Moore
Moore
2011 8162000 245760
2012 16324750 6000000
1000 4000000
GFLOPs:
2000000
0
100 1990 1995 2000 2005 2010 2015
10
1993: 1024 CPU cores billions of
calculations
1 per second
1990 1995 2000 2005 2010 2015 www.top500.org
Year http://www.mooreslaw.org/
3
What is Supercomputing About?
Size
Speed
4
What is Supercomputing About?
5
What is HPC Used For?
• Simulation of physical phenomena by developing a model
that represents the key characteristics of the selected
physical or abstract system or process. Areas where [1]
simulation is heavy used:
• Weather forecasting
• Galaxy formation
• Oil reservoir management
6
Supercomputing Issues
• Scaling issues
• High-speed interconnect
• Software stack
• Facility
7
What is a “Cluster”?
8
What Does a Cluster
Look Like?
Network View
9
10
What Does a Cluster Look Like?
11
Cluster Components
All Components Working Together
• Computational resources
• Management infrastructure
• High-speed interconnect
12
Cluster Components
All Components Working Together
User Applications
13
Computational Resources
14
Processor Types Examples
• X-86 Architecture:
• The instruction set architecture (ISA) is Intel's most successful line of processors.
• Xeon & Xeon Phi - Many-Core (Intel) and Opteron (AMD)
• GPGPU or GPU:
• General-purpose computing on graphics processing units is the use of a graphics processing
unit (GPU), which typically handles computation only for computer graphics, to perform
computation in applications traditionally handled by the central processing unit (CPU).
• NVIDIA, AMD, ASUS, etc., manufacture GPGPU/GPU.
• POWER: (Power Optimization with Enhanced RISC)
• IBM has a series of high performance microprocessors called POWER.
• IBM launched OpenPOWER Foundation for collaboration on their Power Architecture in
2013. Google, Tyan, Nvidia, and Mellanox are founding members.
• ARM (Advanced RISC Machines)
• CPUs based on the RISC (reduced instruction set computer) architecture developed by
Advanced RISC Machines (ARM).
• Companies using ARM cores on their chips are Qualcomm, Samsung Electronics, Texas
Instruments, and Cavium among others.
15
Storage and File System
16
Management Infrastructure
• Cluster Management
• Service Nodes
• Head/Login Nodes
• Facility
17
Cluster Management
• Fabric management
• Parallel shell
• One ring to rule them all, one ring to find them, one ring to
bring them all, and in the darkness bind them.
18
Facility
• Power
• 120 V, 280 V, 440V, etc.
• 3-Phase, DC
• N + N or N+1 Redundancy
• UPS, Generators
• Cooling
• Water cooled or air cooled
• Racks 19
HPC Software Stack
• Operating System
• File System
• Scheduler/Resource Manager
20
High-Speed Interconnect
21
HPC Applications and Workflow
• Parallelism:
• Speedup is not linear
• Dependencies
• Tuning:
• Race conditions
• Mutual exclusions
• Synchronization
22