0% found this document useful (0 votes)

13 views72 pages

Class 8

The document discusses parallel computing in Fortran, emphasizing the need for efficient code parallelization to handle large-scale numerical modeling, particularly in geophysical contexts. It introduces key concepts such as shared and distributed memory, the use of MPI for message-passing, and domain decomposition for parallel processing. Additionally, it covers finite Prandtl number convection, highlighting the modifications required for modeling various fluids and the relevant equations governing their behavior.

Uploaded by

Ahmed El Shennawey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views72 pages

Class 8

Uploaded by

Ahmed El Shennawey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 72

Numerical Modelling in

Fortran: day 8
Paul Tackley, 2017
Today’s Goals
1. Introduction to parallel computing
(applicable to Fortran or C; examples are in
Fortran)
2. Finite Prandtl number convection
Motivation:
To model the Earth,
need a huge number
of grid points / cells
/elements!
• e.g., to fill mantle volume:
– (8 km)3 cells -> 1.9 billion
cells
– (2 km)3 cells -> 123 billion
cells
Huge problems => huge computer

www.top500.org
Huge problems => huge computer

www.top500.org
Progress: iPhone > fastest
computer in 1976 (cost: $8
million)

(photo taken at NCAR museum)

In Switzerland

Each node: 12-core Intel CPU + GPU

Piz Dora

Each node: 2* 18-core Intel CPU

Shared memory: several cpus
(or cores) share the same
memory. Parallelisation can
often be done by the compiler
(sometimes with help, e.g.,
OpenMP instructions in the
code)

Distributed memory:
each cpu has its own
memory. Parallelisation
usually requires
message-passing, e.g.
using MPI (message-
passing interface)
A brief history of supercomputers

1983-5
4 CPUs,
Shared
memory
1991: 512 CPUs, distributed memory

2010: 224,162 Cores, distributed + shared memory (12 cores per node)
Another possibility: build you own
(“Beowulf” cluster)
Using standard PC cases: or using rack-mounted cases
MPI: message-passing interface
• A standard library for communicating
between different tasks (cpus)
– Pass messages (e.g., arrays)
– Global operations (e.g., sum, maximum)
– Tasks could be on different cpus/cores of the
same node, or on different nodes
• Works with Fortran and C
• Works on everything from a laptop to the
largest supercomputers. 2 versions are:
– http://www.mcs.anl.gov/research/projects/mpic
h2/
– http://www.open-mpi.org/
How to parallelise a code:
worked example
Example: Scalar Poisson eqn.
∇ u= f
2

Finite-difference approximation:
1
h2 (u i+1 jk )
+ ui−1 jk + uij +1k + uij −1k + uijk +1 + uijk +1 − 6ui, j = f ij

Use iterative approach=>start with u=0, sweep through grid updating

u values according to: 2
n +1 h
u˜ij = u˜ + αRij
n
ij
6
2
Where Rij is the residue (“error”): R = ∇ u˜ − f
Code
Parallelisation: domain decomposition
Single CPU 8 CPUs

CPU 0 CPU 1

CPU 2 CPU 3

CPU 4 CPU 5

CPU 6 CPU 7

Each CPU will do the same operations but on different parts of

the domain
You need to build parallelization into
the code using MPI
• Any scalar code will run on multiple CPUs, but
will produce the same result on each CPU.
• Code must first setup local grid in relation to
global grid, then handle communication
• Only a few MPI calls needed:
– Init. (MPI_init,MPI_com_size,MPI_com_rank)
– Global combinations (MPI_allreduce)
– CPU-CPU communication (MPI_send,MPI_recv…)
Boundaries
• When updating points at
edge of subdomain, need
values on neighboring
subdomains
• Hold copies of these locally
using “ghost points”
• This minimizes #of
messages, because they can
be updated all at once
instead of individually
=ghost points
Scalar Grid

Red=boundary points (=0)

Yellow=iterated/solved
(1…n-1)
Parallel grids

Red=ext. boundaries
Green=int. boundaries
Yellow=iterated/solved
First things the code has to
do:
• Call MPI_init(ierr)
• Find #CPUs using MPI_com_size
• Find which CPU it is, using MPI_com_rank
(returns a number from 0…#CPUs-1)
• Calculate which part of the global grid it is
dealing with, and which other CPUs are
handling neighboring subdomains.
Example: “Hello world” program
Moving forward
• Update values in subdomain using
‘ghost points’ as boundary condition,
i.e.,
– Timestep (explicit), or
– Iteration (implicit)
• Update ghost points by communicating
with other CPUs
• Works well for explicit or iterative
approaches
Boundary communication

Step 1: x-faces

Step 2: y-faces (including

corner values from step 1)

[Step 3: z-faces (including corner

values from steps 1 & 2)]

Doing the 3 directions sequentially avoids the need for

additional messages to do edges & corners (=>in 3D, 6
messages instead of 26)
Main changes
• Parallelisation hidden in
set_up_parallelisation and update_sides
• Many new variables to store
parallelisation information
• Loop limits depend on whether global
domain boundary or local subdomain
Simplest communication

Not optimal – uses blocking send/receive

Better: using non-blocking (isend/irecv)
Performance: theoretical
analysis
How much time is spent
communicating?
• Computation time ∝ volume (Nx^3)
• Communication time ∝ surf. area (Nx^2)
• =>Communication/Computation ∝ 1/Nx
• =>Have as many points/cpu as possible!
Is it better to split 1D, 2D or 3D?
• E.g., 256x256x256 points on 64 CPUs
• 1D split: 256x256x4 points/cpu
– Area=2x(256x256)=131,072
• 2D split: 256x32x32 points/cpu
– Area=4x(256x32)=32,768
• 3D split:64x64x64 points/cpu
– Area=6x(64x64)=24,576
• =>3D best but more messages needed
Model(Time
code performance
per step or iteration)

Computation : t=aN3
Communication : t=nL+bN2 /B
(L=Latency, B=bandwidth)

TOTAL : t= aN3 +nL+bN2 /B

Example: Scalar Poisson equation
2
∇ u= f
t= aN3 +nL+bN2/B
Assume 15 operations/point/iteration & 1 Gflop performance
Þa=15/1e9=1.5e-8
If 3D decomposition, n=6, b=6*4 (single precision)

Gigabit ethernet: L=40e-6 s, B=100 MB/s

Quadrics: L=2e-6 s, B=875 MB/s
Time/iteration vs. #cpus
Quadrics, Gonzales-size cluster
Up to 2e5 CPUs (Quadrics communication)
Efficiency
Now multigrid V cycles

Smooth 32x32x32

Smooth 16x16x16

Residues (=error)

corrections
Smooth 8x8x8

Exact solution 4x4x4

Application to StagYY
Cartesian or spherical
StagYY
iterations:
3D Cartesian
Change in scaling from
same-node to cross-node
communication

Simple-minded multigrid:
Very inefficient coarse levels!
Exact coarse solution can take
long time!
New treatment:
follow minima
• Keep #points/core >
minimum (tuned for
system)
• Different for on-
node and cross-node
communication
Multigrid – now (& before): yin-yang

1.8 billion
Summary
• For very large-scale problems, need to
parallelise code using MPI
• For finite-difference codes, the best method is
to assign different parts of the domain to
different CPUs (“domain decomposition”)
• The code looks similar to before, but with some
added routines to take care of communication
• Multigrid scales fine on 1000s CPUs if:
– Treat coarse grids on subsets of CPUs
– Large enough total problem size
For more information
• https://computing.llnl.gov/tutorials/parall
el_comp/
• http://en.wikipedia.org/wiki/Parallel_com
puting
• http://www.mcs.anl.gov/~itf/dbpp/
• http://en.wikipedia.org/wiki/Message_Pa
ssing_Interface
Programming:
Finite Prandtl number convection
(i.e., almost any fluid)

Ludwig Prandtl (1875-1953)

Values of the Prandt number Pr
ν Viscous diffusivity
Pr =
κ Thermal diffusivity

• Liquid metals: 0.004-0.03

• Air: 0.7
• Water: 1.7-12
• Rock: ~1024 !!! (effectively infinite)
Finite-Prandtl number convection

• Existing code assumes infinite Prandtl

number
– also known as Stokes flow
– appropriate for highly-viscous fluids like
rock, honey etc.
• Fluids like water, air, liquid metal have a
lower Prandtl number so equations
must be modified
Applications for finite Pr
• Outer core (geodynamo)
• Atmosphere
• Ocean
• Anything that’s not solid like the mantle
Equations
• Conservation of mass (=‘continuity’)
• Conservation of momentum (‘Navier-
Stokes’ equation: F=ma for a fluid)
• Conservation of energy

Claude Navier Sir George Stokes

(1785-1836) (1819-1903)
Finite Pr Equations
Navier-Stokes equation: F=ma for a fluid Coriolis force
  
⎛ ∂ v  ⎞ 2
ρ⎜ + v ⋅∇v ⎟ = −∇P + ρν∇ v + 2 ρΩ × v + g ρα Tŷ
⎝ ∂t ⎠

Valid for constant viscosity only

“ma”

continuity and energy equations same as before

∂T  
+ v ⋅ ∇T = κ∇ T + Q
2
∇⋅v =0
∂t
ρ=density, ν=kinematic viscosity, g=gravity,
α=thermal expansivity
Non-dimensionalise the equations

• Reduces the number of parameters

• Makes it easier to identify the dynamical
regime
• Facilitates comparison of systems with
different scales but similar dynamics (e.g.,
analogue laboratory experiments
compared to core or mantle)
Non-dimensionalise to thermal
diffusion scales
• Lengthscale D (depth of domain)
• Temperature scale (T drop over domain)
Time to D / κ
2
•
• Velocity to κ / D
Stress to ρνκ / D
2
•
Nondimensional equations
 ∂T 
∇⋅ v = 0 + v ⋅∇T = ∇ T
2

∂t


1 ⎛ ∂v   ⎞ 2 1  
⎜ + v ⋅ ∇v ⎟ = −∇P + ∇ v + Ω × v + Ra.Tyˆ
Pr ⎝ ∂t ⎠ Ek

ν ν gα∇TD 3
Pr = Ek = 2 Ra =
κ 2ΩD νκ
Prandtl number Ekman number Rayleigh number
As before, use streamfunction

∂ψ ∂ψ
vx = vy = −
∂y ∂x

Also simplify by assuming 1/Ek=0

Eliminating pressure
• Take curl of 2D momentum equation: curl
of grad=0, so pressure disappears
 
• Replace velocity by vorticity: ω = ∇ × v
• in 2D only one component of vorticity is
needed (the one perpendicular to the 2D
plane), ∇ ψ = ωz
2

1 ⎛ ∂ω ∂ω ∂ω ⎞ ∂T
⎜ + vx + vy ⎟ = ∇ ω − Ra
2

Pr ⎝ ∂ t ∂x ∂y⎠ ∂x
=> the streamfunction-vorticity
formulation
1 ⎛ ∂ω ∂ω ∂ω ⎞ ∂T
⎜ + vx + vy ⎟ = ∇ ω − Ra
2

Pr ⎝ ∂ t ∂x ∂y⎠ ∂x
⎛ ∂ψ ∂ψ ⎞
∇ ψ = −ω
2
( )
vx ,vy = ⎜ ,− ⎟
⎝ ∂y ∂x ⎠
∂T 
+ v ⋅ ∇T = ∇ 2T + Q
∂t
Note: Effect of high Pr
1 ⎛ ∂ω ∂ω ∂ω ⎞ ∂T
⎜ + vx + vy ⎟ = ∇ ω − Ra
2

Pr ⎝ ∂ t ∂x ∂y⎠ ∂x
If Pr->infinity, left-hand-side=>0 so equation becomes Poisson
like before:

∂T
∇ ω = Ra
2

∂x
Taking a timestep
(i) Calculate ψ from ω using: ∇ ψ =ω
2

⎛ ∂ψ ∂ψ ⎞
(ii) Calculate v from ψ ( )
vx ,vy = ⎜ ,− ⎟
⎝ ∂y ∂x ⎠
(iii) Time-step ω and T using explicit finite differences:

∂T ∂T ∂T
= −vx − vy +∇ T
2

∂t ∂x ∂y
∂ω ∂ω ∂ω ∂T
= −vx − vy + Pr ∇ ω − Ra Pr
2

∂t ∂x ∂y ∂x
T time step is the same as before

Tnew − Told ∂ Told ∂ Told

= −vx − vy + ∇ 2Told
Δt ∂x ∂y
⎛ 2 ∂ Told ∂ Told ⎞
Tnew = Told + Δt ⎜ ∇ Told − vx − vy
⎝ ∂x ∂ y ⎟⎠

w must now be time stepped in a similar way

ω new − ω old ∂ω old ∂ω old ∂ Told

= −vx − vy + Pr ∇ ω old − Ra Pr
2

Δt ∂x ∂y ∂x

⎛ ∂ω old ∂ω old ∂ Told ⎞

ω new = ω old + Δt ⎜ Pr ∇ ω old − vx
2
− vy − Ra Pr
⎝ ∂x ∂y ∂ x ⎟⎠
Stability condition

h2
Diffusion: dtdiff = adiff
max(Pr,1)

⎛ h h ⎞
Advection: dt adv = aadv min ⎜ ,
⎝ max val(abs(vx)) max val(abs(vy)) ⎟⎠

Combined: dt = min(dtdiff , dtadv )

Modification of previous
convection program
• Replace Poisson calculation of w with time-
step, done at the same time as T time-step
• Get a compiling code!
• Make sure it is stable and convergent for
values of Pr between 0.01 and 1e2
• Hand in your code, and your solutions to the
test cases in the following slides
• Due date: 18 December (2 weeks from
today)
Test cases
• All have nx=257, ny=65, Ra=1e5,
total_time=0.1, and random initial T and w
fields, unless otherwise stated

• Due to random start, results will not look

exactly as these, but they should look
similar (i.e.. width of upwellings &
downwellings & boundary layers similar, but
number and placement of
upwellings/downwellings different).
Pr=10
Pr=1
Pr=0.1
Pr=0.01
Pr=0.01, time=1.0
Pr=0.1, Ra=1e7

FreeFEM Documentation
No ratings yet
FreeFEM Documentation
716 pages
FreeFEM Documentation
No ratings yet
FreeFEM Documentation
715 pages
Industrialization in India A Powerpoint Presentaion by Vivaan Pal Class 10th A..
No ratings yet
Industrialization in India A Powerpoint Presentaion by Vivaan Pal Class 10th A..
10 pages
NMFHT 2020
No ratings yet
NMFHT 2020
358 pages
FreeFEM Documentation
No ratings yet
FreeFEM Documentation
746 pages
Comics and Novelization A Literary History of Bandes Dessines Benot Glaude PDF Download
No ratings yet
Comics and Novelization A Literary History of Bandes Dessines Benot Glaude PDF Download
76 pages
Olb Manual
No ratings yet
Olb Manual
220 pages
FreeFEM Documentation
No ratings yet
FreeFEM Documentation
736 pages
Fipy-3 1 3
No ratings yet
Fipy-3 1 3
453 pages
Spiegelman MMM
No ratings yet
Spiegelman MMM
202 pages
Advance Python Programming
0% (1)
Advance Python Programming
184 pages
PDE Overview
No ratings yet
PDE Overview
334 pages
Mesh Generation
100% (1)
Mesh Generation
42 pages
Exjobb Rev
No ratings yet
Exjobb Rev
43 pages
Class 7
No ratings yet
Class 7
36 pages
FreeFEM-documentation Manual PDF
No ratings yet
FreeFEM-documentation Manual PDF
673 pages
Fluid
No ratings yet
Fluid
39 pages
CFD Slide Set
No ratings yet
CFD Slide Set
31 pages
CFD Theory
No ratings yet
CFD Theory
25 pages
CFD Slide Set
No ratings yet
CFD Slide Set
31 pages
Mec3075f P2
No ratings yet
Mec3075f P2
10 pages
Algorithms To Estimate Shapley Value Feature Attributions
No ratings yet
Algorithms To Estimate Shapley Value Feature Attributions
33 pages
High Performance Parallel Computing of Flows in Complex Geometries - Part 1 - Methods 22222
No ratings yet
High Performance Parallel Computing of Flows in Complex Geometries - Part 1 - Methods 22222
27 pages
Lazar Et Al 2020
No ratings yet
Lazar Et Al 2020
31 pages
A Deep Dive Into The Latest HPC Software
No ratings yet
A Deep Dive Into The Latest HPC Software
38 pages
Freefem Doc PDF
No ratings yet
Freefem Doc PDF
426 pages
Brochure Dietetics With Nutrition
100% (1)
Brochure Dietetics With Nutrition
12 pages
Witherden - 2013
No ratings yet
Witherden - 2013
13 pages
Sail Application
No ratings yet
Sail Application
3 pages
Oil Well Drilling Problems Presentation
100% (1)
Oil Well Drilling Problems Presentation
28 pages
Freefem Doc
100% (1)
Freefem Doc
418 pages
Medical Writing Humour
No ratings yet
Medical Writing Humour
1 page
FVM Presentation
No ratings yet
FVM Presentation
33 pages
Sforza - Chiara - Rossi - Matteo - ScientificComputing2 - HPC
No ratings yet
Sforza - Chiara - Rossi - Matteo - ScientificComputing2 - HPC
4 pages
Computation Fluid Dynamics Prof. Dr. Suman Chakraborty Department of Mechanical Engineering Indian Institute of Technology, Kharagpur
No ratings yet
Computation Fluid Dynamics Prof. Dr. Suman Chakraborty Department of Mechanical Engineering Indian Institute of Technology, Kharagpur
39 pages
Manapy: MPI-Based Framework For Solving Partial Differential Equations Using Finite-Volume On Unstructured-Grid
No ratings yet
Manapy: MPI-Based Framework For Solving Partial Differential Equations Using Finite-Volume On Unstructured-Grid
24 pages
HTML Quiz Questions
100% (1)
HTML Quiz Questions
11 pages
SYLLABUS
No ratings yet
SYLLABUS
3 pages
Fluids Talk Notes
No ratings yet
Fluids Talk Notes
75 pages
SMMA Contract Template757-1
No ratings yet
SMMA Contract Template757-1
7 pages
CFD Lab 1
No ratings yet
CFD Lab 1
39 pages
Openfoam 2d Refinement Explanation Dynamic Meshing Around Fluid-Fluid Interfaces With Applications T
No ratings yet
Openfoam 2d Refinement Explanation Dynamic Meshing Around Fluid-Fluid Interfaces With Applications T
220 pages
The Different Hypotheses Explaining The Origin of The Universe
No ratings yet
The Different Hypotheses Explaining The Origin of The Universe
29 pages
Parallel Computing in CFD: Milovan Perić
No ratings yet
Parallel Computing in CFD: Milovan Perić
25 pages
E Sports
No ratings yet
E Sports
6 pages
DeAngeli-sbac Pad2003
No ratings yet
DeAngeli-sbac Pad2003
8 pages
Current Electricity f1
No ratings yet
Current Electricity f1
4 pages
Basic Aspects of Discretization
No ratings yet
Basic Aspects of Discretization
56 pages
Fresh Healthy Juice Boosters, Inc.: Report On Business Process Review
No ratings yet
Fresh Healthy Juice Boosters, Inc.: Report On Business Process Review
29 pages
CFD04 - Grid Generation
No ratings yet
CFD04 - Grid Generation
17 pages
Computational Engineering: Tackling Turbulence With (Super) Computers
No ratings yet
Computational Engineering: Tackling Turbulence With (Super) Computers
30 pages
Computational Engineering: Tackling Turbulence With (Super) Computers
No ratings yet
Computational Engineering: Tackling Turbulence With (Super) Computers
30 pages
NOTES ON IPC (RA No. 8293) - LAW On PATENTS
100% (4)
NOTES ON IPC (RA No. 8293) - LAW On PATENTS
21 pages
Criminal Negligence-Article 365 of The RPC
No ratings yet
Criminal Negligence-Article 365 of The RPC
12 pages
Computational Fluid DynamicsFluent Modeling CourseFirst: An Introduction To CFD
No ratings yet
Computational Fluid DynamicsFluent Modeling CourseFirst: An Introduction To CFD
67 pages
Documentation Report - Ammungan Festival 2019
No ratings yet
Documentation Report - Ammungan Festival 2019
12 pages
Manufacturing Processes II: Fundamentals of Metal Forming
No ratings yet
Manufacturing Processes II: Fundamentals of Metal Forming
17 pages
Indb Lpi Humsafr Third Ac (3A) : Electronic Reserva On Slip (ERS)
No ratings yet
Indb Lpi Humsafr Third Ac (3A) : Electronic Reserva On Slip (ERS)
3 pages
Sample Case
No ratings yet
Sample Case
6 pages
Notes PDF
No ratings yet
Notes PDF
9 pages
9685 2018 2019 AGU Int Students Req
No ratings yet
9685 2018 2019 AGU Int Students Req
22 pages
Parallel-Vector Equation Solvers For Finite Element Engineering Applications
No ratings yet
Parallel-Vector Equation Solvers For Finite Element Engineering Applications
15 pages
Incompact3D User Guide Version 2.0: Sylvain Laizet (Imperial College London)
No ratings yet
Incompact3D User Guide Version 2.0: Sylvain Laizet (Imperial College London)
13 pages
Manual Freefem
No ratings yet
Manual Freefem
140 pages
Pharmaceutical Analysis 1
No ratings yet
Pharmaceutical Analysis 1
5 pages
OpenFOAM Foundation Handout PDF
No ratings yet
OpenFOAM Foundation Handout PDF
92 pages
Accelerating CFD Simulations With Gpus: Patrice Castonguay
No ratings yet
Accelerating CFD Simulations With Gpus: Patrice Castonguay
67 pages
Cfdpre
No ratings yet
Cfdpre
354 pages
AVC (Average Variable Cost) ATC (Average Total Cost) MC (Marginal Cost)
No ratings yet
AVC (Average Variable Cost) ATC (Average Total Cost) MC (Marginal Cost)
2 pages
Programming Turbulence Models in Fortran: Eirik Helno Herø
No ratings yet
Programming Turbulence Models in Fortran: Eirik Helno Herø
67 pages
Assignment 2 PDF
No ratings yet
Assignment 2 PDF
14 pages
Supply Chain Performance: Achieving Strategic Fit and Scope
No ratings yet
Supply Chain Performance: Achieving Strategic Fit and Scope
28 pages
Company Profile HDFC Bank
No ratings yet
Company Profile HDFC Bank
7 pages
User Guide Incompact3d V1-0
No ratings yet
User Guide Incompact3d V1-0
8 pages
10997B Lab 03
No ratings yet
10997B Lab 03
9 pages
Engine Identification: Vitara 5
No ratings yet
Engine Identification: Vitara 5
35 pages
Lecture 1.0
No ratings yet
Lecture 1.0
29 pages
MC Openmp
No ratings yet
MC Openmp
10 pages
A Guide To Writing Your First CFD Solver: Mark Owkes Mark - Owkes@montana - Edu June 2, 2017
No ratings yet
A Guide To Writing Your First CFD Solver: Mark Owkes Mark - Owkes@montana - Edu June 2, 2017
11 pages
The Growth of Computacional Fluid Dynamics
No ratings yet
The Growth of Computacional Fluid Dynamics
8 pages
Mesh DiscretIzation
No ratings yet
Mesh DiscretIzation
45 pages
Openpipeflow 1.02b
No ratings yet
Openpipeflow 1.02b
22 pages
Action Plan in Reading
No ratings yet
Action Plan in Reading
2 pages
Fluent Tutorial 2014
No ratings yet
Fluent Tutorial 2014
6 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet
Solutions to Problems in Fluids and Turbomachinery
From Everand
Solutions to Problems in Fluids and Turbomachinery
Rahul Basu
No ratings yet
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Class 8

Uploaded by

Class 8

Uploaded by

Numerical Modelling in

(photo taken at NCAR museum)

Each node: 12-core Intel CPU + GPU

Each node: 2* 18-core Intel CPU

Use iterative approach=>start with u=0, sweep through grid updating

Each CPU will do the same operations but on different parts of

Red=boundary points (=0)

Step 2: y-faces (including

[Step 3: z-faces (including corner

Doing the 3 directions sequentially avoids the need for

Not optimal – uses blocking send/receive

TOTAL : t= aN3 +nL+bN2 /B

Gigabit ethernet: L=40e-6 s, B=100 MB/s

Exact solution 4x4x4

Ludwig Prandtl (1875-1953)

• Liquid metals: 0.004-0.03

• Existing code assumes infinite Prandtl

Claude Navier Sir George Stokes

Valid for constant viscosity only

continuity and energy equations same as before

• Reduces the number of parameters

Also simplify by assuming 1/Ek=0

Tnew − Told ∂ Told ∂ Told

w must now be time stepped in a similar way

ω new − ω old ∂ω old ∂ω old ∂ Told

⎛ ∂ω old ∂ω old ∂ Told ⎞

Combined: dt = min(dtdiff , dtadv )

• Due to random start, results will not look

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.