0% found this document useful (0 votes)

292 views

Accelerating Computational Science and Engineering

Uploaded by

Ayush kumar singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

292 views

Accelerating Computational Science and Engineering

Uploaded by

Ayush kumar singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

PARALLEL COMPUTING:

ACCELERATING COMPUTATIONAL
SCIENCE AND ENGINEERING (CSE)
Advances in Parallel Computing
This book series publishes research and development results on all aspects of parallel computing.
Topics may include one or more of the following: high-speed computing architectures (Grids,
clusters, Service Oriented Architectures, etc.), network technology, performance measurement,
system software, middleware, algorithm design, development tools, software engineering,
services and applications.

Series Editor:
Professor Dr. Gerhard R. Joubert

Volume 25
Recently published in this series
Vol. 24. E.H. D’Hollander, J.J. Dongarra, I.T. Foster, L. Grandinetti and G.R. Joubert (Eds.),
Transition of HPC Towards Exascale Computing
Vol. 23. C. Catlett, W. Gentzsch, L. Grandinetti, G. Joubert and J.L. Vazquez-Poletti (Eds.),
Cloud Computing and Big Data
Vol. 22. K. De Bosschere, E.H. D’Hollander, G.R. Joubert, D. Padua and F. Peters (Eds.),
Applications, Tools and Techniques on the Road to Exascale Computing
Vol. 21. J. Kowalik and T. Puźniakowski, Using OpenCL – Programming Massively Parallel
Computers
Vol. 20. I. Foster, W. Gentzsch, L. Grandinetti and G.R. Joubert (Eds.), High Performance
Computing: From Grids and Clouds to Exascale
Vol. 19. B. Chapman, F. Desprez, G.R. Joubert, A. Lichnewsky, F. Peters and T. Priol (Eds.),
Parallel Computing: From Multicores and GPU’s to Petascale
Vol. 18. W. Gentzsch, L. Grandinetti and G. Joubert (Eds.), High Speed and Large Scale
Scientific Computing
Vol. 17. F. Xhafa (Ed.), Parallel Programming, Models and Applications in Grid and P2P
Systems
Vol. 16. L. Grandinetti (Ed.), High Performance Computing and Grids in Action
Vol. 15. C. Bischof, M. Bücker, P. Gibbon, G.R. Joubert, T. Lippert, B. Mohr and F. Peters
(Eds.), Parallel Computing: Architectures, Algorithms and Applications

Volumes 1–14 published by Elsevier Science.

ISSN 0927-5452 (print)

ISSN 1879-808X (online)
Parallel Com
P mputing
g:
Acceleratin
ng Com
mputattional
S
Science
e and Engin
E neeringg (CSEE)

Edited by
y
Michael Baader
Technissche Universsität München
n, Munich, Germany
G

A
Arndt Bod
de
Leibnizz Supercomp
puting Centree, Munich, Germany
G

Hans-Jo
oachim Bungartz
B
Technissche Universsität München
n, Munich, Germany
G

Micchael Gerrndt
Technissche Universsität München
n, Munich, Germany
G

Gerhard R. Jo
oubert
Teechnical Univversity Claussthal, Germa
any
and
Frrans Peteers
Philips Research,
R Neetherlands

Amstterdam • Berrlin • Tokyo • Washington, DC

All rights reserved. No part of this book may be reproduced, stored in a retrieval system,
or transmitted, in any form or by any means, without prior written permission from the publisher.

ISBN 978-1-61499-380-3 (print)

ISBN 978-1-61499-381-0 (online)
Library of Congress Control Number: 2014932893

Publisher
IOS Press BV
Nieuwe Hemweg 6B
1013 BG Amsterdam
Netherlands
fax: +31 20 687 0019
e-mail: order@iospress.nl

Distributor in the USA and Canada

IOS Press, Inc.
4502 Rachael Manor Drive
Fairfax, VA 22032
USA
fax: +1 703 323 3668
e-mail: iosbooks@iospress.com

LEGAL NOTICE
The publisher is not responsible for the use which might be made of the following information.

PRINTED IN THE NETHERLANDS

Parallel Computing: Accelerating Computational Science and Engineering (CSE) v
M. Bader et al. (Eds.)
IOS Press, 2014
© 2014 The authors and IOS Press. All rights reserved.

Preface
This volume of the series “Advances in Parallel Computing” contains the proceedings
of the International Conference on Parallel Programming – ParCo 2013 – held from 10
to 13 September 2013 in Garching, Germany. The conference was hosted by the
Technische Universität München (Department of Informatics) and the Leibniz Super-
computing Centre.
With ParCo 2013, the biennial ParCo conference series now looks back at 30 years
of top-level research in parallel algorithms, architectures and applications. It has finally
entered an era in which parallel computing – for many years the enabling technology of
high-end machines – is now ubiquitous and the key for the efficient use of any kind of
computer architecture: from embedded and personal up to exascale systems.
The trend towards heterogeneous architectures, multiple levels of parallelism and
towards higher and higher core numbers of supercomputing platforms, which was al-
ready addressed in the previous ParCo instances, can now be seen in full bloom. Paral-
lel programming models for multi- and manycore CPUs, GPUs, FPGAs, and heteroge-
neous platforms have been one of the clear focal points at ParCo 2013. In addition,
performance engineering processes, including analysis, tools and metrics, must be
adapted to these new and innovative platforms. It also becomes apparent from the con-
tributions that novel numerical algorithms are required: for basic tasks in numerical
linear algebra as well as for adaptive or space-time parallel simulations. Most important,
all these aspects need to be combined in the parallelisation and optimisation of large-
scale applications, in order to make parallel computing – including the processing of
large data sets (“Big Data”) – a persistent driver of research in many fields of science
and engineering.
ParCo 2013 strongly profited from its 12 mini-symposia (including an industry
session and a special PhD Symposium), which represented and intensified the discus-
sion of current “hot topics” in high performance and parallel computing in an excellent
manner. At least three mini-symposia were dedicated to large-scale supercomputing, in
particular. Three mini-symposia focused on novel challenges arising from parallel ar-
chitectures (multi-/manycore, heterogeneous platforms, FPGAs). A further mini-
symposium hotspot was established by the “multi”-challenges: multi-level algorithms
as well as multi-scale, multi-physics and multi-dimensional problems.
We would like to express our sincerest thanks to ParCo’s four keynote speakers –
Pete Beckman, Sudip Dosanjh, Wolfgang Nagel and Martin Schulz – who, in their
presentations, gave an exciting overview of both promises and challenges for the age of
exascale and Big Data. We are equally obliged to all presenters at the conference, all
authors and co-authors who contributed to these proceedings, and of course to all at-
tendees at ParCo 2013 – all of them contributed to the excellent scientific quality of the
vi

conference and to its inspiring atmosphere. Last, but definitely not least, special thanks
go to all (co-)organisers, including the mini-symposium organisers, to the members of
the international programme committee, and to all persons who assisted during the con-
ference.

Michael Bader
Arndt Bode
Hans-Joachim Bungartz
Michael Gerndt
Gerhard R. Joubert
Frans Peters

Date: 2013-12-01
vii

Conference Committee
Gerhard Joubert (Germany/Netherlands) (Conference Chair)
Michael Bader (Germany)
Arndt Bode (Germany)
Hans-Joachim Bungartz (Germany)
Michael Gerndt (Germany)
Frans Peters (Netherlands)

Advisory Committee
Thomas Lippert (Germany)
Thierry Priol (France)
Koen De Bosschere (Belgium)
Jack Dongarra (USA)

Minisymposium Committee
Tobias Weinzierl (Germany)
Miriam Mehl (Germany)

Organising & Exhibition Committee

Michael Bader (Germany)
Arndt Bode (Germany)
Hans-Joachim Bungartz (Germany)
Michael Gerndt (Germany)
Houssam Haitof (Germany)
Herbert Huber (Germany)
Carsten Trinitis (Germany)
Josef Weidendorfer (Germany)

Finance Committee
Frans Peters (Netherlands)
viii

Conference Programme Committee

Arndt Bode (Germany) (Chair)
Michael Bader (Germany) (Chair)
Rosa Badia (Spain) (Co-Chair)

Peter Arbenz (Switzerland) Bettina Krammer (France)

Pete Beckman (USA) Dieter Kranzlmüller (Germany)
Mark Bull (UK) Herbert Kuchen (Germany)
Andrea Clematis (Italy) Alexey Lastovetsky (Ireland)
Luisa D’Amore (Italy) Jin-Fu Li (Taiwan)
Erik D’Hollander (Belgium) Bernd Mohr (Germany)
Michel Dayde (France) Wolfgang E. Nagel (Germany)
Bjorn De Sutter (Belgium) Victor Pankratius (USA)
Frank Dehne (Canada) Christian Pérez (France)
Paul Feautrier (France) Oscar Plata (Spain)
Basilio Fraguela (Spain) Sabri Pllana (Austria)
Franz Franchetti (USA) Thierry Priol (France)
Efstratios Gallopoulos (Greece) Enrique Quintana-Ort (Spain)
William Gropp (USA) J. (Ram) Ramanujam (USA)
David Ham (UK) Dirk Roose (Belgium)
Torsten Hoefler (Switzerland) Gudula Rünger (Germany)
Lei Huang (USA) Peter Sanders (Germany)
Thomas Huckle (Germany) Martin Schulz (USA)
Hai Jin (China) Dirk Stroobandt (Belgium)
Wolfgang Karl (Germany) Tor Sørevik (Norway)
Christoph Kessler (Sweden) Domenico Talia (Italy)
Harald Köstler (Germany) Paco Tirado (Spain)
Markus Kowarschik (Germany) Denis Trystram (France)
ix

Programme Committees of Mini-Symposia

ParCo 2013 PhD Symposium

Josef Weidendorfer (Symposium Chair) (Germany)

Michael Bader (Symposium Co-Chair) (Germany)

Jens Breitbart (Germany)

Carsten Burstedde (Germany)
Karl Fürlinger (Germany)
Rainer Keller (Germany)
Harald Köstler (Germany)
Dirk Pflüger (Germany)
Martin Schulz (USA)
Carsten Trinitis (Germany)

ParaFPGA-2013: Parallel Computing with FPGA’s

Erik H. D’Hollander (Symposium Chair) (Belgium)
Dirk Stroobandt (Programme Committee Chair) (Belgium)
Abdellah Touhafi (Programme Committee Co-Chair) (Belgium)

Abbes Amira (United Kingdom)

Georgi Gaydadjiev (Netherlands)
Mike Hutton (USA)
Tsutomu Maruyama, (Japan)
Dionisios Pnevmatikos (Greece)
Viktor Prasanna (USA)
Mazen A.R. Saghir (Qatar)
Donatella Sciuto (Italy)
Sascha Uhrig (Germany)
Sotirios G. Ziavras (USA)

High-Dimensional Meets Parallel – Algorithms and Applications

Dirk Pflüger (Symposium Chair) (Germany)

Hans-Joachim Bungartz (Symposium Co-Chair) (Germany)
Markus Hegland (Symposium Co-Chair) (Australia)
x

Application Autotuning for HPC (Architectures)

Siegfried Benkner (Austria)

Matthias Brehm (Germany)
Michael Gerndt (Germany)
Wolfram Hesse (Germany)
Anna Sikora (Spain)

Extreme Scaling on SuperMUC

Ferdinand Jamitzky (Symposium Chair) (Germany)

Nikolay Hammer (Symposium Co-chair) (Germany)
Helmut Satzger (Symposium Co-chair) (Germany)

Parallel Programming for Heterogeneous Architectures

Bettina Krammer (Symposium Chair) (Germany)
Hartmut Mix (Symposium Co-chair) (Germany)
Markus Geimer (Symposium Co-chair) (Germany)

DECI Minisymposium
(PRACE – Partnership for Advanced Computing in Europe)
Chris Johnson (Symposium Chair) (United Kingdom)

Efficient Highly Scalable Multi-level Preconditioners

for Linear Systems
Matthias Bolten (Symposium Chair) (Germany)

Performance Modeling and Engineering

for Multi-/Many-Core Architectures
Gerhard Wellein (Symposium Chair) (Germany)

Space-filling Curves in Parallel Computing

Dirk Roose (Symposium Chair) (Belgium)

Michael Bader (Germany)
Tobias Weinzierl (Germany)
xi

Interaction and HPC: Multi-Scale/Multi-Physics Applications

Ralf-Peter Mundani (Symposium Chair) (Germany)

ParCo2013 Sponsors

AMD – Advanced Micro Devices, Inc.

Bull
EUROTECH S.p.A.
EXTOLL GmbH
FUJITSU
IBM Deutschland GmbH
MEGWARE Computer Vertrieb und Service GmbH
NVIDIA Corporation
GWT-TUD GmbH (Vertriebspartner VAMPIR)
xiii

Contents
Preface v
Michael Bader, Arndt Bode, Hans-Joachim Bungartz, Michael Gerndt,
Gerhard R. Joubert and Frans Peters
Conference Organisation vii

Invited Talks

Extreme Data Science at the National Energy Research Scientific Computing

(NERSC) Center 3
Sudip Dosanjh, Shane Canon, Jack Deslippe, Kjiersten Fagnan,
Richard Gerber, Lisa Gerhardt, Jason Hick, Douglas Jacobsen,
David Skinner and Nicholas J. Wright
Performance Analysis Techniques for the Exascale Co-Design Process 19
Martin Schulz, Jim Belak, Abhinav Bhatele, Peer-Timo Bremer,
Greg Bronevetsky, Marc Casas, Todd Gamblin, Katherine E. Isaacs,
Ignacio Laguna, Joshua Levine, Valerio Pascucci, David Richards
and Barry Rountree

Parallel Programming Models

XMP-IO Function and Its Application to MapReduce on the K Computer 35

Tomotake Nakamura and Mitsuhisa Sato
POLCA – A Programming Model for Large Scale, Strongly Heterogeneous
Infrastructures 43
Lutz Schubert, Jan Kuper and José Gracia
Exploitation of Quality/Throughput Tradeoffs in Image Processing Through
Invasive Computing 53
Alexandru Tanase, Vahid Lari, Frank Hannig and Jürgen Teich
An Efficient Thread Mapping Strategy for Multiprogramming on Manycore
Processors 63
Ashkan Tousimojarad and Wim Vanderbauwhede
A Scalable Farm Skeleton for Heterogeneous Parallel Programming 72
Steffen Ernsting and Herbert Kuchen
Towards Truly Boolean Arrays in Data-Parallel Array Processing 82
Clemens Grelck and Hraban Luyat
Deep Packet Inspection on Commodity Hardware Using FastFlow 92
M. Danelutto, L. Deri, D. De Sensi and M. Torquati
xiv

Performance Analysis and Tools

Formalizing Bottlenecks in Task-Based OpenMP Applications 103

Shajulin Benedict, Michael Gerndt and Diana-Mihaela Gudu
Characterizing Performance of Applications on Blue Gene/Q 113
Paul F. Baumeister, Hans Boettiger, Thorsten Hater, Michael Knobloch,
Thilo Maurer, Andrea Nobile, Dirk Pleiter and Nicolas Vandenbergen
Specification of Periscope Tuning Framework Plugins 123
Robert Mijaković, Antonio Pimenta Soto, Isaías A. Comprés Ureña,
Michael Gerndt, Anna Sikora and Eduardo César

Parallel Numerical Linear Algebra

On Using Speculative Computations for Parallel Reduction to Tridiagonal Form 135

Sergey V. Kuznetsov
Fast Approximate Solution of the Non-Symmetric Generalized Eigenvalue
Problem on Multicore Architectures 143
Peter Benner, Martin Köhler and Jens Saak
Locality Optimization on a NUMA Architecture for Hybrid LU Factorization 153
Adrien Rémy, Marc Baboulin, Masha Sosonkina and Brigitte Rozoy
Variable Block Algebraic Recursive Multilevel Solver (VBARMS) for Sparse
Linear Systems 163
Bruno Carpentieri, Jia Liao and Masha Sosonkina
A Proposal of a Single-Synchronized Solver Suited to Large Scale Linear
Systems on Parallel Computers with Distributed Memory 173
Seiji Fujino, Keiichi Murakami and Kosuke Iwasato
Approximate Inverse Preconditioners for Krylov Methods on Heterogeneous
Parallel Computers 183
Daniele Bertaccini and Salvatore Filippone
Cache and Energy Efficiency of Sparse Matrix-Vector Multiplication
for Different BLAS Numerical Types with the RSB Format 193
Michele Martone
Heterogeneous Sparse Matrix Computations on Hybrid GPU/CPU Platforms 203
Valeria Cardellini, Alessandro Fanfarillo and Salvatore Filippone

Parallel Algorithms

MapReduce Streaming Algorithms for Laplace Relaxation on the Cloud 215

Atanas Radenski and Boyana Norris
Space Exploration Using Parallel Orbits: A Study in Parallel Symbolic
Computing 225
Vladimir Janjic, Christopher Brown, Max Neunhöffer, Kevin Hammond,
Steve Linton and Hans-Wolfgang Loidl
xv

SFC-Based Communication Metadata Encoding for Adaptive Mesh Refinement 233

Martin Schreiber, Tobias Weinzierl and Hans-Joachim Bungartz
Graph Repartitioning with Both Dynamic Load and Dynamic Processor
Allocation 243
Clément Vuchener and Aurélien Esnard
ForestClaw: Hybrid Forest-of-Octrees AMR for Hyperbolic Conservation Laws 253
Carsten Burstedde, Donna Calhoun, Kyle Mandli and Andy R. Terrel
A Space-Time Parallel Solver for the Three-Dimensional Heat Equation 263
Robert Speck, Daniel Ruprecht, Matthew Emmett, Matthias Bolten
and Rolf Krause
An Efficient Pipelined Implementation of Space-Time Parallel Applications 273
Toshiya Takami and Daiki Fukudome

GPU Computing and Applications

Efficient GPU-Based Optimization of Volume Meshes 285

Eric Shaffer, Zuofu Cheng, Raine Yeh, George Zagaris and Luke Olson
Fast Uniform Grid Construction on GPGPUs Using Atomic Operations 295
Davide Barbieri, Valeria Cardellini and Salvatore Filippone
Porting Large HPC Applications to GPU Clusters: The Codes GENE
and VERTEX 305
Tilman Dannert, Andreas Marek and Markus Rampp
Numerical Simulation of the Low Compressible Viscous Gas Flows
on GPU-Based Hybrid Supercomputers 315
Alexander A. Davydov and Evgeny V. Shilnikov
Simulation of Multiphase Flows in the Subsurface on GPU-Based
Supercomputers 324
Marina Trapeznikova, Natalia Churbanova, Anastasiya Lyupa
and Dmitry Morozov
Atomic Computing – A Different Perspective on Massively Parallel Problems 334
Andrew Brown, Rob Mills, Jeff Reeve, Kier Dugan and Steve Furber

Parallelisation and Optimisation of Large-Scale Applications

Accelerating SeisSol by Generating Vectorized Code for Sparse Matrix

Operators 347
Alexander Breuer, Alexander Heinecke, Michael Bader
and Christian Pelties
Experience with the MPI/STARSS Programming Model on a Large Production
Code 357
Dirk Brömmel, Paul Gibbon, Marta Garcia, Víctor López,
Vladimir Marjanović and Jesús Labarta
xvi

Exploiting Data- and Task-Parallelism in the Solution of Riccati Equations

on Multicore Servers and GPUs 367
P. Benner, P. Ezzatti, E.S. Quintana-Ortí and A. Remón
Testing and Implementing Some New Algorithms Using the FFTW Library
on Massively Parallel Supercomputers 375
Massimiliano Guarrasi, Ning Li, Sandro Frigio, Andrew Emerson
and Giovanni Erbacci
Performance Measurements of MHD Simulation for Planetary Magnetosphere
on Peta-Scale Computer FX10 387
Keiichiro Fukazawa, Takeshi Nanri and Takayuki Umeda
Parallel Simulations of Self-Propelled Microorganisms 395
Kristina Pickl, Matthias Hofmann, Tobias Preclik, Harald Köstler,
Ana-Sunčana Smith and Ulrich Rüde
Improving Communication Performance of Sparse Linear Algebra
for an Atomistic Simulation Application 405
Christiane Pousa, Jürg Hutter and Joost Vandevondele
NEMORB’s Fourier Filter and Distributed Matrix Transposition on Petaflop
Systems 415
Tiago Ribeiro and Matthieu Haefele
Parallel Computing Design for Exact Diagonalization Scheme on Multi-Band
Hubbard Cluster Models 427
Susumu Yamada, Toshiyuki Imamura and Masahiko Machida

ParCo PhD Symposium

ParCo 2013 PhD Symposium 439

Josef Weidendorfer and Michael Bader
Numerical Experiments with New Algorithms for Parallel Decomposition
of Large Computational Meshes 441
Evdokia Golovchenko, Elizaveta Dorofeeva, Irina Gasilova
and Alexey Boldarev
A Distributed Algorithm for the Permutation Flow Shop Problem –
An Empirical Analysis 451
Samia Kouki, Mohamed Jemni and Talel Ladhari
GPI2 for GPUs: A PGAS Framework for Efficient Communication in Hybrid
Clusters 461
Lena Oden
A Fault Tolerant Implementation of Multi-Level Monte Carlo Methods 471
Stefan Pauli, Manuel Kohler and Peter Arbenz
High Performance CPU/GPU Multiresolution Poisson Solver 481
Wim M. Van Rees, Diego Rossinelli, Panagiotis Hadjidoukas
and Petros Koumoutsakos
xvii

Mini-Symposium “Parallel Computing with FPGAs (ParaFPGA2013)”

ParaFPGA 2013: Harnessing Programs, Power and Performance in Parallel

FPGA Applications 493
Erik H. D’Hollander, Dirk Stroobandt and Abdellah Touhafi
High-Level Synthesis Revised: Generation of FPGA Accelerators
from a Domain-Specific Language Using the Polyhedron Model 497
Moritz Schmid, Frank Hannig, Alexandru Tanase and Jürgen Teich
Compiling a Dataflow-Based Language Abstraction onto an FPGA 507
Eva Burrows
Timing Driven C-Slow Retiming on RTL for MultiCores on FPGAs 515
Tobias Strauch
Performance and Resource Modeling for FPGAs Using High-Level Synthesis
Tools 523
Bruno Da Silva, An Braeken, Erik H. D’Hollander and Abdellah Touhafi
Interactive Graph Cuts Using FPGA 532
Daichi Kobori and Tsutomu Maruyama
An Image Filter System Based on Dynamic Partial Reconfiguration on FPGA 540
Hisaaki Kurita and Tsutomu Maruyama
Investigating Energy Consumption of an SRAM-Based FPGA for Duty-Cycle
Applications 548
Khurram Shahzad and Bengt Oelmann

Mini-Symposium “High-Dimensional Meets Parallel – Algorithms

and Applications”

High-Dimensional Meets Parallel: Algorithms and Applications 563

Hans-Joachim Bungartz, Dirk Pflüger and Markus Hegland
Global Communication Schemes for the Sparse Grid Combination Technique 564
Philipp Hupp, Riko Jacob, Mario Heene, Dirk Pflüger
and Markus Hegland
Load Balancing for Massively Parallel Computations with the Sparse Grid
Combination Technique 574
Mario Heene, Christoph Kowitz and Dirk Pflüger
A Parallel Fault Tolerant Combination Technique 584
Brendan Harding and Markus Hegland
Managing Complexity in the Parallel Sparse Grid Combination Technique 593
J.W. Larson, P.E. Strazdins, M. Hegland, B. Harding, S. Roberts, L. Stals,
A.P. Rendell, Md.M. Ali and J. Southern
Scalability and Fault Tolerance of the Alternating Direction Method
of Multipliers for Sparse Grids 603
Valeriy Khakhutskyy, Dirk Pflüger and Markus Hegland
xviii

Mini-Symposium “Application Autotuning for HPC (Architectures)”

Mini-Symposium on Application Autotuning for HPC 615

Siegfried Benkner, Matthias Brehm, Michael Gerndt, Wolfram Hesse
and Anna Sikora
Investigating Performance Benefits from OpenACC Kernel Directives 616
Benjamin Eagan, Gilles Civario and Renato Miceli
Application-Independent Autotuning for GPUs 626
Martin Tillmann, Thomas Karcher, Carsten Dachsbacher
and Walter F. Tichy
Autotuning of Pattern Runtimes for Accelerated Parallel Systems 636
Enes Bajrovic, Siegfried Benkner, Jiri Dokulil and Martin Sandrieser
Empirical Performance Modeling of GPU Kernels Using Active Learning 646
Prasanna Balaprakash, Karl Rupp, Azamat Mametjanov,
Robert B. Gramacy, Paul D. Hovland and Stefan M. Wild
Crowdtuning: Systematizing Auto-Tuning Using Predictive Modeling
and Crowdsourcing 656
Abdul Memon and Grigori Fursin
Autotuning the Energy Consumption 668
Carmen B. Navarrete, Carla Guillen, Wolfram Hesse and Matthias Brehm
Potentials and Limitations for Energy Efficiency Auto-Tuning 678
Robert Schöne, Andreas Knüpfer and Daniel Molka

Mini-Symposium “Extreme Scaling on SuperMUC”

Extreme Scaling Workshop at the LRZ 691

Momme Allalen, Gurvan Bazin, Christoph Bernau, Arndt Bode,
David Brayford, Matthias Brehm, Jürg Diemand, Klaus Dolag,
Jan Engels, Nicolay Hammer, Herbert Huber, Ferdinand Jamitzky,
Anupam Kamakar, Carsten Kutzner, Andreas Marek, Carmen Navarrete,
Helmut Satzger, Wolfram Schmidt and Philipp Trisjono
Extreme Scaling of Lattice Quantum Chromodynamics 698
David Brayford, Momme Allalen and Volker Weinberg
End-to-End Parallel Simulations with APES 703
Harald Klimach, Kartik Jain and Sabine Roller
Towards Petaflops Capability of the VERTEX Supernova Code 712
Andreas Marek, Markus Rampp, Florian Hanke and Hans-Thomas Janka
Scaling of the GROMACS 4.6 Molecular Dynamics Code on SuperMUC 722
Carsten Kutzner, Rossen Apostolov, Berk Hess and Helmut Grubmüller
xix

Mini-Symposium “Parallel Programming for Heterogeneous Architectures”

Parallel Programming for Heterogeneous Architectures 731

Bettina Krammer, Hartmut Mix and Markus Geimer
Execution Schemes for the NPB-MZ Benchmarks on Hybrid Architectures:
A Comparative Study 733
Jörg Dümmler and Gudula Rünger
Scilab on a Hybrid Platform 743
Victor Lomüller, Sylvestre Ledru and Henri-Pierre Charles
Divide and Conquer Parallelization of Finite Element Method Assembly 753
Loïc Thébault, Eric Petit, Marc Tchiboukdjian, Quang Dinh
and William Jalby
Cudagrind: A Valgrind Extension for CUDA 763
Thomas M. Baumann and José Gracia
Profiling Hybrid HMPP Applications with Score-P on Heterogeneous Hardware 773
Marc Schlütter, Peter Philippen, Laurent Morin, Markus Geimer
and Bernd Mohr
Binary Instrumentation for Scalable Performance Measurement of OpenMP
Applications 783
Julien Jaeger, Peter Philippen, Eric Petit, Andres Charif Rubial,
Christian Rössel, William Jalby and Bernd Mohr
A Case Study: Holistic Performance Analysis on Heterogeneous Architectures
Using the Vampir Toolchain 793
Robert Dietrich, Frank Winkler, Thomas William, Jonas Stolle,
Robert Henschel and Donald K. Berry

Further Mini-Symposium Contributions

PRACE DECI (Distributed European Computing Initiative) Minisymposium 805

Chris Johnson, Anastasia V. Bochenkova, Alexander A. Granovsky,
Peter J. Bond, Teresa Paramo, Tristan Glatard, William A. Romero R.,
Denis Friboulet, Stefan J. Zasada and Peter V. Coveney
A Generic Prototype to Benchmark Algorithms and Data Structures
for Hierarchical Hybrid Grids 813
Sebastian Kuckuk, Björn Gmeiner, Harald Köstler and Ulrich Rüde
Towards a Performance Engineering Workflow for OpenMP 4.0 823
Dirk Schmidl, Christian Iwainsky, Christian Terboven, Christian H. Bischof
and Matthias S. Müller
Theoretical Measures of Cache Efficiency for Tetrahedral Adaptive Meshes.
A Case Study with a Quasi Space-Filling Curve Order 833
Oliver Kunst and Jörn Behrens

Author Index 843

Astm D4491
No ratings yet
Astm D4491
8 pages
2023 ExAC References & Resources
No ratings yet
2023 ExAC References & Resources
4 pages
Nic 225296
No ratings yet
Nic 225296
830 pages
Nic Series Volume37
No ratings yet
Nic Series Volume37
215 pages
Parallel Computing 1st Edition G. R. Joubert download
100% (1)
Parallel Computing 1st Edition G. R. Joubert download
61 pages
Parallel Computing 1st Edition G. R. Joubert - The full ebook with all chapters is available for download
100% (4)
Parallel Computing 1st Edition G. R. Joubert - The full ebook with all chapters is available for download
50 pages
Parallel Computing 1st Edition G. R. Joubert download
No ratings yet
Parallel Computing 1st Edition G. R. Joubert download
52 pages
Big Data And High Performance Computing V Grandinetti L Joubert instant download
No ratings yet
Big Data And High Performance Computing V Grandinetti L Joubert instant download
47 pages
Full download (Ebook) Euro-Par 2020: Parallel Processing: 26th International Conference on Parallel and Distributed Computing, Warsaw, Poland, August 24–28, 2020, Proceedings by Maciej Malawski, Krzysztof Rzadca ISBN 9783030576745, 9783030576752, 3030576744, 3030576752 pdf docx
100% (9)
Full download (Ebook) Euro-Par 2020: Parallel Processing: 26th International Conference on Parallel and Distributed Computing, Warsaw, Poland, August 24–28, 2020, Proceedings by Maciej Malawski, Krzysztof Rzadca ISBN 9783030576745, 9783030576752, 3030576744, 3030576752 pdf docx
65 pages
Parallel And Distributed Computing Alberto Ros pdf download
No ratings yet
Parallel And Distributed Computing Alberto Ros pdf download
77 pages
Euro Par 2019 Parallel Processing 25th International Conference on Parallel and Distributed Computing Göttingen Germany August 26 30 2019 Proceedings Ramin Yahyapour - Download the ebook now to start reading without waiting
100% (1)
Euro Par 2019 Parallel Processing 25th International Conference on Parallel and Distributed Computing Göttingen Germany August 26 30 2019 Proceedings Ramin Yahyapour - Download the ebook now to start reading without waiting
41 pages
(Ebooks PDF) Download Parallel Computing 1st Edition G. R. Joubert Full Chapters
100% (10)
(Ebooks PDF) Download Parallel Computing 1st Edition G. R. Joubert Full Chapters
84 pages
Gallopoulos - Parallelism in Matrix Computations
No ratings yet
Gallopoulos - Parallelism in Matrix Computations
505 pages
Network and Parallel Computing 20th IFIP WG 10.3 International Conference, NPC 2024, Part II
No ratings yet
Network and Parallel Computing 20th IFIP WG 10.3 International Conference, NPC 2024, Part II
523 pages
Euro Par 2022 Parallel Processing 28th International Conference on Parallel and Distributed Computing Glasgow UK August 22 26 2022 Proceedings José Canoinstant download
100% (1)
Euro Par 2022 Parallel Processing 28th International Conference on Parallel and Distributed Computing Glasgow UK August 22 26 2022 Proceedings José Canoinstant download
45 pages
Stefano Markidis, Erwin Laure - Solving Software Challenges For Exascale 2015
No ratings yet
Stefano Markidis, Erwin Laure - Solving Software Challenges For Exascale 2015
154 pages
Download Complete (Ebook) Using OpenCL: Programming Massively Parallel Computers by J. Kowalik, T. Puzniakowski ISBN 9781614990291, 1614990298 PDF for All Chapters
100% (5)
Download Complete (Ebook) Using OpenCL: Programming Massively Parallel Computers by J. Kowalik, T. Puzniakowski ISBN 9781614990291, 1614990298 PDF for All Chapters
76 pages
Facing The Multicorechallenge Ii Aspects Of New Paradigms And Technologies In Parallel Computing Rainer Keller download
No ratings yet
Facing The Multicorechallenge Ii Aspects Of New Paradigms And Technologies In Parallel Computing Rainer Keller download
47 pages
Parallel Computing Advances And Current Issues Proceedings Of The International Conference Parco2001 1st Gerhard Robert Joubert instant download
No ratings yet
Parallel Computing Advances And Current Issues Proceedings Of The International Conference Parco2001 1st Gerhard Robert Joubert instant download
82 pages
High Performance Parallel I O 1st Edition I Foster All Chapters Instant Download
No ratings yet
High Performance Parallel I O 1st Edition I Foster All Chapters Instant Download
81 pages
(Ebook) Parallel Computing by G. R. Joubert, Italy) Parco200 (2001 Naples, Gerhard Joubert, Almerica Murli, Frans Peters ISBN 9781860943157, 9781860949630, 1860943152, 1860949630 - Download the ebook now for instant access to all chapters
100% (2)
(Ebook) Parallel Computing by G. R. Joubert, Italy) Parco200 (2001 Naples, Gerhard Joubert, Almerica Murli, Frans Peters ISBN 9781860943157, 9781860949630, 1860943152, 1860949630 - Download the ebook now for instant access to all chapters
59 pages
Parallel Processing and Applied Mathematics 11th International Conference PPAM 2015 Krakow Poland September 6 9 2015 Revised Selected Papers Part II 1st Edition Roman Wyrzykowski download
100% (7)
Parallel Processing and Applied Mathematics 11th International Conference PPAM 2015 Krakow Poland September 6 9 2015 Revised Selected Papers Part II 1st Edition Roman Wyrzykowski download
48 pages
Parallel Scientific Computation A Structured Approach Using Bsp 2nd Edition Rob H Bisseling instant download
No ratings yet
Parallel Scientific Computation A Structured Approach Using Bsp 2nd Edition Rob H Bisseling instant download
86 pages
Europar 2008 Parallel Processing 14th International Europar Conference Las Palmas De Gran Canaria Spain August 2629 2008 Proceedings 1st Edition Marios Dikaiakos download
100% (2)
Europar 2008 Parallel Processing 14th International Europar Conference Las Palmas De Gran Canaria Spain August 2629 2008 Proceedings 1st Edition Marios Dikaiakos download
83 pages
Full Download Euro Par 2019 Parallel Processing 25th International Conference on Parallel and Distributed Computing Göttingen Germany August 26 30 2019 Proceedings Ramin Yahyapour PDF DOCX
No ratings yet
Full Download Euro Par 2019 Parallel Processing 25th International Conference on Parallel and Distributed Computing Göttingen Germany August 26 30 2019 Proceedings Ramin Yahyapour PDF DOCX
55 pages
Parallel Algorithms 1st Edition Mh Alsuwaiyel instant download
No ratings yet
Parallel Algorithms 1st Edition Mh Alsuwaiyel instant download
89 pages
Instant download (Ebook) Computing with Foresight and Industry: 15th Conference on Computability in Europe, CiE 2019, Durham, UK, July 15–19, 2019, Proceedings by Florin Manea, Barnaby Martin, Daniël Paulusma, Giuseppe Primiero ISBN 9783030229955, 9783030229962, 9783319500621, 9783642569999, 3030229955, 3030229963, 3319500627, 3642569994 pdf all chapter
100% (9)
Instant download (Ebook) Computing with Foresight and Industry: 15th Conference on Computability in Europe, CiE 2019, Durham, UK, July 15–19, 2019, Proceedings by Florin Manea, Barnaby Martin, Daniël Paulusma, Giuseppe Primiero ISBN 9783030229955, 9783030229962, 9783319500621, 9783642569999, 3030229955, 3030229963, 3319500627, 3642569994 pdf all chapter
65 pages
Euro Par 2019 Parallel Processing 25th International Conference on Parallel and Distributed Computing Göttingen Germany August 26 30 2019 Proceedings Ramin Yahyapour download pdf
100% (3)
Euro Par 2019 Parallel Processing 25th International Conference on Parallel and Distributed Computing Göttingen Germany August 26 30 2019 Proceedings Ramin Yahyapour download pdf
65 pages
Europar 2015 Parallel Processing 21st International Conference On Parallel And Distributed Computing Vienna Austria August 2428 2015 Proceedings 1st Edition Jesper Larsson Trff pdf download
No ratings yet
Europar 2015 Parallel Processing 21st International Conference On Parallel And Distributed Computing Vienna Austria August 2428 2015 Proceedings 1st Edition Jesper Larsson Trff pdf download
90 pages
High_performance_cluster_computing_Book
No ratings yet
High_performance_cluster_computing_Book
2 pages
Facing The Multicorechallenge Aspects Of New Paradigms And Technologies In Parallel Computing 1st Edition David Bader Auth download
No ratings yet
Facing The Multicorechallenge Aspects Of New Paradigms And Technologies In Parallel Computing 1st Edition David Bader Auth download
54 pages
Facing The Multicorechallenge Aspects Of New Paradigms And Technologies In Parallel Computing 1st Edition David Bader Auth download
No ratings yet
Facing The Multicorechallenge Aspects Of New Paradigms And Technologies In Parallel Computing 1st Edition David Bader Auth download
57 pages
Parallel Scientific Computation A Structured Approach Using Bsp Rob H Bisseling instant download
100% (1)
Parallel Scientific Computation A Structured Approach Using Bsp Rob H Bisseling instant download
91 pages
High Performance Parallel I O 1st Edition I Foster - The latest ebook edition with all chapters is now available
100% (1)
High Performance Parallel I O 1st Edition I Foster - The latest ebook edition with all chapters is now available
80 pages
Schnorr 2014
No ratings yet
Schnorr 2014
3 pages
2018 Book IntroductionToParallelComputin PDF
100% (1)
2018 Book IntroductionToParallelComputin PDF
263 pages
Facing the Multicore-Challenge
No ratings yet
Facing the Multicore-Challenge
164 pages
Where can buy An Introduction to Parallel Programming 2nd Edition Peter Pacheco ebook with cheap price
100% (7)
Where can buy An Introduction to Parallel Programming 2nd Edition Peter Pacheco ebook with cheap price
40 pages
Fundamentals of Multicore Software Development PDF
No ratings yet
Fundamentals of Multicore Software Development PDF
322 pages
[Ebooks PDF] download Using OpenCL Programming Massively Parallel Computers J. Kowalik full chapters
100% (8)
[Ebooks PDF] download Using OpenCL Programming Massively Parallel Computers J. Kowalik full chapters
70 pages
Buy ebook Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisseling cheap price
100% (3)
Buy ebook Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisseling cheap price
50 pages
Complete Download (Ebook) Parallel Computing by G. R. Joubert, Italy) Parco200 (2001 Naples, Gerhard Joubert, Almerica Murli, Frans Peters ISBN 9781860943157, 9781860949630, 1860943152, 1860949630 PDF All Chapters
100% (1)
Complete Download (Ebook) Parallel Computing by G. R. Joubert, Italy) Parco200 (2001 Naples, Gerhard Joubert, Almerica Murli, Frans Peters ISBN 9781860943157, 9781860949630, 1860943152, 1860949630 PDF All Chapters
76 pages
Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisseling - The ebook in PDF/DOCX format is available for instant download
100% (3)
Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisseling - The ebook in PDF/DOCX format is available for instant download
70 pages
High Performance Parallel I O 1st Edition I Foster instant download
100% (2)
High Performance Parallel I O 1st Edition I Foster instant download
70 pages
Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisselinginstant download
100% (2)
Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisselinginstant download
24 pages
Instant download (Ebook) Algorithms and Architectures for Parallel Processing: 18th International Conference, ICA3PP 2018, Guangzhou, China, November 15-17, 2018, Proceedings, Part IV by Jaideep Vaidya, Jin Li ISBN 9783030050627, 9783030050634, 3030050629, 3030050637 pdf all chapter
100% (10)
Instant download (Ebook) Algorithms and Architectures for Parallel Processing: 18th International Conference, ICA3PP 2018, Guangzhou, China, November 15-17, 2018, Proceedings, Part IV by Jaideep Vaidya, Jin Li ISBN 9783030050627, 9783030050634, 3030050629, 3030050637 pdf all chapter
67 pages
Instant Access to Parallel Processing and Applied Mathematics 11th International Conference PPAM 2015 Krakow Poland September 6 9 2015 Revised Selected Papers Part II 1st Edition Roman Wyrzykowski ebook Full Chapters
No ratings yet
Instant Access to Parallel Processing and Applied Mathematics 11th International Conference PPAM 2015 Krakow Poland September 6 9 2015 Revised Selected Papers Part II 1st Edition Roman Wyrzykowski ebook Full Chapters
62 pages
[Ebooks PDF] download Parallel and Distributed Processing Techniques and Applications 1st Edition Hamid R. Arabnia full chapters
100% (16)
[Ebooks PDF] download Parallel and Distributed Processing Techniques and Applications 1st Edition Hamid R. Arabnia full chapters
77 pages
17965
No ratings yet
17965
70 pages
18820
No ratings yet
18820
84 pages
High Performance Computing for Computational Science – VECPAR 2018: 13th International Conference, São Pedro, Brazil, September 17-19, 2018, Revised Selected Papers Hermes Senger - The ebook in PDF and DOCX formats is ready for download now
100% (5)
High Performance Computing for Computational Science – VECPAR 2018: 13th International Conference, São Pedro, Brazil, September 17-19, 2018, Revised Selected Papers Hermes Senger - The ebook in PDF and DOCX formats is ready for download now
68 pages
Using OpenCL Programming Massively Parallel Computers J. Kowalik - The ebook in PDF/DOCX format is available for instant download
100% (1)
Using OpenCL Programming Massively Parallel Computers J. Kowalik - The ebook in PDF/DOCX format is available for instant download
43 pages
Full download (Ebook) Algorithms and Architectures for Parallel Processing: 13th International Conference, ICA3PP 2013, Vietri sul Mare, Italy, December 18-20, 2013, Proceedings, Part II by Peter Benner, Pablo Ezzatti, Enrique Quintana-Ortí, Alfredo Remón (auth.), Rocco Aversa, Joanna Kołodziej, Jun Zhang, Flora Amato, Giancarlo Fortino (eds.) ISBN 9783319038889, 9783319038896, 3319038885, 3319038893 pdf docx
100% (8)
Full download (Ebook) Algorithms and Architectures for Parallel Processing: 13th International Conference, ICA3PP 2013, Vietri sul Mare, Italy, December 18-20, 2013, Proceedings, Part II by Peter Benner, Pablo Ezzatti, Enrique Quintana-Ortí, Alfredo Remón (auth.), Rocco Aversa, Joanna Kołodziej, Jun Zhang, Flora Amato, Giancarlo Fortino (eds.) ISBN 9783319038889, 9783319038896, 3319038885, 3319038893 pdf docx
55 pages
Algorithms And Architectures For Parallel Processing 15th International Conference Ica3pp 2015 Zhangjiajie China November 1820 2015 Proceedings Part Ii 1st Edition Guojun Wang pdf download
No ratings yet
Algorithms And Architectures For Parallel Processing 15th International Conference Ica3pp 2015 Zhangjiajie China November 1820 2015 Proceedings Part Ii 1st Edition Guojun Wang pdf download
84 pages
Download Complete Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisseling PDF for All Chapters
100% (1)
Download Complete Parallel Scientific Computation: A Structured Approach Using BSP 2nd Edition Rob H. Bisseling PDF for All Chapters
40 pages
Algorithms and Computation 25th International Symposium ISAAC 2014 Jeonju Korea December 15 17 2014 Proceedings 1st Edition Hee-Kap Ahn - Download the ebook now to never miss important information
No ratings yet
Algorithms and Computation 25th International Symposium ISAAC 2014 Jeonju Korea December 15 17 2014 Proceedings 1st Edition Hee-Kap Ahn - Download the ebook now to never miss important information
66 pages
Download Full Formal Techniques for Distributed Objects Components and Systems 39th IFIP WG 6 1 International Conference FORTE 2019 Held as Part of the 14th International Federated Conference on Distributed Computing Techniques DisCoTec 2019 Kongens Lyngby Denm Jorge A. Pérez PDF All Chapters
100% (1)
Download Full Formal Techniques for Distributed Objects Components and Systems 39th IFIP WG 6 1 International Conference FORTE 2019 Held as Part of the 14th International Federated Conference on Distributed Computing Techniques DisCoTec 2019 Kongens Lyngby Denm Jorge A. Pérez PDF All Chapters
62 pages
(Ebook) Parallel Scientific Computation: A Structured Approach Using BSP by Rob H. Bisseling ISBN 9780191092572, 0191092576 - Download the ebook now for instant access to all chapters
100% (2)
(Ebook) Parallel Scientific Computation: A Structured Approach Using BSP by Rob H. Bisseling ISBN 9780191092572, 0191092576 - Download the ebook now for instant access to all chapters
83 pages
Download full Parallel Computing Hits the Power Wall Principles Challenges and a Survey of Solutions SpringerBriefs in Computer Science Arthur Francisco Lorenzon Antonio Carlos Schneider Beck Filho ebook all chapters
100% (1)
Download full Parallel Computing Hits the Power Wall Principles Challenges and a Survey of Solutions SpringerBriefs in Computer Science Arthur Francisco Lorenzon Antonio Carlos Schneider Beck Filho ebook all chapters
40 pages
Intelligent Computing: Kohei Arai Editor
No ratings yet
Intelligent Computing: Kohei Arai Editor
1,183 pages
Lecture Notes in Computer Science 1st Edition by Springer ISBN instant download
100% (2)
Lecture Notes in Computer Science 1st Edition by Springer ISBN instant download
73 pages
Value Sensitive Design: Shaping Technology with Moral Imagination
From Everand
Value Sensitive Design: Shaping Technology with Moral Imagination
Batya Friedman
5/5 (1)
Iso 20022 Guide
No ratings yet
Iso 20022 Guide
1 page
WM3500I+WM4000I+WM6000i Introduction
No ratings yet
WM3500I+WM4000I+WM6000i Introduction
7 pages
Lecture # 34: Motion Analysis (Particle Filters) : Muhammad Rzi Abbas
No ratings yet
Lecture # 34: Motion Analysis (Particle Filters) : Muhammad Rzi Abbas
34 pages
IT5443 Syllabus
No ratings yet
IT5443 Syllabus
3 pages
Packing Standard Masu U-2
No ratings yet
Packing Standard Masu U-2
15 pages
Mtr10ii Series
No ratings yet
Mtr10ii Series
290 pages
3545 Manual de Servicio
No ratings yet
3545 Manual de Servicio
8 pages
Eng8 U1 - U2 - T7 (HS) - Form 40 Câu
No ratings yet
Eng8 U1 - U2 - T7 (HS) - Form 40 Câu
3 pages
Intelligent Envelopes For High-Performance Buildings: Guedi Capeluto Carlos Ernesto Ochoa
No ratings yet
Intelligent Envelopes For High-Performance Buildings: Guedi Capeluto Carlos Ernesto Ochoa
140 pages
Sales Dataset Analysis
No ratings yet
Sales Dataset Analysis
28 pages
Deep Learning Crash Course For Beginners With Python Theory And Applications Stepbystep Using Tensorflow 20contains A Lot Of Exercises And Handson Projects Publishing download
100% (1)
Deep Learning Crash Course For Beginners With Python Theory And Applications Stepbystep Using Tensorflow 20contains A Lot Of Exercises And Handson Projects Publishing download
86 pages
extracted_text
No ratings yet
extracted_text
3 pages
Malware Reverse Engineering Part 1 Static Analysis
No ratings yet
Malware Reverse Engineering Part 1 Static Analysis
27 pages
CQI & Throughput
No ratings yet
CQI & Throughput
9 pages
Mad 19
No ratings yet
Mad 19
3 pages
ProveedoresAdjudicados 201911
No ratings yet
ProveedoresAdjudicados 201911
39 pages
Networking Assignment
No ratings yet
Networking Assignment
73 pages
Preparation For Loading
No ratings yet
Preparation For Loading
2 pages
UltraFlow CIP Systems BioPharm
No ratings yet
UltraFlow CIP Systems BioPharm
4 pages
R1 Plots
No ratings yet
R1 Plots
20 pages
Finale 2014 Win Read Me
No ratings yet
Finale 2014 Win Read Me
10 pages
Dampers Del y Post
No ratings yet
Dampers Del y Post
7 pages
User Guide Bitdefender
No ratings yet
User Guide Bitdefender
9 pages
Lighting Guide PDF
No ratings yet
Lighting Guide PDF
1 page
Examples 2
No ratings yet
Examples 2
9 pages
C 12 Maintenance
No ratings yet
C 12 Maintenance
45 pages
FMG Report
No ratings yet
FMG Report
9 pages
Iot3X 7 Level Iot Reference Model
No ratings yet
Iot3X 7 Level Iot Reference Model
9 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Accelerating Computational Science and Engineering

Uploaded by

Accelerating Computational Science and Engineering

Uploaded by

PARALLEL COMPUTING:

Volumes 1–14 published by Elsevier Science.

ISSN 0927-5452 (print)

Amstterdam • Berrlin • Tokyo • Washington, DC

ISBN 978-1-61499-380-3 (print)

Distributor in the USA and Canada

PRINTED IN THE NETHERLANDS

Organising & Exhibition Committee

Conference Programme Committee

Peter Arbenz (Switzerland) Bettina Krammer (France)

Programme Committees of Mini-Symposia

ParCo 2013 PhD Symposium

Josef Weidendorfer (Symposium Chair) (Germany)

Jens Breitbart (Germany)

ParaFPGA-2013: Parallel Computing with FPGA’s

Abbes Amira (United Kingdom)

High-Dimensional Meets Parallel – Algorithms and Applications

Dirk Pflüger (Symposium Chair) (Germany)

Application Autotuning for HPC (Architectures)

Siegfried Benkner (Austria)

Extreme Scaling on SuperMUC

Ferdinand Jamitzky (Symposium Chair) (Germany)

Parallel Programming for Heterogeneous Architectures

Efficient Highly Scalable Multi-level Preconditioners

Performance Modeling and Engineering

Space-filling Curves in Parallel Computing

Dirk Roose (Symposium Chair) (Belgium)

Interaction and HPC: Multi-Scale/Multi-Physics Applications

Ralf-Peter Mundani (Symposium Chair) (Germany)

AMD – Advanced Micro Devices, Inc.

Extreme Data Science at the National Energy Research Scientific Computing

Parallel Programming Models

XMP-IO Function and Its Application to MapReduce on the K Computer 35

Performance Analysis and Tools

Formalizing Bottlenecks in Task-Based OpenMP Applications 103

Parallel Numerical Linear Algebra

On Using Speculative Computations for Parallel Reduction to Tridiagonal Form 135

MapReduce Streaming Algorithms for Laplace Relaxation on the Cloud 215

SFC-Based Communication Metadata Encoding for Adaptive Mesh Refinement 233

GPU Computing and Applications

Efficient GPU-Based Optimization of Volume Meshes 285

Parallelisation and Optimisation of Large-Scale Applications

Accelerating SeisSol by Generating Vectorized Code for Sparse Matrix

Exploiting Data- and Task-Parallelism in the Solution of Riccati Equations

ParCo PhD Symposium

ParCo 2013 PhD Symposium 439

Mini-Symposium “Parallel Computing with FPGAs (ParaFPGA2013)”

ParaFPGA 2013: Harnessing Programs, Power and Performance in Parallel

Mini-Symposium “High-Dimensional Meets Parallel – Algorithms

High-Dimensional Meets Parallel: Algorithms and Applications 563

Mini-Symposium “Application Autotuning for HPC (Architectures)”

Mini-Symposium on Application Autotuning for HPC 615

Mini-Symposium “Extreme Scaling on SuperMUC”

Extreme Scaling Workshop at the LRZ 691

Mini-Symposium “Parallel Programming for Heterogeneous Architectures”

Parallel Programming for Heterogeneous Architectures 731

Further Mini-Symposium Contributions

PRACE DECI (Distributed European Computing Initiative) Minisymposium 805

Author Index 843

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.