APS DPF Elmer Future of Computing
APS DPF Elmer Future of Computing
Future of Computing
Future of Computing
A story of technologies, collaborations and science driven challenges
Aleph@CERN CMS@CERN
BaBar@SLAC
International dialing at reduced rates to
1990s Technology Context participate in meetings with phone cards
Dec 1991 - first web server (later first DB)
deployed in the US, at SLAC
Future of Computing?
PE You
Tell me about the future of computing in high energy physics
ChatGPT
The future of computing in high-energy physics holds exciting prospects driven by technological advancements and the
evolving needs of scientific research. Here are some trends and areas of development that could shape the future of
computing in this field:
Overall, the future of computing in high-energy physics is likely to be characterized by a combination of technological
innovation, collaborative research efforts, and a continued focus on addressing the computational challenges posed by
large-scale experiments and datasets.
CHEP 2023 Scientific Program CHEP 2019 - Scientific Program
Track 1 - Data and Metadata Organization, Management Track 1 - Online and Real-time Computing
and Access Track 2 - Offline Computing
Track 2 - Online Computing
Track 3 - Middleware and Distributed Computing
Track 3 - Offline Computing
Track 4 - Distributed Computing Track 4 - Data Organisation, Management and
Track 5 - Sustainable and Collaborative Software Access
Engineering Track 5 - Software Development
Track 6 - Physics Analysis Tools Track 6 - Physics Analysis
Track 7 - Facilities and Virtualization
Track 7 - Facilities, Clouds and Containers
Track 8 - Collaboration, Reinterpretation, Outreach and
Education Track 8 - Collaboration, Education, Training and
Track 9 - Artificial Intelligence and Machine Learning Outreach
Track 10 - Exascale Science Track 9 - Exascale Science
Track 11 - Heterogeneous Computing and Accelerators
Track 12 - Quantum Computing
http://chep2019.org/
https://www.jlab.org/conference/CHEP2023
CHEP 2007 Scientific Program
CHEP 2000 - Scientific Program
Track 1 - Online Computing
Track 2 - Event Processing Track 1 - Data Analysis: Algorithms and methods
Track 3 - Software components, tools and databases Track 2 - Data Acquisition and Control System
Track 4 - Computer facilities, production grids and Track 3 - Object Persistency and Data Handling
networking Track 4 - Network: Applications and Services
Track 5 - Grid middleware and tools Track 5 - Commodity Hardware and Software and
Track 6 - Distributed data analysis and information Integration in Farm and Large Systems
management Track 6 - Data Analysis: Technology and
Track 7 - Collaborative initiatives with other sciences Presentation
Track 8 - Collaborative tools
https://indico.cern.ch/event/3580/
https://chep2000.pd.infn.it/
https://www.zeuthen.desy.de/CHEP97/chep97.htm
CHEP 1997 - “Future of
HEP Computing” Panel
https://www.zeuthen.desy.de/CHEP97/slide/p518/p518.htm
CHEP 2023 Scientific Program
CHEP 2000 - Scientific Program
Track 1 - Data and Metadata Organization, Management
and Access Track 1 - Data Analysis: Algorithms and methods
Track 2 - Online Computing Track 2 - Data Acquisition and Control System
Track 3 - Offline Computing
Track 4 - Distributed Computing Track 3 - Object Persistency and Data Handling
Track 5 - Sustainable and Collaborative Software Track 4 - Network: Applications and Services
Engineering Track 5 - Commodity Hardware and Software and
Track 6 - Physics Analysis Tools Integration in Farm and Large Systems
Track 7 - Facilities and Virtualization Track 6 - Data Analysis: Technology and
Track 8 - Collaboration, Reinterpretation, Outreach and
Education Presentation
Track 9 - Artificial Intelligence and Machine Learning
Track 10 - Exascale Science
Track 11 - Heterogeneous Computing and Accelerators
Track 12 - Quantum Computing
https://www.jlab.org/conference/CHEP2023 https://chep2000.pd.infn.it/
Some conclusions from the exploring the CHEP history
General trend is towards higher level “system” concepts, while earlier technology tracks now
“subsumed” into the system tracks. Some new “technology” tracks appear (e.g. AI/ML,
Quantum Computing). ChatGPT suggested more technology than systems.
“Conway's Law” effect? - the way an organization communicates and is structured will be
directly reflected in the systems and software it creates
One consistently present track is “data analysis” and/or “analysis tools”. ChatGPT missed
this one.
Project Size
Over Time
Community Structures Reflect Our Community Evolution/Needs
The Worldwide LHC Computing Grid (WLCG) project is a
global collaboration of around 170 computing centres in
more than 40 countries, linking up national and international
grid infrastructures. The mission of the WLCG project is to
provide global computing resources ... [2000’s era]
The HEP Software Foundation facilitates cooperation and common
[2010’s era]
efforts in High Energy Physics software and computing internationally.
Community charge from the Worldwide LHC Computing Grid to the (then nascent)
HEP Software Foundation in July 2016:
Computer hardware is a
consumable. Software is the
actual "cyberinfrastructure".
RooFit (D.Kirkby, W.Verkerke) is now nearly 25(?) years old and played a key role in the Higgs
discoveries and LHC (and other) physics.
EvtGen (A.Ryd, D.Lange), originally developed at CLEO and developed in BaBar, lives on in many
experiments.
xrootd (A.Hanushevsky, many others) is now around 20 years old. Initially planned to fix issues
with the Objectivity AMS, it was repurposed into a next generation file server for BaBar with effort
from INFN (A.Dorigo, F.Furano) and (later) CERN.
xrootd has grown into a collaboration of SLAC/CERN/UCSD and others, It is not only widely used
by the LHC experiments, but it is a key element of the OSG’s “Open Science Data Federation
(OSDF)” service broadly supporting science in the US (next slide) that needs high throughput
computing.
Snowmass CompF recommendation
APS DPF and
Coordinating Panel
for Software and
Computing
all done in a reproducible & preservable way that can scale to HL-LHC requirements.
See also “Demonstrator Analysis 200
Begun as an integration exercise for IRIS-HEP, but has evolved as a community project
Gb/s” (B.Bockelman) at WLCG/HSF
and benchmark reference bringing together different groups and experiments.
Application
Specific
Domain
Specific
Technique
Specific
28
Foundational
Jake Vanderplas 2017 keynote-style Python ecosystem, but with layer labeling a la “Array programming with NumPy”, Nature , 585, 357-362 (2020)
The “Future of Computing”
is of course people to do
it: software training!
Training and Education Model
How to build a sustainable and
scalable training framework
that grows skills through multiple
stages of people’s careers?
Training Survey
In early 2019, we did a survey of training needs (link for results summary), 334
people responded!
31
Multiple software, computing, data science and AI/ML
programs aiming at different career stages
● Undergraduate summer programs: US-CMS PURSUE program, US-ATLAS
SUPER program, IRIS-HEP Fellows program
● HSF/IRIS-HEP Training activities (materials and events) - 1600 students and 50
educators in the past few years
● A3D3 PostBac program
● Summer schools: IAIFI, CoDaS-HEP
● US-ATLAS and US-CMS postdoc R&D programs
● DOE CompHEP Traineeship projects: TAC-HEP, WATCHEP, C2-The-P2
● International: Fellows/mentoring in NSF-funded HSF-India project, HSF Google
summer of code, CERN summer students, etc.
32
Central repository of training materials + instructor
HSF Software Training Center community - modeled on The Carpentries
Plus more…
And growing…
CoDaS-HEP 2022
CoDaS-HEP 2019
IRIS-HEP Fellows
Connects undergraduates in
physics and computer science
with HEP mentors active in
developing HEP research
software.
35
DOE Traineeships in Computational HEP
Although
participation in
experiment software
is international,
much of the core
software
development comes
primarily from the
US and Europe. HSF-India project: 5-year project: software training, researcher
visits and seeding of software collaborations
http://research-software-collaborations.org/ (OISE-2201990)
And the “Future of Computing”?
The Worldwide LHC Computing Grid (WLCG) project is a
global collaboration of around 170 computing centres in
more than 40 countries, linking up national and international
grid infrastructures. The mission of the WLCG project is to
provide global computing resources ... [2000’s era]