


default search action
26th CLUSTER 2024: Kobe, Japan
- IEEE International Conference on Cluster Computing, CLUSTER 2024, Kobe, Japan, September 24-27, 2024. IEEE 2024, ISBN 979-8-3503-5871-1
- Yutong Lu, Wuchun Feng, Mohamed Wahib:
Welcome Message from the IEEE Cluster 2024 Program Chairs. xiii - Lishan Yang, George Papadimitriou, Dimitris Sartzetakis, Adwait Jog, Evgenia Smirni, Dimitris Gizopoulos:
GPU Reliability Assessment: Insights Across the Abstraction Layers. 1-13 - Jiyu Luo, Tao Yan, Qingguo Xu, Jingwei Sun, Guangzhong Sun:
Siesta: Synthesizing Proxy Applications for MPI Programs. 14-26 - Xiang Fu, Shiman Meng, Weiping Zhang, Luanzheng Guo, Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz:
Distributed Order Recording Techniques for Efficient Record-and-Replay of Multi - Threaded Programs. 27-38 - Gan Sun, Jiang Zhou, Bo Li, Xiaoyan Gu, Weiping Wang, Shuibing He:
FTGraph: A Flexible Tree-Based Graph Store on Persistent Memory for Large-Scale Dynamic Graphs. 39-50 - Xiaohui Wei, Weikai Tang, Hao Qi, Hengshan Yue:
PGSampler: Accelerating GPU-Based Graph Sampling in GNN Systems via Workload Fusion. 51-61 - Aishwarya Sarkar, Sayan Ghosh, Nathan R. Tallent, Ali Jannesari:
MassiveGNN: Efficient Training via Prefetching for Massively Connected Distributed Graphs. 62-73 - Emile Cadorel, Dimitri Saingre:
A Protocol to Assess the Accuracy of Process-Level Power Models. 74-84 - Omri Mor, George Bosilca, Marc Snir:
Holistic Performance Analysis for Asynchronous Many-Task Runtimes. 85-96 - Tomé Maseda, Jonatan Enes, Roberto R. Expósito, Juan Touriño:
Automated Approach for Accurate CPU Power Modelling. 97-107 - Majid Salimi Beni, Biagio Cosenza, Sascha Hunold:
MPI Collective Algorithm Selection in the Presence of Process Arrival Patterns. 108-119 - Gerald Collom, Derek Schafer, Amanda Bienz, Patrick G. Bridges, Galen M. Shipman:
Optimizing Neighbor Collectives with Topology Objects. 120-130 - Hamed Sharifian, Amir Hossein Sojoodi, Ahmad Afsahi:
A Topology- and Load-Aware Design for Neighborhood Allgather. 131-142 - Petros Anastasiadis, Nikela Papadopoulou, Nectarios Koziris, Georgios I. Goumas:
Uncut-GEMMs: Communication-Aware Matrix Multiplication on Multi-GPU Nodes. 143-154 - Yifei He, Stefano Markidis:
High-Performance FFT Code Generation via MLIR Linalg Dialect and SIMD Micro-Kernels. 155-165 - Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari, Timothy Tsai, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant J. Nair, Kevin J. Barker, Ang Li:
Understanding Mixed Precision GEMM with MPGemmFI: Insights into Fault Resilience. 166-178 - Yang Zhou, Fang Wang, Zhan Shi, Dan Feng:
Parallelism or Fairness? How to Be Friendly for SSDs in Cloud Environments. 179-189 - Pierre Jacquet, Thomas Ledoux, Romain Rouvoy:
SlackVM: Packing Virtual Machines in Oversubscribed Cloud Infrastructures. 190-201 - Ranhao Jia, Zixiao Chen, Chentao Wu, Jie Li, Minyi Guo, Hongwen Huang:
RL-Cache: An Efficient Reinforcement Learning Based Cache Partitioning Approach for Multi-Tenant CDN Services. 202-213 - Hongjian Zhang, Akira Nukada, Qiucheng Liao:
FCUFS: Core-Level Frequency Tuning for Energy Optimization on Intel Processors. 214-225 - Sejeong Oh, Gordon Euhyun Moon, Sungyong Park:
ML-Based Dynamic Operator-Level Query Mapping for Stream Processing Systems in Heterogeneous Computing Environments. 226-237 - Yao Xu, Gene Cooperman:
Enabling Practical Transparent Checkpointing for MPI: A Topological Sort Approach. 238-249 - Md Rajib Hossen, Vanessa V. Sochat, Abhik Sarkar, Mohammad A. Islam, Daniel J. Milroy:
Enabling Workload-Driven Elasticity in MPI-based Ensembles. 250-262 - Mohammad Reza Hoseiny Farahabady, Albert Y. Zomaya:
Geo-Distributed Analytical Streaming Architecture for IoT Platforms. 263-274 - Jingwen Du, Fang Wang, Dan Feng, Dexin Zeng, Sheng Yi:
Seastar: A Cache-Efficient and Load-Balanced Key-Value Store on Disaggregated Memory. 275-285 - Reza Farahani, Narges Mehran, Sashko Ristov, Radu Prodan:
HEFTLess: A Bi-Objective Serverless Workflow Batch Orchestration on the Computing Continuum. 286-296 - Jie Li, George Michelogiannakis, Samuel Maloney, Brandon Cook, Estela Suarez, John Shalf, Yong Chen:
Job Scheduling in High Performance Computing Systems with Disaggregated Memory Resources. 297-309 - Mingtian Shao, Wenzhe Zhang, Ruibo Wang, Huijun Wu, Yiqin Dai, Kai Lu:
Fully Decentralized Data Distribution for Exascale-HPC: End of the Provider-Demander Matching Puzzle. 310-321 - Shixun Wu, Yitong Ding, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Huangliang Dai, Sheng Di, Bryan M. Wong, Zizhong Chen, Franck Cappello:
FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance. 322-334 - Wenqing Lin, Hemeng Wang, Haodong Deng, Qingxiao Sun:
ScalFrag: Efficient Tiled-MTTKRP with Adaptive Launching on GPUs. 335-345 - Scott Levy, Whit Schonbein, Craig D. Ulmer:
Leveraging High-Performance Data Transfer to Offload Data Management Tasks to SmartNICs. 346-356 - Meng Tang, Jaime Cernuda, Jie Ye, Luanzheng Guo, Nathan R. Tallent, Anthony Kougkas, Xian-He Sun:
DaYu: Optimizing Distributed Scientific Workflows by Decoding Dataflow Semantics and Dynamics. 357-369 - Jonathan Bader, Fabian Skalski, Fabian Lehmann, Dominik Scheinert, Jonathan Will, Lauritz Thamsen, Odej Kao:
Sizey: Memory-Efficient Execution of Scientific Workflow Tasks. 370-381 - Jannis Klinkenberg, Clément Foyer, Pierre Clouzet, Brice Goglin, Emmanuel Jeannot, Christian Terboven, Anara Kozhokanova:
Phase-Based Data Placement Optimization in Heterogeneous Memory. 382-393 - Wenyang Zhao, Osamu Miyashita, Miki Nakano, Florence Tama:
Xphase3d: Memory-Distributed Phase Retrieval for Reconstructing Large-Scale 3D Density Maps of Biological Macromolecules. 394-402 - Chunhong Du, Shanjiang Tang, Song Meng, Jiekai Gou, Ce Yu, Yusen Li, Hao Fu, Ye Tian, Ding Yuan:
Accuracy-Efficiency Optimization for Multi-Stage Small Object Detection in Surveillance Video with Collaborative Frame Sampling. 403-413 - Keichi Takahashi, Takashi Abe, Akihiro Musa, Yoshihiko Sato, Yoichi Shimomura, Hiroyuki Takizawa, Shunichi Koshimura:
Modernizing an Operational Real-Time Tsunami Simulator to Support Diverse Hardware Platforms. 414-425 - Ahmad Tarraf, Javier Fernández Muñoz, David E. Singh, Taylan Özden, Jesús Carretero, Felix Wolf:
I/O Behind the Scenes: Bandwidth Requirements of HPC Applications with Asynchronous I/O. 426-439 - Sohei Koyama, Kohei Hiraga, Osamu Tatebe:
FINCHFS: Design of Ad-Hoc File System for I/O Heavy HPC Workloads. 440-450 - Yujie Shi, Yu Hua, Jianming Huang:
A High-Performance and Fast-Recovery Scheme for Secure Non-Volatile Memory Systems. 451-463

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.