skip to main content
10.1145/3472456.3472461acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

gem5 + rtl: A Framework to Enable RTL Models Inside a Full-System Simulator

Published: 05 October 2021 Publication History

Abstract

In recent years there has been a surge of interest in designing custom accelerators for power-efficient high-performance computing. However, available tools to simulate low-level RTL designs often neglect the target system in which the design will operate. This hinders proper testing and debugging of functionalities, and does not allow co-designing the accelerator to obtain a balanced and efficient architecture.
In this paper, we introduce gem5 + rtl, a flexible framework that enables simulation of RTL models inside a full-system software simulator. We present the framework’s functionality that allows easy integration of RTL models on a simulated system-on-chip (SoC) that is able to boot Linux and run complex multi-threaded and multi-programmed workloads. We demonstrate the framework with two relevant use cases that integrate a multi-core SoC with a Performance Monitoring Unit (PMU) and the NVIDIA Deep Learning Accelerator (NVDLA), showcasing how the framework enables testing RTL model features and how it can enable co-design taking into account the entire SoC.

References

[1]
2018. NVDLA Github webpage. (2018). https://github.com/nvdla/hw
[2]
Anandtech. 2019. Hot Chips 31: Tesla Solution for Full Self Driving Car. (2019). https://www.anandtech.com/show/14766/hot-chips-31-live-blogs-tesla-solution-for-full-self-driving
[3]
Ehsan K. Ardestani and Jose Renau. 2013. ESESC: A fast multicore simulator using Time-Based Sampling. In High Performance Computer Architecture, 2013 IEEE 19th International Symposium on. 448–459.
[4]
Eduardo Argollo, Ayose Falcón, Paolo Faraboschi, Matteo Monchiero, and Daniel Ortega. 2009. COTSon: infrastructure for full system simulation. ACM SIGOPS Operating Systems Review 43, 1 (2009), 52–61.
[5]
ARM. 2011. AMBA AXI and ACE Protocol Specification.(2011). https://static.docs.arm.com/ihi0022/g/IHI0022G_amba_axi_protocol_spec.pdf
[6]
Owen Astrachan. 2003. Bubble Sort: An Archaeological Algorithmic Analysis. In Proceedings of the 34th SIGCSE Technical Symposium on Computer Science Education (Reno, Navada, USA).
[7]
J. Bachrach, H. Vo, B. Richards, Y. Lee, A. Waterman, R. Avižienis, J. Wawrzynek, and K. Asanović. 2012. Chisel: Constructing hardware in a Scala embedded language. In DAC Design Automation Conference 2012.
[8]
Nathan Binkert, Bradford Beckmann, Gabriel Black, Steven K. Reinhardt, Ali Saidi, Arkaprava Basu, Joel Hestness, Derek R. Hower, Tushar Krishna, Somayeh Sardashti, Rathijit Sen, Korey Sewell, Muhammad Shoaib, Nilay Vaish, Mark D. Hill, and David A. Wood. 2011. The Gem5 Simulator. SIGARCH Comput. Archit. News 39, 2 (Aug. 2011), 1–7.
[9]
Trevor E Carlson, Wim Heirman, and Lieven Eeckhout. 2011. Sniper: Exploring the level of abstraction for scalable and accurate parallel multi-core simulation. In High Performance Computing, Networking, Storage and Analysis, 2011 International Conference for. 1–12.
[10]
D. Chiou, D. Sunwoo, J. Kim, N. A. Patil, W. Reinhart, D. E. Johnson, J. Keefe, and H. Angepat. 2007. FPGA-Accelerated Simulation Technologies (FAST): Fast, Full-System, Cycle-Accurate Simulators. In MICRO.
[11]
Cobham-Gaisler. 2006. LEON3FT Fault-tolerant processor. (2006). https://www.gaisler.com/index.php/products/processors/leon3ft
[12]
J. Cong, Z. Fang, M. Gill, and G. Reinman. 2015. PARADE: A cycle-accurate full-system simulation Platform for Accelerator-Rich Architectural Design and Exploration. In ICCAD.
[13]
Embecosm. 2019. High Performance SoC Modeling with Verilator. (2019). https://www.embecosm.com/appnotes/ean6/embecosm-or1k-verilator-tutorial-ean6-issue-1.html
[14]
Farzad Farshchi, Qijing Huang, and Heechul Yun. 2019. Integrating nvidia deep learning accelerator (nvdla) with risc-v soc on firesim. arXiv preprint arXiv:1903.06495(2019).
[15]
L. Feng, H. Liang, S. Sinha, and W. Zhang. 2017. HeteroSim: A Heterogeneous CPU-FPGA Simulator. IEEE Comput. Archit. Lett. (Jan 2017).
[16]
Davy Genbrugge, Stijn Eyerman, and Lieven Eeckhout. 2010. Interval simulation: Raising the level of abstraction in architectural simulation. In High Performance Computer Architecture, 2010 IEEE 16th International Symposium on. 1–12.
[17]
Tristan Gingold. 2007. Ghdl. (2007). http://ghdl.free.fr/
[18]
Martin A. Goetz. 1963. Internal and Tape Sorting Using the Replacement-Selection Technique. (1963), 6 pages.
[19]
T. Grass, C. Allande, A. Armejach, A. Rico, E. Ayguade, J. Labarta, M. Valero, M. Casas, and M. Moreto. 2016. MUSA: A Multi-level Simulation Approach for Next-Generation HPC Machines. In SC ’16. 526–537.
[20]
C. A. R. Hoare. 1962. Quicksort. Comput. J. 5, 1 (01 1962), 10–16.
[21]
S. Karandikar, H. Mao, D. Kim, D. Biancolin, A. Amid, and D. Lee et al.2018. FireSim: FPGA-Accelerated Cycle-Exact Scale-Out System Simulation in the Public Cloud. In 45th ISCA. 29–42.
[22]
A. Khan, M. Vijayaraghavan, S. Boyd-Wickizer, and Arvind. 2012. Fast and cycle-accurate modeling of a multicore processor. In ISPASS.
[23]
Sheng Li, Jung Ho Ahn, Richard D. Strong, Jay B. Brockman, Dean M. Tullsen, and Norman P. Jouppi. 2009. McPAT: An Integrated Power, Area, and Timing Modeling Framework for Multicore and Manycore Architectures. In MICRO ’09.
[24]
T. Liang, L. Feng, S. Sinha, and W. Zhang. 2007. PAAS: A system level simulator for heterogeneous computing architectures. In FPL.
[25]
Jason Lowe-Power, Abdul Mutaal Ahmad, Ayaz Akram, Mohammad Alian, Rico Amslinger, 2020. The gem5 Simulator: Version 20.0+. CoRR abs/2007.03152(2020).
[26]
NVIDIA. 2018. Integrator’s Manual. (2018). http://nvdla.org/hw/v1/integration_guide.html
[27]
NVIDIA. 2019. Jetson AGX Xavier and the new era of autonomous machines. (2019). http://info.nvidia.com/rs/156-OFN-742/images/Jetson_AGX_Xavier_New_Era_Autonomous_Machines.pdf
[28]
Tetsuya Odajima, Yuetsu Kodama, Miwako Tsuji, Motohiko Matsuda, Yutaka Maruyama, and Mitsuhisa Sato. 2020. Preliminary Performance Evaluation of the Fujitsu A64FX Using HPC Applications. In 2020 IEEE International Conference on Cluster Computing (CLUSTER). 523–530.
[29]
M. Pellauer, M. Adler, M. Kinsy, A. Parashar, and J. Emer. 2011. HAsim: FPGA-based high-detail multicore simulation using time-division multiplexing. In 17th HPCA. 406–417.
[30]
Daniel Sanchez and Christos Kozyrakis. 2013. ZSim: fast and accurate microarchitectural simulation of thousand-core systems. In ACM SIGARCH Computer Architecture News, Vol. 41. 475–486.
[31]
Amazon Web Services. 2016. Amazon EC2 F1 instances. (2016). https://aws.amazon.com/ec2/instance-types/f1/
[32]
Yakun Sophia Shao, Brandon Reagen, Gu-Yeon Wei, and David Brooks. 2014. Aladdin: A Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures. In 41st ISCA.
[33]
Y. S. Shao, S. L. Xi, V. Srinivasan, G. Wei, and D. Brooks. 2012. Co-designing accelerators and SoC interfaces using gem5-Aladdin. In MICRO ’16.
[34]
W. Snyder. 2012. Verilator the fast free verilog simulator. (2012). http://www.veripool.org
[35]
Z. Tan, A. Waterman, R. Avizienis, Y. Lee, H. Cook, D. Patterson, and K. Asanovic. 2010. RAMP gold: An FPGA-based architecture simulator for multiprocessors. In DAC. 463–468.
[36]
Rafael Ubal, Byunghyun Jang, Perhaad Mistry, Dana Schaa, and David Kaeli. 2012. Multi2Sim: A Simulation Framework for CPU-GPU Computing. In PACT.
[37]
John Wawrzynek, David Patterson, Mark Oskin, Shih-Lien Lu, Christoforos Kozyrakis, James C Hoe, Derek Chiou, and Krste Asanovic. 2007. RAMP: Research accelerator for multiple processors. IEEE Micro 27, 2 (2007), 46–57.

Cited By

View all
  • (2024)gem5-NVDLA: A Simulation Framework for Compiling, Scheduling, and Architecture Evaluation on AI System-on-ChipsACM Transactions on Design Automation of Electronic Systems10.1145/366199729:5(1-20)Online publication date: 29-Apr-2024
  • (2024)Backward-Edge Control Flow Integrity Based on Return Address Encryption2024 IEEE 17th International Conference on Solid-State & Integrated Circuit Technology (ICSICT)10.1109/ICSICT62049.2024.10831556(1-3)Online publication date: 22-Oct-2024
  • (2024)Chimera: A co-simulation framework combining with gem5 and FPGA platform for efficient verification2024 34th International Conference on Field-Programmable Logic and Applications (FPL)10.1109/FPL64840.2024.00027(133-139)Online publication date: 2-Sep-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

ICPP '21: Proceedings of the 50th International Conference on Parallel Processing
August 2021
927 pages
ISBN:9781450390682
DOI:10.1145/3472456
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Accelerators
  2. GHDL
  3. Heterogeneous computing
  4. RTL
  5. Simulation
  6. System-On-Chip (SoC)
  7. Verilator
  8. gem5

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • ERDF Operational Program of Catalonia 2014-2020
  • Ministerio de Ciencia, Innovación y Universidades
  • Horizon 2020 Framework Programme
  • Juan de la Cierva postdoctoral fellowship
  • Ajuts per a la contractació de personal investigador novell
  • Ramón y Cajal fellowship

Conference

ICPP 2021

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)215
  • Downloads (Last 6 weeks)31
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)gem5-NVDLA: A Simulation Framework for Compiling, Scheduling, and Architecture Evaluation on AI System-on-ChipsACM Transactions on Design Automation of Electronic Systems10.1145/366199729:5(1-20)Online publication date: 29-Apr-2024
  • (2024)Backward-Edge Control Flow Integrity Based on Return Address Encryption2024 IEEE 17th International Conference on Solid-State & Integrated Circuit Technology (ICSICT)10.1109/ICSICT62049.2024.10831556(1-3)Online publication date: 22-Oct-2024
  • (2024)Chimera: A co-simulation framework combining with gem5 and FPGA platform for efficient verification2024 34th International Conference on Field-Programmable Logic and Applications (FPL)10.1109/FPL64840.2024.00027(133-139)Online publication date: 2-Sep-2024
  • (2023)Accelerating Loop-Oriented RTL Simulation With Code InstrumentationIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2023.327693942:12(4985-4998)Online publication date: 17-May-2023
  • (2023)ERAS: A Flexible and Scalable Framework for Seamless Integration of RTL Models with Structural Simulation Toolkit2023 IEEE International Symposium on Workload Characterization (IISWC)10.1109/IISWC59245.2023.00038(196-200)Online publication date: 1-Oct-2023
  • (2023)Ethernet Emulation over PCIe for RISC-V Software Development Vehicles2023 38th Conference on Design of Circuits and Integrated Systems (DCIS)10.1109/DCIS58620.2023.10335994(1-6)Online publication date: 15-Nov-2023
  • (2023)Coupled Data Prefetch and Cache Partitioning Scheme for CPU-Accelerator System2023 IEEE 15th International Conference on ASIC (ASICON)10.1109/ASICON58565.2023.10396658(1-4)Online publication date: 24-Oct-2023
  • (2023)Characterization of a Coherent Hardware Accelerator Framework for SoCsEmbedded Computer Systems: Architectures, Modeling, and Simulation10.1007/978-3-031-46077-7_7(91-106)Online publication date: 2-Jul-2023
  • (2022)Build Automation Framework for Design Validation of Automotive Gateway Controllers2022 IFIP Networking Conference (IFIP Networking)10.23919/IFIPNetworking55013.2022.9829801(1-6)Online publication date: 13-Jun-2022
  • (2022)Prof5: A RISC-V profiler tool2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)10.1109/SBAC-PAD55451.2022.00031(201-210)Online publication date: Nov-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy