The gem5 Simulator: Version 20.0+

Abdul Mutaal Ahmad; Adri\`a Armejach; Adrian Herrera; Alec Roelke; Amin Farmahini-Farahani; Andrea Mondelli; Andreas Hansson; Andreas Sandberg; Anthony Gutierrez; Austin Harris

arxiv: 2007.03152 · v2 · pith:TDB4S4EHnew · submitted 2020-07-07 · 💻 cs.AR

The gem5 Simulator: Version 20.0+

Jason Lowe-Power , Abdul Mutaal Ahmad , Ayaz Akram , Mohammad Alian , Rico Amslinger , Matteo Andreozzi , Adri\`a Armejach , Nils Asmussen

show 70 more authors

Brad Beckmann Srikant Bharadwaj Gabe Black Gedare Bloom Bobby R. Bruce Daniel Rodrigues Carvalho Jeronimo Castrillon Lizhong Chen Nicolas Derumigny Stephan Diestelhorst Wendy Elsasser Carlos Escuin Marjan Fariborz Amin Farmahini-Farahani Pouya Fotouhi Ryan Gambord Jayneel Gandhi Dibakar Gope Thomas Grass Anthony Gutierrez Bagus Hanindhito Andreas Hansson Swapnil Haria Austin Harris Timothy Hayes Adrian Herrera Matthew Horsnell Syed Ali Raza Jafri Radhika Jagtap Hanhwi Jang Reiley Jeyapaul Timothy M. Jones Matthias Jung Subash Kannoth Hamidreza Khaleghzadeh Yuetsu Kodama Tushar Krishna Tommaso Marinelli Christian Menard Andrea Mondelli Miquel Moreto Tiago M\"uck Omar Naji Krishnendra Nathella Hoa Nguyen Nikos Nikoleris Lena E. Olson Marc Orr Binh Pham Pablo Prieto Trivikram Reddy Alec Roelke Mahyar Samani Andreas Sandberg Javier Setoain Boris Shingarov Matthew D. Sinclair Tuan Ta Rahul Thakur Giacomo Travaglini Michael Upton Nilay Vaish Ilias Vougioukas William Wang Zhengrong Wang Norbert Wehn Christian Weis David A. Wood Hongil Yoon \'Eder F. Zulian

This is my paper

classification 💻 cs.AR

keywords gem5simulatorcomputerarchitecturebeenfeaturesmodelrelease

0 comments

read the original abstract

The open-source and community-supported gem5 simulator is one of the most popular tools for computer architecture research. This simulation infrastructure allows researchers to model modern computer hardware at the cycle level, and it has enough fidelity to boot unmodified Linux-based operating systems and run full applications for multiple architectures including x86, Arm, and RISC-V. The gem5 simulator has been under active development over the last nine years since the original gem5 release. In this time, there have been over 7500 commits to the codebase from over 250 unique contributors which have improved the simulator by adding new features, fixing bugs, and increasing the code quality. In this paper, we give and overview of gem5's usage and features, describe the current state of the gem5 simulator, and enumerate the major changes since the initial release of gem5. We also discuss how the gem5 simulator has transitioned to a formal governance model to enable continued improvement and community support for the next 20 years of computer architecture research.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 17 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Speed Kills: Exploring Confused Deputy Attacks Through Edge AI Accelerators
cs.CR 2026-05 conditional novelty 8.0

An empirical security study shows confused deputy attacks are practical on most edge AI accelerators via a new LLM-assisted analysis framework, with vendor-confirmed impact on over 100 million devices.
CHIA: An open-source framework for principled, agentic AI-driven hardware/software co-design research
cs.AR 2026-06 unverdicted novelty 7.0

CHIA introduces a framework for building and deploying agentic AI co-design flows as CHIA loops with tool nodes, reliability mechanisms, and five case-study demonstrations.
CHIA: An open-source framework for principled, agentic AI-driven hardware/software co-design research
cs.AR 2026-06 unverdicted novelty 7.0

CHIA is an open-source framework for agentic AI-driven hardware/software co-design using CHIA loops as directed cyclic graphs, a tool library, and features for reliable experimentation, shown via five case studies.
OpenURMA: A Clean-Room Open Implementation of the Unified Bus Protocol
cs.AI 2026-05 unverdicted novelty 7.0

OpenURMA is the first clean-room open implementation of the Unified Bus transport and transaction layers, showing ~500 ns end-to-end latency for 64-byte remote loads versus 2186 ns for RoCEv2 RC.
Scalable Packed Layouts for Vector-Length-Agnostic ML Code Generation
cs.PF 2026-05 conditional novelty 7.0

Presents scalable packed layouts and extensions to tiling/fusion/vectorization in MLIR/IREE for VLA ML code generation on Arm SVE, achieving up to 1.45x speedup over NEON and outperforming PyTorch frameworks.
SPEC CPU: The Next Generation
cs.PF 2026-05 unverdicted novelty 7.0

SPEC CPU 2026 presents a new benchmark suite using open-source apps, expanded multithreading, and Rolling-Round-Robin Rate to address gaps in evaluating heterogeneous multiprogrammed CPU performance.
InjectV: Modeling Fault Injection Attacks in RISC-V Simulation Environment
cs.CR 2026-06 unverdicted novelty 6.0

InjectV is a gem5-based framework for precise fault injection in RISC-V that identifies attack points on FISSC security benchmarks with a claimed 95.8% time saving versus traditional methods.
Distributed Persistence Domain for Persistent Memory Pooling
cs.ET 2026-06 unverdicted novelty 6.0

Proposes Distributed Persistence Domain and Persistent CXL Switch to enable low-latency persistence operations at CXL switch level while maintaining crash consistency in disaggregated memory.
Throughput-Optimized Networks at Scale
cs.NI 2026-05 unverdicted novelty 6.0

TONS uses linear optimization and heuristics to synthesize deadlock-free network topologies and routing for datacenter AI training, reporting 2.1x and 1.6x geometric mean speedups over best TPU torus variants for unif...
HammerSim: A System-Level Tool to Model RowHammer
cs.CR 2026-05 unverdicted novelty 6.0

HammerSim is a gem5-based full-system framework for modeling RowHammer with probability-driven bitflip simulation, validated against real DDR4 DIMMs via JS divergence.
Scalable Packed Layouts for Vector-Length-Agnostic ML Code Generation
cs.PF 2026-05 unverdicted novelty 6.0

Packed layouts and extensions to tiling/fusion/vectorization in MLIR/IREE enable VLA ML code generation for SVE, achieving up to 1.45x speedup over NEON and outperforming PyTorch frameworks while scaling with vector length.
Understanding Simulated Architecture via gem5 Call-Stack Profiling
cs.AR 2026-05 unverdicted novelty 6.0

A specialized profiling tool using Linux perf_event samples gem5 call-stacks to expose simulated architecture behaviors such as TimingSimpleCPU inefficiencies and cache coherence deadlocks not visible in conventional stats.
PG-MDP: Profile-Guided Memory Dependence Prediction for Area-Constrained Cores
cs.PL 2026-04 unverdicted novelty 6.0

Profile-guided opcode labeling removes consistently independent loads from the MDP working set, cutting queries 79%, false dependencies 77%, and raising small-core IPC 1.47% on SPEC2017 intspeed.
DARTH-PUM: A Hybrid Processing-Using-Memory Architecture
cs.AR 2026-02 unverdicted novelty 6.0

DARTH-PUM integrates analog and Boolean PUM with optimized peripherals, coordination hardware, and a programming interface to run kernels like AES, CNNs, and LLMs fully in memory, achieving speedups of 59.4x, 14.8x, a...
ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure Modeling
cs.DC 2026-06 unverdicted novelty 5.0

ASTRA-sim 3.0 introduces cache-line load-store simulation, a detailed GPU execution model, and InfraGraph to support high-fidelity distributed machine learning infrastructure simulations.
Akita: A High Usability Simulation Framework for Computer Architecture
cs.DC 2026-04 unverdicted novelty 5.0

Akita is a decoupled simulation engine that lets developers write simple single-threaded cycle-based code while automatically delivering event-driven performance, transparent parallel execution, and built-in tracing f...
Ramulator 2.1: A Composable Memory System Simulator for Modern DRAM Systems
cs.AR 2026-06 unverdicted novelty 4.0

Ramulator 2.1 is an updated open-source DRAM simulator adding support for recent memory standards, a Python modeling interface, and enhanced validation workflows.