CHIA: An open-source framework for principled, agentic AI-driven hardware/software co-design research

Angela Cui; Borivoje Nikolic; Chengyi Lux Zhang; Christopher W. Fletcher; Ella Schwarz; Ferran Hermida-Rivera; Jack Toubes; Jim Fang; Junha Kim; Raghav Gupta

arxiv: 2606.27350 · v1 · pith:U5JHFPMUnew · submitted 2026-06-25 · 💻 cs.AR

CHIA: An open-source framework for principled, agentic AI-driven hardware/software co-design research

Angela Cui , Ferran Hermida-Rivera , Jack Toubes , Raghav Gupta , Jim Fang , Chengyi Lux Zhang , Ella Schwarz , Junha Kim

show 4 more authors

Yakun Sophia Shao Borivoje Nikolic Christopher W. Fletcher Sagar Karandikar

This is my paper

Pith reviewed 2026-06-26 01:47 UTC · model grok-4.3

classification 💻 cs.AR

keywords hardware/software co-designagentic AICHIA frameworkCHIA loopsopen-sourcecomputer architectureAI-driven workflowsfault-tolerant execution

0 comments

The pith

CHIA turns hardware and software co-design into agentic AI loops with built-in reliability.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper presents CHIA, a framework that makes building and running AI-driven hardware and software co-design workflows the main goal rather than an afterthought. In CHIA, these workflows appear as CHIA loops, which are directed cyclic graphs connecting nodes that run tools such as simulators, CAD software, and AI models. The framework supplies ready-made nodes for popular tools and adds features like isolation of AI models from hardware tools, performance profiling, and automatic recovery from failures. These capabilities are meant to support research at scale across many different computing systems. A sympathetic reader would care because this could let AI assist with complex design tasks in a repeatable, measurable way instead of one-off experiments.

Core claim

CHIA treats the productive construction and scalable deployment of the co-design flow itself as a first-class objective. In CHIA, agentic AI-driven hardware and software design flows are expressed as CHIA loops: directed cyclic graphs whose nodes execute various system-on-chip design tools, microarchitectural simulators, software build systems, AI models, evolutionary coding agents, and more. The CHIA library provides node implementations for many popular tools, and the system supplies isolation, profiling, fault-tolerant execution, and reliability at scale.

What carries the argument

CHIA loops, directed cyclic graphs whose nodes execute design tools, simulators, AI models and other components, carrying the flow of agentic design.

If this is right

Five case studies demonstrate loops for RTL-to-simulator alignment, LLM-driven RTL changes, critical path optimization, evolutionary discovery, and GitHub issue resolution.
Research can move from small isolated demonstrations to workflows that run reliably on hundreds of heterogeneous systems.
Agentic methods become applicable to real tools like Chipyard, gem5, and commercial CAD flows.
The same structure supports both evolutionary coding agents and LLM-based agents in one framework.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

CHIA's isolation mechanisms could reduce risks when AI agents interact with sensitive hardware design tools.
Extending the node library might allow integration with additional domains such as compiler passes or operating system kernels.
Researchers could test whether the fault tolerance actually reduces the human effort needed for large co-design experiments.
The approach might generalize to other agentic AI applications in engineering fields beyond hardware.

Load-bearing premise

The library of nodes and the reliability features will be enough to handle the complexity of actual hardware and software co-design tasks without needing major extra work.

What would settle it

A measurement showing whether a CHIA loop for one of the case studies completes successfully across a large set of heterogeneous systems with minimal human intervention.

Figures

Figures reproduced from arXiv: 2606.27350 by Angela Cui, Borivoje Nikolic, Chengyi Lux Zhang, Christopher W. Fletcher, Ella Schwarz, Ferran Hermida-Rivera, Jack Toubes, Jim Fang, Junha Kim, Raghav Gupta, Sagar Karandikar, Yakun Sophia Shao.

**Figure 1.** Figure 1: Executive summary of this work. system-on-chip design frameworks [6, 14, 63, 87] or microarchitectural simulators [16, 20, 37, 60, 70, 89]) makes it challenging to enforce verification and validation requirements that are critical to successful hardware design while minimizing human review overhead. Though several design flows have been proposed (e.g., evolutionary coding agents, multi-agent collaborati… view at source ↗

**Figure 2.** Figure 2: Simple example CHIA workflow for turning a specification into an RTL description of hardware. The CHIA loop takes a specification [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: CHIA loop for automatically generating a representative [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Average difference in gem5 and Verilator simulation cycle [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 8.** Figure 8: Wall-clock execution profile of the agentic RISC-V [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗

**Figure 9.** Figure 9: OpenSSL speedup when executed with the Crypto extension [PITH_FULL_IMAGE:figures/full_fig_p011_9.png] view at source ↗

**Figure 11.** Figure 11: Timing optimization tree of results, showing maximum achievable frequency and area of each iteration when synthesized in [PITH_FULL_IMAGE:figures/full_fig_p012_11.png] view at source ↗

**Figure 12.** Figure 12: Architecture of an illustrative agentic discovery CHIA [PITH_FULL_IMAGE:figures/full_fig_p013_12.png] view at source ↗

**Figure 13.** Figure 13: Architecture of the CIRCT GitHub issue fixing and PR [PITH_FULL_IMAGE:figures/full_fig_p014_13.png] view at source ↗

read the original abstract

Agentic artificial intelligence shows great promise for radically improving the pace of innovation in hardware/software co-design research across computer architecture, systems, compilers, and VLSI. Thus far, however, applications of AI in these contexts have generally been demonstrated in isolated settings on small-scale problems, due to the difficulty of designing and deploying complex AI-infused hardware and software development workflows. This paper introduces CHIA, an open-source hardware/software co-design framework for agile and principled research on the application of AI to co-design. CHIA treats the productive construction and scalable deployment of the co-design flow itself as a first-class objective. In CHIA, agentic AI-driven hardware and software design flows are expressed as \textit{CHIA loops}: directed cyclic graphs whose nodes execute various system-on-chip design tools, microarchitectural simulators, software build systems, AI models, evolutionary coding agents, and more. The \textit{CHIA library} provides node implementations for many popular tools, including Chipyard, gem5, ChampSim, FireSim, Hammer (thus several commercial ASIC CAD tools), Vivado, AlphaEvolve, AdaEvolve, and many others. CHIA also provides a broad set of features to conduct principled science around these flows. These include isolation between AI models and hardware tools, profiling mechanisms, fault-tolerant execution, and reliability at scale across hundreds of heterogeneous systems (CPUs, FPGAs, GPUs, etc., across public cloud/on-prem.). To showcase CHIA, we present five CHIA loops as case studies: (1) automatic RTL-to-gem5 simulator alignment, (2) LLM-driven implementation of microarchitectural features in RTL, (3) agentic, IPC-aware critical path optimization, (4) evolutionary architectural discovery, and (5) maintainer-friendly agentic GitHub issue fixing.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

CHIA introduces a practical open-source framework for structuring AI-driven co-design as loops with tool integrations and reliability hooks, but the abstract supplies no measurements to support the scalability and fault-tolerance claims.

read the letter

The main thing to know is that CHIA is a new open-source framework that represents hardware/software co-design workflows as directed cyclic graphs called CHIA loops. Nodes in these loops call out to simulators, RTL tools, build systems, and AI agents, and the library ships implementations for Chipyard, gem5, FireSim, Hammer, Vivado, and several evolutionary coding agents. The authors also add isolation between AI components and hardware tools, plus profiling and fault-tolerant execution meant to run across hundreds of heterogeneous machines.

This framing and the bundled library are the clearest new pieces. Treating the construction and reliable deployment of the flow itself as a first-class goal is a reasonable engineering response to the problem of isolated, small-scale AI demos in architecture. The five case studies (RTL-to-gem5 alignment, LLM-driven microarchitectural changes, IPC-aware optimization, evolutionary discovery, and GitHub issue fixing) cover a useful spread of tasks.

The soft spot is the missing data. The abstract describes the features and lists the case studies but reports no success rates, recovery statistics, overhead numbers, or results across more than a handful of systems. The stress-test note about unmeasured assertions on isolation and fault tolerance therefore lands. Without those measurements it is hard to judge whether the reliability mechanisms actually enable principled research at scale.

Citation patterns look normal for a tools paper; the work references the simulators and CAD tools it wraps rather than over-claiming prior results. No circularity or invented entities appear in the argument.

This paper is aimed at computer-architecture researchers who want to experiment with agentic AI on real co-design problems without building the plumbing from scratch. A reader already working on similar flows would get concrete value from the node library and the loop abstraction. It deserves a serious referee because the engineering scope is substantial and the open-source release could be useful even if the current evaluation is mostly descriptive. I would send it out for review and expect the referees to ask for quantitative results on the case studies.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces CHIA, an open-source framework for agentic AI-driven hardware/software co-design. Design flows are expressed as CHIA loops (directed cyclic graphs whose nodes invoke tools such as Chipyard, gem5, ChampSim, FireSim, Hammer, Vivado, and evolutionary agents). The CHIA library supplies node implementations for these tools, while additional mechanisms provide isolation between AI models and hardware tools, profiling, fault-tolerant execution, and claimed reliability across hundreds of heterogeneous systems. Five case studies are presented: automatic RTL-to-gem5 alignment, LLM-driven microarchitectural RTL implementation, agentic IPC-aware critical-path optimization, evolutionary architectural discovery, and maintainer-friendly agentic GitHub issue fixing.

Significance. If the isolation, fault-tolerance, and scalability mechanisms prove effective in practice, CHIA could meaningfully accelerate systematic experimentation in AI-assisted co-design by reducing the engineering overhead of deploying complex, multi-tool workflows. The open-source release together with the broad node library for widely used tools (gem5, FireSim, Hammer, etc.) is a concrete strength that supports reproducibility and adoption.

major comments (2)

[Abstract] Abstract: the claim that the node library plus isolation/profiling/fault-tolerance mechanisms suffice for 'principled science' and 'reliability at scale across hundreds of heterogeneous systems' is asserted without any reported measurements of isolation effectiveness, fault-recovery rates, execution overhead, or success rates on >10 systems; this is load-bearing for the central sufficiency argument.
[Case studies] Case studies paragraph: the five CHIA loops are enumerated at a high level but the manuscript supplies no quantitative results, error analysis, or verification data, leaving the reader unable to assess whether the framework actually enables the claimed scalable and principled research.

minor comments (2)

The terms 'CHIA loops' and 'CHIA library' are introduced without a preceding formal definition or reference to a diagram; a short definitional sentence in the introduction would improve readability.
[Abstract] The abstract lists many tools (Chipyard, gem5, ChampSim, FireSim, Hammer, Vivado, AlphaEvolve, AdaEvolve) but does not indicate which subset is actually exercised in the five case studies; an explicit mapping would clarify coverage.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback, which identifies key areas where the manuscript's claims require stronger support or qualification. We address each major comment below.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that the node library plus isolation/profiling/fault-tolerance mechanisms suffice for 'principled science' and 'reliability at scale across hundreds of heterogeneous systems' is asserted without any reported measurements of isolation effectiveness, fault-recovery rates, execution overhead, or success rates on >10 systems; this is load-bearing for the central sufficiency argument.

Authors: We agree that the abstract asserts sufficiency for principled science and reliability at scale without accompanying quantitative measurements of the mechanisms. The manuscript describes the isolation, profiling, and fault-tolerance features and states their intended purpose, but does not report empirical data such as recovery rates or overheads. We will revise the abstract to remove or qualify these unmeasured claims, limiting assertions to the framework design and the illustrative use in the case studies. revision: yes
Referee: [Case studies] Case studies paragraph: the five CHIA loops are enumerated at a high level but the manuscript supplies no quantitative results, error analysis, or verification data, leaving the reader unable to assess whether the framework actually enables the claimed scalable and principled research.

Authors: The case studies section presents the five loops at a high level to demonstrate how CHIA expresses and deploys co-design flows. We acknowledge that the current text provides no quantitative results, error analysis, or verification data, which limits the ability to evaluate effectiveness. We will expand this section in revision to include available quantitative outcomes, error rates, and verification steps from the experiments underlying each case study. revision: yes

Circularity Check

0 steps flagged

No circularity: framework description with no derivations or fitted predictions

full rationale

The paper introduces an open-source software framework (CHIA) for expressing co-design flows as directed graphs with library nodes and reliability features. It lists five case studies but contains no equations, parameter fits, predictions, or uniqueness theorems. No self-citation chains or ansatzes are invoked to justify core claims; the work is a descriptive systems paper whose assertions rest on implementation and case-study existence rather than any self-referential reduction. This matches the default non-circular outcome for framework papers.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 2 invented entities

The paper introduces new abstractions (CHIA loops and the CHIA library) as core contributions; no free parameters or background axioms are invoked beyond standard software engineering assumptions.

invented entities (2)

CHIA loops no independent evidence
purpose: Express agentic AI-driven hardware/software design flows as directed cyclic graphs whose nodes execute design tools and AI models
Core new abstraction introduced to treat workflow construction as first-class
CHIA library no independent evidence
purpose: Provide reusable node implementations for tools including Chipyard, gem5, FireSim, Hammer, and AI agents
New library component presented as part of the framework

pith-pipeline@v0.9.1-grok · 5918 in / 1334 out tokens · 34687 ms · 2026-06-26T01:47:08.223928+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

118 extracted references · 28 canonical work pages

[1]

SPEC CPU 2006 benchmark

2006. SPEC CPU 2006 benchmark. spec.org. https://www.spec.org/cpu2006/

2006
[2]

Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng

Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Man- junath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A system f...

Pith/arXiv arXiv 2016
[3]

Stefan Abi-Karam and Cong Hao. 2025. HLS-Eval: A Benchmark and Frame- work for Evaluating LLMs on High-Level Synthesis Design Tasks. In2025 IEEE International Conference on LLM-Aided Design (ICLAD). 219–226. doi:10.1109/ ICLAD65226.2025.00021

arXiv 2025
[4]

Dimakis, Ion Stoica, Dan Klein, Matei Zaharia, and Omar Khattab

Lakshya A Agrawal, Shangyin Tan, Dilara Soylu, Noah Ziems, Rishi Khare, Krista Opsahl-Ong, Arnav Singhvi, Herumb Shandilya, Michael J Ryan, Meng Jiang, Christopher Potts, Koushik Sen, Alexandros G. Dimakis, Ion Stoica, Dan Klein, Matei Zaharia, and Omar Khattab. 2026. GEPA: Reflective Prompt Evo- lution Can Outperform Reinforcement Learning. arXiv:2507.19...

Pith/arXiv arXiv 2026
[5]

Elisavet Lydia Alvanaki, Kevin Lee, and Luca P. Carloni. 2025. SLDB: An End-To- End Heterogeneous System-on-Chip Benchmark Suite for LLM-Aided Design. arXiv:2507.06376 [cs.AR] https://arxiv.org/abs/2507.06376

arXiv 2025
[6]

Alon Amid, David Biancolin, Abraham Gonzalez, Daniel Grubb, Sagar Karandikar, Harrison Liew, Albert Magyar, Howard Mao, Albert Ou, Nathan Pemberton, Paul Rigge, Colin Schmidt, John Wright, Jerry Zhao, Yakun Sophia Shao, Krste Asanović, and Borivoje Nikolić. 2020. Chipyard: Integrated Design, Simulation, and Implementation Framework for Custom SoCs.IEEE Mi...

work page doi:10.1109/mm.2020.2996616 2020
[7]

2025.opencode

anomalyco. 2025.opencode. https://github.com/anomalyco/opencode

2025
[8]

Anonymous. 2025. ArchAgent: Agentic AI-driven Computer Architecture Discovery. OpenReview Anonymous Preprint. https://openreview.net/forum ?id=hcxN9l6zqZ Submission Number 714

2025
[9]

Anthropic. [n. d.].Claude Code. https://www.anthropic.com/claude-code
[10]

2015.airflow

apache. 2015.airflow. https://github.com/apache/airflow

2015
[11]

AWS. 2026. Amazon Bedrock. https://aws.amazon.com/bedrock/ Accessed: 2026-06-23

2026
[12]

AWS. 2026. S3. https://aws.amazon.com/s/idc-server-side-test/awswt-956-v2- template-s3/variant/ Accessed: 2026-06-23

2026
[13]

Yunsheng Bai, Ghaith Bany Hamad, Syed Suhaib, and Haoxing Ren. 2025. Asser- tionForge: Enhancing Formal Verification Assertion Generation with Structured Representation of Specifications and RTL. In2025 IEEE International Conference on LLM-Aided Design (ICLAD). 85–92. doi:10.1109/ICLAD65226.2025.00009

work page doi:10.1109/iclad65226.2025.00009 2025
[14]

Jonathan Balkind, Michael McKeown, Yaosheng Fu, Tri Nguyen, Yanqi Zhou, Alexey Lavrov, Mohammad Shahrad, Adi Fuchs, Samuel Payne, Xiaohua Liang, Matthew Matl, and David Wentzlaff. 2016. OpenPiton: An Open Source Many- core Research Framework. InProceedings of the Twenty-First International Con- ference on Architectural Support for Programming Languages an...

work page doi:10.1145/2872362.2872414 2016
[15]

Abhishek Bhandwaldar, Mihir Choudhury, Ruchir Puri, and Akash Srivastava
[16]

Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization? arXiv:2603.25719 [cs.AI] https: //arxiv.org/abs/2603.25719

Pith/arXiv arXiv
[17]

Reinhardt, Ali Saidi, Arkaprava Basu, Joel Hestness, Derek R

Nathan Binkert, Bradford Beckmann, Gabriel Black, Steven K. Reinhardt, Ali Saidi, Arkaprava Basu, Joel Hestness, Derek R. Hower, Tushar Krishna, Somayeh Sardashti, Rathijit Sen, Korey Sewell, Muhammad Shoaib, Nilay Vaish, Mark D. Hill, and David A. Wood. 2011. The gem5 simulator.SIGARCH Comput. Archit. News39, 2 (Aug. 2011), 1–7. doi:10.1145/2024716.2024718

work page doi:10.1145/2024716.2024718 2011
[18]

Black and J.P

B. Black and J.P. Shen. 1998. Calibration of microprocessor performance models. Computer31, 5 (1998), 59–65. doi:10.1109/2.675637

work page doi:10.1109/2.675637 1998
[19]

Alexander Blasberg, Vasilis Kypriotis, and Dimitrios Skarlatos. 2026. Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization. arXiv:2604.25083 [cs.AI] https://arxiv.org/abs/2604.25083

Pith/arXiv arXiv 2026
[20]

Sriyash Caculo, Mahesh Madhav, and Jeff Baxter. 2025. Memory Access Vectors: Improving Sampling Fidelity for CPU Performance Simulations. arXiv:2506.02344 [cs.AR] https://arxiv.org/abs/2506.02344

arXiv 2025
[21]

Carlson, Wim Heirman, and Lieven Eeckhout

Trevor E. Carlson, Wim Heirman, and Lieven Eeckhout. 2011. Sniper: Exploring the level of abstraction for scalable and accurate parallel multi-core simulation. InSC ’11: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis. 1–12. doi:10.1145/2063384.2063 454

work page doi:10.1145/2063384.2063 2011
[22]

2018.A Highly Productive Implementation of an Out-of- Order Processor Generator

Christopher Celio. 2018.A Highly Productive Implementation of an Out-of- Order Processor Generator. Ph. D. Dissertation. EECS Department, University of California, Berkeley. http://www2.eecs.berkeley.edu/Pubs/TechRpts/2018/EE CS-2018-151.html

2018
[23]

Mert Cemri, Shubham Agrawal, Akshat Gupta, Shu Liu, Audrey Cheng, Qiuyang Mang, Ashwin Naren, Lutfi Eren Erdogan, Koushik Sen, Matei Zaharia, Alex Dimakis, and Ion Stoica. 2026. AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization. arXiv:2602.20133 [cs.NE] https://arxiv.org/abs/2602.20133

arXiv 2026
[24]

Zhengrui Chen, Zixuan Song, Yu Li, Qi Sun, and Cheng Zhuo. 2026. FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA. arXiv:2603.25243 [cs.AR] https://arxiv.org/abs/2603.25243

arXiv 2026
[25]

Audrey Cheng, Shu Liu, Melissa Pan, Zhifei Li, Bowen Wang, Alex Krentsel, Tian Xia, Mert Cemri, Jongseok Park, Shuo Yang, Jeff Chen, Lakshya Agrawal, Aditya Desai, Jiarong Xing, Koushik Sen, Matei Zaharia, and Ion Stoica. 2025. Barbarians at the Gate: How AI is Upending Systems Research. (2025). arXiv:2510.06189 [cs.AI] https://arxiv.org/abs/2510.06189

arXiv 2025
[26]

CHIPS Alliance. 2026. riscv-dv: Random Instruction Generator for RISC-V Processor Verification. https://github.com/chipsalliance/riscv-dv. Accessed: 2026-06-09

2026
[27]

Clark, Vinay Vashishtha, Lucian Shifren, Aditya Gujja, Saurabh Sinha, Brian Cline, Chandarasekaran Ramamurthy, and Greg Yeric

Lawrence T. Clark, Vinay Vashishtha, Lucian Shifren, Aditya Gujja, Saurabh Sinha, Brian Cline, Chandarasekaran Ramamurthy, and Greg Yeric. 2016. ASAP7: A 7-nm finFET predictive process design kit.Microelectronics Journal53 (2016), 105–115. doi:10.1016/j.mejo.2016.04.006

work page doi:10.1016/j.mejo.2016.04.006 2016
[28]

Davis, Klaudiusz Rydzy, Srinivasan Ramesh, Aadit Nilay, Daniel Nichols, Swapna Raj, Nikhil Jain, and Abhinav Bhatele

Joshua H. Davis, Klaudiusz Rydzy, Srinivasan Ramesh, Aadit Nilay, Daniel Nichols, Swapna Raj, Nikhil Jain, and Abhinav Bhatele. 2026. KEET: Explaining Performance of GPU Kernels Using LLM Agents. arXiv:2605.04467 [cs.PF] https://arxiv.org/abs/2605.04467

Pith/arXiv arXiv 2026
[29]

Chenhui Deng, Zhongzhi Yu, Guan-Ting Liu, Nathaniel Pinckney, Brucek Khailany, and Haoxing Ren. 2026. ACE-RTL: When Agentic Context Evo- lution Meets RTL-Specialized LLMs. arXiv:2602.10218 [cs.AR] https://arxiv.or g/abs/2602.10218

arXiv 2026
[30]

Schuyler Eldridge, Prithayan Barua, Aliaksei Chapyzhenka, Adam Izraelevitz, Jack Koenig, Chris Lattner, Andrew Lenharth, George Leontiev, Fabian Schuiki, Ram Sunder, Andrew Young, and Richard Xia. 2021. MLIR as hardware compiler infrastructure. InWorkshop on Open-Source EDA Technology (WOSET), Vol. 3

2021
[31]

FireworksAI. 2026. FireworksAI. https://fireworks.ai/ Accessed: 2026-06-23

2026
[32]

Gaffney, Martin Prammer, Larry Brasfield, D

Kevin P. Gaffney, Martin Prammer, Larry Brasfield, D. Richard Hipp, Dan Kennedy, and Jignesh M. Patel. 2022. SQLite: past, present, and future.Proc. VLDB Endow.15, 12 (Aug. 2022), 3535–3547. doi:10.14778/3554821.3554842

work page doi:10.14778/3554821.3554842 2022
[33]

Jiahao Gai, Hao Chen, Zhican Wang, Hongyu Zhou, Wanru Zhao, Nicholas Lane, and Hongxiang Fan. 2025. Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis. In Proceedings of the 30th Asia and South Pacific Design Automation Conference (Tokyo, Japan)(ASPDAC ’25). Association for Computing Machiner...

work page doi:10.1145/3658617.3697616 2025
[34]

Gansner and Stephen C

Emden R. Gansner and Stephen C. North. 2000. An open graph visualization system and its applications to software engineering.Softw. Pract. Exper.30, 11 (Sept. 2000), 1203–1233

2000
[35]

Bogdan Georgiev, Javier Gómez-Serrano, Terence Tao, and Adam Zsolt Wag- ner. 2025. Mathematical exploration and discovery at scale.arXiv preprint arXiv:2511.02864(2025)

Pith/arXiv arXiv 2025
[36]

Michael Gerstenhaber and Michael Bachman. 2026. Introducing Gemini Enter- prise Agent Platform, powering the next wave of agents. https://cloud.goog le.com/blog/products/ai-machine-learning/introducing-gemini-enterprise- agent-platform

2026
[37]

Github. [n. d.]. GitHub REST API documentation. https://docs.github.com/en/r est Accessed: 2026-06-23

2026
[38]

Gratz, Daniel A

Nathan Gober, Gino Chacon, Lei Wang, Paul V. Gratz, Daniel A. Jimenez, Elvira Teran, Seth Pugsley, and Jinchun Kim. 2022. The Championship Simulator: Ar- chitectural Simulation for Education and Competition. arXiv:2210.14324 [cs.AR] https://arxiv.org/abs/2210.14324

arXiv 2022
[39]

Google. 2026. Google Antigravity. https://antigravity.google/ Accessed: 2026-06-23

2026
[40]

2014.google-cloud-python

Google googleapis. 2014.google-cloud-python. https://github.com/googleapis/ google-cloud-python

2014
[41]

Groq. 2026. Groq. https://groq.com/ Accessed: 2026-06-23

2026
[42]

Ce Guo and Tong Zhao. 2025. ResBench: A Resource-Aware Benchmark for LLM- Generated FPGA Designs. InProceedings of the 15th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART ’25). Association for Computing Machinery, New York, NY, USA, 25–34. doi:10.114 5/3728179.3728192

arXiv 2025
[43]

Raghav Gupta, Akanksha Jain, Abraham Gonzalez, Alexander Novikov, Po-Sen Huang, Matej Balog, Marvin Eisenberger, Sergey Shirobokov, Ngân V˜u, Martin Dixon, Borivoje Nikolić, Parthasarathy Ranganathan, and Sagar Karandikar
[44]

arXiv:2602.22425 [cs.AI] https://arxiv.org/abs/2602.22425

ArchAgent: Agentic AI-driven Computer Architecture Discovery. arXiv:2602.22425 [cs.AI] https://arxiv.org/abs/2602.22425

arXiv
[45]

Zhuolun He, Haoyuan Wu, Xinyun Zhang, Xufeng Yao, Su Zheng, Haisheng Zheng, and Bei Yu. 2023. ChatEDA: A Large Language Model Powered Au- tonomous Agent for EDA. In2023 ACM/IEEE 5th Workshop on Machine Learning for CAD (MLCAD). 1–6. doi:10.1109/MLCAD58807.2023.10299852

work page doi:10.1109/mlcad58807.2023.10299852 2023
[46]

Charles Hong, Sahil Bhatia, Alvin Cheung, and Yakun Sophia Shao. 2025. Au- tocomp: A Powerful and Portable Code Optimizer for Tensor Accelerators. arXiv:2505.18574 [cs.PL] https://arxiv.org/abs/2505.18574

arXiv 2025
[47]

Izraelevitz, J

A. Izraelevitz, J. Koenig, P. Li, R. Lin, A. Wang, A. Magyar, D. Kim, C. Schmidt, C. Markley, J. Lawson, and J. Bachrach. 2017. Reusability is FIRRTL ground: Hardware construction languages, compiler frameworks, and transformations. In2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD). 209–216. doi:10.1109/ICCAD.2017.8203780

work page doi:10.1109/iccad.2017.8203780 2017
[48]

Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan

Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan. 2024. SWE-bench: Can Language Models Resolve Real-World GitHub Issues? arXiv:2310.06770 [cs.CL] https://arxiv.org/ abs/2310.06770

Pith/arXiv arXiv 2024
[49]

Sagar Karandikar, Howard Mao, Donggyu Kim, David Biancolin, Alon Amid, Dayeol Lee, Nathan Pemberton, Emmanuel Amaro, Colin Schmidt, Aditya Chopra, Qijing Huang, Kyle Kovacs, Borivoje Nikolic, Randy Katz, Jonathan Bachrach, and Krste Asanović. 2018. FireSim: FPGA-accelerated Cycle-exact Scale-out System Simulation in the Public Cloud. InProceedings of the ...

work page doi:10.1109/isca.2018.00014 2018
[50]

Steve Kosier. 2023. SKY’s the Limit with the SKY130 Open-Source PDK. https: //www.skywatertechnology.com/sky130-open-source-pdk/ Accessed: 2026-06-23

2023
[51]

Srivatsan Krishnan, Amir Yazdanbakhsh, Shvetank Prakash, Jason Jabbour, Ikechukwu Uchendu, Susobhan Ghosh, Behzad Boroujerdian, Daniel Richins, 17 Cui*, Hermida-Rivera*, Toubes*, et. al. Devashree Tripathy, Aleksandra Faust, and Vijay Janapa Reddi. 2023. ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture De- sign. InProceedings o...

work page doi:10.1145/3579371.3589049 2023
[52]

Gonzalez, Hao Zhang, and Ion Stoica

Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, and Ion Stoica. 2023. Efficient Memory Management for Large Language Model Serving with PagedAttention. arXiv:2309.06180 [cs.LG] https://arxiv.org/abs/2309.06180

Pith/arXiv arXiv 2023
[53]

2023.langgraph

langchain ai. 2023.langgraph. https://github.com/langchain-ai/langgraph

2023
[54]

2012.riscv-torture

Yunsup Lee and Henry Cook. 2012.riscv-torture. https://github.com/ucb- bar/riscv-torture

2012
[55]

Li, Adam M

Patrick S. Li, Adam M. Izraelevitz, and Jonathan Bachrach. 2016.Specification for the FIRRTL Language. Technical Report UCB/EECS-2016-9. EECS Department, University of California, Berkeley. http://www2.eecs.berkeley.edu/Pubs/Tech Rpts/2016/EECS-2016-9.html

2016
[56]

Xingquan Li, Simin Tao, Zengrong Huang, Shijian Chen, Zhisheng Zeng, Liwei Ni, Zhipeng Huang, Chunan Zhuang, Hongxi Wu, Weiguo Li1, Xueyan Zhao, He Liu, Shuaiying Long, Wei He, Bojun Liu, Sifeng Gan, Zihao Yu, Tong Liu, Yuchi Miao, Zhiyuan Yan, Hao Wang, Jie Zhao, Yifan Li, Ruizhi Liu, Xiaoze Lin, Bo Yang, Zhen Xue, Fuxing Huang, Zonglin Yang, Zhenggang W...

arXiv 2023
[57]

Harrison Liew, Daniel Grubb, John Wright, Colin Schmidt, Nayiri Krzysztofow- icz, Adam Izraelevitz, Edward Wang, Krste Asanović, Jonathan Bachrach, and Borivoje Nikolić. 2022. Hammer: a modular and reusable physical design flow tool: invited. InProceedings of the 59th ACM/IEEE Design Automation Conference (San Francisco, California)(DAC ’22). Association ...

work page doi:10.1145/3489517.3530672 2022
[58]

Pan, Alexander Du, Kurt Keutzer, Alvin Cheung, Alexandros G

Shu Liu, Shubham Agarwal, Monishwaran Maheswaran, Mert Cemri, Zhifei Li, Qiuyang Mang, Ashwin Naren, Ethan Boneh, Audrey Cheng, Melissa Z. Pan, Alexander Du, Kurt Keutzer, Alvin Cheung, Alexandros G. Dimakis, Koushik Sen, Matei Zaharia, and Ion Stoica. 2026. EvoX: Meta-Evolution for Automated Discovery. arXiv:2602.23413 [cs.LG] https://arxiv.org/abs/2602.23413

arXiv 2026
[59]

Dimakis, and Ion Stoica

Shu Liu, Mert Cemri, Shubham Agarwal, Alexander Krentsel, Ashwin Naren, Qiuyang Mang, Zhifei Li, Akshat Gupta, Monishwaran Maheswaran, Au- drey Cheng, Melissa Pan, Ethan Boneh, Kannan Ramchandran, Koushik Sen, Matei Zaharia, Alexandros G. Dimakis, and Ion Stoica. 2026.SkyDiscover: A Flexible, Adaptive Framework for AI-Driven Scientific and Algorithmic Dis...

work page doi:10.1145/3786335.3813221 2026
[60]

Shang Liu, Wenji Fang, Yao Lu, Qijun Zhang, Hongce Zhang, and Zhiyao Xie
[61]

In2024 IEEE LLM Aided Design Workshop (LAD)

RTLCoder: Outperforming GPT-3.5 in Design RTL Generation with Our Open-Source Dataset and Lightweight Solution. In2024 IEEE LLM Aided Design Workshop (LAD). IEEE, 1–5. doi:10.1109/lad62341.2024.10691788

work page doi:10.1109/lad62341.2024.10691788 2024
[62]

LLVM. 2026. LLVM AI Tool Use Policy. llvm.org. https://llvm.org/docs/AITool Policy.html Accessed: 2026-06-19

2026
[63]

Jason Lowe-Power, Abdul Mutaal Ahmad, Ayaz Akram, Mohammad Alian, Rico Amslinger, Matteo Andreozzi, Adrià Armejach, Nils Asmussen, Brad Beckmann, Srikant Bharadwaj, Gabe Black, Gedare Bloom, Bobby R. Bruce, Daniel Rodrigues Carvalho, Jeronimo Castrillon, Lizhong Chen, Nicolas Deru- migny, Stephan Diestelhorst, Wendy Elsasser, Carlos Escuin, Marjan Faribor...

arXiv 2020
[64]

Yao Lu, Shang Liu, Qijun Zhang, and Zhiyao Xie. 2024. RTLLM: An Open-Source Benchmark for Design RTL Generation with Large Language Model. In2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC). 722–727. doi:10.1109/ASP-DAC58780.2024.10473904

work page doi:10.1109/asp-dac58780.2024.10473904 2024
[65]

Yi-Chen Lu, Hao-Hsiang Hsiao, and Haoxing Ren. 2025. Invited Paper: LLM- Enhanced GPU-Optimized Physical Design at Scale. In2025 IEEE/ACM Interna- tional Conference On Computer Aided Design (ICCAD). 1–7. doi:10.1109/ICCA D66269.2025.11240986

work page doi:10.1109/icca 2025
[66]

Cota, Michele Petracca, Christian Pilato, and Luca P

Paolo Mantovani, Davide Giri, Giuseppe Di Guglielmo, Luca Piccolboni, Joseph Zuckerman, Emilio G. Cota, Michele Petracca, Christian Pilato, and Luca P. Carloni. 2020. Agile SoC development with open ESP. InProceedings of the 39th International Conference on Computer-Aided Design(Virtual Event, USA) (ICCAD ’20). Association for Computing Machinery, New Yor...

work page doi:10.1145/3400302.3415753 2020
[67]

Dirk Merkel. 2014. Docker: lightweight Linux containers for consistent devel- opment and deployment.Linux J.2014, 239, Article 2 (March 2014)

2014
[68]

Kaushal Mhapsekar, Azam Ghanbari, Bita Aslrousta, and Samira Mirbagher- Ajorpaz. 2026. CacheMind: From Miss Rates to Why - Natural-Language, Trace- Grounded Reasoning for Cache Replacement. InProceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2(USA)(ASPLOS ’26). Association...

work page doi:10.1145/3779212.3790136 2026
[69]

2025.agent-framework

Microsoft. 2025.agent-framework. https://github.com/microsoft/agent- framework

2025
[70]

Jordan, and Ion Stoica

Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I. Jordan, and Ion Stoica. 2018. Ray: a distributed framework for emerging AI applications. InProceedings of the 13th USENIX Conference on Operating Systems Design and Implementation(Carlsbad, CA, USA)(OSDI’18). US...

2018
[71]

Alexander Novikov, Ngân V˜u, Marvin Eisenberger, Emilien Dupont, Po-Sen Huang, Adam Zsolt Wagner, Sergey Shirobokov, Borislav Kozlovskii, Francisco J. R. Ruiz, Abbas Mehrabian, M. Pawan Kumar, Abigail See, Swarat Chaud- huri, George Holland, Alex Davies, Sebastian Nowozin, Pushmeet Kohli, and Matej Balog. 2025. AlphaEvolve: A coding agent for scientific a...

Pith/arXiv arXiv 2025
[72]

2015.Microbench

Tony Nowatzki. 2015.Microbench. https://github.com/VerticalResearchGroup /microbench

2015
[73]

Surim Oh, Mingsheng Xu, Tanvir Ahmed Khan, Baris Kasikci, and Heiner Litz
[74]

InProceedings of the 51st International Symposium on Computer Architecture (ISCA) (ISCA 2024)

UDP: Utility-Driven Fetch Directed Instruction Prefetching. InProceedings of the 51st International Symposium on Computer Architecture (ISCA) (ISCA 2024)

2024
[75]

2023.ollama

ollama. 2023.ollama. https://github.com/ollama/ollama

2023
[76]

OpenAI. 2026. Codex. https://chatgpt.com/codex/ Accessed: 2026-06-23

2026
[77]

2026.openclaw

openclaw. 2026.openclaw. https://github.com/openclaw/openclaw

2026
[78]

OpenRouter. 2026. OpenRouter. https://openrouter.ai/ Accessed: 2026-06-23

2026
[79]

2013.openssl

openssl. 2013.openssl. https://github.com/openssl/openssl

2013
[80]

Anne Ouyang, Simon Guo, Simran Arora, Alex L Zhang, William Hu, Christo- pher Re, and Azalia Mirhoseini. 2025. KernelBench: Can LLMs Write Efficient GPU Kernels?. InForty-second International Conference on Machine Learning. https://openreview.net/forum?id=yeoN1iQT1x

2025

Showing first 80 references.

[1] [1]

SPEC CPU 2006 benchmark

2006. SPEC CPU 2006 benchmark. spec.org. https://www.spec.org/cpu2006/

2006

[2] [2]

Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng

Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Man- junath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A system f...

Pith/arXiv arXiv 2016

[3] [3]

Stefan Abi-Karam and Cong Hao. 2025. HLS-Eval: A Benchmark and Frame- work for Evaluating LLMs on High-Level Synthesis Design Tasks. In2025 IEEE International Conference on LLM-Aided Design (ICLAD). 219–226. doi:10.1109/ ICLAD65226.2025.00021

arXiv 2025

[4] [4]

Dimakis, Ion Stoica, Dan Klein, Matei Zaharia, and Omar Khattab

Lakshya A Agrawal, Shangyin Tan, Dilara Soylu, Noah Ziems, Rishi Khare, Krista Opsahl-Ong, Arnav Singhvi, Herumb Shandilya, Michael J Ryan, Meng Jiang, Christopher Potts, Koushik Sen, Alexandros G. Dimakis, Ion Stoica, Dan Klein, Matei Zaharia, and Omar Khattab. 2026. GEPA: Reflective Prompt Evo- lution Can Outperform Reinforcement Learning. arXiv:2507.19...

Pith/arXiv arXiv 2026

[5] [5]

Elisavet Lydia Alvanaki, Kevin Lee, and Luca P. Carloni. 2025. SLDB: An End-To- End Heterogeneous System-on-Chip Benchmark Suite for LLM-Aided Design. arXiv:2507.06376 [cs.AR] https://arxiv.org/abs/2507.06376

arXiv 2025

[6] [6]

Alon Amid, David Biancolin, Abraham Gonzalez, Daniel Grubb, Sagar Karandikar, Harrison Liew, Albert Magyar, Howard Mao, Albert Ou, Nathan Pemberton, Paul Rigge, Colin Schmidt, John Wright, Jerry Zhao, Yakun Sophia Shao, Krste Asanović, and Borivoje Nikolić. 2020. Chipyard: Integrated Design, Simulation, and Implementation Framework for Custom SoCs.IEEE Mi...

work page doi:10.1109/mm.2020.2996616 2020

[7] [7]

2025.opencode

anomalyco. 2025.opencode. https://github.com/anomalyco/opencode

2025

[8] [8]

Anonymous. 2025. ArchAgent: Agentic AI-driven Computer Architecture Discovery. OpenReview Anonymous Preprint. https://openreview.net/forum ?id=hcxN9l6zqZ Submission Number 714

2025

[9] [9]

Anthropic. [n. d.].Claude Code. https://www.anthropic.com/claude-code

[10] [10]

2015.airflow

apache. 2015.airflow. https://github.com/apache/airflow

2015

[11] [11]

AWS. 2026. Amazon Bedrock. https://aws.amazon.com/bedrock/ Accessed: 2026-06-23

2026

[12] [12]

AWS. 2026. S3. https://aws.amazon.com/s/idc-server-side-test/awswt-956-v2- template-s3/variant/ Accessed: 2026-06-23

2026

[13] [13]

Yunsheng Bai, Ghaith Bany Hamad, Syed Suhaib, and Haoxing Ren. 2025. Asser- tionForge: Enhancing Formal Verification Assertion Generation with Structured Representation of Specifications and RTL. In2025 IEEE International Conference on LLM-Aided Design (ICLAD). 85–92. doi:10.1109/ICLAD65226.2025.00009

work page doi:10.1109/iclad65226.2025.00009 2025

[14] [14]

Jonathan Balkind, Michael McKeown, Yaosheng Fu, Tri Nguyen, Yanqi Zhou, Alexey Lavrov, Mohammad Shahrad, Adi Fuchs, Samuel Payne, Xiaohua Liang, Matthew Matl, and David Wentzlaff. 2016. OpenPiton: An Open Source Many- core Research Framework. InProceedings of the Twenty-First International Con- ference on Architectural Support for Programming Languages an...

work page doi:10.1145/2872362.2872414 2016

[15] [15]

Abhishek Bhandwaldar, Mihir Choudhury, Ruchir Puri, and Akash Srivastava

[16] [16]

Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization? arXiv:2603.25719 [cs.AI] https: //arxiv.org/abs/2603.25719

Pith/arXiv arXiv

[17] [17]

Reinhardt, Ali Saidi, Arkaprava Basu, Joel Hestness, Derek R

Nathan Binkert, Bradford Beckmann, Gabriel Black, Steven K. Reinhardt, Ali Saidi, Arkaprava Basu, Joel Hestness, Derek R. Hower, Tushar Krishna, Somayeh Sardashti, Rathijit Sen, Korey Sewell, Muhammad Shoaib, Nilay Vaish, Mark D. Hill, and David A. Wood. 2011. The gem5 simulator.SIGARCH Comput. Archit. News39, 2 (Aug. 2011), 1–7. doi:10.1145/2024716.2024718

work page doi:10.1145/2024716.2024718 2011

[18] [18]

Black and J.P

B. Black and J.P. Shen. 1998. Calibration of microprocessor performance models. Computer31, 5 (1998), 59–65. doi:10.1109/2.675637

work page doi:10.1109/2.675637 1998

[19] [19]

Alexander Blasberg, Vasilis Kypriotis, and Dimitrios Skarlatos. 2026. Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization. arXiv:2604.25083 [cs.AI] https://arxiv.org/abs/2604.25083

Pith/arXiv arXiv 2026

[20] [20]

Sriyash Caculo, Mahesh Madhav, and Jeff Baxter. 2025. Memory Access Vectors: Improving Sampling Fidelity for CPU Performance Simulations. arXiv:2506.02344 [cs.AR] https://arxiv.org/abs/2506.02344

arXiv 2025

[21] [21]

Carlson, Wim Heirman, and Lieven Eeckhout

Trevor E. Carlson, Wim Heirman, and Lieven Eeckhout. 2011. Sniper: Exploring the level of abstraction for scalable and accurate parallel multi-core simulation. InSC ’11: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis. 1–12. doi:10.1145/2063384.2063 454

work page doi:10.1145/2063384.2063 2011

[22] [22]

2018.A Highly Productive Implementation of an Out-of- Order Processor Generator

Christopher Celio. 2018.A Highly Productive Implementation of an Out-of- Order Processor Generator. Ph. D. Dissertation. EECS Department, University of California, Berkeley. http://www2.eecs.berkeley.edu/Pubs/TechRpts/2018/EE CS-2018-151.html

2018

[23] [23]

Mert Cemri, Shubham Agrawal, Akshat Gupta, Shu Liu, Audrey Cheng, Qiuyang Mang, Ashwin Naren, Lutfi Eren Erdogan, Koushik Sen, Matei Zaharia, Alex Dimakis, and Ion Stoica. 2026. AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization. arXiv:2602.20133 [cs.NE] https://arxiv.org/abs/2602.20133

arXiv 2026

[24] [24]

Zhengrui Chen, Zixuan Song, Yu Li, Qi Sun, and Cheng Zhuo. 2026. FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA. arXiv:2603.25243 [cs.AR] https://arxiv.org/abs/2603.25243

arXiv 2026

[25] [25]

Audrey Cheng, Shu Liu, Melissa Pan, Zhifei Li, Bowen Wang, Alex Krentsel, Tian Xia, Mert Cemri, Jongseok Park, Shuo Yang, Jeff Chen, Lakshya Agrawal, Aditya Desai, Jiarong Xing, Koushik Sen, Matei Zaharia, and Ion Stoica. 2025. Barbarians at the Gate: How AI is Upending Systems Research. (2025). arXiv:2510.06189 [cs.AI] https://arxiv.org/abs/2510.06189

arXiv 2025

[26] [26]

CHIPS Alliance. 2026. riscv-dv: Random Instruction Generator for RISC-V Processor Verification. https://github.com/chipsalliance/riscv-dv. Accessed: 2026-06-09

2026

[27] [27]

Clark, Vinay Vashishtha, Lucian Shifren, Aditya Gujja, Saurabh Sinha, Brian Cline, Chandarasekaran Ramamurthy, and Greg Yeric

Lawrence T. Clark, Vinay Vashishtha, Lucian Shifren, Aditya Gujja, Saurabh Sinha, Brian Cline, Chandarasekaran Ramamurthy, and Greg Yeric. 2016. ASAP7: A 7-nm finFET predictive process design kit.Microelectronics Journal53 (2016), 105–115. doi:10.1016/j.mejo.2016.04.006

work page doi:10.1016/j.mejo.2016.04.006 2016

[28] [28]

Davis, Klaudiusz Rydzy, Srinivasan Ramesh, Aadit Nilay, Daniel Nichols, Swapna Raj, Nikhil Jain, and Abhinav Bhatele

Joshua H. Davis, Klaudiusz Rydzy, Srinivasan Ramesh, Aadit Nilay, Daniel Nichols, Swapna Raj, Nikhil Jain, and Abhinav Bhatele. 2026. KEET: Explaining Performance of GPU Kernels Using LLM Agents. arXiv:2605.04467 [cs.PF] https://arxiv.org/abs/2605.04467

Pith/arXiv arXiv 2026

[29] [29]

Chenhui Deng, Zhongzhi Yu, Guan-Ting Liu, Nathaniel Pinckney, Brucek Khailany, and Haoxing Ren. 2026. ACE-RTL: When Agentic Context Evo- lution Meets RTL-Specialized LLMs. arXiv:2602.10218 [cs.AR] https://arxiv.or g/abs/2602.10218

arXiv 2026

[30] [30]

Schuyler Eldridge, Prithayan Barua, Aliaksei Chapyzhenka, Adam Izraelevitz, Jack Koenig, Chris Lattner, Andrew Lenharth, George Leontiev, Fabian Schuiki, Ram Sunder, Andrew Young, and Richard Xia. 2021. MLIR as hardware compiler infrastructure. InWorkshop on Open-Source EDA Technology (WOSET), Vol. 3

2021

[31] [31]

FireworksAI. 2026. FireworksAI. https://fireworks.ai/ Accessed: 2026-06-23

2026

[32] [32]

Gaffney, Martin Prammer, Larry Brasfield, D

Kevin P. Gaffney, Martin Prammer, Larry Brasfield, D. Richard Hipp, Dan Kennedy, and Jignesh M. Patel. 2022. SQLite: past, present, and future.Proc. VLDB Endow.15, 12 (Aug. 2022), 3535–3547. doi:10.14778/3554821.3554842

work page doi:10.14778/3554821.3554842 2022

[33] [33]

Jiahao Gai, Hao Chen, Zhican Wang, Hongyu Zhou, Wanru Zhao, Nicholas Lane, and Hongxiang Fan. 2025. Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis. In Proceedings of the 30th Asia and South Pacific Design Automation Conference (Tokyo, Japan)(ASPDAC ’25). Association for Computing Machiner...

work page doi:10.1145/3658617.3697616 2025

[34] [34]

Gansner and Stephen C

Emden R. Gansner and Stephen C. North. 2000. An open graph visualization system and its applications to software engineering.Softw. Pract. Exper.30, 11 (Sept. 2000), 1203–1233

2000

[35] [35]

Bogdan Georgiev, Javier Gómez-Serrano, Terence Tao, and Adam Zsolt Wag- ner. 2025. Mathematical exploration and discovery at scale.arXiv preprint arXiv:2511.02864(2025)

Pith/arXiv arXiv 2025

[36] [36]

Michael Gerstenhaber and Michael Bachman. 2026. Introducing Gemini Enter- prise Agent Platform, powering the next wave of agents. https://cloud.goog le.com/blog/products/ai-machine-learning/introducing-gemini-enterprise- agent-platform

2026

[37] [37]

Github. [n. d.]. GitHub REST API documentation. https://docs.github.com/en/r est Accessed: 2026-06-23

2026

[38] [38]

Gratz, Daniel A

Nathan Gober, Gino Chacon, Lei Wang, Paul V. Gratz, Daniel A. Jimenez, Elvira Teran, Seth Pugsley, and Jinchun Kim. 2022. The Championship Simulator: Ar- chitectural Simulation for Education and Competition. arXiv:2210.14324 [cs.AR] https://arxiv.org/abs/2210.14324

arXiv 2022

[39] [39]

Google. 2026. Google Antigravity. https://antigravity.google/ Accessed: 2026-06-23

2026

[40] [40]

2014.google-cloud-python

Google googleapis. 2014.google-cloud-python. https://github.com/googleapis/ google-cloud-python

2014

[41] [41]

Groq. 2026. Groq. https://groq.com/ Accessed: 2026-06-23

2026

[42] [42]

Ce Guo and Tong Zhao. 2025. ResBench: A Resource-Aware Benchmark for LLM- Generated FPGA Designs. InProceedings of the 15th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART ’25). Association for Computing Machinery, New York, NY, USA, 25–34. doi:10.114 5/3728179.3728192

arXiv 2025

[43] [43]

Raghav Gupta, Akanksha Jain, Abraham Gonzalez, Alexander Novikov, Po-Sen Huang, Matej Balog, Marvin Eisenberger, Sergey Shirobokov, Ngân V˜u, Martin Dixon, Borivoje Nikolić, Parthasarathy Ranganathan, and Sagar Karandikar

[44] [44]

arXiv:2602.22425 [cs.AI] https://arxiv.org/abs/2602.22425

ArchAgent: Agentic AI-driven Computer Architecture Discovery. arXiv:2602.22425 [cs.AI] https://arxiv.org/abs/2602.22425

arXiv

[45] [45]

Zhuolun He, Haoyuan Wu, Xinyun Zhang, Xufeng Yao, Su Zheng, Haisheng Zheng, and Bei Yu. 2023. ChatEDA: A Large Language Model Powered Au- tonomous Agent for EDA. In2023 ACM/IEEE 5th Workshop on Machine Learning for CAD (MLCAD). 1–6. doi:10.1109/MLCAD58807.2023.10299852

work page doi:10.1109/mlcad58807.2023.10299852 2023

[46] [46]

Charles Hong, Sahil Bhatia, Alvin Cheung, and Yakun Sophia Shao. 2025. Au- tocomp: A Powerful and Portable Code Optimizer for Tensor Accelerators. arXiv:2505.18574 [cs.PL] https://arxiv.org/abs/2505.18574

arXiv 2025

[47] [47]

Izraelevitz, J

A. Izraelevitz, J. Koenig, P. Li, R. Lin, A. Wang, A. Magyar, D. Kim, C. Schmidt, C. Markley, J. Lawson, and J. Bachrach. 2017. Reusability is FIRRTL ground: Hardware construction languages, compiler frameworks, and transformations. In2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD). 209–216. doi:10.1109/ICCAD.2017.8203780

work page doi:10.1109/iccad.2017.8203780 2017

[48] [48]

Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan

Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan. 2024. SWE-bench: Can Language Models Resolve Real-World GitHub Issues? arXiv:2310.06770 [cs.CL] https://arxiv.org/ abs/2310.06770

Pith/arXiv arXiv 2024

[49] [49]

Sagar Karandikar, Howard Mao, Donggyu Kim, David Biancolin, Alon Amid, Dayeol Lee, Nathan Pemberton, Emmanuel Amaro, Colin Schmidt, Aditya Chopra, Qijing Huang, Kyle Kovacs, Borivoje Nikolic, Randy Katz, Jonathan Bachrach, and Krste Asanović. 2018. FireSim: FPGA-accelerated Cycle-exact Scale-out System Simulation in the Public Cloud. InProceedings of the ...

work page doi:10.1109/isca.2018.00014 2018

[50] [50]

Steve Kosier. 2023. SKY’s the Limit with the SKY130 Open-Source PDK. https: //www.skywatertechnology.com/sky130-open-source-pdk/ Accessed: 2026-06-23

2023

[51] [51]

Srivatsan Krishnan, Amir Yazdanbakhsh, Shvetank Prakash, Jason Jabbour, Ikechukwu Uchendu, Susobhan Ghosh, Behzad Boroujerdian, Daniel Richins, 17 Cui*, Hermida-Rivera*, Toubes*, et. al. Devashree Tripathy, Aleksandra Faust, and Vijay Janapa Reddi. 2023. ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture De- sign. InProceedings o...

work page doi:10.1145/3579371.3589049 2023

[52] [52]

Gonzalez, Hao Zhang, and Ion Stoica

Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, and Ion Stoica. 2023. Efficient Memory Management for Large Language Model Serving with PagedAttention. arXiv:2309.06180 [cs.LG] https://arxiv.org/abs/2309.06180

Pith/arXiv arXiv 2023

[53] [53]

2023.langgraph

langchain ai. 2023.langgraph. https://github.com/langchain-ai/langgraph

2023

[54] [54]

2012.riscv-torture

Yunsup Lee and Henry Cook. 2012.riscv-torture. https://github.com/ucb- bar/riscv-torture

2012

[55] [55]

Li, Adam M

Patrick S. Li, Adam M. Izraelevitz, and Jonathan Bachrach. 2016.Specification for the FIRRTL Language. Technical Report UCB/EECS-2016-9. EECS Department, University of California, Berkeley. http://www2.eecs.berkeley.edu/Pubs/Tech Rpts/2016/EECS-2016-9.html

2016

[56] [56]

Xingquan Li, Simin Tao, Zengrong Huang, Shijian Chen, Zhisheng Zeng, Liwei Ni, Zhipeng Huang, Chunan Zhuang, Hongxi Wu, Weiguo Li1, Xueyan Zhao, He Liu, Shuaiying Long, Wei He, Bojun Liu, Sifeng Gan, Zihao Yu, Tong Liu, Yuchi Miao, Zhiyuan Yan, Hao Wang, Jie Zhao, Yifan Li, Ruizhi Liu, Xiaoze Lin, Bo Yang, Zhen Xue, Fuxing Huang, Zonglin Yang, Zhenggang W...

arXiv 2023

[57] [57]

Harrison Liew, Daniel Grubb, John Wright, Colin Schmidt, Nayiri Krzysztofow- icz, Adam Izraelevitz, Edward Wang, Krste Asanović, Jonathan Bachrach, and Borivoje Nikolić. 2022. Hammer: a modular and reusable physical design flow tool: invited. InProceedings of the 59th ACM/IEEE Design Automation Conference (San Francisco, California)(DAC ’22). Association ...

work page doi:10.1145/3489517.3530672 2022

[58] [58]

Pan, Alexander Du, Kurt Keutzer, Alvin Cheung, Alexandros G

Shu Liu, Shubham Agarwal, Monishwaran Maheswaran, Mert Cemri, Zhifei Li, Qiuyang Mang, Ashwin Naren, Ethan Boneh, Audrey Cheng, Melissa Z. Pan, Alexander Du, Kurt Keutzer, Alvin Cheung, Alexandros G. Dimakis, Koushik Sen, Matei Zaharia, and Ion Stoica. 2026. EvoX: Meta-Evolution for Automated Discovery. arXiv:2602.23413 [cs.LG] https://arxiv.org/abs/2602.23413

arXiv 2026

[59] [59]

Dimakis, and Ion Stoica

Shu Liu, Mert Cemri, Shubham Agarwal, Alexander Krentsel, Ashwin Naren, Qiuyang Mang, Zhifei Li, Akshat Gupta, Monishwaran Maheswaran, Au- drey Cheng, Melissa Pan, Ethan Boneh, Kannan Ramchandran, Koushik Sen, Matei Zaharia, Alexandros G. Dimakis, and Ion Stoica. 2026.SkyDiscover: A Flexible, Adaptive Framework for AI-Driven Scientific and Algorithmic Dis...

work page doi:10.1145/3786335.3813221 2026

[60] [60]

Shang Liu, Wenji Fang, Yao Lu, Qijun Zhang, Hongce Zhang, and Zhiyao Xie

[61] [61]

In2024 IEEE LLM Aided Design Workshop (LAD)

RTLCoder: Outperforming GPT-3.5 in Design RTL Generation with Our Open-Source Dataset and Lightweight Solution. In2024 IEEE LLM Aided Design Workshop (LAD). IEEE, 1–5. doi:10.1109/lad62341.2024.10691788

work page doi:10.1109/lad62341.2024.10691788 2024

[62] [62]

LLVM. 2026. LLVM AI Tool Use Policy. llvm.org. https://llvm.org/docs/AITool Policy.html Accessed: 2026-06-19

2026

[63] [63]

Jason Lowe-Power, Abdul Mutaal Ahmad, Ayaz Akram, Mohammad Alian, Rico Amslinger, Matteo Andreozzi, Adrià Armejach, Nils Asmussen, Brad Beckmann, Srikant Bharadwaj, Gabe Black, Gedare Bloom, Bobby R. Bruce, Daniel Rodrigues Carvalho, Jeronimo Castrillon, Lizhong Chen, Nicolas Deru- migny, Stephan Diestelhorst, Wendy Elsasser, Carlos Escuin, Marjan Faribor...

arXiv 2020

[64] [64]

Yao Lu, Shang Liu, Qijun Zhang, and Zhiyao Xie. 2024. RTLLM: An Open-Source Benchmark for Design RTL Generation with Large Language Model. In2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC). 722–727. doi:10.1109/ASP-DAC58780.2024.10473904

work page doi:10.1109/asp-dac58780.2024.10473904 2024

[65] [65]

Yi-Chen Lu, Hao-Hsiang Hsiao, and Haoxing Ren. 2025. Invited Paper: LLM- Enhanced GPU-Optimized Physical Design at Scale. In2025 IEEE/ACM Interna- tional Conference On Computer Aided Design (ICCAD). 1–7. doi:10.1109/ICCA D66269.2025.11240986

work page doi:10.1109/icca 2025

[66] [66]

Cota, Michele Petracca, Christian Pilato, and Luca P

Paolo Mantovani, Davide Giri, Giuseppe Di Guglielmo, Luca Piccolboni, Joseph Zuckerman, Emilio G. Cota, Michele Petracca, Christian Pilato, and Luca P. Carloni. 2020. Agile SoC development with open ESP. InProceedings of the 39th International Conference on Computer-Aided Design(Virtual Event, USA) (ICCAD ’20). Association for Computing Machinery, New Yor...

work page doi:10.1145/3400302.3415753 2020

[67] [67]

Dirk Merkel. 2014. Docker: lightweight Linux containers for consistent devel- opment and deployment.Linux J.2014, 239, Article 2 (March 2014)

2014

[68] [68]

Kaushal Mhapsekar, Azam Ghanbari, Bita Aslrousta, and Samira Mirbagher- Ajorpaz. 2026. CacheMind: From Miss Rates to Why - Natural-Language, Trace- Grounded Reasoning for Cache Replacement. InProceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2(USA)(ASPLOS ’26). Association...

work page doi:10.1145/3779212.3790136 2026

[69] [69]

2025.agent-framework

Microsoft. 2025.agent-framework. https://github.com/microsoft/agent- framework

2025

[70] [70]

Jordan, and Ion Stoica

Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I. Jordan, and Ion Stoica. 2018. Ray: a distributed framework for emerging AI applications. InProceedings of the 13th USENIX Conference on Operating Systems Design and Implementation(Carlsbad, CA, USA)(OSDI’18). US...

2018

[71] [71]

Alexander Novikov, Ngân V˜u, Marvin Eisenberger, Emilien Dupont, Po-Sen Huang, Adam Zsolt Wagner, Sergey Shirobokov, Borislav Kozlovskii, Francisco J. R. Ruiz, Abbas Mehrabian, M. Pawan Kumar, Abigail See, Swarat Chaud- huri, George Holland, Alex Davies, Sebastian Nowozin, Pushmeet Kohli, and Matej Balog. 2025. AlphaEvolve: A coding agent for scientific a...

Pith/arXiv arXiv 2025

[72] [72]

2015.Microbench

Tony Nowatzki. 2015.Microbench. https://github.com/VerticalResearchGroup /microbench

2015

[73] [73]

Surim Oh, Mingsheng Xu, Tanvir Ahmed Khan, Baris Kasikci, and Heiner Litz

[74] [74]

InProceedings of the 51st International Symposium on Computer Architecture (ISCA) (ISCA 2024)

UDP: Utility-Driven Fetch Directed Instruction Prefetching. InProceedings of the 51st International Symposium on Computer Architecture (ISCA) (ISCA 2024)

2024

[75] [75]

2023.ollama

ollama. 2023.ollama. https://github.com/ollama/ollama

2023

[76] [76]

OpenAI. 2026. Codex. https://chatgpt.com/codex/ Accessed: 2026-06-23

2026

[77] [77]

2026.openclaw

openclaw. 2026.openclaw. https://github.com/openclaw/openclaw

2026

[78] [78]

OpenRouter. 2026. OpenRouter. https://openrouter.ai/ Accessed: 2026-06-23

2026

[79] [79]

2013.openssl

openssl. 2013.openssl. https://github.com/openssl/openssl

2013

[80] [80]

Anne Ouyang, Simon Guo, Simran Arora, Alex L Zhang, William Hu, Christo- pher Re, and Azalia Mirhoseini. 2025. KernelBench: Can LLMs Write Efficient GPU Kernels?. InForty-second International Conference on Machine Learning. https://openreview.net/forum?id=yeoN1iQT1x

2025