Mage: A multi-agent engine for automated rtl code generation

MAGE: A Multi-Agent Engine for Automated RTL Code Generation · 2024 · arXiv 2412.07822

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

representative citing papers

RTL-BenchLS: A Large-Scale Benchmark for RTL Reasoning and Generation with Large Language Models

cs.AI · 2026-06-08 · unverdicted · novelty 8.0

RTL-BenchLS supplies a large-scale formally verified benchmark and three novel tasks that expose low performance of frontier LLMs on realistic RTL reasoning and generation.

CHIA: An open-source framework for principled, agentic AI-driven hardware/software co-design research

cs.AR · 2026-06-25 · unverdicted · novelty 7.0

CHIA introduces a framework for building and deploying agentic AI co-design flows as CHIA loops with tool nodes, reliability mechanisms, and five case-study demonstrations.

TimingLLM: A Two-Stage Retrieval-Augmented Framework for Pre-Synthesis Timing Prediction from Verilog

cs.AR · 2026-04-26 · conditional · novelty 7.0

TimingLLM uses a fine-tuned LLM to generate structural timing cues from Verilog followed by a retrieval-augmented regressor with a learned steering vector to predict WNS and TNS with R values of 0.91 and 0.97.

CASS-RTL: Correctness-Aware Subspace Steering for RTL Generation with LLMs

cs.PL · 2026-06-04 · unverdicted · novelty 6.0

CASS-RTL identifies correctness-linked attention heads, builds a steering subspace from them, and applies a geometry-aware intervention that raises pass@1/5/10 accuracy 10-20% on VerilogEval and 5% on CVDP across multiple LLMs without retraining or extra labels.

Verilog-Evolve: Feedback-Driven and Skill-Evolving Verilog Generation

cs.CL · 2026-05-26 · unverdicted · novelty 6.0

Verilog-Evolve uses executable feedback from simulation, synthesis, timing, and GEMM metrics to refine LLM-generated Verilog and evolves skills across tasks, improving functional success and downstream hardware quality on VerilogEval and mixed-precision GEMM benchmarks.

RTL-BenchMT: Dynamic Maintenance of RTL Generation Benchmark Through Agent-Assisted Analysis and Revision

cs.AI · 2026-05-15 · unverdicted · novelty 6.0

RTL-BenchMT is an agent-assisted framework for dynamically maintaining RTL generation benchmarks by fixing flaws and reducing overfitting in LLM-based EDA applications.

Agentic Hardware Design as Repository-Level Code Evolution

cs.AR · 2026-06-26 · unverdicted · novelty 4.0

HORIZON applies repository-level self-evolution to hardware design artifacts and reports 100% completion on ChipBench, RTLLM, Verilog-Eval, and nine CVDP categories using a hands-free agent loop.

citing papers explorer

Showing 6 of 6 citing papers after filters.

RTL-BenchLS: A Large-Scale Benchmark for RTL Reasoning and Generation with Large Language Models cs.AI · 2026-06-08 · unverdicted · none · ref 38
RTL-BenchLS supplies a large-scale formally verified benchmark and three novel tasks that expose low performance of frontier LLMs on realistic RTL reasoning and generation.
CHIA: An open-source framework for principled, agentic AI-driven hardware/software co-design research cs.AR · 2026-06-25 · unverdicted · none · ref 119
CHIA introduces a framework for building and deploying agentic AI co-design flows as CHIA loops with tool nodes, reliability mechanisms, and five case-study demonstrations.
CASS-RTL: Correctness-Aware Subspace Steering for RTL Generation with LLMs cs.PL · 2026-06-04 · unverdicted · none · ref 6
CASS-RTL identifies correctness-linked attention heads, builds a steering subspace from them, and applies a geometry-aware intervention that raises pass@1/5/10 accuracy 10-20% on VerilogEval and 5% on CVDP across multiple LLMs without retraining or extra labels.
Verilog-Evolve: Feedback-Driven and Skill-Evolving Verilog Generation cs.CL · 2026-05-26 · unverdicted · none · ref 28
Verilog-Evolve uses executable feedback from simulation, synthesis, timing, and GEMM metrics to refine LLM-generated Verilog and evolves skills across tasks, improving functional success and downstream hardware quality on VerilogEval and mixed-precision GEMM benchmarks.
RTL-BenchMT: Dynamic Maintenance of RTL Generation Benchmark Through Agent-Assisted Analysis and Revision cs.AI · 2026-05-15 · unverdicted · none · ref 20
RTL-BenchMT is an agent-assisted framework for dynamically maintaining RTL generation benchmarks by fixing flaws and reducing overfitting in LLM-based EDA applications.
Agentic Hardware Design as Repository-Level Code Evolution cs.AR · 2026-06-26 · unverdicted · none · ref 17
HORIZON applies repository-level self-evolution to hardware design artifacts and reports 100% completion on ChipBench, RTLLM, Verilog-Eval, and nine CVDP categories using a hands-free agent loop.

Mage: A multi-agent engine for automated rtl code generation

fields

years

verdicts

representative citing papers

citing papers explorer