QBugLM: An Agentic Benchmarking Framework for LLM-based Quantum Software Debugging

· 2026 · cs.SE · arXiv 2606.07314

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Quantum software bugs often yield silent, incorrect outputs rather than explicit errors, making them particularly difficult to detect and repair with conventional techniques. Although large language models (LLMs) have shown strong performance on classical software engineering tasks, their ability to debug quantum code remains largely unexplored. To bridge this gap, we propose QBugLM, a multi-agent framework that automates the quantum software debugging pipeline, from taxonomy-driven bug injection to LLM-based detection and repair, and finally to simulation-based validation, for framework-agnostic OpenQASM 3.0 programs. We further conduct a comprehensive case study using QBugLM to benchmark two LLMs, Claude 4.6 Sonnet and Qwen3 Coder Next, across different prompting strategies, bug categories, and quantum programs. Our results show that iterative feedback is critical, as a single retry raises Pass@1 from below 25% to above 80%. Moreover, simpler structured prompting can even outperform Chain-of-Thought and ReAct for reasoning-capable models under fixed-resource constraints. Our work takes initial steps toward benchmarking LLM capabilities for debugging quantum programs and offers practical insights to support future efforts in automated quantum software repair.

representative citing papers

Leveraging LLM-Based Agentic Systems to Generate Quantum Applications for Test Optimization

cs.SE · 2026-07-01 · unverdicted · novelty 3.0

QPipe deploys specialized LLM agents for parsing, formulation, code generation, review, execution and verification to produce quantum applications from 20 natural-language test-optimization requirements, reporting 100% compilation and 96.7% execution success with solutions that beat a genetic-algori

citing papers explorer

Showing 1 of 1 citing paper after filters.

Leveraging LLM-Based Agentic Systems to Generate Quantum Applications for Test Optimization cs.SE · 2026-07-01 · unverdicted · none · ref 21 · internal anchor
QPipe deploys specialized LLM agents for parsing, formulation, code generation, review, execution and verification to produce quantum applications from 20 natural-language test-optimization requirements, reporting 100% compilation and 96.7% execution success with solutions that beat a genetic-algori

QBugLM: An Agentic Benchmarking Framework for LLM-based Quantum Software Debugging

fields

years

verdicts

representative citing papers

citing papers explorer