A dynamic-circuit framework for multi-step quantum Markov decision processes reduces physical qubit count from O(T) to O(1) while preserving trajectory fidelity and applying Grover amplification for high-return paths.
Each run was configured for 32K shots, and the circuit was executed 30 times to obtain a statistically meaningful dis- tribution of outcomes
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
quant-ph 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Scalable Quantum Reinforcement Learning on NISQ Devices with Dynamic-Circuit Qubit Reuse and Grover Optimization
A dynamic-circuit framework for multi-step quantum Markov decision processes reduces physical qubit count from O(T) to O(1) while preserving trajectory fidelity and applying Grover amplification for high-return paths.