Each run was configured for 32K shots, and the circuit was executed 30 times to obtain a statistically meaningful dis- tribution of outcomes

Implementation, validation of optimal trajectory search on quantum hardware To further evaluate the proposed QRL framework, Grover’s search–based optimal trajectory identificati

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Scalable Quantum Reinforcement Learning on NISQ Devices with Dynamic-Circuit Qubit Reuse and Grover Optimization

quant-ph · 2025-09-19 · unverdicted · novelty 5.0

A dynamic-circuit framework for multi-step quantum Markov decision processes reduces physical qubit count from O(T) to O(1) while preserving trajectory fidelity and applying Grover amplification for high-return paths.

citing papers explorer

Showing 1 of 1 citing paper.

Scalable Quantum Reinforcement Learning on NISQ Devices with Dynamic-Circuit Qubit Reuse and Grover Optimization quant-ph · 2025-09-19 · unverdicted · none · ref 6
A dynamic-circuit framework for multi-step quantum Markov decision processes reduces physical qubit count from O(T) to O(1) while preserving trajectory fidelity and applying Grover amplification for high-return paths.

Each run was configured for 32K shots, and the circuit was executed 30 times to obtain a statistically meaningful dis- tribution of outcomes

fields

years

verdicts

representative citing papers

citing papers explorer