QED bounds cross-run KL divergence in Boltzmann policies by setting temperature proportional to Q-disagreement and reduces return variance by two orders of magnitude on 18 continuous-control tasks without performance loss.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
SynthRAD2025 shows deep learning produces synthetic CTs with MAE 48-65 HU and high dosimetric gamma passing rates for radiotherapy, performing better on CBCT-to-CT than MRI-to-CT tasks.
citing papers explorer
-
Behavior-Consistent Deep Reinforcement Learning
QED bounds cross-run KL divergence in Boltzmann policies by setting temperature proportional to Q-disagreement and reduces return variance by two orders of magnitude on 18 continuous-control tasks without performance loss.
-
Generating synthetic computed tomography for radiotherapy: SynthRAD2025 challenge report
SynthRAD2025 shows deep learning produces synthetic CTs with MAE 48-65 HU and high dosimetric gamma passing rates for radiotherapy, performing better on CBCT-to-CT than MRI-to-CT tasks.