A T-estimation-based procedure for adaptive density estimation and optimal control in offline contextual MDPs without stationarity, providing oracle risk bounds under two loss functions and finite-sample cost guarantees.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Products of finite-dimensional quantum channels asymptotically forget input states under decay of the centered trace-Dobrushin coefficient, yielding unique replacement channels and convergence for deterministic and random inhomogeneous MPS.
citing papers explorer
-
Adaptive Estimation and Optimal Control in Offline Contextual MDPs without Stationarity
A T-estimation-based procedure for adaptive density estimation and optimal control in offline contextual MDPs without stationarity, providing oracle risk bounds under two loss functions and finite-sample cost guarantees.
-
Asymptotic Replacement for Quantum Channel Products with Applications to Inhomogeneous Matrix Product States
Products of finite-dimensional quantum channels asymptotically forget input states under decay of the centered trace-Dobrushin coefficient, yielding unique replacement channels and convergence for deterministic and random inhomogeneous MPS.