A doubly robust estimator is developed for quantile treatment effects on long-term outcomes by integrating randomized trial data with observational data under surrogate transportability, remaining consistent if either nuisance function is correctly estimated.
Journal of Machine Learning Research , volume=
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2representative citing papers
DQPOPE estimates the entire return distribution in off-policy evaluation via deep quantile process regression, providing statistical advantages over standard single-value methods with equivalent sample sizes.
citing papers explorer
-
Double/debiased machine learning of quantile treatment effects on long-term outcomes in clinical trials
A doubly robust estimator is developed for quantile treatment effects on long-term outcomes by integrating randomized trial data with observational data under surrogate transportability, remaining consistent if either nuisance function is correctly estimated.
-
Distributional Off-Policy Evaluation with Deep Quantile Process Regression
DQPOPE estimates the entire return distribution in off-policy evaluation via deep quantile process regression, providing statistical advantages over standard single-value methods with equivalent sample sizes.