DISA decouples partition function estimation using offline importance sampling for distribution-matching LLM-RL, matching or exceeding online baselines like FlowRL on math and code benchmarks while retaining more strategy diversity.
Wiley Interdisciplinary Reviews: Computational Statistics , volume=
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
BAMIFun provides Bayesian multiple imputation for functional data via low-rank penalized spline models, achieving accurate imputation and improved coverage in simulations and real datasets compared to single-imputation FPCA methods.
citing papers explorer
-
DISA: Offline Importance Sampling for Distribution-Matching LLM-RL
DISA decouples partition function estimation using offline importance sampling for distribution-matching LLM-RL, matching or exceeding online baselines like FlowRL on math and code benchmarks while retaining more strategy diversity.
-
BAMIFun: Bayesian Multiple Imputation for Functional Data
BAMIFun provides Bayesian multiple imputation for functional data via low-rank penalized spline models, achieving accurate imputation and improved coverage in simulations and real datasets compared to single-imputation FPCA methods.