RLMM decouples person-level choice sensitivity from task-level value functions via a parametric RL model with Boltzmann choice and MAP estimation, outperforming tabular MDP-MM in simulations and linking person parameters to performance in real gameplay data.
Analyzing Group Differences and Measurement Fairness in Process Data: A Sequential Response Model With Covariates , volume =
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
stat.ME 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Reinforcement Learning Measurement Model
RLMM decouples person-level choice sensitivity from task-level value functions via a parametric RL model with Boltzmann choice and MAP estimation, outperforming tabular MDP-MM in simulations and linking person parameters to performance in real gameplay data.