Title resolution pending

ISSN 00911798, 2168894X · arXiv stable/2959268

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Preference-Based Reward Learning under Partial Observability with Inexact Dynamics

math.OC · 2026-06-29 · unverdicted · novelty 6.0

Establishes stability of belief filters to model error in log-linear and neural-softmax POMDPs under mixing conditions and derives finite-sample guarantees for preference-based reward learning that decouple statistical error from model-mismatch bias.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Preference-Based Reward Learning under Partial Observability with Inexact Dynamics math.OC · 2026-06-29 · unverdicted · none · ref 7
Establishes stability of belief filters to model error in log-linear and neural-softmax POMDPs under mixing conditions and derives finite-sample guarantees for preference-based reward learning that decouple statistical error from model-mismatch bias.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer