InProceedings of the ACM Web Conference 2023

Offline Policy Evaluation in Large Action Spaces via Outcome-Oriented Action Grouping · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Offline Contextual Bandits in the Presence of New Actions

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

PONA integrates the LCPI estimator for new action selection with the DR estimator for existing actions to optimize policies in offline contextual bandits with evolving action spaces.

citing papers explorer

Showing 1 of 1 citing paper.

Offline Contextual Bandits in the Presence of New Actions cs.LG · 2026-05-18 · unverdicted · none · ref 21
PONA integrates the LCPI estimator for new action selection with the DR estimator for existing actions to optimize policies in offline contextual bandits with evolving action spaces.

InProceedings of the ACM Web Conference 2023

fields

years

verdicts

representative citing papers

citing papers explorer