pith. sign in

Title resolution pending

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

dataset 2

citation-polarity summary

fields

cs.IR 2 cs.LG 2

years

2026 4

verdicts

UNVERDICTED 4

roles

dataset 2

polarities

use dataset 2

clear filters

representative citing papers

Offline Contextual Bandits in the Presence of New Actions

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

PONA integrates the LCPI estimator for new action selection with the DR estimator for existing actions to optimize policies in offline contextual bandits with evolving action spaces.

Off-Policy Learning with Limited Supply

cs.LG · 2026-03-19 · unverdicted · novelty 6.0 · 2 refs

OPLS is a new off-policy learning method for contextual bandits with limited supply that outperforms conventional greedy approaches by prioritizing items with relatively higher expected rewards compared to other users.

citing papers explorer

Showing 2 of 2 citing papers after filters.