Neural contextual bandits with deep representation and shallow exploration.arXiv preprint arXiv:2012.01780,

Pan Xu, Zheng Wen, Handong Zhao, Quanquan Gu · 2012 · arXiv 2012.01780

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration

cs.LG · 2025-06-02 · unverdicted · novelty 6.0

Variance-aware neural dueling bandit algorithms achieve sublinear regret of order O(d sqrt(sum sigma_t^2) + sqrt(d T)) for wide networks on nonlinear utilities.

Neural Exploitation and Exploration of Contextual Bandits

cs.LG · 2023-05-05 · unverdicted · novelty 6.0

EE-Net is a contextual bandit algorithm that pairs an exploitation neural net with a separate exploration neural net and proves an instance-dependent Õ(√T) regret bound while beating linear and neural baselines on real data.

citing papers explorer

Showing 2 of 2 citing papers.

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration cs.LG · 2025-06-02 · unverdicted · none · ref 30
Variance-aware neural dueling bandit algorithms achieve sublinear regret of order O(d sqrt(sum sigma_t^2) + sqrt(d T)) for wide networks on nonlinear utilities.
Neural Exploitation and Exploration of Contextual Bandits cs.LG · 2023-05-05 · unverdicted · none · ref 9
EE-Net is a contextual bandit algorithm that pairs an exploitation neural net with a separate exploration neural net and proves an instance-dependent Õ(√T) regret bound while beating linear and neural baselines on real data.

Neural contextual bandits with deep representation and shallow exploration.arXiv preprint arXiv:2012.01780,

fields

years

verdicts

representative citing papers

citing papers explorer