Pac subset selection in stochastic multi-armed bandits

Kalyanakrishnan, S · 2012

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Pure Exploration Beyond Reward Feedback: The Role of Post-Action Context

cs.LG · 2025-02-05 · unverdicted · novelty 6.0

Introduces BAI with post-action context in fixed-confidence stochastic bandits, derives instance-dependent lower bounds, and gives asymptotically optimal algorithms for separator and non-separator cases.

RL4RLA: Teaching ML to Discover Randomized Linear Algebra Algorithms Through Curriculum Design and Graph-Based Search

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

RL4RLA is a reinforcement learning framework that discovers interpretable symbolic randomized linear algebra algorithms by combining curriculum learning and graph-based search to overcome sparse rewards and large search spaces.

citing papers explorer

Showing 2 of 2 citing papers.

Pure Exploration Beyond Reward Feedback: The Role of Post-Action Context cs.LG · 2025-02-05 · unverdicted · none · ref 33
Introduces BAI with post-action context in fixed-confidence stochastic bandits, derives instance-dependent lower bounds, and gives asymptotically optimal algorithms for separator and non-separator cases.
RL4RLA: Teaching ML to Discover Randomized Linear Algebra Algorithms Through Curriculum Design and Graph-Based Search cs.LG · 2026-05-18 · unverdicted · none · ref 68
RL4RLA is a reinforcement learning framework that discovers interpretable symbolic randomized linear algebra algorithms by combining curriculum learning and graph-based search to overcome sparse rewards and large search spaces.

Pac subset selection in stochastic multi-armed bandits

fields

years

verdicts

representative citing papers

citing papers explorer