pith. sign in

Odalric-Ambrym Maillard

Identifiers

No identifiers captured yet.

Papers (15)

  1. Pliable rejection sampling stat.ML · 2026 · author #4
  2. Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits cs.LG · 2019 · author #2
  3. Practical Open-Loop Optimistic Planning cs.LG · 2019 · author #2
  4. Budgeted Reinforcement Learning in Continuous State Space cs.LG · 2019 · author #5
  5. Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs stat.ML · 2018 · author #2
  6. Efficient tracking of a growing number of experts stat.ML · 2017 · author #2
  7. Streaming kernel regression with provably adaptive mean, variance, and regularization stat.ML · 2017 · author #2
  8. Boundary Crossing Probabilities for General Exponential Families stat.ML · 2017 · author #1
  9. Random Shuffling and Resets for the Non-stationary Stochastic Bandit Problem cs.AI · 2016 · author #3
  10. Low-rank Bandits with Latent Mixtures cs.LG · 2016 · author #2
  11. Selecting Near-Optimal Approximate State Representations in Reinforcement Learning cs.LG · 2014 · author #2
  12. Concentration inequalities for sampling without replacement math.ST · 2013 · author #2
  13. Optimal Regret Bounds for Selecting the State Representation in Reinforcement Learning cs.LG · 2013 · author #1
  14. Selecting the State-Representation in Reinforcement Learning cs.LG · 2013 · author #1
  15. Kullback-Leibler upper confidence bounds for optimal sequential allocation math.PR · 2012 · author #3

Mentions

No mention provenance yet.

Frequent Coauthors