pith. sign in

Emma Brunskill

Identifiers

  • name variant Emma Brunskill 0.60 · backfill

Papers (39)

  1. When are LLMs Sufficient Policy Optimizers for Sequential RL Tasks? cs.LG · 2026 · author #2
  2. Active Learning for Stochastic Contextual Linear Bandits cs.LG · 2026 · author #1
  3. Improving Hybrid Human-AI Tutoring by Differentiating Human Tutor Roles Based on Student Needs cs.CY · 2026 · author #7
  4. Trading off rewards and errors in multi-armed bandits cs.LG · 2026 · author #4
  5. GIANTS: Generative Insight Anticipation from Scientific Literature cs.CL · 2026 · author #7
  6. Generative Experiences for Digital Mental Health Interventions: Evidence from a Randomized Study cs.HC · 2026 · author #5
  7. On the Opportunities and Risks of Foundation Models cs.LG · 2021 · author #10
  8. Directed Exploration for Reinforcement Learning cs.LG · 2019 · author #2
  9. PLOTS: Procedure Learning from Observations using Subtask Structure cs.LG · 2019 · author #3
  10. Off-Policy Policy Gradient with State Distribution Correction cs.LG · 2019 · author #4
  11. Separating value functions across time-scales cs.LG · 2019 · author #4
  12. Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research cs.DL · 2018 · author #2
  13. Policy Certificates: Towards Accountable Reinforcement Learning cs.LG · 2018 · author #4
  14. Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters cs.LG · 2018 · author #7
  15. Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning cs.AI · 2018 · author #4
  16. When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms cs.LG · 2018 · author #2
  17. Representation Balancing MDPs for Off-Policy Policy Evaluation cs.LG · 2018 · author #7
  18. Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands cs.CL · 2017 · author #7
  19. On Ensuring that Intelligent Machines Are Well-Behaved cs.AI · 2017 · author #4
  20. Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines cs.AI · 2017 · author #2
  21. Decoupling Learning Rules from Representations cs.AI · 2017 · author #3
  22. Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning cs.LG · 2017 · author #3
  23. Sample Efficient Feature Selection for Factored MDPs cs.LG · 2017 · author #2
  24. Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation cs.AI · 2017 · author #3
  25. Sample Efficient Policy Search for Optimal Stopping Domains cs.AI · 2017 · author #3
  26. Importance Sampling with Unequal Support cs.LG · 2016 · author #2
  27. A PAC RL Algorithm for Episodic POMDPs cs.LG · 2016 · author #3
  28. Latent Contextual Bandits and their Application to Personalized Recommendations for New Users cs.LG · 2016 · author #2
  29. Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning cs.LG · 2016 · author #2
  30. Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning stat.ML · 2015 · author #2
  31. The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning cs.LG · 2015 · author #1
  32. Online Stochastic Optimization under Correlated Bandit Feedback stat.ML · 2014 · author #3
  33. Efficient Planning under Uncertainty with Macro-actions cs.AI · 2014 · author #2
  34. Sample Complexity of Multi-task Reinforcement Learning cs.LG · 2013 · author #1
  35. Sequential Transfer in Multi-armed Bandit with Finite Set of Models stat.ML · 2013 · author #3
  36. Regret Bounds for Reinforcement Learning with Policy Advice stat.ML · 2013 · author #3
  37. Incentive Decision Processes cs.GT · 2012 · author #2
  38. CORL: A Continuous-state Offset-dynamics Reinforcement Learner cs.LG · 2012 · author #1
  39. RAPID: A Reachable Anytime Planner for Imprecisely-sensed Domains cs.AI · 2012 · author #1

Mentions

  • 1510.08906 #2 · backfill · confidence 0.70 Emma Brunskill
  • 1402.0562 #3 · arxiv_oai · confidence 0.70 Emma Brunskill
  • 1506.03379 #1 · backfill · confidence 0.70 Emma Brunskill
  • 2605.30719 #2 · arxiv_oai · confidence 0.70 Emma Brunskill
  • 1402.0562 #3 · backfill · confidence 0.70 Emma Brunskill
  • 1401.3827 #2 · backfill · confidence 0.70 Emma Brunskill
  • 1309.6821 #1 · backfill · confidence 0.70 Emma Brunskill
  • 1307.6887 #3 · backfill · confidence 0.70 Emma Brunskill
  • 1305.1027 #3 · backfill · confidence 0.70 Emma Brunskill
  • 2605.24803 #1 · arxiv_oai · confidence 0.70 Emma Brunskill
  • 1210.4877 #2 · backfill · confidence 0.70 Emma Brunskill
  • 1206.3231 #1 · backfill · confidence 0.70 Emma Brunskill
  • 1203.3538 #1 · backfill · confidence 0.70 Emma Brunskill

Frequent Coauthors