pith. sign in

Mohammad Ghavamzadeh

Identifiers

  • name variant Mohammad Ghavamzadeh 0.60 · backfill

Papers (34)

  1. MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving cs.RO · 2026 · author #11
  2. Bayesian policy gradient and actor-critic algorithms cs.LG · 2026 · author #1
  3. Maximum Entropy Semi-Supervised Inverse Reinforcement Learning cs.LG · 2026 · author #4
  4. Aligning Text-to-Image Models using Human Feedback cs.LG · 2023 · author #8
  5. Benchmarking Batch Deep Reinforcement Learning Algorithms cs.LG · 2019 · author #3
  6. Active Learning for Binary Classification with Abstention cs.LG · 2019 · author #2
  7. Binary Classification with Bounded Abstention Rate cs.LG · 2019 · author #2
  8. Lyapunov-based Safe Policy Optimization for Continuous Control cs.LG · 2019 · author #5
  9. Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits cs.LG · 2018 · author #5
  10. A Block Coordinate Ascent Algorithm for Mean-Variance Optimization cs.LG · 2018 · author #4
  11. Risk-Sensitive Generative Adversarial Imitation Learning cs.LG · 2018 · author #2
  12. A Lyapunov-based Approach to Safe Reinforcement Learning cs.LG · 2018 · author #4
  13. Optimizing over a Restricted Policy Class in Markov Decision Processes cs.LG · 2018 · author #3
  14. Path Consistency Learning in Tsallis Entropy Regularized MDPs cs.AI · 2018 · author #3
  15. More Robust Doubly Robust Off-policy Evaluation cs.AI · 2018 · author #3
  16. Disentangling Dynamics and Content for Control and Planning cs.LG · 2017 · author #4
  17. Robust Locally-Linear Controllable Embedding cs.LG · 2017 · author #3
  18. Online Learning to Rank in Stochastic Click Models cs.LG · 2017 · author #3
  19. Active Learning for Accurate Estimation of Linear Models stat.ML · 2017 · author #2
  20. Model-Independent Online Learning for Influence Maximization cs.LG · 2017 · author #4
  21. Bottleneck Conditional Density Estimation stat.ML · 2016 · author #3
  22. Conservative Contextual Linear Bandits stat.ML · 2016 · author #2
  23. Bayesian Reinforcement Learning: A Survey cs.AI · 2016 · author #1
  24. Safe Policy Improvement by Minimizing Robust Baseline Regret stat.ML · 2016 · author #3
  25. Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem cs.LG · 2016 · author #3
  26. Graphical Model Sketch cs.DS · 2016 · author #3
  27. Risk-Constrained Reinforcement Learning with Percentile Risk Criteria cs.AI · 2015 · author #2
  28. Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits cs.LG · 2015 · author #3
  29. Robust Policy Optimization with Baseline Guarantees math.OC · 2015 · author #3
  30. Policy Gradient for Coherent Risk Measures cs.AI · 2015 · author #3
  31. Constrained Stochastic Optimal Control with a Baseline Performance Guarantee math.OC · 2014 · author #2
  32. Classification-based Approximate Policy Iteration: Experiments and Extended Discussions cs.LG · 2014 · author #4
  33. Algorithms for CVaR Optimization in MDPs cs.AI · 2014 · author #2
  34. Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs cs.LG · 2014 · author #2

Mentions

  • 1502.03919 #3 · backfill · confidence 0.70 Mohammad Ghavamzadeh
  • 1410.2726 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
  • 1407.0449 #4 · backfill · confidence 0.70 Mohammad Ghavamzadeh
  • 1406.3339 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
  • 1403.6530 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
  • 2605.14201 #11 · arxiv_oai · confidence 0.70 Mohammad Ghavamzadeh
  • 1910.01708 #3 · arxiv_oai · confidence 0.70 Mohammad Ghavamzadeh

Frequent Coauthors