Mohammad Ghavamzadeh
Identifiers
- name variant Mohammad Ghavamzadeh 0.60 · backfill
Papers (34)
- MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving cs.RO · 2026 · author #11
- Bayesian policy gradient and actor-critic algorithms cs.LG · 2026 · author #1
- Maximum Entropy Semi-Supervised Inverse Reinforcement Learning cs.LG · 2026 · author #4
- Aligning Text-to-Image Models using Human Feedback cs.LG · 2023 · author #8
- Benchmarking Batch Deep Reinforcement Learning Algorithms cs.LG · 2019 · author #3
- Active Learning for Binary Classification with Abstention cs.LG · 2019 · author #2
- Binary Classification with Bounded Abstention Rate cs.LG · 2019 · author #2
- Lyapunov-based Safe Policy Optimization for Continuous Control cs.LG · 2019 · author #5
- Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits cs.LG · 2018 · author #5
- A Block Coordinate Ascent Algorithm for Mean-Variance Optimization cs.LG · 2018 · author #4
- Risk-Sensitive Generative Adversarial Imitation Learning cs.LG · 2018 · author #2
- A Lyapunov-based Approach to Safe Reinforcement Learning cs.LG · 2018 · author #4
- Optimizing over a Restricted Policy Class in Markov Decision Processes cs.LG · 2018 · author #3
- Path Consistency Learning in Tsallis Entropy Regularized MDPs cs.AI · 2018 · author #3
- More Robust Doubly Robust Off-policy Evaluation cs.AI · 2018 · author #3
- Disentangling Dynamics and Content for Control and Planning cs.LG · 2017 · author #4
- Robust Locally-Linear Controllable Embedding cs.LG · 2017 · author #3
- Online Learning to Rank in Stochastic Click Models cs.LG · 2017 · author #3
- Active Learning for Accurate Estimation of Linear Models stat.ML · 2017 · author #2
- Model-Independent Online Learning for Influence Maximization cs.LG · 2017 · author #4
- Bottleneck Conditional Density Estimation stat.ML · 2016 · author #3
- Conservative Contextual Linear Bandits stat.ML · 2016 · author #2
- Bayesian Reinforcement Learning: A Survey cs.AI · 2016 · author #1
- Safe Policy Improvement by Minimizing Robust Baseline Regret stat.ML · 2016 · author #3
- Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem cs.LG · 2016 · author #3
- Graphical Model Sketch cs.DS · 2016 · author #3
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria cs.AI · 2015 · author #2
- Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits cs.LG · 2015 · author #3
- Robust Policy Optimization with Baseline Guarantees math.OC · 2015 · author #3
- Policy Gradient for Coherent Risk Measures cs.AI · 2015 · author #3
- Constrained Stochastic Optimal Control with a Baseline Performance Guarantee math.OC · 2014 · author #2
- Classification-based Approximate Policy Iteration: Experiments and Extended Discussions cs.LG · 2014 · author #4
- Algorithms for CVaR Optimization in MDPs cs.AI · 2014 · author #2
- Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs cs.LG · 2014 · author #2
Mentions
- 1502.03919 #3 · backfill · confidence 0.70 Mohammad Ghavamzadeh
- 1410.2726 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
- 1407.0449 #4 · backfill · confidence 0.70 Mohammad Ghavamzadeh
- 1406.3339 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
- 1403.6530 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
- 2605.14201 #11 · arxiv_oai · confidence 0.70 Mohammad Ghavamzadeh
- 1910.01708 #3 · arxiv_oai · confidence 0.70 Mohammad Ghavamzadeh
Frequent Coauthors
- Yinlam Chow 12 shared papers
- Branislav Kveton 4 shared papers
- Alessandro Lazaric 3 shared papers
- Ershad Banijamali 3 shared papers
- Ofir Nachum 3 shared papers
- Zheng Wen 3 shared papers
- Ali Ghodsi 2 shared papers
- Aviv Tamar 2 shared papers
- Csaba Szepesvari 2 shared papers
- Edgar Duenez-Guzman 2 shared papers
- Georgios Theocharous 2 shared papers
- Hung Bui 2 shared papers
- Joelle Pineau 2 shared papers
- Marco Pavone 2 shared papers
- Marek Petrik 2 shared papers
- Michal Valko 2 shared papers
- Rui Shu 2 shared papers
- Sharan Vaswani 2 shared papers
- Shie Mannor 2 shared papers
- Shubhanshu Shekhar 2 shared papers