Mohammad Ghavamzadeh — Pith Author Registry

Identifiers

name variant Mohammad Ghavamzadeh 0.60 · backfill

Papers (34)

MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving cs.RO · 2026 · author #11
Bayesian policy gradient and actor-critic algorithms cs.LG · 2026 · author #1
Maximum Entropy Semi-Supervised Inverse Reinforcement Learning cs.LG · 2026 · author #4
Aligning Text-to-Image Models using Human Feedback cs.LG · 2023 · author #8
Benchmarking Batch Deep Reinforcement Learning Algorithms cs.LG · 2019 · author #3
Active Learning for Binary Classification with Abstention cs.LG · 2019 · author #2
Binary Classification with Bounded Abstention Rate cs.LG · 2019 · author #2
Lyapunov-based Safe Policy Optimization for Continuous Control cs.LG · 2019 · author #5
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits cs.LG · 2018 · author #5
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization cs.LG · 2018 · author #4
Risk-Sensitive Generative Adversarial Imitation Learning cs.LG · 2018 · author #2
A Lyapunov-based Approach to Safe Reinforcement Learning cs.LG · 2018 · author #4
Optimizing over a Restricted Policy Class in Markov Decision Processes cs.LG · 2018 · author #3
Path Consistency Learning in Tsallis Entropy Regularized MDPs cs.AI · 2018 · author #3
More Robust Doubly Robust Off-policy Evaluation cs.AI · 2018 · author #3
Disentangling Dynamics and Content for Control and Planning cs.LG · 2017 · author #4
Robust Locally-Linear Controllable Embedding cs.LG · 2017 · author #3
Online Learning to Rank in Stochastic Click Models cs.LG · 2017 · author #3
Active Learning for Accurate Estimation of Linear Models stat.ML · 2017 · author #2
Model-Independent Online Learning for Influence Maximization cs.LG · 2017 · author #4
Bottleneck Conditional Density Estimation stat.ML · 2016 · author #3
Conservative Contextual Linear Bandits stat.ML · 2016 · author #2
Bayesian Reinforcement Learning: A Survey cs.AI · 2016 · author #1
Safe Policy Improvement by Minimizing Robust Baseline Regret stat.ML · 2016 · author #3
Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem cs.LG · 2016 · author #3
Graphical Model Sketch cs.DS · 2016 · author #3
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria cs.AI · 2015 · author #2
Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits cs.LG · 2015 · author #3
Robust Policy Optimization with Baseline Guarantees math.OC · 2015 · author #3
Policy Gradient for Coherent Risk Measures cs.AI · 2015 · author #3
Constrained Stochastic Optimal Control with a Baseline Performance Guarantee math.OC · 2014 · author #2
Classification-based Approximate Policy Iteration: Experiments and Extended Discussions cs.LG · 2014 · author #4
Algorithms for CVaR Optimization in MDPs cs.AI · 2014 · author #2
Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs cs.LG · 2014 · author #2

Mentions

1502.03919 #3 · backfill · confidence 0.70 Mohammad Ghavamzadeh
1410.2726 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
1407.0449 #4 · backfill · confidence 0.70 Mohammad Ghavamzadeh
1406.3339 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
1403.6530 #2 · backfill · confidence 0.70 Mohammad Ghavamzadeh
2605.14201 #11 · arxiv_oai · confidence 0.70 Mohammad Ghavamzadeh
1910.01708 #3 · arxiv_oai · confidence 0.70 Mohammad Ghavamzadeh

Frequent Coauthors

Yinlam Chow 12 shared papers
Branislav Kveton 4 shared papers
Alessandro Lazaric 3 shared papers
Ershad Banijamali 3 shared papers
Ofir Nachum 3 shared papers
Zheng Wen 3 shared papers
Ali Ghodsi 2 shared papers
Aviv Tamar 2 shared papers
Csaba Szepesvari 2 shared papers
Edgar Duenez-Guzman 2 shared papers
Georgios Theocharous 2 shared papers
Hung Bui 2 shared papers
Joelle Pineau 2 shared papers
Marco Pavone 2 shared papers
Marek Petrik 2 shared papers
Michal Valko 2 shared papers
Rui Shu 2 shared papers
Sharan Vaswani 2 shared papers
Shie Mannor 2 shared papers
Shubhanshu Shekhar 2 shared papers