John Schulman
Identifiers
- name variant John Schulman 0.60 · backfill
Papers (37)
- Reasoning Models Don't Always Say What They Think cs.CL · 2025 · author #6
- Measuring short-form factuality in large language models cs.CL · 2024 · author #7
- GPT-4o System Card cs.CL · 2024 · author #193
- Let's Verify Step by Step cs.LG · 2023 · author #8
- GPT-4 Technical Report cs.CL · 2023 · author #215
- Scaling Laws for Reward Model Overoptimization cs.LG · 2022 · author #2
- Efficient Training of Language Models to Fill in the Middle cs.CL · 2022 · author #4
- Training language models to follow instructions with human feedback cs.CL · 2022 · author #11
- WebGPT: Browser-assisted question-answering with human feedback cs.CL · 2021 · author #18
- Training Verifiers to Solve Math Word Problems cs.LG · 2021 · author #12
- Unsolved Problems in ML Safety cs.LG · 2021 · author #3
- Scaling Laws for Autoregressive Generative Modeling cs.LG · 2020 · author #17
- Policy Gradient Search: Online Planning and Expert Iteration without Search Trees cs.LG · 2019 · author #5
- Semi-Supervised Learning by Label Gradient Alignment cs.LG · 2019 · author #2
- Quantifying Generalization in Reinforcement Learning cs.LG · 2018 · author #5
- Model-Based Reinforcement Learning via Meta-Policy Optimization cs.LG · 2018 · author #3
- Gotta Learn Fast: A New Benchmark for Generalization in RL cs.LG · 2018 · author #5
- On First-Order Meta-Learning Algorithms cs.LG · 2018 · author #3
- Meta Learning Shared Hierarchies cs.LG · 2017 · author #5
- Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations cs.LG · 2017 · author #5
- Proximal Policy Optimization Algorithms cs.LG · 2017 · author #1
- Teacher-Student Curriculum Learning cs.LG · 2017 · author #4
- UCB Exploration via Q-Ensembles cs.LG · 2017 · author #4
- Equivalence Between Policy Gradients and Soft Q-Learning cs.LG · 2017 · author #1
- #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning cs.AI · 2016 · author #7
- RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning cs.AI · 2016 · author #2
- Variational Lossy Autoencoder cs.LG · 2016 · author #6
- Concrete Problems in AI Safety cs.AI · 2016 · author #5
- InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets cs.LG · 2016 · author #4
- OpenAI Gym cs.LG · 2016 · author #5
- VIME: Variational Information Maximizing Exploration cs.LG · 2016 · author #4
- Kahler-Einstein and Kahler scalar flat supermanifolds hep-th · 2016 · author #3
- Theano: A Python framework for fast computation of mathematical expressions cs.SC · 2016 · author #89
- Benchmarking Deep Reinforcement Learning for Continuous Control cs.LG · 2016 · author #4
- Gradient Estimation Using Stochastic Computation Graphs cs.LG · 2015 · author #1
- High-Dimensional Continuous Control Using Generalized Advantage Estimation cs.LG · 2015 · author #1
- Trust Region Policy Optimization cs.LG · 2015 · author #1
Mentions
- 1502.05477 #1 · backfill · confidence 0.70 John Schulman
- 2210.10760 #2 · arxiv_oai · confidence 0.70 John Schulman
- 2207.14255 #4 · arxiv_oai · confidence 0.70 John Schulman
- 2109.13916 #3 · arxiv_oai · confidence 0.70 John Schulman
- 2303.08774 #215 · backfill · confidence 0.70 John Schulman
Frequent Coauthors
- Pieter Abbeel 13 shared papers
- Xi Chen 8 shared papers
- Ilya Sutskever 6 shared papers
- Yan Duan 6 shared papers
- Christopher Hesse 5 shared papers
- Heewoo Jun 5 shared papers
- Jan Leike 5 shared papers
- Mark Chen 5 shared papers
- Alec Radford 4 shared papers
- Jacob Hilton 4 shared papers
- Karl Cobbe 4 shared papers
- Long Ouyang 4 shared papers
- Prafulla Dhariwal 4 shared papers
- Rein Houthooft 4 shared papers
- Aditya Ramesh 3 shared papers
- Alex Nichol 3 shared papers
- Chong Zhang 3 shared papers
- Chris Hallacy 3 shared papers
- Christina Kim 3 shared papers
- Christine McLeavey 3 shared papers