John Schulman — Pith Author Registry

Identifiers

name variant John Schulman 0.60 · backfill

Papers (37)

Reasoning Models Don't Always Say What They Think cs.CL · 2025 · author #6
Measuring short-form factuality in large language models cs.CL · 2024 · author #7
GPT-4o System Card cs.CL · 2024 · author #193
Let's Verify Step by Step cs.LG · 2023 · author #8
GPT-4 Technical Report cs.CL · 2023 · author #215
Scaling Laws for Reward Model Overoptimization cs.LG · 2022 · author #2
Efficient Training of Language Models to Fill in the Middle cs.CL · 2022 · author #4
Training language models to follow instructions with human feedback cs.CL · 2022 · author #11
WebGPT: Browser-assisted question-answering with human feedback cs.CL · 2021 · author #18
Training Verifiers to Solve Math Word Problems cs.LG · 2021 · author #12
Unsolved Problems in ML Safety cs.LG · 2021 · author #3
Scaling Laws for Autoregressive Generative Modeling cs.LG · 2020 · author #17
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees cs.LG · 2019 · author #5
Semi-Supervised Learning by Label Gradient Alignment cs.LG · 2019 · author #2
Quantifying Generalization in Reinforcement Learning cs.LG · 2018 · author #5
Model-Based Reinforcement Learning via Meta-Policy Optimization cs.LG · 2018 · author #3
Gotta Learn Fast: A New Benchmark for Generalization in RL cs.LG · 2018 · author #5
On First-Order Meta-Learning Algorithms cs.LG · 2018 · author #3
Meta Learning Shared Hierarchies cs.LG · 2017 · author #5
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations cs.LG · 2017 · author #5
Proximal Policy Optimization Algorithms cs.LG · 2017 · author #1
Teacher-Student Curriculum Learning cs.LG · 2017 · author #4
UCB Exploration via Q-Ensembles cs.LG · 2017 · author #4
Equivalence Between Policy Gradients and Soft Q-Learning cs.LG · 2017 · author #1
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning cs.AI · 2016 · author #7
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning cs.AI · 2016 · author #2
Variational Lossy Autoencoder cs.LG · 2016 · author #6
Concrete Problems in AI Safety cs.AI · 2016 · author #5
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets cs.LG · 2016 · author #4
OpenAI Gym cs.LG · 2016 · author #5
VIME: Variational Information Maximizing Exploration cs.LG · 2016 · author #4
Kahler-Einstein and Kahler scalar flat supermanifolds hep-th · 2016 · author #3
Theano: A Python framework for fast computation of mathematical expressions cs.SC · 2016 · author #89
Benchmarking Deep Reinforcement Learning for Continuous Control cs.LG · 2016 · author #4
Gradient Estimation Using Stochastic Computation Graphs cs.LG · 2015 · author #1
High-Dimensional Continuous Control Using Generalized Advantage Estimation cs.LG · 2015 · author #1
Trust Region Policy Optimization cs.LG · 2015 · author #1

Mentions

1502.05477 #1 · backfill · confidence 0.70 John Schulman
2210.10760 #2 · arxiv_oai · confidence 0.70 John Schulman
2207.14255 #4 · arxiv_oai · confidence 0.70 John Schulman
2109.13916 #3 · arxiv_oai · confidence 0.70 John Schulman
2303.08774 #215 · backfill · confidence 0.70 John Schulman

Frequent Coauthors

Pieter Abbeel 13 shared papers
Xi Chen 8 shared papers
Ilya Sutskever 6 shared papers
Yan Duan 6 shared papers
Christopher Hesse 5 shared papers
Heewoo Jun 5 shared papers
Jan Leike 5 shared papers
Mark Chen 5 shared papers
Alec Radford 4 shared papers
Jacob Hilton 4 shared papers
Karl Cobbe 4 shared papers
Long Ouyang 4 shared papers
Prafulla Dhariwal 4 shared papers
Rein Houthooft 4 shared papers
Aditya Ramesh 3 shared papers
Alex Nichol 3 shared papers
Chong Zhang 3 shared papers
Chris Hallacy 3 shared papers
Christina Kim 3 shared papers
Christine McLeavey 3 shared papers