Ilya Sutskever
Identifiers
- name variant Ilya Sutskever 0.60 · backfill
Papers (53)
- OpenAI o1 System Card cs.AI · 2024 · author #102
- GPT-4o System Card cs.CL · 2024 · author #159
- Scaling and evaluating sparse autoencoders cs.LG · 2024 · author #7
- Let's Verify Step by Step cs.LG · 2023 · author #9
- GPT-4 Technical Report cs.CL · 2023 · author #233
- Consistency Models cs.LG · 2023 · author #4
- Robust Speech Recognition via Large-Scale Weak Supervision eess.AS · 2022 · author #6
- GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models cs.CV · 2021 · author #7
- Evaluating Large Language Models Trained on Code cs.LG · 2021 · author #57
- Learning Transferable Visual Models From Natural Language Supervision cs.CV · 2021 · author #12
- Zero-Shot Text-to-Image Generation cs.CV · 2021 · author #8
- Generative Language Modeling for Automated Theorem Proving cs.LG · 2020 · author #2
- Language Models are Few-Shot Learners cs.CL · 2020 · author #30
- Jukebox: A Generative Model for Music eess.AS · 2020 · author #6
- Dota 2 with Large Scale Deep Reinforcement Learning cs.LG · 2019 · author #22
- Generating Long Sequences with Sparse Transformers cs.LG · 2019 · author #4
- FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models cs.LG · 2018 · author #4
- GamePad: A Learning Environment for Theorem Proving cs.LG · 2018 · author #4
- Some Considerations on Learning to Explore via Meta-Reinforcement Learning cs.AI · 2018 · author #8
- Emergent Complexity via Multi-Agent Competition cs.AI · 2017 · author #4
- Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments cs.LG · 2017 · author #4
- An online sequence-to-sequence model for noisy speech recognition cs.CL · 2017 · author #6
- Learning to Generate Reviews and Discovering Sentiment cs.LG · 2017 · author #3
- One-Shot Imitation Learning cs.AI · 2017 · author #6
- Evolution Strategies as a Scalable Alternative to Reinforcement Learning stat.ML · 2017 · author #5
- RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning cs.AI · 2016 · author #5
- Variational Lossy Autoencoder cs.LG · 2016 · author #7
- Extensions and Limitations of the Neural GPU cs.NE · 2016 · author #3
- Learning Online Alignments with Continuous Rewards Policy Gradient cs.LG · 2016 · author #4
- Improving Variational Inference with Inverse Autoregressive Flow cs.LG · 2016 · author #5
- InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets cs.LG · 2016 · author #5
- TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems cs.DC · 2016 · author #29
- Continuous Deep Q-Learning with Model-based Acceleration cs.LG · 2016 · author #3
- Neural GPUs Learn Algorithms cs.LG · 2015 · author #2
- Adding Gradient Noise Improves Learning for Very Deep Networks stat.ML · 2015 · author #4
- Towards Principled Unsupervised Learning cs.LG · 2015 · author #1
- Neural Random-Access Machines cs.LG · 2015 · author #3
- Multi-task Sequence to Sequence Learning cs.LG · 2015 · author #3
- MuProp: Unbiased Backpropagation for Stochastic Neural Networks cs.LG · 2015 · author #3
- A Neural Transducer cs.LG · 2015 · author #5
- Neural Programmer: Inducing Latent Programs with Gradient Descent cs.LG · 2015 · author #3
- Reinforcement Learning Neural Turing Machines - Revised cs.LG · 2015 · author #2
- Grammar as a Foreign Language cs.CL · 2014 · author #5
- Move Evaluation in Go Using Deep Convolutional Neural Networks cs.LG · 2014 · author #3
- Addressing the Rare Word Problem in Neural Machine Translation cs.CL · 2014 · author #2
- Learning to Execute cs.NE · 2014 · author #2
- Sequence to Sequence Learning with Neural Networks cs.CL · 2014 · author #1
- Recurrent Neural Network Regularization cs.NE · 2014 · author #2
- Intriguing properties of neural networks cs.CV · 2013 · author #3
- Learning Factored Representations in a Deep Mixture of Experts cs.LG · 2013 · author #3
- Distributed Representations of Words and Phrases and their Compositionality cs.CL · 2013 · author #2
- Exploiting Similarities among Languages for Machine Translation cs.CL · 2013 · author #3
- Improving neural networks by preventing co-adaptation of feature detectors cs.NE · 2012 · author #4
Mentions
- 1207.0580 #4 · backfill · confidence 0.70 Ilya Sutskever
- 2009.03393 #2 · arxiv_oai · confidence 0.70 Ilya Sutskever
- 2303.08774 #233 · backfill · confidence 0.70 Ilya Sutskever
Frequent Coauthors
- Alec Radford 11 shared papers
- Wojciech Zaremba 11 shared papers
- Mark Chen 8 shared papers
- Oriol Vinyals 8 shared papers
- Lukasz Kaiser 7 shared papers
- Prafulla Dhariwal 7 shared papers
- Quoc V. Le 7 shared papers
- Scott Gray 7 shared papers
- Aditya Ramesh 6 shared papers
- Greg Brockman 6 shared papers
- John Schulman 6 shared papers
- Pieter Abbeel 6 shared papers
- Xi Chen 6 shared papers
- Bob McGrew 5 shared papers
- Gabriel Goh 5 shared papers
- Jakub Pachocki 5 shared papers
- Jan Leike 5 shared papers
- Jie Tang 5 shared papers
- Jong Wook Kim 5 shared papers
- Mikhail Pavlov 5 shared papers