pith. sign in

Ilya Sutskever

Identifiers

  • name variant Ilya Sutskever 0.60 · backfill

Papers (53)

  1. OpenAI o1 System Card cs.AI · 2024 · author #102
  2. GPT-4o System Card cs.CL · 2024 · author #159
  3. Scaling and evaluating sparse autoencoders cs.LG · 2024 · author #7
  4. Let's Verify Step by Step cs.LG · 2023 · author #9
  5. GPT-4 Technical Report cs.CL · 2023 · author #233
  6. Consistency Models cs.LG · 2023 · author #4
  7. Robust Speech Recognition via Large-Scale Weak Supervision eess.AS · 2022 · author #6
  8. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models cs.CV · 2021 · author #7
  9. Evaluating Large Language Models Trained on Code cs.LG · 2021 · author #57
  10. Learning Transferable Visual Models From Natural Language Supervision cs.CV · 2021 · author #12
  11. Zero-Shot Text-to-Image Generation cs.CV · 2021 · author #8
  12. Generative Language Modeling for Automated Theorem Proving cs.LG · 2020 · author #2
  13. Language Models are Few-Shot Learners cs.CL · 2020 · author #30
  14. Jukebox: A Generative Model for Music eess.AS · 2020 · author #6
  15. Dota 2 with Large Scale Deep Reinforcement Learning cs.LG · 2019 · author #22
  16. Generating Long Sequences with Sparse Transformers cs.LG · 2019 · author #4
  17. FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models cs.LG · 2018 · author #4
  18. GamePad: A Learning Environment for Theorem Proving cs.LG · 2018 · author #4
  19. Some Considerations on Learning to Explore via Meta-Reinforcement Learning cs.AI · 2018 · author #8
  20. Emergent Complexity via Multi-Agent Competition cs.AI · 2017 · author #4
  21. Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments cs.LG · 2017 · author #4
  22. An online sequence-to-sequence model for noisy speech recognition cs.CL · 2017 · author #6
  23. Learning to Generate Reviews and Discovering Sentiment cs.LG · 2017 · author #3
  24. One-Shot Imitation Learning cs.AI · 2017 · author #6
  25. Evolution Strategies as a Scalable Alternative to Reinforcement Learning stat.ML · 2017 · author #5
  26. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning cs.AI · 2016 · author #5
  27. Variational Lossy Autoencoder cs.LG · 2016 · author #7
  28. Extensions and Limitations of the Neural GPU cs.NE · 2016 · author #3
  29. Learning Online Alignments with Continuous Rewards Policy Gradient cs.LG · 2016 · author #4
  30. Improving Variational Inference with Inverse Autoregressive Flow cs.LG · 2016 · author #5
  31. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets cs.LG · 2016 · author #5
  32. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems cs.DC · 2016 · author #29
  33. Continuous Deep Q-Learning with Model-based Acceleration cs.LG · 2016 · author #3
  34. Neural GPUs Learn Algorithms cs.LG · 2015 · author #2
  35. Adding Gradient Noise Improves Learning for Very Deep Networks stat.ML · 2015 · author #4
  36. Towards Principled Unsupervised Learning cs.LG · 2015 · author #1
  37. Neural Random-Access Machines cs.LG · 2015 · author #3
  38. Multi-task Sequence to Sequence Learning cs.LG · 2015 · author #3
  39. MuProp: Unbiased Backpropagation for Stochastic Neural Networks cs.LG · 2015 · author #3
  40. A Neural Transducer cs.LG · 2015 · author #5
  41. Neural Programmer: Inducing Latent Programs with Gradient Descent cs.LG · 2015 · author #3
  42. Reinforcement Learning Neural Turing Machines - Revised cs.LG · 2015 · author #2
  43. Grammar as a Foreign Language cs.CL · 2014 · author #5
  44. Move Evaluation in Go Using Deep Convolutional Neural Networks cs.LG · 2014 · author #3
  45. Addressing the Rare Word Problem in Neural Machine Translation cs.CL · 2014 · author #2
  46. Learning to Execute cs.NE · 2014 · author #2
  47. Sequence to Sequence Learning with Neural Networks cs.CL · 2014 · author #1
  48. Recurrent Neural Network Regularization cs.NE · 2014 · author #2
  49. Intriguing properties of neural networks cs.CV · 2013 · author #3
  50. Learning Factored Representations in a Deep Mixture of Experts cs.LG · 2013 · author #3
  51. Distributed Representations of Words and Phrases and their Compositionality cs.CL · 2013 · author #2
  52. Exploiting Similarities among Languages for Machine Translation cs.CL · 2013 · author #3
  53. Improving neural networks by preventing co-adaptation of feature detectors cs.NE · 2012 · author #4

Mentions

  • 1207.0580 #4 · backfill · confidence 0.70 Ilya Sutskever
  • 2009.03393 #2 · arxiv_oai · confidence 0.70 Ilya Sutskever
  • 2303.08774 #233 · backfill · confidence 0.70 Ilya Sutskever

Frequent Coauthors