Jascha Sohl-Dickstein
Identifiers
- name variant Jascha Sohl-Dickstein 0.60 · backfill
Papers (48)
- AI Organizations are More Effective but Less Aligned than Individual Agents cs.AI · 2026 · author #8
- The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity? cs.AI · 2026 · author #5
- Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #182
- Score-Based Generative Modeling through Stochastic Differential Equations cs.LG · 2020 · author #2
- Using learned optimizers to make models robust to input noise cs.LG · 2019 · author #4
- The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study cs.LG · 2019 · author #2
- A Mean Field Theory of Batch Normalization cs.NE · 2019 · author #4
- Eliminating all bad Local Minima from Loss Landscapes without even adding an Extra Unit stat.ML · 2019 · author #1
- Measuring the Effects of Data Parallelism on Neural Network Training cs.LG · 2018 · author #4
- Understanding and correcting pathologies in the training of learned optimizers cs.NE · 2018 · author #5
- Guided evolutionary strategies: Augmenting random search with surrogate gradients cs.NE · 2018 · author #5
- Stochastic natural gradient descent draws posterior samples in function space cs.LG · 2018 · author #5
- PCA of high dimensional random walks with comparison to neural network training stat.ML · 2018 · author #2
- Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks stat.ML · 2018 · author #3
- Meta-Learning Update Rules for Unsupervised Representation Learning cs.LG · 2018 · author #4
- Sensitivity and Generalization in Neural Networks: an Empirical Study stat.ML · 2018 · author #5
- Generalizing Hamiltonian Monte Carlo with Neural Networks stat.ML · 2017 · author #3
- Deep Neural Networks as Gaussian Processes stat.ML · 2017 · author #6
- A Correspondence Between Random Neural Networks and Statistical Field Theory stat.ML · 2017 · author #3
- SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability stat.ML · 2017 · author #4
- REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models cs.LG · 2017 · author #5
- Learned Optimizers that Scale and Generalize cs.LG · 2017 · author #7
- Improved generator objectives for GANs cs.LG · 2016 · author #3
- Capacity and Trainability in Recurrent Neural Networks stat.ML · 2016 · author #2
- Input Switched Affine Networks: An RNN Architecture Designed for Interpretability cs.AI · 2016 · author #4
- Survey of Expressivity in Deep Neural Networks stat.ML · 2016 · author #5
- Unrolled Generative Adversarial Networks cs.LG · 2016 · author #4
- Deep Information Propagation stat.ML · 2016 · author #4
- Exponential expressivity in deep neural networks through transient chaos stat.ML · 2016 · author #4
- On the Expressive Power of Deep Neural Networks stat.ML · 2016 · author #5
- Density estimation using Real NVP cs.LG · 2016 · author #2
- A universal tradeoff between power, precision and speed in physical communication cond-mat.stat-mech · 2016 · author #2
- A Markov Jump Process for More Efficient Hamiltonian Monte Carlo stat.ML · 2015 · author #4
- Deep Knowledge Tracing cs.AI · 2015 · author #7
- Note on Equivalence Between Recurrent Neural Network Time Series Models and Variational Bayesian Models cs.LG · 2015 · author #1
- Deep Unsupervised Learning using Nonequilibrium Thermodynamics cs.LG · 2015 · author #1
- Hamiltonian Monte Carlo Without Detailed Balance stat.CO · 2014 · author #1
- Analyzing noise in autoencoders and deep networks cs.NE · 2014 · author #2
- Fast large-scale optimization by unifying stochastic gradient and quasi-Newton methods cs.LG · 2013 · author #1
- Higher Order Correlations within Cortical Layers Dominate Functional Connectivity in Microcolumns q-bio.NC · 2013 · author #2
- Minimum and maximum entropy distributions for binary systems with known means and pairwise correlations physics.bio-ph · 2012 · author #3
- Efficient Methods for Unsupervised Learning of Probabilistic Models cs.LG · 2012 · author #1
- Hamiltonian Monte Carlo with Reduced Momentum Flips physics.data-an · 2012 · author #1
- Hamiltonian Annealed Importance Sampling for partition function estimation cs.LG · 2012 · author #1
- The Natural Gradient by Analogy to Signal Whitening, and Recipes and Tricks for its Use cs.LG · 2012 · author #1
- Efficient and optimal binary Hopfield associative memory storage using minimum probability flow nlin.AO · 2012 · author #2
- An Unsupervised Algorithm For Learning Lie Group Transformations cs.CV · 2010 · author #1
- Minimum Probability Flow Learning cs.LG · 2009 · author #1
Mentions
- 1301.0050 #2 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 1209.3744 #3 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 1205.4295 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 1205.1939 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 1205.1925 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 1205.1828 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 1204.2916 #2 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 1001.1027 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
- 0906.4779 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
Frequent Coauthors
- Surya Ganguli 9 shared papers
- Ben Poole 8 shared papers
- Luke Metz 6 shared papers
- Niru Maheswaranathan 6 shared papers
- Samuel S. Schoenholz 6 shared papers
- Jeffrey Pennington 5 shared papers
- Maithra Raghu 4 shared papers
- Michael R. DeWeese 4 shared papers
- Yasaman Bahri 4 shared papers
- Jaehoon Lee 3 shared papers
- Justin Gilmer 3 shared papers
- Roman Novak 3 shared papers
- Bruno A. Olshausen 2 shared papers
- Christopher Hillar 2 shared papers
- Daniel Levy 2 shared papers
- David Sussillo 2 shared papers
- Diederik P. Kingma 2 shared papers
- George Tucker 2 shared papers
- Henry Sleight 2 shared papers
- Jason Yosinski 2 shared papers