Jascha Sohl-Dickstein — Pith Author Registry

Identifiers

name variant Jascha Sohl-Dickstein 0.60 · backfill

Papers (48)

AI Organizations are More Effective but Less Aligned than Individual Agents cs.AI · 2026 · author #8
The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity? cs.AI · 2026 · author #5
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #182
Score-Based Generative Modeling through Stochastic Differential Equations cs.LG · 2020 · author #2
Using learned optimizers to make models robust to input noise cs.LG · 2019 · author #4
The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study cs.LG · 2019 · author #2
A Mean Field Theory of Batch Normalization cs.NE · 2019 · author #4
Eliminating all bad Local Minima from Loss Landscapes without even adding an Extra Unit stat.ML · 2019 · author #1
Measuring the Effects of Data Parallelism on Neural Network Training cs.LG · 2018 · author #4
Understanding and correcting pathologies in the training of learned optimizers cs.NE · 2018 · author #5
Guided evolutionary strategies: Augmenting random search with surrogate gradients cs.NE · 2018 · author #5
Stochastic natural gradient descent draws posterior samples in function space cs.LG · 2018 · author #5
PCA of high dimensional random walks with comparison to neural network training stat.ML · 2018 · author #2
Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks stat.ML · 2018 · author #3
Meta-Learning Update Rules for Unsupervised Representation Learning cs.LG · 2018 · author #4
Sensitivity and Generalization in Neural Networks: an Empirical Study stat.ML · 2018 · author #5
Generalizing Hamiltonian Monte Carlo with Neural Networks stat.ML · 2017 · author #3
Deep Neural Networks as Gaussian Processes stat.ML · 2017 · author #6
A Correspondence Between Random Neural Networks and Statistical Field Theory stat.ML · 2017 · author #3
SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability stat.ML · 2017 · author #4
REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models cs.LG · 2017 · author #5
Learned Optimizers that Scale and Generalize cs.LG · 2017 · author #7
Improved generator objectives for GANs cs.LG · 2016 · author #3
Capacity and Trainability in Recurrent Neural Networks stat.ML · 2016 · author #2
Input Switched Affine Networks: An RNN Architecture Designed for Interpretability cs.AI · 2016 · author #4
Survey of Expressivity in Deep Neural Networks stat.ML · 2016 · author #5
Unrolled Generative Adversarial Networks cs.LG · 2016 · author #4
Deep Information Propagation stat.ML · 2016 · author #4
Exponential expressivity in deep neural networks through transient chaos stat.ML · 2016 · author #4
On the Expressive Power of Deep Neural Networks stat.ML · 2016 · author #5
Density estimation using Real NVP cs.LG · 2016 · author #2
A universal tradeoff between power, precision and speed in physical communication cond-mat.stat-mech · 2016 · author #2
A Markov Jump Process for More Efficient Hamiltonian Monte Carlo stat.ML · 2015 · author #4
Deep Knowledge Tracing cs.AI · 2015 · author #7
Note on Equivalence Between Recurrent Neural Network Time Series Models and Variational Bayesian Models cs.LG · 2015 · author #1
Deep Unsupervised Learning using Nonequilibrium Thermodynamics cs.LG · 2015 · author #1
Hamiltonian Monte Carlo Without Detailed Balance stat.CO · 2014 · author #1
Analyzing noise in autoencoders and deep networks cs.NE · 2014 · author #2
Fast large-scale optimization by unifying stochastic gradient and quasi-Newton methods cs.LG · 2013 · author #1
Higher Order Correlations within Cortical Layers Dominate Functional Connectivity in Microcolumns q-bio.NC · 2013 · author #2
Minimum and maximum entropy distributions for binary systems with known means and pairwise correlations physics.bio-ph · 2012 · author #3
Efficient Methods for Unsupervised Learning of Probabilistic Models cs.LG · 2012 · author #1
Hamiltonian Monte Carlo with Reduced Momentum Flips physics.data-an · 2012 · author #1
Hamiltonian Annealed Importance Sampling for partition function estimation cs.LG · 2012 · author #1
The Natural Gradient by Analogy to Signal Whitening, and Recipes and Tricks for its Use cs.LG · 2012 · author #1
Efficient and optimal binary Hopfield associative memory storage using minimum probability flow nlin.AO · 2012 · author #2
An Unsupervised Algorithm For Learning Lie Group Transformations cs.CV · 2010 · author #1
Minimum Probability Flow Learning cs.LG · 2009 · author #1

Mentions

1301.0050 #2 · backfill · confidence 0.70 Jascha Sohl-Dickstein
1209.3744 #3 · backfill · confidence 0.70 Jascha Sohl-Dickstein
1205.4295 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
1205.1939 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
1205.1925 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
1205.1828 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
1204.2916 #2 · backfill · confidence 0.70 Jascha Sohl-Dickstein
1001.1027 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein
0906.4779 #1 · backfill · confidence 0.70 Jascha Sohl-Dickstein

Frequent Coauthors

Surya Ganguli 9 shared papers
Ben Poole 8 shared papers
Luke Metz 6 shared papers
Niru Maheswaranathan 6 shared papers
Samuel S. Schoenholz 6 shared papers
Jeffrey Pennington 5 shared papers
Maithra Raghu 4 shared papers
Michael R. DeWeese 4 shared papers
Yasaman Bahri 4 shared papers
Jaehoon Lee 3 shared papers
Justin Gilmer 3 shared papers
Roman Novak 3 shared papers
Bruno A. Olshausen 2 shared papers
Christopher Hillar 2 shared papers
Daniel Levy 2 shared papers
David Sussillo 2 shared papers
Diederik P. Kingma 2 shared papers
George Tucker 2 shared papers
Henry Sleight 2 shared papers
Jason Yosinski 2 shared papers