Predicting Neural Network Accuracy from Weights

Daniel Keysers; Ilya Tolstikhin; Olivier Bousquet; Sylvain Gelly; Thomas Unterthiner

arxiv: 2002.11448 · v4 · pith:ZYSVPC5Fnew · submitted 2020-02-26 · 📊 stat.ML · cs.LG

Predicting Neural Network Accuracy from Weights

Thomas Unterthiner , Daniel Keysers , Sylvain Gelly , Olivier Bousquet , Ilya Tolstikhin This is my paper

classification 📊 stat.ML cs.LG

keywords neuralaccuracydifferentnetworknetworkstrainedweightsable

0 comments

read the original abstract

We show experimentally that the accuracy of a trained neural network can be predicted surprisingly well by looking only at its weights, without evaluating it on input data. We motivate this task and introduce a formal setting for it. Even when using simple statistics of the weights, the predictors are able to rank neural networks by their performance with very high accuracy (R2 score more than 0.98). Furthermore, the predictors are able to rank networks trained on different, unobserved datasets and with different architectures. We release a collection of 120k convolutional neural networks trained on four different datasets to encourage further research in this area, with the goal of understanding network training and performance better.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Structural Symmetries: Linear Mode Connectivity via Neuron Identifiability
cs.LG 2026-06 unverdicted novelty 7.0

Neural networks admit large families of approximately equivalent solutions via neuron identifiability even without structural symmetry, enabling linear low-loss merging paths without prior alignment.
ModelLens: Finding the Best for Your Task from Myriads of Models
cs.LG 2026-05 unverdicted novelty 6.0

ModelLens learns a performance-aware latent space from 1.62M leaderboard records to rank unseen models on unseen datasets without forward passes on the target.
Evaluation without Generation: Non-Generative Assessment of Harmful Model Specialization with Applications to CSAM
cs.LG 2026-04 unverdicted novelty 6.0

Gaussian probing infers harmful model specialization from parameter perturbations and internal representation responses to Gaussian latent ensembles rather than from generated outputs.
Dynamic Neural Graph Encoding of Inference Processes in Deep Weight Space
cs.LG 2026-07 unverdicted novelty 5.0

DNG-Encoder represents NN weights as dynamic graphs to preserve sequential inference and powers INR2JLS, which raises INR classification accuracy by ~10% on CIFAR-100-INR.
What Linear Probes Miss: Multi-View Probing for Weight-Space Learning
cs.LG 2026-05 unverdicted novelty 5.0

MVProbe is a multi-perspective probing framework for weight-space learning that combines first-order and Gram-based views and outperforms ProbeX on the Model Jungle benchmark.
Towards Learning Representations of Policies in Two-Player Zero-Sum Imperfect-Information Games
cs.LG 2026-07 unverdicted novelty 4.0

Basic dataset creation, embedding learning, and evaluation tasks on Kuhn and Leduc Poker demonstrate that useful behavioral representations appear in the learned embeddings.