pith. sign in

arxiv: 2604.16875 · v2 · submitted 2026-04-18 · 💻 cs.LG · q-bio.NC

Untrained CNNs Match Backpropagation at V1: A Systematic RSA Comparison of Four Learning Rules Against Human fMRI

Pith reviewed 2026-05-10 06:35 UTC · model grok-4.3

classification 💻 cs.LG q-bio.NC
keywords convolutional neural networksrepresentational similarity analysisvisual cortexlearning rulesbackpropagationfMRIuntrained modelsTHINGS dataset
0
0 comments X

The pith

Untrained CNNs align better with human V1 than backpropagation-trained models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper compares convolutional networks trained under four different learning rules against human fMRI data from the THINGS dataset using representational similarity analysis across visual areas. It shows that a network with random untrained weights produces higher similarity to V1 and V2 activity than backpropagation, feedback alignment, predictive coding, or STDP. This indicates that the convolutional architecture itself, not the training procedure, primarily determines early visual alignment. Learning rules produce distinguishable results mainly at intermediate areas such as LOC, while all conditions converge at the highest level of the hierarchy.

Core claim

At V1/V2 the untrained random-weights baseline exceeds backpropagation in RSA correlation with human fMRI (rho = 0.076 vs. rho = 0.034), STDP reaches the highest alignment among trained rules, only backpropagation reliably exceeds baseline at LOC, and all five conditions (including untrained) show statistically indistinguishable alignment at IT.

What carries the argument

Representational Similarity Analysis (RSA) applied to identical CNN architectures under four learning rules plus an untrained baseline, measured against fMRI responses in V1, V2, LOC, and IT.

Load-bearing premise

Representational similarity analysis on the THINGS-fMRI dataset with 720 stimuli provides a reliable and unbiased measure of alignment between model representations and human visual cortex activity.

What would settle it

Retraining the same architectures on a different or larger image set and obtaining significantly higher RSA correlations for backpropagation than for the untrained baseline at V1 would falsify the architecture-driven claim.

Figures

Figures reproduced from arXiv: 2604.16875 by Nils Leutenegger.

Figure 1
Figure 1. Figure 1: Model architecture and layer-to-ROI mapping. [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Training dynamics. Loss (left) and accuracy (right) over 40 epochs on CIFAR-10 (8,000 samples) for all [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Brain alignment across ROIs (main result). Spearman [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Hierarchical gradient of brain alignment. For [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Pairwise condition comparisons (permutation test). Horizontal bars show [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Subject-level RSA scores per learning rule and ROI (sub-01: blue, sub-02: orange, sub-03: green). Sub-03 [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: RSA scores (mean ± std across 5 random seeds, 224 × 224 input). Error bars reflect variability across initializations. At V1/V2, between-rule differences are large relative to within-rule seed variability, confirming that rankings are stable across initializations. At IT, seed variability overlaps across rules, consistent with the convergence finding. are dissociated throughout the hierarchy in our small￾s… view at source ↗
Figure 8
Figure 8. Figure 8: Partial RSA controlling for pixel similarity. Left: standard RSA scores. Right: partial Spearman [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Conv1 filter analysis for all five conditions. (a) [PITH_FULL_IMAGE:figures/full_fig_p009_9.png] view at source ↗
read the original abstract

A central question in computational neuroscience is whether the learning rule used to train a neural network determines how well its internal representations align with those of the human visual cortex. We present a systematic comparison of four learning rules (backpropagation (BP), feedback alignment (FA), predictive coding (PC), and spike-timing-dependent plasticity (STDP)) applied to identical convolutional architectures and evaluated against human fMRI data from the THINGS-fMRI dataset (720 stimuli, 3 subjects) using Representational Similarity Analysis (RSA). All models process stimuli at 224 x 224 resolution; results are averaged across 5 random seeds. Crucially, we include an untrained random-weights baseline that reveals the dominant role of architecture. At V1/V2, the untrained baseline exceeds backpropagation (rho = 0.076 vs. rho = 0.034; Delta-rho = +0.044, p < 0.001), and STDP achieves the highest V1 alignment among trained rules (rho = 0.064). At LOC, only BP reliably exceeds the random baseline (rho = 0.012 vs. -0.005, p < 0.001). At IT, all five conditions converge (rho = 0.008-0.014) with no significant pairwise differences among trained rules (p > 0.05, FDR-corrected). FA consistently produces the lowest alignment at V1, V2, and LOC (rho = 0.012 at V1, below all other conditions). Partial RSA confirms all effects survive pixel-similarity control. Seed variability is small relative to between-rule differences at V1/V2. These results demonstrate that early visual alignment is architecture-driven, learning rules differentiate only at intermediate areas, and all rules converge at the highest levels of the hierarchy.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper claims that a systematic RSA comparison of four learning rules (BP, FA, PC, STDP) plus an untrained random-weights baseline on identical CNNs against THINGS-fMRI data (720 stimuli, 3 subjects) shows architecture dominates early visual alignment: at V1/V2 the untrained baseline exceeds BP (rho=0.076 vs 0.034, p<0.001), STDP is strongest among trained rules, only BP exceeds random at LOC, and all conditions converge at IT with no significant differences among trained rules.

Significance. If the results hold, the work is significant because it provides a controlled empirical demonstration that architectural inductive biases, rather than the choice of learning rule, primarily determine alignment with human V1/V2 representations. The inclusion of an untrained baseline, multiple alternative rules, seed averaging, and a partial RSA pixel-similarity control strengthens the design and yields a clear hierarchical pattern (differentiation at intermediate areas, convergence at IT) that can guide future modeling.

major comments (1)
  1. Results (V1/V2 comparison and Abstract): The central claim that untrained exceeds BP at V1/V2 (rho = 0.076 vs. rho = 0.034; Delta-rho = +0.044, p < 0.001) to conclude architecture-driven alignment rests on group-level brain RDMs from only 3 subjects. No noise-ceiling, split-half reliability, or inter-subject consistency metrics for the V1/V2 RDMs are reported, leaving open the possibility that the difference and its p-value reflect sampling variability or low SNR rather than a robust effect.
minor comments (2)
  1. Abstract: The statement that 'partial RSA confirms all effects survive pixel-similarity control' is underspecified; the exact control variable, how partial correlations are computed, and which effects survive should be stated more precisely.
  2. Methods: The correspondence between model layers and cortical areas (V1/V2, LOC, IT), the precise RSA implementation (distance metric, RDM construction from fMRI and activations), and stimulus preprocessing details for the 224x224 inputs are not fully elaborated, limiting reproducibility.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and for emphasizing the need to demonstrate robustness of the V1/V2 results given the limited number of subjects. We address the major comment below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: Results (V1/V2 comparison and Abstract): The central claim that untrained exceeds BP at V1/V2 (rho = 0.076 vs. rho = 0.034; Delta-rho = +0.044, p < 0.001) to conclude architecture-driven alignment rests on group-level brain RDMs from only 3 subjects. No noise-ceiling, split-half reliability, or inter-subject consistency metrics for the V1/V2 RDMs are reported, leaving open the possibility that the difference and its p-value reflect sampling variability or low SNR rather than a robust effect.

    Authors: We agree that reliability metrics are important to report when using group-level RDMs from a small number of subjects (here, the three subjects available in THINGS-fMRI). In the revised manuscript we will add split-half reliability (Spearman-Brown corrected correlation between RDMs computed on odd/even stimulus splits), noise-ceiling estimates (average correlation of each subject's RDM with the group RDM), and inter-subject consistency (average pairwise correlation between individual subject RDMs) for V1/V2 and the other ROIs. These will be presented in the Results section alongside the existing RSA values. We will also clarify that the reported p-values were obtained via stimulus-label permutation tests (10,000 permutations) and will include supplementary subject-level RSA analyses to show that the pattern of untrained > BP at V1/V2 holds in individual subjects where detectable. While the small sample size remains a limitation of the dataset, these additions will allow readers to evaluate whether the observed differences exceed what would be expected from sampling variability or low SNR. We do not expect the central conclusions to change. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical RSA comparisons on external fMRI benchmark

full rationale

The paper reports direct empirical measurements of Spearman rho between model RDMs (from CNNs trained with BP, FA, PC, STDP or left untrained) and human brain RDMs from the THINGS-fMRI dataset. No equations, derivations, fitted parameters, or predictions are defined; the reported values (e.g., rho = 0.076 untrained vs. 0.034 BP at V1/V2) are computed outputs of standard RSA on fixed architectures and independent neural data. No self-citations, uniqueness theorems, or ansatzes are invoked to support the central claims. The analysis is self-contained against the external benchmark, with seed averaging and pixel-similarity controls providing independent checks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that RSA distance matrices from CNN layers are directly comparable to fMRI RDMs and that the chosen CNN architecture is a fair model of visual cortex processing.

axioms (1)
  • domain assumption Representational similarity analysis provides a valid proxy for comparing model and brain representations
    Invoked throughout the RSA comparisons in the abstract

pith-pipeline@v0.9.0 · 5641 in / 1280 out tokens · 37735 ms · 2026-05-10T06:35:16.787700+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology

    cs.LG 2026-05 unverdicted novelty 5.0

    Cross-species RSA finds conserved early visual cortex alignment across learning rules but divergent IT rankings, with model capacity and training data limiting higher-area matches more than the specific rule.

Reference graph

Works this paper leans on

14 extracted references · 14 canonical work pages · cited by 1 Pith paper

  1. [1]

    and Poo, M.-m

    Bi, G.-q. and Poo, M.-m. (1998). Synaptic modifications inculturedhippocampalneurons: dependenceonspike timing, synaptic strength, and postsynaptic cell type. Journal of Neuroscience, 18:10464–10472

  2. [2]

    Crick, F.(1989).Therecentexcitementaboutneuralnet- works.Nature, 337:129–132

  3. [3]

    Gifford, A. T. and Cichy, R. M. (2022). THINGS-fMRI: A large-scale dataset of fMRI brain responses to 22,248 images from the THINGS object concept database. NeuroImage, 264:119628

  4. [4]

    Corriveau, A., Van Wicklin, C., and Baker, C. I. 9 (2019). THINGS: A database of 1,854 object con- cepts and more than 26,000 naturalistic object images. PLOS ONE, 14:e0223792

  5. [5]

    Kriegeskorte, N., Mur, M., and Bandettini, P. (2008). Representational similarity analysis – connecting the branches of systems neuroscience.Frontiers in Systems Neuroscience, 2:4

  6. [6]

    P., Cownden, D., Tweed, D

    Lillicrap, T. P., Cownden, D., Tweed, D. B., and Aker- man, C. J. (2016). Random synaptic feedback weights support error backpropagation for deep learning.Na- ture Communications, 7:13276

  7. [7]

    P., Santoro, A., Marris, L., Akerman, C

    Lillicrap, T. P., Santoro, A., Marris, L., Akerman, C. J., andHinton, G.(2020).Backpropagationandthebrain. Nature Reviews Neuroscience, 21:335–346

  8. [8]

    Olshausen, B. A. and Field, D. J. (1996). Emergence of simple-cell receptive field properties by learning a sparse code for natural images.Nature, 381:607–609

  9. [9]

    Rao, R. P. N. and Ballard, D. H. (1999). Predictive cod- ing in the visual cortex: a functional interpretation of some extra-classical receptive-field effects.Nature Neuroscience, 2:79–87

  10. [10]

    M., Koh, P

    Saxe, A. M., Koh, P. W., Chen, Z., Bhand, M., Suresh, B., and Ng, A. Y. (2011). On random weights and unsupervised feature learning. InProceedings of the 28th International Conference on Machine Learning

  11. [11]

    DiCarlo, J. J. (2020). Brain-Score: Which artificial neural network for object recognition is most brain- like?bioRxiv

  12. [12]

    and Cusack, R

    Truzzi, A. and Cusack, R. (2025). Neural responses in early visual cortex are well predicted by random- weight CNNs.bioRxiv

  13. [13]

    Whittington, J. C. R. and Bogacz, R. (2017). An approx- imation of the error backpropagation algorithm in a predictive coding network with local Hebbian synap- tic plasticity.Neural Computation, 29:1229–1262

  14. [14]

    Yamins, D. L. K. and DiCarlo, J. J. (2016). Using goal- drivendeeplearningmodelstounderstandsensorycor- tex.Nature Neuroscience, 19:356–365. 10