Revisiting the Platonic Representation Hypothesis: An Aristotelian View

Fabian Gr\"oger; Maria Brbi\'c; Shuo Wen

arxiv: 2602.14486 · v2 · pith:FYWH4DCOnew · submitted 2026-02-16 · 💻 cs.LG · cs.AI· cs.CV· cs.NE

Revisiting the Platonic Representation Hypothesis: An Aristotelian View

Fabian Gr\"oger , Shuo Wen , Maria Brbi\'c This is my paper

classification 💻 cs.LG cs.AIcs.CVcs.NE

keywords hypothesisrepresentationsimilaritylocalplatonicrepresentationalaristoteliancalibration

0 comments

read the original abstract

The Platonic Representation Hypothesis suggests that representations from neural networks are converging to a common statistical model of reality. We show that the existing metrics used to measure representational similarity are confounded by network scale: increasing model depth or width can systematically inflate representational similarity scores. To correct these effects, we introduce a permutation-based null-calibration framework that transforms any representational similarity metric into a calibrated score with statistical guarantees. We revisit the Platonic Representation Hypothesis with our calibration framework, which reveals a nuanced picture: the apparent convergence reported by global spectral measures largely disappears after calibration, while local neighborhood similarity, but not local distances, retains significant agreement across different modalities. Based on these findings, we propose the Aristotelian Representation Hypothesis: representations in neural networks are converging to shared local neighborhood relationships.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 10 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Unifying Framework for Concept-Based Representational Similarity
cs.LG 2026-06 unverdicted novelty 7.0

A unifying framework decomposes concept alignment into instance-wise and distributional translation and concept consistency, introduces the InterVenchA benchmark, and shows that joint optimization via CoSAE recovers s...
Convergence Without Understanding: When Language Models Agree on Representations but Disagree on Reasoning
cs.CL 2026-05 unverdicted novelty 7.0

Representational convergence across 16 LLMs on 800 reasoning problems is stronger for failed tasks and pre-decision stages but shows minimal causal influence on predictions, pointing to shared processing constraints o...
Capability $\neq$ Interpretability: Human Interpretability of Vision Foundation Models
cs.CV 2026-05 conditional novelty 7.0

Foundation models yield less human-interpretable features than supervised vision transformers, with interpretability tied to activation locality and coarse semantic alignment rather than task performance.
Better Together: Evaluating the Complementarity of Earth Embedding Models
cs.CV 2026-05 unverdicted novelty 7.0

Fusing embeddings from four Earth models (AlphaEarth, Tessera, GeoCLIP, SatCLIP) outperforms the best single model on four of six tasks, with gains depending on task and location.
Characterizing Universal Object Representations Across Vision Models
cs.CV 2026-05 unverdicted novelty 7.0

Vision models converge on universal object dimensions that are semantically interpretable and align more closely with biological vision than model-specific ones.
Geometry-Aware CLIP Retrieval via Local Cross-Modal Alignment and Steering
cs.CV 2026-04 unverdicted novelty 7.0

Neighborhood re-ranking via Hungarian matching and query-conditioned local steering improve CLIP retrieval on attribute-binding and compositional tasks by addressing local geometric inconsistencies.
Back into Plato's Cave: Examining Cross-modal Representational Convergence at Scale
cs.CV 2026-04 unverdicted novelty 6.0

Evidence for cross-modal representational convergence weakens substantially at scale and in realistic many-to-many settings, indicating models learn rich but distinct representations.
Order Is Not Control
cs.LG 2026-06 unverdicted novelty 5.0

Order is distinct from control, where control is defined as a local receiver-gated response law demonstrated across biological circuits and LLM response panels with reported prediction accuracies of 72-84%.
A quantitative analysis of semantic information in deep representations of text and images
cs.CL 2025-05 unverdicted novelty 5.0

Semantic information in deep representations is distributed across many tokens and concentrated in specific layers, with directed predictability strongest in middle layers for text and varying by modality and language.
There Will Be a Scientific Theory of Deep Learning
stat.ML 2026-04 unverdicted novelty 2.0

A mechanics of the learning process is emerging in deep learning theory, characterized by dynamics, coarse statistics, and falsifiable predictions across idealized settings, limits, laws, hyperparameters, and universa...