pith. sign in

arxiv: 2504.20197 · v1 · pith:IKJQSSTHnew · submitted 2025-04-28 · 💻 cs.LG · cond-mat.dis-nn· cs.AI

Representation Learning on a Random Lattice

classification 💻 cs.LG cond-mat.dis-nncs.AI
keywords featureslearneddatadistributionlatticemodelrandomadopt
0
0 comments X
read the original abstract

Decomposing a deep neural network's learned representations into interpretable features could greatly enhance its safety and reliability. To better understand features, we adopt a geometric perspective, viewing them as a learned coordinate system for mapping an embedded data distribution. We motivate a model of a generic data distribution as a random lattice and analyze its properties using percolation theory. Learned features are categorized into context, component, and surface features. The model is qualitatively consistent with recent findings in mechanistic interpretability and suggests directions for future research.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Critical Percolation as a Synthetic Data Model for Interpretability

    cs.LG 2026-06 unverdicted novelty 6.0

    Critical percolation clusters embedded in high dimensions, combined with taxonomic latent variables, form an analytically tractable synthetic data model whose ground-truth hierarchy can be linearly decoded from networ...