pith. sign in

Feature visualization

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

roles

background 1

polarities

background 1

representative citing papers

Toy Models of Superposition

cs.LG · 2022-09-21 · accept · novelty 8.0

Toy models demonstrate that polysemanticity arises when neural networks store more sparse features than neurons via superposition, producing a phase transition tied to polytope geometry and increased adversarial vulnerability.

From Mechanistic to Compositional Interpretability

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

Compositional interpretability defines explanations as commuting syntactic-semantic mapping pairs grounded in compositionality and minimum description length, with compressive refinement and a parsimony theorem guaranteeing concise human-aligned decompositions.

Feature Visualization Recovers Known Cortical Selectivity from TRIBE v2

q-bio.NC · 2026-05-13 · unverdicted · novelty 6.0

Feature visualization on TRIBE v2 brain encoders recovers the known ventral visual hierarchy from V1 to V4 and produces distinctive patterns for MT, FFA, and PPA, with optimized stimuli driving ~4x higher activation than natural images.

citing papers explorer

Showing 6 of 6 citing papers.

  • Toy Models of Superposition cs.LG · 2022-09-21 · accept · none · ref 10

    Toy models demonstrate that polysemanticity arises when neural networks store more sparse features than neurons via superposition, producing a phase transition tied to polytope geometry and increased adversarial vulnerability.

  • From Mechanistic to Compositional Interpretability cs.LG · 2026-05-09 · unverdicted · none · ref 203

    Compositional interpretability defines explanations as commuting syntactic-semantic mapping pairs grounded in compositionality and minimum description length, with compressive refinement and a parsimony theorem guaranteeing concise human-aligned decompositions.

  • Feature Visualization Recovers Known Cortical Selectivity from TRIBE v2 q-bio.NC · 2026-05-13 · unverdicted · none · ref 14

    Feature visualization on TRIBE v2 brain encoders recovers the known ventral visual hierarchy from V1 to V4 and produces distinctive patterns for MT, FFA, and PPA, with optimized stimuli driving ~4x higher activation than natural images.

  • HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability cs.LG · 2025-12-08 · unverdicted · none · ref 61

    HOLE applies persistent homology to latent embeddings in neural networks and uses visualizations such as cluster flow diagrams to reveal patterns of class separation, feature disentanglement, and robustness.

  • NeuroViz: Real-time Interactive Visualization of Forward and Backward Passes in Neural Network Training cs.LG · 2026-05-03 · unverdicted · none · ref 34

    NeuroViz offers interactive real-time visualization of neural network forward and backward passes, achieving top usability scores in a study with 31 participants compared to existing tools.

  • Open Problems in Mechanistic Interpretability cs.LG · 2025-01-27 · unverdicted · none · ref 9

    A review paper that organizes conceptual, practical, and socio-technical open problems in mechanistic interpretability.