Representation biases: will we achieve com- plete understanding by analyzing represen- tations?arXiv preprint arXiv:2507.22216, 2025

Andrew Kyle Lampinen, Stephanie CY Chan, Yuxuan Li, Katherine Hermann · 2025 · arXiv 2507.22216

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

A framework for analyzing concept representations in neural models

cs.CL · 2026-05-02 · unverdicted · novelty 7.0

A new framework shows concept subspaces are not unique, estimator choice affects containment and disentanglement, LEACE works well but generalizes poorly, and HuBERT encodes phone info as contained and disentangled from speaker info while speaker info resists compact containment.

The Homogenization Problem in LLMs: Towards Meaningful Diversity in AI Safety

cs.AI · 2026-01-03 · unverdicted · novelty 6.0 · 2 refs

Proposes a value-encoding framework to characterize and counter homogenization in LLMs by formalizing it via normativity from queer theory and introducing xeno-reproduction tasks from feminist theory, illustrated with a gender-bias experiment on Claude 3.5 Haiku.

Stimulus symmetries can confound representational similarity analyses

q-bio.NC · 2026-05-20 · unverdicted · novelty 5.0

Stimulus symmetries render many neural representations functionally equivalent yet produce qualitatively different RSMs, including drifting ones from SGD or regularization in image-encoding networks.

citing papers explorer

Showing 3 of 3 citing papers.

A framework for analyzing concept representations in neural models cs.CL · 2026-05-02 · unverdicted · none · ref 77
A new framework shows concept subspaces are not unique, estimator choice affects containment and disentanglement, LEACE works well but generalizes poorly, and HuBERT encodes phone info as contained and disentangled from speaker info while speaker info resists compact containment.
The Homogenization Problem in LLMs: Towards Meaningful Diversity in AI Safety cs.AI · 2026-01-03 · unverdicted · none · ref 76 · 2 links
Proposes a value-encoding framework to characterize and counter homogenization in LLMs by formalizing it via normativity from queer theory and introducing xeno-reproduction tasks from feminist theory, illustrated with a gender-bias experiment on Claude 3.5 Haiku.
Stimulus symmetries can confound representational similarity analyses q-bio.NC · 2026-05-20 · unverdicted · none · ref 31
Stimulus symmetries render many neural representations functionally equivalent yet produce qualitatively different RSMs, including drifting ones from SGD or regularization in image-encoding networks.

Representation biases: will we achieve com- plete understanding by analyzing represen- tations?arXiv preprint arXiv:2507.22216, 2025

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer