pith. sign in

Representation biases: will we achieve com- plete understanding by analyzing represen- tations?arXiv preprint arXiv:2507.22216, 2025

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 3

verdicts

UNVERDICTED 3

roles

background 1

polarities

background 1

representative citing papers

A framework for analyzing concept representations in neural models

cs.CL · 2026-05-02 · unverdicted · novelty 7.0

A new framework shows concept subspaces are not unique, estimator choice affects containment and disentanglement, LEACE works well but generalizes poorly, and HuBERT encodes phone info as contained and disentangled from speaker info while speaker info resists compact containment.

The Homogenization Problem in LLMs: Towards Meaningful Diversity in AI Safety

cs.AI · 2026-01-03 · unverdicted · novelty 6.0 · 2 refs

Proposes a value-encoding framework to characterize and counter homogenization in LLMs by formalizing it via normativity from queer theory and introducing xeno-reproduction tasks from feminist theory, illustrated with a gender-bias experiment on Claude 3.5 Haiku.

citing papers explorer

Showing 3 of 3 citing papers.

  • A framework for analyzing concept representations in neural models cs.CL · 2026-05-02 · unverdicted · none · ref 77

    A new framework shows concept subspaces are not unique, estimator choice affects containment and disentanglement, LEACE works well but generalizes poorly, and HuBERT encodes phone info as contained and disentangled from speaker info while speaker info resists compact containment.

  • The Homogenization Problem in LLMs: Towards Meaningful Diversity in AI Safety cs.AI · 2026-01-03 · unverdicted · none · ref 76 · 2 links

    Proposes a value-encoding framework to characterize and counter homogenization in LLMs by formalizing it via normativity from queer theory and introducing xeno-reproduction tasks from feminist theory, illustrated with a gender-bias experiment on Claude 3.5 Haiku.

  • Stimulus symmetries can confound representational similarity analyses q-bio.NC · 2026-05-20 · unverdicted · none · ref 31

    Stimulus symmetries render many neural representations functionally equivalent yet produce qualitatively different RSMs, including drifting ones from SGD or regularization in image-encoding networks.