Transformer Circuits Thread2(2023)

Bricken, T · 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Contrastive Semantic Projection: Faithful Neuron Labeling with Contrastive Examples

cs.CV · 2026-04-24 · unverdicted · novelty 6.0

Using contrastive examples with vision-language models and a new CLIP-based scoring method called CSP produces more faithful and granular neuron labels than prior activation-only approaches.

In your own words: computationally identifying interpretable themes in free-text survey data

cs.CY · 2026-03-27 · unverdicted · novelty 6.0

A computational framework identifies more coherent themes in free-text survey data on race, gender, and sexual orientation than previous methods, with applications for survey design, explaining variation, and detecting identity discordance.

citing papers explorer

Showing 2 of 2 citing papers.

Contrastive Semantic Projection: Faithful Neuron Labeling with Contrastive Examples cs.CV · 2026-04-24 · unverdicted · none · ref 4
Using contrastive examples with vision-language models and a new CLIP-based scoring method called CSP produces more faithful and granular neuron labels than prior activation-only approaches.
In your own words: computationally identifying interpretable themes in free-text survey data cs.CY · 2026-03-27 · unverdicted · none · ref 104
A computational framework identifies more coherent themes in free-text survey data on race, gender, and sexual orientation than previous methods, with applications for survey design, explaining variation, and detecting identity discordance.

Transformer Circuits Thread2(2023)

fields

years

verdicts

representative citing papers

citing papers explorer