Goodfire Research , year =

Gorton, Liv, Wang, Nicholas, Nguyen, Nam, Deng, Myra, Ho, Eric, Balsam, Daniel · DOI 10.5281/zenodo.14895891

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

VFUSE: Virulent Feature Understanding with Sparse autoEncoders

cs.LG · 2026-06-08 · unverdicted · novelty 7.0

VFUSE applies sparse autoencoders to diffusion-transformer activations in RoseTTAFold3 and RFDiffusion3 to find monosemantic features that detect hazardous protein designs with AUROC up to 0.84.

citing papers explorer

Showing 1 of 1 citing paper after filters.

VFUSE: Virulent Feature Understanding with Sparse autoEncoders cs.LG · 2026-06-08 · unverdicted · none · ref 40
VFUSE applies sparse autoencoders to diffusion-transformer activations in RoseTTAFold3 and RFDiffusion3 to find monosemantic features that detect hazardous protein designs with AUROC up to 0.84.

Goodfire Research , year =

fields

years

verdicts

representative citing papers

citing papers explorer