pith. sign in

Enhancing neural network interpretability with feature-aligned sparse autoencoders

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.LG 2 cs.CE 1

years

2026 3

verdicts

UNVERDICTED 3

representative citing papers

Improving Sparse Autoencoder with Dynamic Attention

cs.LG · 2026-04-16 · unverdicted · novelty 7.0

A cross-attention SAE with sparsemax attention achieves lower reconstruction loss and higher-quality concepts than fixed-sparsity baselines by making activation counts data-dependent.

citing papers explorer

Showing 3 of 3 citing papers.