Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

Kasa, Siva Rajesh, Gupta, Karan, Roychowdhury, Sumegh, Kumar, Ashutosh, Biruduraju, Yaswanth, Kasa, Santhosh Kumar · 2025 · DOI 10.18653/v1/2025.emnlp-main.486

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

LLM Safety From Within: Detecting Harmful Content with Internal Representations

cs.AI · 2026-04-20 · unverdicted · novelty 6.0

SIREN identifies safety neurons via linear probing on internal LLM layers and combines them with adaptive weighting to detect harm, outperforming prior guard models with 250x fewer parameters.

citing papers explorer

Showing 1 of 1 citing paper.

LLM Safety From Within: Detecting Harmful Content with Internal Representations cs.AI · 2026-04-20 · unverdicted · none · ref 47
SIREN identifies safety neurons via linear probing on internal LLM layers and combines them with adaptive weighting to detect harm, outperforming prior guard models with 250x fewer parameters.

Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

fields

years

verdicts

representative citing papers

citing papers explorer