Title resolution pending

Sanhanat Sivapiromrat, Caiqi Zhang, Marco Basaldella, Nigel Collier · 2025 · arXiv 2507.11112

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in LLMs

cs.AI · 2026-06-06 · unverdicted · novelty 6.0

Sparse autoencoders identify shared latent features across diverse backdoor attacks in LLMs that enable unified detection via classifiers, causal control via steering, and mitigation via ablation fine-tuning.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in LLMs cs.AI · 2026-06-06 · unverdicted · none · ref 83
Sparse autoencoders identify shared latent features across diverse backdoor attacks in LLMs that enable unified detection via classifiers, causal control via steering, and mitigation via ablation fine-tuning.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer