Injecting undetectable backdoors in obfuscated neural networks and language models.Advances in Neural Information Processing Systems, 37:21537–21571, 2024

Alkis Kalavasis, Amin Karbasi, Argyris Oikonomou, Katerina Sotiraki, Grigoris Velegkas, Manolis Zampetakis · 2024

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Backdoor Channels Hidden in Latent Space: Cryptographic Undetectability in Modern Neural Networks

cs.CR · 2026-05-13 · unverdicted · novelty 7.0

Backdoors can be realized as statistically natural latent directions in modern neural networks, achieving high attack success with negligible clean accuracy loss and resisting existing defenses.

Undetectable Backdoors in Model Parameters: Hiding Sparse Secrets in High Dimensions

cs.CR · 2026-05-05 · unverdicted · novelty 7.0

Sparse Backdoor plants a provably undetectable backdoor in neural network weights via structured sparse perturbations and isotropic Gaussian dithering, with detection hardness reduced to Sparse PCA.

citing papers explorer

Showing 2 of 2 citing papers.

Backdoor Channels Hidden in Latent Space: Cryptographic Undetectability in Modern Neural Networks cs.CR · 2026-05-13 · unverdicted · none · ref 18
Backdoors can be realized as statistically natural latent directions in modern neural networks, achieving high attack success with negligible clean accuracy loss and resisting existing defenses.
Undetectable Backdoors in Model Parameters: Hiding Sparse Secrets in High Dimensions cs.CR · 2026-05-05 · unverdicted · none · ref 25
Sparse Backdoor plants a provably undetectable backdoor in neural network weights via structured sparse perturbations and isotropic Gaussian dithering, with detection hardness reduced to Sparse PCA.

Injecting undetectable backdoors in obfuscated neural networks and language models.Advances in Neural Information Processing Systems, 37:21537–21571, 2024

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer