Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks?

· 2025 · eess.AS · arXiv 2509.21087

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Machine learning approaches for speech enhancement are becoming increasingly expressive, enabling ever more powerful modifications of input signals. In this paper, we demonstrate that this expressiveness introduces a vulnerability: advanced speech enhancement models can be susceptible to adversarial attacks. Specifically, we show that adversarial noise, carefully crafted and psychoacoustically masked by the original input, can be injected such that the enhanced speech output conveys an entirely different semantic meaning. We experimentally verify that contemporary predictive speech enhancement models can indeed be manipulated in this way. Furthermore, we highlight that diffusion models with stochastic samplers exhibit inherent robustness to such adversarial attacks by design.

representative citing papers

Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks?

eess.AS · 2025-09-25 · unverdicted · novelty 7.0

Predictive speech enhancement models can be manipulated by psychoacoustically masked adversarial noise to alter output semantics, while diffusion models exhibit inherent robustness.

citing papers explorer

Showing 1 of 1 citing paper.

Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks? eess.AS · 2025-09-25 · unverdicted · none · ref 2 · internal anchor
Predictive speech enhancement models can be manipulated by psychoacoustically masked adversarial noise to alter output semantics, while diffusion models exhibit inherent robustness.

Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks?

fields

years

verdicts

representative citing papers

citing papers explorer