A Large-Scale Per-Speaker Analysis of Re-identification Risk in Speech Anonymization

Emmanuel Vincent; Mickael Rouvier; Orane Dufour; Paul Magron

arxiv: 2606.07210 · v1 · pith:D5QXQB7Dnew · submitted 2026-06-05 · 💻 cs.SD · cs.CR

A Large-Scale Per-Speaker Analysis of Re-identification Risk in Speech Anonymization

Orane Dufour , Paul Magron , Mickael Rouvier , Emmanuel Vincent This is my paper

classification 💻 cs.SD cs.CR

keywords acrossanonymizationattackerre-identificationspeechanalysisanonymizerevaluated

0 comments

read the original abstract

Speech anonymization is commonly evaluated using averagecase metrics such as the equal error rate, which can hide large disparities in re-identification risks across individuals. In this paper, we conduct a large-scale per-speaker privacy analysis using a linkability-based metric under a worst-case scenario. Nearly 5,000 speakers are evaluated across multiple anonymization systems, attacker architectures, and conversation lengths. While linkability scores are highly polarized at the speaker level, the sets of easy to re-identify and hard to re-identify speakers vary substantially across configurations. We show that no single factor explains speaker vulnerability. Instead, the re-identification risk emerges from the interaction between the attacker, the anonymizer, and the amount of available speech. These results challenge the notion of intrinsic speaker-level privacy risks and emphasize the need for evaluation protocols that are explicitly conditioned on the attacker and anonymizer.

This paper has not been read by Pith yet.

A Large-Scale Per-Speaker Analysis of Re-identification Risk in Speech Anonymization

discussion (0)