DiffAnon introduces the first diffusion model for voice anonymization that supplies structured, continuous, inference-time control over prosody preservation via classifier-free guidance on RVQ semantic embeddings.
Available: https://arxiv.org/abs/2601.11846
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
eess.AS 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
SRD provides a threshold-independent, representation-level privacy assessment for voice anonymization that reveals system weaknesses not detected by equal error rate evaluation.
citing papers explorer
-
DiffAnon: Diffusion-based Prosody Control for Voice Anonymization
DiffAnon introduces the first diffusion model for voice anonymization that supplies structured, continuous, inference-time control over prosody preservation via classifier-free guidance on RVQ semantic embeddings.
-
Evaluating voice anonymisation using similarity rank disclosure
SRD provides a threshold-independent, representation-level privacy assessment for voice anonymization that reveals system weaknesses not detected by equal error rate evaluation.