Introduces forensic similarity for speech deepfakes via a Siamese feature extractor and similarity network to verify shared forensic traces and source models between audio segments.
The LJ Speech Dataset
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.SD 2verdicts
UNVERDICTED 2representative citing papers
MOS-Bench benchmark shows that existing SSQA models struggle with out-of-domain generalization and that training on multiple diverse datasets improves robustness.
citing papers explorer
-
Forensic Similarity for Speech Deepfakes
Introduces forensic similarity for speech deepfakes via a Siamese feature extractor and similarity network to verify shared forensic traces and source models between audio segments.
-
MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models
MOS-Bench benchmark shows that existing SSQA models struggle with out-of-domain generalization and that training on multiple diverse datasets improves robustness.