Human perception of audio deepfakes

Nicolas M · 2022 · arXiv 2466.355653

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

extension 1

citation-polarity summary

extend 1

representative citing papers

Eroding Trust in Real Speech: A Large-Scale Study of Human Audio Deepfake Perception

cs.SD · 2026-05-21 · unverdicted · novelty 6.0

Large-scale listening study of 35,532 judgments finds human accuracy on real audio fell from 72.7% to 64.1% since 2021 while fake detection remained stable, indicating a skepticism shift toward genuine speech.

APEX: Audio Prototype EXplanations for Classification Tasks

cs.SD · 2026-05-11 · unverdicted · novelty 6.0

APEX generates four types of prototype-based explanations for pre-trained audio classifiers that preserve output invariance and target acoustic properties better than gradient methods applied to spectrograms.

MeloDISinger: Melody-Aware & Duration-Preserving Singing Voice Editing with Audio Infilling

eess.AS · 2026-06-29 · unverdicted · novelty 4.0

Proposes MeloDISinger, a flow-matching SVE model with MeloDRP for melody-aware duration-preserving editing and audio infilling, claiming SOTA results.

citing papers explorer

Showing 3 of 3 citing papers.

Eroding Trust in Real Speech: A Large-Scale Study of Human Audio Deepfake Perception cs.SD · 2026-05-21 · unverdicted · none · ref 22
Large-scale listening study of 35,532 judgments finds human accuracy on real audio fell from 72.7% to 64.1% since 2021 while fake detection remained stable, indicating a skepticism shift toward genuine speech.
APEX: Audio Prototype EXplanations for Classification Tasks cs.SD · 2026-05-11 · unverdicted · none · ref 6
APEX generates four types of prototype-based explanations for pre-trained audio classifiers that preserve output invariance and target acoustic properties better than gradient methods applied to spectrograms.
MeloDISinger: Melody-Aware & Duration-Preserving Singing Voice Editing with Audio Infilling eess.AS · 2026-06-29 · unverdicted · none · ref 25
Proposes MeloDISinger, a flow-matching SVE model with MeloDRP for melody-aware duration-preserving editing and audio infilling, claiming SOTA results.

Human perception of audio deepfakes

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer