pith. machine review for the scientific record. sign in

arxiv: 1904.03365 · v2 · submitted 2019-04-06 · 💻 cs.CR · cs.LG· cs.SD· eess.AS

Recognition: unknown

ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems

Authors on Pith no claims yet
classification 💻 cs.CR cs.LGcs.SDeess.AS
keywords databasesystemsvoicerecordingscommandsconditionscontainsdifferent
0
0 comments X
read the original abstract

This paper introduces a new database of voice recordings with the goal of supporting research on vulnerabilities and protection of voice-controlled systems (VCSs). In contrast to prior efforts, the proposed database contains both genuine voice commands and replayed recordings of such commands, collected in realistic VCSs usage scenarios and using modern voice assistant development kits. Specifically, the database contains recordings from four systems (each with a different microphone array) in a variety of environmental conditions with different forms of background noise and relative positions between speaker and device. To the best of our knowledge, this is the first publicly available database that has been specifically designed for the protection of state-of-the-art voice-controlled systems against various replay attacks in various conditions and environments.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Alethia: A Foundational Encoder for Voice Deepfakes

    cs.SD 2026-04 unverdicted novelty 6.0

    Alethia is a pretrained audio encoder using continuous embedding prediction and generative flow-matching reconstruction that outperforms existing speech foundation models on voice deepfake tasks with better robustness...