and McVicar, Matt and Battenberg, Eric and Nieto, Oriol , title =

Brian McFee, Colin Raffel, Dawen Liang, Daniel P W Ellis, Matt McVicar, Eric Battenberg, Oriol Nieto · 2015 · DOI 10.25080/majora-7b98e3ed-003

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open at publisher browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

A multimodal dataset of photoplethysmography and continuous behavioral responses to ASMR and nature videos

cs.LG · 2026-05-30 · unverdicted · novelty 8.0

Introduces REST-ASMR multimodal dataset of PPG, stimuli, and continuous annotations for ASMR research, validated with 97% responder rate, significant agreement, PPG deceleration, and BiLSTM achieving 75.51% frame-level accuracy under strict subject-video independent 4-fold CV.

DirectorBench: Diagnosing Long-Form Video Generation with Personalized Multi-Agent Evaluation

cs.CL · 2026-05-28 · unverdicted · novelty 7.0

DirectorBench is a profile-aware diagnostic benchmark that localizes bottlenecks in long-form video generation workflows using structured checkpoints and multi-agent evaluation.

Voice "Cloning" is Style Transfer

cs.SD · 2026-05-15 · conditional · novelty 6.0 · 2 refs

Voice cloning applies systematic style transfer rather than faithful replication, producing voices rated higher on authority and trust with reduced variance in accent and rate.

Communicating Sound Through Natural Language

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

Lexical acoustic coding lets LLMs transmit audio waveforms as editable natural-language sentences that another LLM can parse and reconstruct into sound.

Improving Automatic Speech Recognition for Speakers Treated for Oral Cancer using Data Augmentation and LLM Error Correction

eess.AS · 2026-05-15 · conditional · novelty 5.0

TTS data augmentation and LLM error correction together cut relative WER by 40-50% on ASR models for oral cancer speech.

Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations

cs.AR · 2026-05-12 · unverdicted · novelty 5.0 · 2 refs

BMRUs enable analog recurrent neural network hardware via discrete outputs that suppress noise 20-fold, with one-to-one parameter-to-circuit mapping and linear power scaling for recurrence.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Voice "Cloning" is Style Transfer cs.SD · 2026-05-15 · conditional · none · ref 62 · 2 links
Voice cloning applies systematic style transfer rather than faithful replication, producing voices rated higher on authority and trust with reduced variance in accent and rate.
Improving Automatic Speech Recognition for Speakers Treated for Oral Cancer using Data Augmentation and LLM Error Correction eess.AS · 2026-05-15 · conditional · none · ref 49
TTS data augmentation and LLM error correction together cut relative WER by 40-50% on ASR models for oral cancer speech.

and McVicar, Matt and Battenberg, Eric and Nieto, Oriol , title =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer