Introduces REST-ASMR multimodal dataset of PPG, stimuli, and continuous annotations for ASMR research, validated with 97% responder rate, significant agreement, PPG deceleration, and BiLSTM achieving 75.51% frame-level accuracy under strict subject-video independent 4-fold CV.
and McVicar, Matt and Battenberg, Eric and Nieto, Oriol , title =
6 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 6roles
background 1polarities
background 1representative citing papers
DirectorBench is a profile-aware diagnostic benchmark that localizes bottlenecks in long-form video generation workflows using structured checkpoints and multi-agent evaluation.
Voice cloning applies systematic style transfer rather than faithful replication, producing voices rated higher on authority and trust with reduced variance in accent and rate.
Lexical acoustic coding lets LLMs transmit audio waveforms as editable natural-language sentences that another LLM can parse and reconstruct into sound.
TTS data augmentation and LLM error correction together cut relative WER by 40-50% on ASR models for oral cancer speech.
BMRUs enable analog recurrent neural network hardware via discrete outputs that suppress noise 20-fold, with one-to-one parameter-to-circuit mapping and linear power scaling for recurrence.
citing papers explorer
-
Voice "Cloning" is Style Transfer
Voice cloning applies systematic style transfer rather than faithful replication, producing voices rated higher on authority and trust with reduced variance in accent and rate.
-
Improving Automatic Speech Recognition for Speakers Treated for Oral Cancer using Data Augmentation and LLM Error Correction
TTS data augmentation and LLM error correction together cut relative WER by 40-50% on ASR models for oral cancer speech.