UnityShots uses fixed LTM and STM memory slots with boundary-conditioned gating and speaker tokens to achieve coherent multi-shot audio-video generation, leading open-source baselines on cross-shot coherence metrics.
Beat this! accurate beat tracking without dbn postprocessing
4 Pith papers cite this work. Polarity classification is still indexing.
4
Pith papers citing it
citation-role summary
method 1
citation-polarity summary
years
2026 4roles
method 1polarities
use method 1representative citing papers
Rubato model with InterMo representation outperforms cascade methods in generating timestamped piano sheet music from audio, even when cascades receive ground-truth MIDI.
State-of-the-art beat trackers fail on SMC tracks via octave errors, continuity errors, and complete failures, with the DBN's 55 BPM minimum forcing double-tempo predictions on 21% of slow tracks.
The paper presents an LLM-based system architecture that fuses speech transcription, gesture recognition, and beat detection to generate and execute robotic action sequences.
citing papers explorer
No citing papers match the current filters.