No voice agent tops 0.5 on both accuracy and experience
EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents
EVA-Bench shows 12 systems diverge sharply on reliability and accent-noise robustness
Sound
Covers all aspects of computing with sound, and sound as an information channel. Includes models of sound, analysis and synthesis, audio user interfaces, sonification of data, computer music, and sound signal processing. Includes ACM Subject Class H.5.5, and intersects with H.1.2, H.5.1, H.5.2, I.2.7, I.5.4, I.6.3, J.5, K.4.2.
EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents
EVA-Bench shows 12 systems diverge sharply on reliability and accent-noise robustness
PHALAR: Phasors for Learned Musical Audio Representations
Learned pooling and complex processing enforce musical equivariances for faster, lighter stem matching across three datasets.
full image
PHALAR: Phasors for Learned Musical Audio Representations
Contrastive model using learned spectral pooling and complex head sets new benchmarks on stem retrieval while capturing beat and chord info.
full image
PHALAR: Phasors for Learned Musical Audio Representations
A contrastive model adds pitch and phase equivariance through spectral pooling and complex heads, improving musical stem matching and zero-
full image