Musan: A music, speech, and noise corpus

David Snyder, Guoguo Chen, Daniel Povey, “Musan: A music, speech, noise corpus,” · 2015

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Hyperbolic Additive Margin Softmax with Hierarchical Information for Speaker Verification

cs.SD · 2026-01-27 · unverdicted · novelty 7.0

Hyperbolic Softmax and HAM-Softmax in hyperbolic space reduce equal error rates by 27.84% and 14.23% on average versus standard Softmax and AM-Softmax by modeling hierarchical speaker features.

XM-ALIGN: Unified Cross-Modal Embedding Alignment for Face-Voice Association

cs.SD · 2025-12-07 · unverdicted · novelty 3.0

XM-ALIGN improves face-voice association performance by jointly optimizing embeddings from separate encoders with MSE alignment loss and data augmentation on the MAV-Celeb dataset.

citing papers explorer

Showing 2 of 2 citing papers.

Hyperbolic Additive Margin Softmax with Hierarchical Information for Speaker Verification cs.SD · 2026-01-27 · unverdicted · none · ref 35
Hyperbolic Softmax and HAM-Softmax in hyperbolic space reduce equal error rates by 27.84% and 14.23% on average versus standard Softmax and AM-Softmax by modeling hierarchical speaker features.
XM-ALIGN: Unified Cross-Modal Embedding Alignment for Face-Voice Association cs.SD · 2025-12-07 · unverdicted · none · ref 16
XM-ALIGN improves face-voice association performance by jointly optimizing embeddings from separate encoders with MSE alignment loss and data augmentation on the MAV-Celeb dataset.

Musan: A music, speech, and noise corpus

fields

years

verdicts

representative citing papers

citing papers explorer