A fused self-supervised encoder and learned DP decoder for word alignment outperforms MFA on English datasets and generalizes to unseen languages.
The IFA corpus: A phonemically segmented Dutch open source speech database,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Presents an end-to-end differentiable neural model for phoneme forced alignment that claims to outperform prior methods on English benchmarks and generalize to unseen languages.
citing papers explorer
-
Multilingual Word-Level Forced Alignment with Self-Supervised Representations and Learned Dynamic Programming
A fused self-supervised encoder and learned DP decoder for word alignment outperforms MFA on English datasets and generalizes to unseen languages.