AMR dynamically routes audio (W2V-BERT 2.0) and face (IResNet-18) embeddings via adapters and a KL-supervised router, reaching 99.07% average accuracy on POLY-SIM 2026 protocols and beating the FOP baseline by 32.73%.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
AMR: Adaptive Modality Routing for Multimodal Polyglot Speaker Identification
AMR dynamically routes audio (W2V-BERT 2.0) and face (IResNet-18) embeddings via adapters and a KL-supervised router, reaching 99.07% average accuracy on POLY-SIM 2026 protocols and beating the FOP baseline by 32.73%.