YoNER supplies a multi-domain Yoruba NER corpus of 5k sentences plus OyoBERT, showing African-centric models beat multilingual baselines in-domain while cross-domain performance drops sharply for blogs and movies.
Our results show that for both OyoBERT and AfroXLMR-large, the ORG entity type consis- tently has the lowest F1 score
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
YoNER: A New Yor\`ub\'a Multi-domain Named Entity Recognition Dataset
YoNER supplies a multi-domain Yoruba NER corpus of 5k sentences plus OyoBERT, showing African-centric models beat multilingual baselines in-domain while cross-domain performance drops sharply for blogs and movies.