YoNER supplies a multi-domain Yoruba NER corpus of 5k sentences plus OyoBERT, showing African-centric models beat multilingual baselines in-domain while cross-domain performance drops sharply for blogs and movies.
Three of these domains contain little to no ORG entities due to their nature, and none include DA TE entities
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
YoNER: A New Yor\`ub\'a Multi-domain Named Entity Recognition Dataset
YoNER supplies a multi-domain Yoruba NER corpus of 5k sentences plus OyoBERT, showing African-centric models beat multilingual baselines in-domain while cross-domain performance drops sharply for blogs and movies.