BioUNER is a gold-standard Urdu biomedical NER dataset of 153K tokens with 0.78 inter-annotator agreement, created from health texts and tested on SVM, LSTM, mBERT, and XLM-RoBERTa models.
UmrinderPal Singh, Vishal Goyal, and Gurpreet Singh Lehal
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
BioUNER: A Benchmark Dataset for Clinical Urdu Named Entity Recognition
BioUNER is a gold-standard Urdu biomedical NER dataset of 153K tokens with 0.78 inter-annotator agreement, created from health texts and tested on SVM, LSTM, mBERT, and XLM-RoBERTa models.