Nsanku benchmark shows current LLMs achieve only modest zero-shot translation scores on 43 Ghanaian languages, with no model reaching both high average performance and high cross-language consistency.
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages
5 Pith papers cite this work, alongside 64 external citations. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 5roles
background 2representative citing papers
MUDIDI introduces a two-stage LLM pipeline for multilingual dictionary digitization, releases a human-annotated dataset from 30 dictionaries, and shows LLMs outperforming OCR and VLMs on character recognition, markup, and entry segmentation.
Biaffine LSTM outperforms transformer parsers like AfroXLMR and RemBERT in low-resource dependency parsing, with transformers gaining advantage as data increases and morphological complexity as a secondary predictor.
Ethnographic study of feminist civic-tech data work argues reparative AI dataset production requires resetting accountability ties to center those harmed by current practices.
AI integration in newsrooms drives internal deferral of judgment to LLMs and external shifts of power to platforms, making fairness, accountability, and transparency harder to sustain unless participatory mechanisms redistribute authority.
citing papers explorer
-
MUDIDI: A Two-Stage Framework for Multilingual Dictionary Digitization with Language Models
MUDIDI introduces a two-stage LLM pipeline for multilingual dictionary digitization, releases a human-annotated dataset from 30 dictionaries, and shows LLMs outperforming OCR and VLMs on character recognition, markup, and entry segmentation.
-
Dependency Parsing Across the Resource Spectrum: Evaluating Architectures on High and Low-Resource Languages
Biaffine LSTM outperforms transformer parsers like AfroXLMR and RemBERT in low-resource dependency parsing, with transformers gaining advantage as data increases and morphological complexity as a secondary predictor.
-
Can Data Work be Reparative?
Ethnographic study of feminist civic-tech data work argues reparative AI dataset production requires resetting accountability ties to center those harmed by current practices.
-
FAccT-Checked: A Narrative Review of Authority Reconfigurations and Retention in AI-Mediated Journalism
AI integration in newsrooms drives internal deferral of judgment to LLMs and external shifts of power to platforms, making fairness, accountability, and transparency harder to sustain unless participatory mechanisms redistribute authority.