Controlled experiments across nine datasets show Arabic HTR maintains a 5-7 CER gap over Latin even at full data scale, with Arabic requiring more samples due to heavier-tailed character distributions and more visually similar confusions.
Understanding Cross-Language Transfer Improvements in Low-Resource HTR: The Role of Sequence Modeling
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Handwritten Text Recognition (HTR) for Arabic-script languages benefits from cross-language joint training under low-resource conditions, particularly when using CRNN-based models that combine convolutional encoders with sequence modeling. However, it remains unclear whether these improvements are better explained by shared visual representations or sequence-level dependencies. In this work, we conduct a controlled architectural study of line-level Arabic-script HTR, comparing CNN-only models with CTC decoding and CRNN models under identical single-script and multi-script training regimes. Experiments are performed on Arabic (KHATT), Urdu (NUST-UHWR), and Persian (PHTD) datasets under low-resource settings (K in {100, 500, 1000}). Our results show a clear divergence in transfer behavior: while CNN-only models exhibit limited or unstable improvements, CRNN models achieve better performance under multi-script training, particularly in the most data-constrained regimes. Focusing on transfer improvements (delta CER) rather than absolute performance, we find that cross-language improvements are associated with sequence-level modeling, while sharing visual representations learned by the CNN encoder, corresponding to similarities in character shapes across scripts, alone appears to be insufficient. This finding suggests that contextual modeling plays an important role in enabling effective transfer in low-resource scenarios, and that similar behavior may extend to other low-resource language settings.
fields
cs.CV 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Performance Gap Analysis between Latin and Arabic Scripts HTR
Controlled experiments across nine datasets show Arabic HTR maintains a 5-7 CER gap over Latin even at full data scale, with Arabic requiring more samples due to heavier-tailed character distributions and more visually similar confusions.