Understanding Cross-Language Transfer Improvements in Low-Resource HTR: The Role of Sequence Modeling

· 2026 · cs.CV · arXiv 2605.05900

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Handwritten Text Recognition (HTR) for Arabic-script languages benefits from cross-language joint training under low-resource conditions, particularly when using CRNN-based models that combine convolutional encoders with sequence modeling. However, it remains unclear whether these improvements are better explained by shared visual representations or sequence-level dependencies. In this work, we conduct a controlled architectural study of line-level Arabic-script HTR, comparing CNN-only models with CTC decoding and CRNN models under identical single-script and multi-script training regimes. Experiments are performed on Arabic (KHATT), Urdu (NUST-UHWR), and Persian (PHTD) datasets under low-resource settings (K in {100, 500, 1000}). Our results show a clear divergence in transfer behavior: while CNN-only models exhibit limited or unstable improvements, CRNN models achieve better performance under multi-script training, particularly in the most data-constrained regimes. Focusing on transfer improvements (delta CER) rather than absolute performance, we find that cross-language improvements are associated with sequence-level modeling, while sharing visual representations learned by the CNN encoder, corresponding to similarities in character shapes across scripts, alone appears to be insufficient. This finding suggests that contextual modeling plays an important role in enabling effective transfer in low-resource scenarios, and that similar behavior may extend to other low-resource language settings.

representative citing papers

Performance Gap Analysis between Latin and Arabic Scripts HTR

cs.CV · 2026-06-17 · conditional · novelty 6.0

Controlled experiments across nine datasets show Arabic HTR maintains a 5-7 CER gap over Latin even at full data scale, with Arabic requiring more samples due to heavier-tailed character distributions and more visually similar confusions.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Performance Gap Analysis between Latin and Arabic Scripts HTR cs.CV · 2026-06-17 · conditional · none · ref 3 · internal anchor
Controlled experiments across nine datasets show Arabic HTR maintains a 5-7 CER gap over Latin even at full data scale, with Arabic requiring more samples due to heavier-tailed character distributions and more visually similar confusions.

Understanding Cross-Language Transfer Improvements in Low-Resource HTR: The Role of Sequence Modeling

fields

years

verdicts

representative citing papers

citing papers explorer