LLM-based POS tagging outperforms traditional taggers on medieval Occitan, Catalan, and French, with fine-tuning and cross-lingual transfer providing the largest gains for under-resourced varieties.
C orpus A ri \`e ja: Building an Annotated Corpus with Variation in O ccitan
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
dataset 1
citation-polarity summary
fields
cs.CL 2years
2026 2roles
dataset 1polarities
use dataset 1representative citing papers
citing papers explorer
-
From Traditional Taggers to LLMs: A Comparative Study of POS Tagging for Medieval Romance Languages
LLM-based POS tagging outperforms traditional taggers on medieval Occitan, Catalan, and French, with fine-tuning and cross-lingual transfer providing the largest gains for under-resourced varieties.
- Lost in Translation? Exploring the Shift in Grammatical Gender from Latin to Occitan