Functional dependencies average 1.71 tokens and stay invariant across languages; lexical dependencies average 2.87 tokens and vary with typology, with the pattern holding in SUD.
The Grammar Does the Work: Functional vs. Lexical Dependency Length Minimization Across Universal Dependencies
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Dependency length minimization (DLM) is a well-documented processing universal, but previous studies report a single mean dependency distance (MDD) per language, obscuring variation across syntactic relation types. We analyze 122 languages in UD and SUD (version 2.17), showing that DLM operates on two distinct levels. Grammar-driven optimization targets functional dependencies (det, case, aux), which are universally short (mean 1.71, $\sigma$ = 0.33) and invariant across typologically diverse languages. Processing-driven optimization operates on lexical dependencies (nsubj, obj, obl), which are longer (mean 2.87), highly variable ($\sigma$ = 0.63), and constrained by word-order typology. This asymmetry holds in SUD despite reversed head direction (r = 0.92). We conclude that ''the grammar does the work'' of minimization by scaffolding sentences with local functional attachments, leaving processing pressures to determine the ordering of lexical heads.
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
The Grammar Does the Work: Functional vs. Lexical Dependency Length Minimization Across Universal Dependencies
Functional dependencies average 1.71 tokens and stay invariant across languages; lexical dependencies average 2.87 tokens and vary with typology, with the pattern holding in SUD.