How can we effectively expand the vocabulary of LLMs with 0.01 GB of target language text? Computational Linguistics, pp.\ 1--40, 11 2025 b

Atsuki Yamaguchi, Aline Villavicencio, Nikolaos Aletras · 2026 · DOI 10.1162/coli.a.581

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

MultiHashFormer: Hash-based Generative Language Models

cs.CL · 2026-06-26 · unverdicted · novelty 7.0

MultiHashFormer enables hash-based autoregression in LMs by encoding tokens as multi-hash signatures, outperforming standard Transformers at 100M-3B scales while keeping parameter count constant for multilingual expansion.

Multilinguality of Large Language Models From a Structural Perspective

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

Low-resource languages are structurally more different from English in LLMs than high- or mid-resource ones, and language-specific post-training alters structures while preserving inter-language relationships.

Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates

cs.CL · 2025-12-04 · conditional · novelty 6.0

SSU mitigates catastrophic forgetting in low-resource LLM target-language adaptation by scoring and column-wise freezing source-critical parameters, reducing source degradation to ~3% versus ~20% for full fine-tuning while matching target performance.

citing papers explorer

Showing 2 of 2 citing papers after filters.

MultiHashFormer: Hash-based Generative Language Models cs.CL · 2026-06-26 · unverdicted · none · ref 55
MultiHashFormer enables hash-based autoregression in LMs by encoding tokens as multi-hash signatures, outperforming standard Transformers at 100M-3B scales while keeping parameter count constant for multilingual expansion.
Multilinguality of Large Language Models From a Structural Perspective cs.CL · 2026-06-01 · unverdicted · none · ref 39
Low-resource languages are structurally more different from English in LLMs than high- or mid-resource ones, and language-specific post-training alters structures while preserving inter-language relationships.

How can we effectively expand the vocabulary of LLMs with 0.01 GB of target language text? Computational Linguistics, pp.\ 1--40, 11 2025 b

fields

years

verdicts

representative citing papers

citing papers explorer