OFA : A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining

Liu, Yihong, Lin, Peiqin, Wang, Mingyang, Schuetze, Hinrich · 2024 · DOI 10.18653/v1/2024.findings-naacl.68

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

TokAlign++: Advancing Vocabulary Adaptation via Better Token Alignment

cs.CL · 2026-05-13 · unverdicted · novelty 7.0

TokAlign++ learns token alignments between LLM vocabularies from monolingual representations to enable faster adaptation, better text compression, and effective token-level distillation across 15 languages with minimal steps.

Defragmenting Language Models: An Interpretability-based Approach for Vocabulary Expansion

cs.CL · 2026-04-17 · unverdicted · novelty 7.0

Interpretability-based selection of vocabulary items plus FragMend initialization reduces token over-fragmentation and improves performance for non-Latin script languages by roughly 20 points over baselines.

citing papers explorer

Showing 2 of 2 citing papers.

TokAlign++: Advancing Vocabulary Adaptation via Better Token Alignment cs.CL · 2026-05-13 · unverdicted · none · ref 103
TokAlign++ learns token alignments between LLM vocabularies from monolingual representations to enable faster adaptation, better text compression, and effective token-level distillation across 15 languages with minimal steps.
Defragmenting Language Models: An Interpretability-based Approach for Vocabulary Expansion cs.CL · 2026-04-17 · unverdicted · none · ref 4
Interpretability-based selection of vocabulary items plus FragMend initialization reduces token over-fragmentation and improves performance for non-Latin script languages by roughly 20 points over baselines.

OFA : A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining

fields

years

verdicts

representative citing papers

citing papers explorer