Introduces Dango, a 1.8B strictly L1-only LLM using corpus filtering and lesson fine-tuning to simulate Japanese-to-English SLA and produce human-like L2 output patterns.
Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Defines language adherence failures in multimodal ASR LLMs and compares soft prompting, SFT, and CoT strategies for reducing violations across languages.
citing papers explorer
-
Dango: A Strictly L1-Only Large Language Model for Studying Second Language Acquisition
Introduces Dango, a 1.8B strictly L1-only LLM using corpus filtering and lesson fine-tuning to simulate Japanese-to-English SLA and produce human-like L2 output patterns.
-
Are you speaking my languages? On spoken language adherence in multimodal LLMs
Defines language adherence failures in multimodal ASR LLMs and compares soft prompting, SFT, and CoT strategies for reducing violations across languages.