A survey that unifies prior code-switching research for LLMs into a taxonomy of data, modeling, and evaluation and distills it into actionable recommendations for practitioners.
Krutrim llm: A novel tokenization strategy for multilingual in- dic languages with petabyte-scale data processing.arXiv preprint arXiv:2407.12481,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Code Mixologist : A Practitioner's Guide to Building Code-Mixed LLMs
A survey that unifies prior code-switching research for LLMs into a taxonomy of data, modeling, and evaluation and distills it into actionable recommendations for practitioners.