pith. sign in

arxiv: 1904.00784 · v3 · pith:U4PYTEASnew · submitted 2019-03-25 · 💻 cs.CL · cs.LG· stat.ML

A Survey of Code-switched Speech and Language Processing

classification 💻 cs.CL cs.LGstat.ML
keywords languageprocessingcode-switchedspeechcode-switchingcommunitiesmultilingualsurvey
0
0 comments X
read the original abstract

Code-switching, the alternation of languages within a conversation or utterance, is a common communicative phenomenon that occurs in multilingual communities across the world. This survey reviews computational approaches for code-switched Speech and Natural Language Processing. We motivate why processing code-switched text and speech is essential for building intelligent agents and systems that interact with users in multilingual communities. As code-switching data and resources are scarce, we list what is available in various code-switched language pairs with the language processing tasks they can be used for. We review code-switching research in various Speech and NLP applications, including language processing tools and end-to-end systems. We conclude with future directions and open problems in the field.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.