Code-switching in text and speech challenges information-theoretic speaker design
Pith reviewed 2026-05-23 21:58 UTC · model grok-4.3
The pith
Code-switching to English occurs even when English is less predictable than equivalent Chinese phrases, rejecting purely speaker-driven accounts.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Low predictability in the primary language (Chinese) correlates with switches to the secondary language (English), yet the English material actually produced has lower predictability than meaning-equivalent Chinese alternatives; therefore the switches do not reduce production difficulty and the purely speaker-driven account of code-switching is rejected for both written and spoken data.
What carries the argument
Language-model perplexity on primary-language text, used to quantify predictability at potential switch points and to compare the actual secondary-language insertion against a meaning-matched primary-language continuation.
If this is right
- Code-switching in both writing and speech serves communicative goals beyond reducing speaker effort.
- Speakers may insert the secondary language to direct listener attention rather than to simplify their own output.
- The same rejection of a purely speaker-driven account holds for both online forum posts and spontaneous speech transcripts.
- Information-theoretic models of speaker design must incorporate listener-oriented or social functions to explain observed switching patterns.
Where Pith is reading between the lines
- Similar comparisons could be run on other language pairs to test whether the pattern is specific to Chinese-English bilinguals.
- Listener comprehension studies could check whether the harder-to-predict English insertions actually improve understanding or signal emphasis.
- The results leave open the possibility that code-switching frequency changes with audience expectations or conversational setting.
Load-bearing premise
Language-model perplexity on the primary language serves as a valid stand-in for the real-time production difficulty a bilingual speaker faces when choosing which language to use next.
What would settle it
A dataset of Chinese-English code-switches in which the English insertions show higher predictability than their meaning-equivalent Chinese alternatives would contradict the central observation.
Figures
read the original abstract
In this work, we use language modeling to investigate the factors that influence insertional code-switching. Code-switching occurs when a speaker alternates between one language variety (the primary language) and another (the secondary language), and is widely observed in multilingual contexts. Recent work has shown that code-switching is often correlated with areas of low predictability in the primary language, but it is unclear whether low primary language predictability only makes the secondary language relatively easier to produce at code-switching points - that is, purely speaker-driven code-switching - or whether code-switching is additionally used by speakers for other purposes, for instance to signal the need for greater attention on the part of listeners. In this paper, we use bilingual Chinese-English online forum posts and transcripts of spontaneous Chinese-English speech to replicate prior findings that low primary language (Chinese) predictability is correlated with insertional switches to the secondary language (English). We then demonstrate that the predictability of the English productions is even lower than that of meaning-equivalent Chinese alternatives, and these are therefore not easier to produce, rejecting the purely speaker-driven theory of code-switching in both writing and speech.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper uses language modeling on Chinese-English bilingual online forum posts and spontaneous speech transcripts to replicate that low primary-language (Chinese) predictability correlates with insertional switches to English. It then claims that the English productions exhibit even lower predictability than meaning-equivalent Chinese alternatives and are therefore not easier to produce, rejecting a purely speaker-driven information-theoretic account of code-switching in both text and speech.
Significance. If the cross-lingual predictability comparison holds after proper calibration, the result would meaningfully challenge speaker-centric accounts by showing that switches do not occur at points of relative production ease. The replication of the primary-language predictability correlation plus the use of meaning-equivalent alternatives across both written and spoken modalities constitute a clear empirical contribution via new corpus measurements.
major comments (2)
- [Abstract and Results] Abstract and central results: the directional claim that English productions have lower predictability than meaning-equivalent Chinese alternatives (and are therefore not easier) rests on direct comparison of perplexities from separate monolingual LMs. No bits-per-character normalization, joint multilingual modeling, or other calibration for differing baseline entropies and tokenization is described, so the inequality does not follow from the reported numbers. This comparison is load-bearing for the rejection of the speaker-driven theory.
- [Methods] Methods and analysis sections: details on LM training (architecture, data, tokenization), the procedure for constructing and aligning meaning-equivalent Chinese alternatives, and any statistical controls or significance tests on the perplexity differences are not supplied. Without these, the robustness of the key negative result cannot be evaluated.
minor comments (2)
- [Abstract] The abstract and introduction could more explicitly flag the cross-lingual comparability issue and how it is addressed (or why separate monolingual perplexities suffice).
- Notation for predictability/perplexity should be defined once and used consistently when switching between Chinese and English contexts.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive report. The two major comments identify important gaps in the presentation of our cross-lingual predictability comparison and in the methodological documentation. We address each point below and commit to a major revision that supplies the missing details and strengthens the empirical claim.
read point-by-point responses
-
Referee: [Abstract and Results] Abstract and central results: the directional claim that English productions have lower predictability than meaning-equivalent Chinese alternatives (and are therefore not easier) rests on direct comparison of perplexities from separate monolingual LMs. No bits-per-character normalization, joint multilingual modeling, or other calibration for differing baseline entropies and tokenization is described, so the inequality does not follow from the reported numbers. This comparison is load-bearing for the rejection of the speaker-driven theory.
Authors: We agree that the manuscript does not describe any cross-lingual calibration and that this omission weakens the central negative result. In the revision we will (1) report bits-per-character normalized perplexities for both languages, (2) train and evaluate a single multilingual model on the combined data to provide a joint baseline, and (3) include an explicit discussion of why the directional inequality survives these controls. If the calibrated comparison no longer supports the claim, we will qualify or retract the rejection of the purely speaker-driven account. We therefore treat this as a required methodological correction rather than a minor clarification. revision: yes
-
Referee: [Methods] Methods and analysis sections: details on LM training (architecture, data, tokenization), the procedure for constructing and aligning meaning-equivalent Chinese alternatives, and any statistical controls or significance tests on the perplexity differences are not supplied. Without these, the robustness of the key negative result cannot be evaluated.
Authors: We acknowledge that the current Methods section is insufficiently detailed. The revised manuscript will add: (a) full LM specifications (architecture, training corpora, tokenizers, and hyper-parameters), (b) the exact procedure used to generate and align meaning-equivalent Chinese alternatives (including any translation or parallel-corpus resources), and (c) statistical tests (paired Wilcoxon signed-rank tests with effect sizes and confidence intervals) on the perplexity differences. These additions will make the negative result reproducible and allow readers to assess its robustness directly. revision: yes
Circularity Check
No circularity: empirical perplexity measurements on new corpora are independent of inputs
full rationale
The paper's chain consists of (1) replicating a correlation between low Chinese predictability and English inserts using language-model perplexity on bilingual forum posts and speech transcripts, then (2) comparing perplexity of observed English tokens against meaning-equivalent Chinese alternatives. Both steps are direct computations on held-out data; no parameter is fitted to a subset and then relabeled as a prediction, no equation reduces to its own definition, and no load-bearing premise rests on a self-citation whose content is itself unverified. The central rejection of speaker-driven code-switching therefore follows from external measurements rather than from any self-referential construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Language-model perplexity on the primary language accurately reflects the production difficulty experienced by bilingual speakers.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We then demonstrate that the predictability of the English productions is even lower than that of meaning-equivalent Chinese alternatives
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
surprisal(wi) = −log2P(wi|wi−1, ..., wi−t)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in ":" * " " * FUNCTION f...
-
[2]
author Ansaldo, A.I. , author Marcotte, K. , author Scherer, L. , author Raboyeau, G. , year 2008 . title Language therapy and bilingual aphasia: Clinical implications of psycholinguistic and neuroimaging research . journal Journal of Neurolinguistics volume 21 , pages 539--557 . https://www.sciencedirect.com/science/article/pii/S0911604408000146, :https:...
-
[3]
author Beatty-Martínez, A.L. , author Navarro-Torres, C.A. , author Dussias, P.E. , year 2020 . title Codeswitching: A bilingual toolkit for opportunistic speech planning . journal Frontiers in Psychology volume 11 . https://www.frontiersin.org/article/10.3389/fpsyg.2020.01699, :10.3389/fpsyg.2020.01699
-
[4]
author Bell, A. , year 1984 . title Language style as audience design . journal Language in Society volume 13 , pages 145--204 . http://www.jstor.org/stable/4167516
-
[5]
author Broersma, M. , year 2009 . title Triggered codeswitching between cognate languages . journal Bilingualism: Language and Cognition volume 12 , pages 447--462 . :10.1017/S1366728909990204
-
[6]
author Calvillo, J. , author Fang, L. , author Cole, J. , author Reitter, D. , year 2020 . title Surprisal predicts code-switching in C hinese- E nglish bilingual text , in: booktitle Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) , publisher Association for Computational Linguistics , address Online . pp. p...
-
[7]
author Canavan, A. , author Graff, D. , author Zipperlen, G. , year 1997 . title CALLHOME A merican E nglish S peech
work page 1997
-
[8]
author Consortium, L.D. , year 2018 . title HUB 5 M andarin T elephone S peech and T ranscripts S econd E dition LDC 2018 S 18
work page 2018
-
[9]
author Dahl, K.L. , year 2009 . title Audience design and code-switching in Bayside, Texas . Ph.D. thesis. The University of Texas at Austin
work page 2009
-
[10]
author D'Amico, S. , author Devescovi, A. , author Bates, E. , year 2001 . title Picture naming and lexical access in italian children and adults . journal Journal of Cognition and Development - J COGN DEV volume 2 . :10.1207/S15327647JCD0201_4
-
[11]
author Dornic, S. , year 1978 . title The Bilingual's Performance: Language Dominance, Stress, and Individual Differences . publisher Springer US , address Boston, MA . chapter chapter null . pp. pages 259--271 . https://doi.org/10.1007/978-1-4615-9077-4_23, :10.1007/978-1-4615-9077-4_23
-
[12]
author Forster, K.I. , author Chambers, S.M. , year 1973 . title Lexical access and naming time . journal Journal of Verbal Learning and Verbal Behavior volume 12 , pages 627--635 . https://www.sciencedirect.com/science/article/pii/S0022537173800428, :https://doi.org/10.1016/S0022-5371(73)80042-8
-
[13]
author Gardner-Chloros, P. , year 2009 . title Code-switching . publisher Cambridge University Press
work page 2009
-
[14]
author Hassan, A.A. , year 2019 . title English- A rabic C ode- S witching of the A rabic L anguage S peakers in I nstant M essaging: M otivations and S tructure . journal Cairo Studies in English volume 2019 , pages 39--59 . :10.21608/cse.2019.66656
-
[15]
author Hofweber, J. , author Marinis, T. , year 2023 . title What sentence repetition tasks can reveal about the processing effort associated with different types of code-switching . journal Languages volume 8 . https://www.mdpi.com/2226-471X/8/1/70, :10.3390/languages8010070
-
[16]
author Isbister, T. , author Carlsson, F. , author Sahlgren, M. , year 2021 . title Should we stop training more monolingual models, and simply use machine translation instead? , in: booktitle Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa) , publisher Link \"o ping University Electronic Press, Sweden , address Reykjavik,...
work page 2021
-
[17]
author Joshi, A.K. , year 1982 . title Processing of sentences with intra-sentential code-switching , in: booktitle C oling 1982: Proceedings of the N inth I nternational C onference on C omputational L inguistics , p. pages null . https://aclanthology.org/C82-1023
work page 1982
-
[18]
author Liu, H. , author Liu, Z. , author Yuan, M. , author Chen, T. , year 2023 . title The effect of cognitive load on code-switching . journal International Journal of Bilingualism volume 0 , pages 13670069231170142 . https://doi.org/10.1177/13670069231170142, :10.1177/13670069231170142, http://arxiv.org/abs/https://doi.org/10.1177/13670069231170142 arX...
-
[19]
author Liu, K. , year 2021 . title Language choices as audience design strategies in C hinese multilingual speakers’ W echat posts . journal Global Media and China volume 6 , pages 391--415 . https://doi.org/10.1177/20594364211035201, :10.1177/20594364211035201
-
[20]
author Lyu, D.C. , author Tan, T.P. , author Chng, E. , author Li, H. , year 2010 . title Mandarin– E nglish code-switching speech corpus in S outh- E ast A sia: SEAME , in: booktitle Language Resources and Evaluation , pp. pages 1986--1989 . :10.1007/s10579-015-9303-x
-
[21]
author Meisel, J.M. , year 1994 . title Code-switching in young bilingual children: The acquisition of grammatical constraints . journal Studies in Second Language Acquisition volume 16 , pages 413--439 . http://www.jstor.org/stable/44487780
-
[22]
author Misra, K. , year 2022 . title minicons: Enabling flexible behavioral and representational analyses of transformer language models . journal arXiv preprint arXiv:2203.13112
-
[23]
author Muysken, P. , year 2000 . title Bilingual speech: a typology of code-mixing . publisher Cambridge University Press
work page 2000
-
[24]
author Myers-Scotton, C. , year 1993 . title Duelling Languages . publisher Oxford: Clarendon Press
work page 1993
-
[25]
author Mysl \'i n, M. , author Levy, R.P. , year 2015 . title Codeswitching and predictability of meaning in discourse . journal Language volume 91 , pages 871 -- 905
work page 2015
-
[26]
author Owens, J. , year 2005 . title Bare forms and lexical insertions in code-switching: A processing-based account . journal Bilingualism: Language and Cognition volume 8 , pages 23--38
work page 2005
-
[27]
author Poplack, S. , year 1980 . title Sometimes I 'll start a sentence in S panish Y TERMINO EN ESPAÑOL : toward a typology of code-switching . journal Linguistics volume 18 , pages 581--618 . :10.1515/ling.1980.18.7-8.581
-
[28]
author Radford, A. , author Wu, J. , author Child, R. , author Luan, D. , author Amodei, D. , author Sutskever, I. , year 2019 . title Language models are unsupervised multitask learners . journal null
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.