Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks
Pith reviewed 2026-05-24 09:05 UTC · model grok-4.3
The pith
Unsupervised convolutional networks trained only on single spoken words begin generating concatenated two- and three-word sequences.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Trained exclusively on single-word acoustic data, the models generate novel outputs consisting of two or three concatenated words; networks trained on two words produce embeddings into unobserved combinations; the concatenated outputs contain precursors to compositionality. The authors formalize a neural mechanism called disinhibition that outlines a possible pathway toward concatenation and compositionality in both artificial and biological systems.
What carries the argument
Spontaneous concatenation, the emergence of multi-word outputs from networks whose training data contained only isolated words.
If this is right
- Basic syntactic operations such as concatenation can emerge from raw acoustic input without any supervised multi-word examples.
- Networks learn to place words into novel combinations never observed during training.
- Concatenated outputs already exhibit precursors to compositionality.
- The disinhibition mechanism supplies a concrete artificial and biological pathway that can be used to generate testable predictions about spoken-language processing.
Where Pith is reading between the lines
- If spontaneous concatenation generalizes beyond the reported architectures, unsupervised exposure to single-word speech alone may suffice to bootstrap initial syntactic structure in both machines and biological systems.
- The same disinhibition-style mechanism could be searched for in other generative models that operate on raw audio.
- Neuroimaging or behavioral experiments could test whether disinhibition-like processes appear during human processing of concatenated speech.
Load-bearing premise
Acoustic inspection of the generated waveforms reliably distinguishes true word concatenations from synthesis artifacts or other training effects.
What would settle it
Quantitative acoustic comparison or blind listening tests that determine whether the generated waveforms match the spectral and temporal signatures of two separate words spoken in sequence versus blended or artifactual sounds.
Figures
read the original abstract
Computational models of syntax are predominantly text-based. Here we propose that the most basic first step in the evolution of syntax can be modeled directly from raw speech in a fully unsupervised way. We focus on one of the most ubiquitous and elementary suboperations of syntax -- concatenation. We introduce \textit{spontaneous concatenation}: a phenomenon where a ciwGAN/fiwGAN models (based on convolutional neural networks) trained on acoustic recordings of individual words start generating outputs with two or even three words concatenated without ever accessing data with multiple words in the training data. We replicate this finding in several independently trained models with different hyperparameters and training data. Additionally, networks trained on two words learn to embed words into novel unobserved word combinations. We also show that the concatenated outputs contain precursors to compositionality. To our knowledge, this is a previously unreported property of CNNs trained in the ciwGAN/fiwGAN setting on raw speech and has implications both for our understanding of how these architectures learn as well as for modeling syntax and its evolution in the brain from raw acoustic inputs. We also propose and formalize a neural mechanism called \textit{disinhibition} that outlines a possible artificial and biological neural pathway towards concatenation and compositionality and suggests our modeling is useful for generating testable predictions for biological and artificial neural processing of spoken language.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that ciwGAN/fiwGAN models (CNN-based GANs) trained unsupervised solely on acoustic recordings of individual words spontaneously generate outputs containing two or three concatenated words, without any exposure to multi-word training data. This phenomenon is replicated across independently trained models with varying hyperparameters; models trained on pairs of words further embed them into novel unobserved combinations, and the outputs exhibit precursors to compositionality. The authors propose and formalize a 'disinhibition' neural mechanism as a possible pathway to concatenation and compositionality, with implications for modeling syntax evolution from raw speech.
Significance. If the central observation is quantitatively verified, the result would constitute a novel, previously unreported property of these architectures when applied to raw speech, providing a computational model of basic syntactic concatenation arising from unsupervised learning. The replication across models and the proposed disinhibition mechanism could generate testable predictions for both artificial and biological neural processing of spoken language.
major comments (3)
- [Abstract] Abstract: The central claim that generated waveforms contain identifiable sequences of two or three distinct words rests on qualitative acoustic inspection. No quantitative validation (forced alignment, ASR transcription accuracy, phonetic boundary metrics, or controls for spectral leakage/vocoder artifacts) is reported to establish that these are true lexical concatenations rather than duration extensions or spurious formant transitions produced by the convolutional generator.
- [Abstract] Abstract: The claim that training data consisted strictly of single-word recordings is presented without verification against possible co-articulation, recording-session structure, or latent cues that could introduce implicit multi-word information, undermining the assertion that concatenation arises purely from single-word exposure.
- [Abstract] Abstract: While replication across several models is stated, the absence of error analysis, prevalence statistics, or artifact controls in the reported findings leaves the load-bearing observation without the quantitative grounding needed to support the syntactic-evolution interpretation.
minor comments (1)
- The formalization of the 'disinhibition' mechanism would benefit from explicit comparison to existing neural concepts (e.g., gating or disinhibitory circuits in the literature) to clarify its distinct contribution.
Simulated Author's Rebuttal
We thank the referee for their constructive comments, which identify key areas where additional quantitative support would strengthen the manuscript. We address each point below and commit to revisions that provide the requested validation without altering the core claims.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central claim that generated waveforms contain identifiable sequences of two or three distinct words rests on qualitative acoustic inspection. No quantitative validation (forced alignment, ASR transcription accuracy, phonetic boundary metrics, or controls for spectral leakage/vocoder artifacts) is reported to establish that these are true lexical concatenations rather than duration extensions or spurious formant transitions produced by the convolutional generator.
Authors: We agree that the current presentation relies primarily on qualitative inspection and that quantitative metrics are needed to rule out artifacts. In the revised manuscript we will report ASR transcription accuracy on the generated outputs, forced-alignment boundary metrics, and controls comparing concatenated generations against single-word baselines and vocoder-reconstructed artifacts. These additions will be placed in a new results subsection with accompanying statistics. revision: yes
-
Referee: [Abstract] Abstract: The claim that training data consisted strictly of single-word recordings is presented without verification against possible co-articulation, recording-session structure, or latent cues that could introduce implicit multi-word information, undermining the assertion that concatenation arises purely from single-word exposure.
Authors: The datasets used are standard single-word corpora (e.g., isolated-word subsets of TIMIT-style recordings) where each file is documented as containing one word. To address the concern, the revision will include an explicit verification subsection that inspects file-level metadata, checks for session-level co-articulation via waveform inspection, and reports controls confirming no multi-word cues are present in the training distribution. revision: yes
-
Referee: [Abstract] Abstract: While replication across several models is stated, the absence of error analysis, prevalence statistics, or artifact controls in the reported findings leaves the load-bearing observation without the quantitative grounding needed to support the syntactic-evolution interpretation.
Authors: We acknowledge the need for prevalence statistics and error analysis. The revised version will report the proportion of generated samples exhibiting clear two- and three-word concatenations across all trained models, include failure-case analysis, and add artifact-control experiments. These quantitative results will directly support the interpretation while preserving the replication already performed. revision: yes
Circularity Check
No circularity: claims rest on direct model training and output inspection
full rationale
The paper reports an empirical observation: ciwGAN/fiwGAN CNNs trained exclusively on single-word acoustic recordings generate outputs containing two or three concatenated words. This finding is replicated across independent runs with varying hyperparameters and data. No derivation chain, equations, or fitted parameters are presented that reduce the claimed spontaneous concatenation to the training inputs by construction. The proposed 'disinhibition' mechanism is introduced as a formalization of a possible pathway, not as a load-bearing mathematical step whose validity depends on self-citation or redefinition. The central result is therefore self-contained against external benchmarks (model outputs) and receives no circularity flags.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Training corpora consist exclusively of isolated single-word recordings with no multi-word sequences present.
invented entities (1)
-
disinhibition neural mechanism
no independent evidence
Reference graph
Works this paper leans on
-
[1]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in ":" * " " * FUNCTION f...
-
[2]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in ":" * " " * FUNCTION f...
-
[3]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in ":" * " " * FUNCTION f...
-
[4]
author Andreas, J. , year 2019 . title Measuring compositionality in representation learning , in: booktitle International Conference on Learning Representations . https://openreview.net/forum?id=HJz05o0qK7
work page 2019
-
[5]
author Arjovsky, M. , author Chintala, S. , author Bottou, L. , year 2017 . title W asserstein generative adversarial networks , in: editor Precup, D. , editor Teh, Y.W. (Eds.), booktitle Proceedings of the 34th International Conference on Machine Learning , publisher PMLR , address International Convention Centre, Sydney, Australia . pp. pages 214--223 ....
work page 2017
-
[6]
author Bates, E. , author Marchman, V. , author Thal, D. , author Fenson, L. , author Dale, P. , author Reznick, J.S. , author Reilly, J. , author Hartung, J. , year 1994 . title Developmental and stylistic variation in the composition of early vocabulary . journal Journal of Child Language volume 21 , pages 85–123 . :10.1017/S0305000900008680
-
[7]
author Begu s , G. , author Zhou, A. , author Zhao, T.C. , year 2023 . title Encoding of speech in convolutional layers and the brain stem based on language experience . journal Scientific Reports volume 13 , pages 6480 . https://doi.org/10.1038/s41598-023-33384-9, :10.1038/s41598-023-33384-9
-
[8]
author Begu s , G. , year 2020 . title Generative adversarial phonology: Modeling unsupervised phonetic and phonological learning with neural networks . journal Frontiers in Artificial Intelligence volume 3 , pages 44 . https://www.frontiersin.org/article/10.3389/frai.2020.00044, :10.3389/frai.2020.00044
-
[9]
author Begu s , G. , year 2021 a. title CiwGAN and fiwGAN : Encoding information in acoustic data to model lexical learning with generative adversarial networks . journal Neural Networks volume 139 , pages 305--325 . https://www.sciencedirect.com/science/article/pii/S0893608021001052, :https://doi.org/10.1016/j.neunet.2021.03.017
-
[10]
author Begu s , G. , year 2021 b. title Identity-based patterns in deep convolutional networks: Generative adversarial phonology and reduplication . journal Transactions of the Association for Computational Linguistics volume 9 , pages 1180--1196 . https://aclanthology.org/2021.tacl-1.70, :10.1162/tacl_a_00421
-
[11]
author Begu s , G. , author Zhou, A. , year 2022 a. title Interpreting intermediate convolutional layers in unsupervised acoustic word classification , in: booktitle ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pp. pages 8207--8211 . :10.1109/ICASSP43922.2022.9746849
-
[12]
author Begu s , G. , author Zhou, A. , year 2022 b. title Interpreting intermediate convolutional layers of generative CNN s trained on waveforms . journal IEEE/ACM Transactions on Audio, Speech, and Language Processing volume 30 , pages 3214--3229 . :10.1109/TASLP.2022.3209938
-
[13]
author Begu s , G. , author Zhou, A. , year 2022 c. title Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data , in: booktitle Proc. Interspeech 2022 , pp. pages 5298--5302 . :10.21437/Interspeech.2022-11219
-
[14]
author Beguš, G. , year 2022 . title Local and non-local dependency learning and emergence of rule-like representations in speech data by deep convolutional generative adversarial networks . journal Computer Speech & Language volume 71 , pages 101244 . https://www.sciencedirect.com/science/article/pii/S0885230821000516, :https://doi.org/10.1016/j.csl.2021.101244
-
[15]
author Berent, I. , author Bat-El, O. , author Brentari, D. , author Dupuis, A. , author Vaknin-Nusbaum, V. , year 2016 . title The double identity of linguistic doubling . journal Proceedings of the National Academy of Sciences volume 113 , pages 13702--13707 . https://www.pnas.org/doi/abs/10.1073/pnas.1613749113, :10.1073/pnas.1613749113, http://arxiv.o...
-
[16]
author Berk, S. , author Lillo-Martin, D. , year 2012 . title The two-word stage: Motivated by linguistic or cognitive constraints? journal Cognitive Psychology volume 65 , pages 118--140 . https://www.sciencedirect.com/science/article/pii/S001002851200014X, :https://doi.org/10.1016/j.cogpsych.2012.02.002
-
[17]
author Berwick, R.C. , author Chomsky, N. , year 2019 . title All or nothing: No half-merge and the evolution of syntax . journal PLOS Biology volume 17 , pages 1--5 . https://doi.org/10.1371/journal.pbio.3000539, :10.1371/journal.pbio.3000539
-
[18]
author Berwick, R.C. , author Okanoya, K. , author Beckers, G.J. , author Bolhuis, J.J. , year 2011 . title Songs to syntax: the linguistics of birdsong . journal Trends in Cognitive Sciences volume 15 , pages 113--121 . https://www.sciencedirect.com/science/article/pii/S1364661311000039, :https://doi.org/10.1016/j.tics.2011.01.002
-
[19]
author Boersma, P. , author Weenink, D. , year 2015 . title Praat: doing phonetics by computer [computer program]. version 5.4.06. howpublished Retrieved 21 February 2015 from http://www.praat.org/
work page 2015
-
[20]
author Chaabouni, R. , author Kharitonov, E. , author Bouchacourt, D. , author Dupoux, E. , author Baroni, M. , year 2020 . title Compositionality and generalization in emergent languages , in: editor Jurafsky, D. , editor Chai, J. , editor Schluter, N. , editor Tetreault, J. (Eds.), booktitle Proceedings of the 58th Annual Meeting of the Association for ...
-
[21]
author Chen, X. , author Duan, Y. , author Houthooft, R. , author Schulman, J. , author Sutskever, I. , author Abbeel, P. , year 2016 . title Infogan: Interpretable representation learning by information maximizing generative adversarial nets , in: editor Lee, D.D. , editor Sugiyama, M. , editor Luxburg, U.V. , editor Guyon, I. , editor Garnett, R. (Eds.)...
work page 2016
-
[22]
author Chomsky, N. , year 2014 . title The Minimalist Program . publisher The MIT Press . https://doi.org/10.7551/mitpress/9780262527347.001.0001, :10.7551/mitpress/9780262527347.001.0001
-
[23]
author Chomsky, N. , year 2020 . title The ucla lectures . https://ling.auf.net/lingbuzz/005485. note unpublished; reference: lingbuzz/005485
work page 2020
-
[24]
author Christiansen, M.H. , author Dale, R. , year 2003 . title Language evolution and change , in: editor Arbib, M.A. (Ed.), booktitle The Handbook of Brain Theory and Neural Networks . edition 2nd ed.. publisher MIT Press , address Cambridge, MA and London , pp. pages 604--606
work page 2003
-
[25]
author Christiansen, M.H. , author Kirby, S. , year 2003 . title Language evolution: consensus and controversies . journal Trends in Cognitive Sciences volume 7 , pages 300--307 . https://www.sciencedirect.com/science/article/pii/S1364661303001360, :https://doi.org/10.1016/S1364-6613(03)00136-0
-
[26]
author Clark, R. , year 1977 . title What’s the use of imitation? journal Journal of Child Language volume 4 , pages 341–358 . :10.1017/S0305000900001732
-
[27]
author Diessel, H. , author Tomasello, M. , year 2005 . title A new look at the acquisition of relative clauses . journal Language volume 81 , pages 882--906 . https://doi.org/10.1353/lan.2005.0169, :10.1353/lan.2005.0169
-
[28]
author Dolatian, H. , author Heinz, J. , year 2020 . title Computing and classifying reduplication with 2-way finite-state transducers . journal Journal of Language Modelling volume 8 , pages 179--250 . https://jlm.ipipan.waw.pl/index.php/JLM/article/view/245, :10.15398/jlm.v8i1.245
-
[29]
author Donahue, C. , author McAuley, J.J. , author Puckette, M.S. , year 2019 . title Adversarial audio synthesis , in: booktitle 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019 , publisher OpenReview.net . pp. pages 1--16 . https://openreview.net/forum?id=ByMVTsR5KQ
work page 2019
-
[30]
author Fitch, W. , year 2000 . title The evolution of speech: a comparative review . journal Trends in Cognitive Sciences volume 4 , pages 258--267 . https://www.sciencedirect.com/science/article/pii/S1364661300014947, :https://doi.org/10.1016/S1364-6613(00)01494-7
-
[31]
author Fox, J. , year 2003 . title Effect displays in R for generalised linear models . journal Journal of Statistical Software volume 8 , pages 1--27 . :10.18637/jss.v008.i15
-
[32]
author Fox, J. , author Weisberg, S. , year 2019 . title An R Companion to Applied Regression . edition 3rd ed., publisher Sage , address Thousand Oaks CA . https://socialsciences.mcmaster.ca/jfox/Books/Companion/index.html
work page 2019
-
[33]
author Garofolo, J.S. , author Lamel, L. , author M Fisher, W. , author Fiscus, J. , author S. Pallett, D. , author L. Dahlgren, N. , author Zue, V. , year 1993 . title TIMIT acoustic-phonetic continuous speech corpus . journal Linguistic Data Consortium
work page 1993
-
[34]
author Giraud, A.L. , author Poeppel, D. , year 2012 . title Cortical oscillations and speech processing: emerging computational principles and operations . journal Nature Neuroscience volume 15 , pages 511--517 . https://doi.org/10.1038/nn.3063, :10.1038/nn.3063
-
[35]
author Griffiths, T.L. , author Kalish, M.L. , year 2007 . title Language evolution by iterated learning with bayesian agents . journal Cognitive Science volume 31 , pages 441--480 . https://onlinelibrary.wiley.com/doi/abs/10.1080/15326900701326576, :https://doi.org/10.1080/15326900701326576, http://arxiv.org/abs/https://onlinelibrary.wiley.com/doi/pdf/10...
-
[36]
author Hahne, A. , author Eckstein, K. , author Friederici, A.D. , year 2004 . title Brain Signatures of Syntactic and Semantic Processes during Children's Language Development . journal Journal of Cognitive Neuroscience volume 16 , pages 1302--1318 . https://doi.org/10.1162/0898929041920504, :10.1162/0898929041920504
-
[37]
author Jackendoff, R. , year 1999 . title Possible stages in the evolution of the language capacity . journal Trends in Cognitive Sciences volume 3 , pages 272--279 . https://www.sciencedirect.com/science/article/pii/S1364661399013339, :https://doi.org/10.1016/S1364-6613(99)01333-9
-
[38]
author Kirby, S. , year 2000 . title Syntax without natural selection: How compositionality emerges from vocabulary in a population of learners , in: editor Knight, C. , editor Studdert-Kennedy, M. , editor Hurford, J. (Eds.), booktitle The Evolutionary Emergence of Language: Social Function and the Origins of Linguistic Form . publisher Cambridge Univers...
work page 2000
-
[39]
author Kirby, S. , year 2001 . title Spontaneous evolution of linguistic structure-an iterated learning model of the emergence of regularity and irregularity . journal IEEE Transactions on Evolutionary Computation volume 5 , pages 102--110 . :10.1109/4235.918430
-
[40]
author Kuhl, P.K. , author Meltzoff, A.N. , year 1996 . title Infant vocalizations in response to speech: Vocal imitation and developmental change . journal The Journal of the Acoustical Society of America volume 100 , pages 2425--2438 . https://doi.org/10.1121/1.417951, :10.1121/1.417951
-
[41]
author Lai, C.I. , author Shi, F. , author Peng, P. , author Kim, Y. , author Gimpel, K. , author Chang, S. , author Chuang, Y.S. , author Bhati, S. , author Cox, D.D. , author Harwath, D. , author Zhang, Y. , author Livescu, K. , author Glass, J.R. , year 2023 . title Textless phrase structure induction from visually-grounded speech . https://openreview....
work page 2023
-
[42]
author Lakhotia, K. , author Kharitonov, E. , author Hsu, W.N. , author Adi, Y. , author Polyak, A. , author Bolte, B. , author Nguyen, T.A. , author Copet, J. , author Baevski, A. , author Mohamed, A. , author Dupoux, E. , year 2021 . title On generative spoken language modeling from raw audio . journal Transactions of the Association for Computational L...
-
[43]
author Lin, R. , year 2022 . title Analysis on the selection of the appropriate batch size in cnn neural network , in: booktitle 2022 International Conference on Machine Learning and Knowledge Engineering (MLKE) , pp. pages 106--109 . :10.1109/MLKE55170.2022.00026
-
[44]
author Luuk, E. , author Luuk, H. , year 2014 . title The evolution of syntax: Signs, concatenation and embedding . journal Cognitive Systems Research volume 27 , pages 1--10 . https://www.sciencedirect.com/science/article/pii/S1389041713000028, :https://doi.org/10.1016/j.cogsys.2013.01.001
-
[45]
author Marcolli, M. , author Chomsky, N. , author Berwick, R. , year 2023 . title Mathematical structure of syntactic merge . journal arXiv https://arxiv.org/abs/2305.18278, http://arxiv.org/abs/2305.18278 arXiv:2305.18278
-
[46]
author Masur, E.F. , author Rodemaker, J.E. , year 1999 . title Mothers' and infants' spontaneous vocal, verbal, and action imitation during the second year . journal Merrill-Palmer Quarterly volume 45 , pages 392 -- 412 . https://www.scopus.com/inward/record.uri?eid=2-s2.0-0033235026&partnerID=40&md5=6d43c615c751588167621ec9057eca48. note cited by: 41
work page 1999
-
[47]
author Nowak, M.A. , author Komarova, N.L. , year 2001 . title Towards an evolutionary theory of language . journal Trends in Cognitive Sciences volume 5 , pages 288--295 . https://www.sciencedirect.com/science/article/pii/S1364661300016831, :https://doi.org/10.1016/S1364-6613(00)01683-1
-
[48]
author Nowak, M.A. , author Krakauer, D.C. , year 1999 . title The evolution of language . journal Proceedings of the National Academy of Sciences volume 96 , pages 8028--8033 . https://www.pnas.org/doi/abs/10.1073/pnas.96.14.8028, :10.1073/pnas.96.14.8028, http://arxiv.org/abs/https://www.pnas.org/doi/pdf/10.1073/pnas.96.14.8028 arXiv:https://www.pnas.or...
-
[49]
author Okanoya, K. , year 2013 . title Finite-State Song Syntax in Bengalese Finches: Sensorimotor Evidence, Developmental Processes, and Formal Procedures for Syntax Extraction , in: booktitle Birdsong, Speech, and Language: Exploring the Evolution of Mind and Brain . publisher The MIT Press . https://doi.org/10.7551/mitpress/9322.003.0016, :10.7551/mitp...
-
[50]
author O’Grady, C. , author Smith, K. , year 2018 . title 899Models of Language Evolution , in: booktitle The Oxford Handbook of Psycholinguistics . publisher Oxford University Press , p. pages 899–914 . https://doi.org/10.1093/oxfordhb/9780198786825.013.38, :10.1093/oxfordhb/9780198786825.013.38
-
[51]
author Patel, A.D. , year 2007 . title Syntax , in: booktitle Music, Language, and the Brain . publisher Oxford University Press . :10.1093/acprof:oso/9780195123753.003.0005
work page doi:10.1093/acprof:oso/9780195123753.003.0005 2007
-
[52]
author Perfors, A. , author Navarro, D.J. , year 2014 . title Language evolution can be shaped by the structure of the world . journal Cognitive Science volume 38 , pages 775--793 . https://onlinelibrary.wiley.com/doi/abs/10.1111/cogs.12102, :https://doi.org/10.1111/cogs.12102, http://arxiv.org/abs/https://onlinelibrary.wiley.com/doi/pdf/10.1111/cogs.1210...
-
[53]
author Pfeffer, C.K. , author Xue, M. , author He, M. , author Huang, Z.J. , author Scanziani, M. , year 2013 . title Inhibition of inhibition in visual cortex: the logic of connections between molecularly distinct interneurons . journal Nature Neuroscience volume 16 , pages 1068--1076 . https://doi.org/10.1038/nn.3446, :10.1038/nn.3446
-
[54]
author Pi, H.J. , author Hangya, B. , author Kvitsiani, D. , author Sanders, J.I. , author Huang, Z.J. , author Kepecs, A. , year 2013 . title Cortical interneurons that specialize in disinhibitory control . journal Nature volume 503 , pages 521--524 . https://doi.org/10.1038/nature12676, :10.1038/nature12676
-
[55]
author Progovac, L. , year 2015 . title Evolutionary syntax . Oxford studies in the evolution of language ; 20. edition first edition ed., publisher Oxford University Press , address Oxford, United Kingdom ; New York, NY
work page 2015
-
[56]
Robust Speech Recognition via Large-Scale Weak Supervision
author Radford, A. , author Kim, J.W. , author Xu, T. , author Brockman, G. , author McLeavey, C. , author Sutskever, I. , year 2022 . title Robust speech recognition via large-scale weak supervision . http://arxiv.org/abs/2212.04356 arXiv:2212.04356
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[57]
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
author Radford, A. , author Metz, L. , author Chintala, S. , year 2015 . title Unsupervised representation learning with deep convolutional generative adversarial networks . journal arXiv preprint arXiv:1511.06434
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[58]
author Rasilo, H. , author R \"a s \"a nen, O. , year 2017 . title An online model for vowel imitation learning . journal Speech Communication volume 86 , pages 1--23 . https://www.sciencedirect.com/science/article/pii/S0167639315300728, :https://doi.org/10.1016/j.specom.2016.10.010
-
[59]
author Rita, M. , author Michel, P. , author Chaabouni, R. , author Pietquin, O. , author Dupoux, E. , author Strub, F. , year 2024 . title Language evolution with deep learning . journal arXiv https://arxiv.org/abs/2403.11958, http://arxiv.org/abs/2403.11958 arXiv:2403.11958
-
[60]
author Rita, M. , author Tallec, C. , author Michel, P. , author Grill, J.B. , author Pietquin, O. , author Dupoux, E. , author Strub, F. , year 2022 . title Emergent communication: Generalization and overfitting in lewis games , in: editor Oh, A.H. , editor Agarwal, A. , editor Belgrave, D. , editor Cho, K. (Eds.), booktitle Advances in Neural Informatio...
work page 2022
-
[61]
author Singla, Y.K. , author Shah, J. , author Chen, C. , author Shah, R.R. , year 2022 . title What do audio transformers hear? probing their representations for language delivery & structure , in: booktitle 2022 IEEE International Conference on Data Mining Workshops (ICDMW) , pp. pages 910--925 . :10.1109/ICDMW58026.2022.00120
-
[62]
author Skeide, M.A. , author Brauer, J. , author Friederici, A.D. , year 2014 . title Syntax gradually segregates from semantics in the developing brain . journal NeuroImage volume 100 , pages 106--111 . https://www.sciencedirect.com/science/article/pii/S1053811914004650, :https://doi.org/10.1016/j.neuroimage.2014.05.080
-
[63]
author Youngblood, M. , year 2024 . title Language-like efficiency and structure in house finch song . journal Proceedings of the Royal Society B: Biological Sciences volume 291 , pages 20240250 . https://royalsocietypublishing.org/doi/abs/10.1098/rspb.2024.0250, :10.1098/rspb.2024.0250, http://arxiv.org/abs/https://royalsocietypublishing.org/doi/pdf/10.1...
-
[64]
author Zuidema, W. , author de Boer , B. , year 2009 . title The evolution of combinatorial phonology . journal Journal of Phonetics volume 37 , pages 125--144 . https://www.sciencedirect.com/science/article/pii/S0095447008000624, :https://doi.org/10.1016/j.wocn.2008.10.003
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.