Fixation-related potentials reveal that confusing program code elicits a late frontal positivity

Annabelle Bergum; Anna-Maria Maurer; Axel Mecklinger; Janet Siegmund; Norman Peitek; Regine Bader; Sven Apel; Vera Demberg

arxiv: 2412.10099 · v4 · submitted 2024-12-13 · 💻 cs.SE

Fixation-related potentials reveal that confusing program code elicits a late frontal positivity

Annabelle Bergum , Anna-Maria Maurer , Norman Peitek , Regine Bader , Axel Mecklinger , Vera Demberg , Janet Siegmund , Sven Apel This is my paper

Pith reviewed 2026-05-23 07:34 UTC · model grok-4.3

classification 💻 cs.SE

keywords fixation-related potentialsatoms of confusionprogram comprehensionevent-related potentialsnatural language processingsoftware engineering

0 comments

The pith

Confusing program code elicits a late frontal positivity 400-700 ms after fixation, resembling responses to unexpected words in sentences.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether confusing code patterns known as atoms of confusion produce distinct brain signals during natural reading. Using fixation-related potentials, it compares brain activity on confusing code against matched clean code. Confusing code produces a late frontal positivity in the 400-700 ms window after first fixation on the confusing element. This pattern matches an established response in natural language processing to words that are unexpected yet plausible in context. The authors interpret the result as evidence that the brain applies similar mechanisms to update a situation model when processing both program code and language.

Core claim

Relative to clean counterparts in program code without an atom of confusion, confusing code elicits a late frontal positivity of about 400 to 700 ms after first looking at the atom of confusion. This frontal positivity resembles an event-related potential component found during natural language processing that is elicited by unexpected but plausible words in sentence context. Thus, the brain engages similar neurocognitive mechanisms in response to unexpected and informative inputs in program code and in natural language, updating a comprehender's situation model essential for information extraction from quickly unfolding input.

What carries the argument

Fixation-related potentials recorded while programmers read code snippets containing atoms of confusion versus clean matched counterparts.

If this is right

The brain applies comparable mechanisms to update situation models for unexpected inputs in both code and natural language.
Situation-model updating supports information extraction from rapidly unfolding sequential input in programming.
The results carry implications for how programmers understand and maintain code.
The work opens routes for collaboration between software engineering and psycholinguistics.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Designers of programming languages or tools might reduce cognitive cost by minimizing atoms of confusion.
The neural similarity raises the possibility that methods from language comprehension research could be tested for improving code reading.
The finding could be extended to measure how other code properties affect the same frontal positivity component.

Load-bearing premise

The late frontal positivity arises specifically from the confusing nature of the atom rather than from any other differences between the code snippets or reading conditions.

What would settle it

An experiment that presents perfectly matched confusing and clean code snippets and records no difference in the 400-700 ms frontal positivity would falsify the central claim.

Figures

Figures reproduced from arXiv: 2412.10099 by Annabelle Bergum, Anna-Maria Maurer, Axel Mecklinger, Janet Siegmund, Norman Peitek, Regine Bader, Sven Apel, Vera Demberg.

**Figure 1.** Figure 1: A pair of corresponding code snippets with one containing an atom of confusion (left) and the other containing functionally equivalent code that is easy to understand (right). due to the atom of confusion7 . While such subtle differences may seem insignificant at first glance, they have caused severe bugs in the past, such as a mismatching indentation in Apple’s iOS operating system rendering its secure ne… view at source ↗

**Figure 2.** Figure 2: Overview of the experiment design (EEG Icon by Freepik). The experiment included three blocks with 24 trials each: fixation cross (5 s to calibrate); snippet presentation (3–30 s, terminated by participant; the duration of snippet presentation is measured as comprehension time); answer correctness (the participant has to submit the output, i.e., the final value in variable R); subjective difficulty rating … view at source ↗

**Figure 3.** Figure 3: Results of the FRP analysis. Part a shows the FRP elicited by confusing (orange) and clean (blue) program code at nine scalp electrodes. The zero time points denote the onset of the fixation and positive voltages are plotted upwards. Part b portrays the topographic distribution of the amplitude difference between confusing and clean code in consecutive 50 ms time intervals. The onset of the time interval i… view at source ↗

read the original abstract

As software pervades more and more areas of our professional and personal lives, there is an ever-increasing need to maintain software and for programmers to efficiently write and understand program code. In the first study of its kind, we analyze fixation-related potentials (FRPs) to explore the online processing of program code patterns that are confusing to programmers, but not to the computer (so-called atoms of confusion), and their underlying neurocognitive mechanisms in an ecologically valid setting. Relative to clean counterparts in program code without an atom of confusion, confusing code elicits a late frontal positivity of about 400 to 700 ms after first looking at the atom of confusion. This frontal positivity resembles an event-related potential (ERP) component found during natural language processing that is elicited by unexpected but plausible words in sentence context. Thus, we suggest that the brain engages similar neurocognitive mechanisms in response to unexpected and informative inputs in program code and in natural language. In both domains, these inputs update a comprehender's situation model, which is essential for information extraction from a quickly unfolding input. Our results have far-reaching implications for programming and pave the way for interdisciplinary collaborations between software engineering and psycholinguistics.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

First FRP study on atoms of confusion finds a late frontal positivity resembling language ERPs, but abstract leaves matching and stats details out.

read the letter

The paper's main result is that code containing atoms of confusion triggers a late frontal positivity in fixation-related potentials between 400 and 700 milliseconds after the first fixation on the confusing part. This response looks like the one seen in natural language when people encounter unexpected but plausible words, and the authors suggest it reflects similar mechanisms for updating a mental model of the input. What is new here is the application of FRPs to atoms of confusion specifically, and the direct comparison to language ERP components. The study uses an ecologically valid setup with programmers reading actual code snippets, which is a step beyond typical lab tasks. They position it as the first study of its kind, which seems accurate based on the citations. The paper does a reasonable job connecting software engineering questions to psycholinguistic findings without overclaiming. The idea that both domains involve handling unexpected input in a similar way is a fair hypothesis to test. The soft spots are in the methods details visible so far. The abstract mentions comparing to clean counterparts but gives no information on how those were constructed or matched for length, complexity, or visual properties. That leaves open the possibility that the positivity comes from other differences rather than the confusion itself. There are also no participant counts, statistical values, or exclusion criteria reported in the abstract, which makes it hard to judge the strength of the effect. If the full paper has those, it would help a lot. This kind of work is for researchers at the intersection of code comprehension and cognitive neuroscience. Someone looking for new methods to study programming difficulty might find it useful, but only if the controls check out. I would recommend sending it for peer review. The topic is timely and the approach is fresh, even if the current evidence level is preliminary.

Referee Report

2 major / 2 minor

Summary. The manuscript reports the first use of fixation-related potentials (FRPs) in an ecologically valid setting to study neurocognitive responses to 'atoms of confusion' in program code. It claims that, relative to clean counterparts, confusing code elicits a late frontal positivity (approximately 400-700 ms after first fixation on the atom), resembling the frontal positivity ERP component observed in natural-language processing for unexpected but plausible words; the authors interpret this as evidence that the brain uses similar mechanisms to update situation models during code and language comprehension.

Significance. If the central result is robust, the work opens an interdisciplinary bridge between software engineering and psycholinguistics by demonstrating measurable, time-locked neural signatures of code confusion during natural reading. The ecological-validity framing and the explicit link to an established language ERP component are strengths; successful replication could inform both theories of program comprehension and practical guidelines for code readability.

major comments (2)

[Methods] Methods (stimulus construction): the manuscript does not report quantitative matching criteria or statistical checks confirming that clean counterparts differ from confusing snippets only in the presence of the atom of confusion. Variables such as token count, line length, indentation depth, syntactic complexity, and visual salience must be shown to be balanced (or covaried) before the late frontal positivity can be attributed specifically to confusability rather than to any other systematic difference between the two sets of snippets.
[Results] Results (FRP analysis): while the abstract states a directional effect in the 400-700 ms window, the manuscript must supply participant N, trial counts after exclusion, exact statistical tests (including any correction for multiple comparisons across electrodes/time windows), and effect-size or peak-amplitude measures with confidence intervals. Without these, the claim that the positivity is reliably elicited by the atom remains unverifiable.

minor comments (2)

[Introduction] The abstract and introduction repeatedly use the phrase 'first study of its kind'; a brief literature note on prior EEG/ERP work on code comprehension would clarify the precise novelty.
[Figures] Figure captions should explicitly state the electrode montage, reference, and time window used for the grand-average waveforms shown.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive review. The comments highlight important issues of stimulus control and statistical reporting that strengthen the manuscript. We address each point below and have revised the manuscript to incorporate the requested details.

read point-by-point responses

Referee: [Methods] Methods (stimulus construction): the manuscript does not report quantitative matching criteria or statistical checks confirming that clean counterparts differ from confusing snippets only in the presence of the atom of confusion. Variables such as token count, line length, indentation depth, syntactic complexity, and visual salience must be shown to be balanced (or covaried) before the late frontal positivity can be attributed specifically to confusability rather than to any other systematic difference between the two sets of snippets.

Authors: We agree that explicit quantitative matching is necessary to isolate the effect of the atom of confusion. The original stimulus set was constructed by starting from published atoms of confusion and creating minimal clean variants that differ only in the targeted syntactic or semantic feature; however, the manuscript did not include formal balance checks. In the revision we will add a new table (or supplementary table) reporting means, standard deviations, and statistical comparisons (independent-samples t-tests or Wilcoxon tests as appropriate) for token count, line length, indentation depth, syntactic complexity (e.g., cyclomatic complexity or number of AST nodes), and visual salience (e.g., pixel-level contrast or saliency-map metrics) between confusing and clean snippets. Any variables showing reliable differences will be entered as covariates in the FRP models or discussed as potential confounds. revision: yes
Referee: [Results] Results (FRP analysis): while the abstract states a directional effect in the 400-700 ms window, the manuscript must supply participant N, trial counts after exclusion, exact statistical tests (including any correction for multiple comparisons across electrodes/time windows), and effect-size or peak-amplitude measures with confidence intervals. Without these, the claim that the positivity is reliably elicited by the atom remains unverifiable.

Authors: We accept that the current version under-reports key sample and inferential statistics. The revised manuscript will explicitly state the final participant N after artifact and behavioral exclusion criteria, the number of trials per condition retained for analysis, the precise statistical procedure (including the electrode/time-window selection method and any correction for multiple comparisons such as cluster-based permutation testing or FDR), and effect-size estimates (Cohen’s d or partial eta-squared) together with 95% confidence intervals for the amplitude difference in the 400–700 ms frontal window. These values were computed during analysis but were not fully documented in the submitted text; they will now appear in the Results section and a new supplementary table. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical observation of FRPs with no derivation or fitting chain

full rationale

The paper reports results from an ERP/FRP experiment comparing brain responses to confusing code atoms versus clean counterparts. No equations, parameters, models, or derivations are present in the provided text. The central claim rests on measured electrophysiological data rather than any self-definitional, fitted-prediction, or self-citation reduction. Self-citations, if present, are not load-bearing for any claimed derivation because none exists. This matches the default case of a self-contained empirical study.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The paper is an empirical neuroscience study; it relies on standard assumptions of ERP averaging and statistical comparison but introduces no new free parameters, axioms beyond domain norms, or invented entities.

axioms (2)

standard math Standard assumptions in ERP/FRP analysis such as trial averaging and baseline correction hold for the recorded signals
Implicit in any FRP study; not stated explicitly but required for the reported positivity
domain assumption The experimental reading task approximates real-world program comprehension
Abstract claims ecological validity without detailing controls for task differences

pith-pipeline@v0.9.0 · 5762 in / 1393 out tokens · 20065 ms · 2026-05-23T07:34:18.957106+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

66 extracted references · 66 canonical work pages

[1]

& DeLine, R

LaToza, T., Venolia, G. & DeLine, R. ”Maintaining mental models: A study of developer work habits”. In Osterweil, L. J., Rombach, H. D. & Soffa, M. L. (eds.) Proc Int Conf Softw Eng, 492–501 (ACM, 2006)

work page 2006
[2]

What programmers really do: An observational study

Tiarks, R. What programmers really do: An observational study. Softwaretechnik-Trends 31, 36–37 (2011)

work page 2011
[3]

& Lanza, M

Minelli, R., Mocci, A. & Lanza, M. ”I know what you did last summer - an investigation of how developers spend their time”. In Lucia, A. D., Bird, C. & Oliveto, R. (eds.) Proceedings of the 2015 IEEE 23rd International Conference on Program Comprehension ICPC, 25–35 (IEEE, 2015)

work page 2015
[4]

Meyer, A. N. et al. Detecting developers’ task switches and types. IEEE Transactions on Software Engineering 48, 225–240 (2022)

work page 2022
[5]

N., Barr, E

Meyer, A. N., Barr, E. T., Bird, C. & Zimmermann, T. Today was a good day: The daily life of software developers. IEEE Transactions on Software Engineering 47, 863–880 (2021)

work page 2021
[6]

N., Barton, L

Meyer, A. N., Barton, L. E., Murphy, G. C., Zimmermann, T. & Fritz, T. The work life of developers: Activities, switches and perceived productivity. IEEE Transactions on Software Engineering 43, 1178–1193 (2017)

work page 2017
[7]

Gopstein, D. et al. ”Understanding misunderstandings in source code”. In Bodden, E., Sch ¨afer, W., van Deursen, A. & Zisman, A. (eds.) Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering ESEC/FSE, 129–139 (ACM, 2017)

work page 2017
[8]

H., Frankl, P

Gopstein, D., Zhou, H. H., Frankl, P. & Cappos, J. ”Prevalence of confusing code in software projects”. In Zaidman, A., Kamei, Y . & Hill, E. (eds.)Proceedings of the 15th International Conference on Mining Software Repositories MSR(2018)

work page 2018
[9]

Medeiros, F. et al. An investigation of misunderstanding code patterns in C open-source software projects. Empir Softw Eng 24, 1693–1726 (2019)

work page 2019
[10]

& Viana, W

Pinheiro, O., Rocha, L. & Viana, W. ”How they relate and leave: Understanding atoms of confusion in open-source Java projects”. In Moonen, L., Newman, C. D. & Gorla, A. (eds.) 2023 IEEE 23rd International Working Conference on Source Code Analysis and Manipulation SCAM, 119–130 (IEEE, 2023)

work page 2023
[11]

& Cappos, J

Gopstein, D., Fayard, A.-L., Apel, S. & Cappos, J. ”Thinking aloud about confusing code: A qualitative investigation of program comprehension and atoms of confusion”. In Devanbu, P., Cohen, M. B. & Zimmermann, T. (eds.) 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering ESEC/FSE, 605—-616 (Asso...

work page 2020
[12]

de Oliveira, B. et al. ”Atoms of confusion: The eyes do not lie”. In Cavalcante, E., Dantas, F. & Batista, T. (eds.) 34th Brazilian Symposium on Software Engineering SBES, 243–252 (ACM, 2020)

work page 2020
[13]

& Aniche, M

Langhout, C. & Aniche, M. ”Atoms of confusion in Java”. In O’Conner, L. (ed.)29th IEEE/ACM International Conference on Program Comprehension ICPC, 25–35 (2021)

work page 2021
[14]

The apple goto fail vulnerability: lessons learned

Wheeler, D. The apple goto fail vulnerability: lessons learned. https://dwheeler.com/essays/apple-goto-fail.html (2014). Accessed: 2025-01-20

work page 2014
[15]

& Watson, T

Boyes, H., Norris, P., Bryant, I. & Watson, T. Trustworthy software: lessons from goto fail & heartbleed bugs. In 9th IET International Conference on System Safety and Cyber Security (2014) (IET, 2014)

work page 2014
[16]

& Zhuang, Y

Yeh, M., Gopstein, D., Yan, Y . & Zhuang, Y . ”Detecting and comparing brain activity in short program comprehension using EEG”. In 2017 IEEE Frontiers in Education Conference FIE, 1–5 (IEEE, 2017)

work page 2017
[17]

da Costa, J. A. S. et al. Seeing confusion through a new lens: On the impact of atoms of confusion on novices’ code comprehension. Empir Softw Eng 28, 81 (2023)

work page 2023
[18]

K.-C., Yan, Y ., Zhuang, Y

Yeh, M. K.-C., Yan, Y ., Zhuang, Y . & DeLong, L. A. Identifying program confusion using electroencephalogram measurements. Behaviour @AND@ Information Technology 41, 2528–2545 (2022)

work page 2022
[19]

Siegmund, J. et al. ”Understanding understanding source code with functional magnetic resonance imaging”. In Jalote, P., C., L. & van der Hoek, A. (eds.) Proc Int Conf Softw Eng, 378–389 (ACM, 2014)

work page 2014
[20]

Peitek, N. et al. A look into programmers’ heads. IEEE Transactions on Software Engineering 46, 442–462 (2020)

work page 2020
[21]

& Castelo-Branco, M

Dur˜aes, J., Madeira, H., Castelhano, J., Duarte, I. & Castelo-Branco, M. ”WAP: Understanding the brain at software debugging”. In Bradbury, J. (ed.) 27th IEEE International Symposium on Software Reliability Engineering ISSRE, 87–92 (IEEE, 2016)

work page 2016
[22]

Castelhano, J. et al. The role of the insula in intuitive expert bug detection in computer code: An fMRI study. Brain Imaging Behav 13, 623–637 (2019)

work page 2019
[23]

& Weimer, W

Floyd, B., Santander, T. & Weimer, W. ”Decoding the representation of code in the brain: An fMRI study of code review and expertise”. In Uchitel, S., Orso, A. & Robillard, M. P. (eds.) Proc Int Conf Softw Eng, 175–186 (IEEE, 2017)

work page 2017
[24]

An introduction to the event-related potential technique (MIT press, 2014)

Luck, S. An introduction to the event-related potential technique (MIT press, 2014)

work page 2014
[25]

Y ., Ledoux, K., Camblin, C

Swaab, T. Y ., Ledoux, K., Camblin, C. C. & Boudewyn, M. A.The Oxford handbook of event-related potential components, chap. ”Language-related ERP components” (Oxford University Press, 2011)

work page 2011
[26]

Thornhill, D. E. & Petten, C. V . Lexical versus conceptual anticipation during sentence processing: Frontal positivity and N400 ERP components. Int J Psychophysiol 83, 382–392 (2012)

work page 2012
[27]

R., Brothers, T

Kuperberg, G. R., Brothers, T. & Wlotko, E. W. A tale of two positivities and the N400: Distinct neural signatures are evoked by confirmed and violated predictions at different levels of representation. J Cogn Neurosci 32, 12–35 (2020)

work page 2020
[28]

& Hillyard, S

Kutas, M. & Hillyard, S. A. Reading senseless sentences: Brain potentials reflect semantic incongruity. Science 207, 203–205 (1980)

work page 1980
[29]

& Steinhauer, K

Kasparian, K. & Steinhauer, K. Confusing similar words: ERP correlates of lexical-semantic processing in first language attrition and late second language acquisition. Neuropsychologia 93, 200–217 (2016)

work page 2016
[30]

& Honrubia-Serrano, M

N´u˜nez-Pe˜na, M. & Honrubia-Serrano, M. P600 related to rule violation in an arithmetic task. Brain Res Cogn Brain Res 18, 130–141 (2004)

work page 2004
[31]

& Holcomb, P

Osterhout, L. & Holcomb, P. J. Event-related brain potentials elicited by syntactic anomaly. J Mem Lang 31, 785–806 (1992)

work page 1992
[32]

D., Gunter, T

Friederici, A. D., Gunter, T. C., Hahne, A. & Mauth, K. The relative timing of syntactic and semantic processes in sentence comprehension. Neuroreport 15, 165–169 (2004)

work page 2004
[33]

& Kuperberg, G

Paczynski, M. & Kuperberg, G. R. Multiple influences of semantic memory on sentence processing: Distinct effects of semantic relatedness on violations of real-world event/state knowledge and animacy selection restrictions. J Mem Lang 67, 426–448 (2012)

work page 2012
[34]

W., Warnke, L

Brothers, T., Wlotko, E. W., Warnke, L. & Kuperberg, G. R. Going the extra mile: Effects of discourse context on two late positivities during language comprehension. Neurobiol Lang (Camb) 1, 135–160 (2020)

work page 2020
[35]

Kuperberg, G. R. Separate streams or probabilistic inference? What the N400 can tell us about the comprehension of events. Lang Cogn Neurosci 31, 602–616 (2016). 12/15

work page 2016
[36]

& Luka, B

Van Petten, C. & Luka, B. J. Prediction during language comprehension: Benefits, costs, and ERP components. Int J Psychophysiol 83, 176–190 (2012)

work page 2012
[37]

Kuperberg, G. R. et al. Multimodal neuroimaging evidence for looser lexico-semantic networks in schizophrenia: Evidence from masked indirect semantic priming. Neuropsychologia 124, 337–349 (2019)

work page 2019
[38]

& Apel, S

Peitek, N., Siegmund, J. & Apel, S. ”What drives the reading order of programmers? An eye tracking study”, 342–353 (ACM, 2020)

work page 2020
[39]

& Maletic, J

Aljehane, S., Sharif, B. & Maletic, J. ”Determining differences in reading behavior between experts and novices by investigating eye movement on source code constructs during a bug fixing task”. In Bulling, A. et al. (eds.) Proc Eye Track Res Appl Symp, 1–6 (ACM, 2021)

work page 2021
[40]

Busjahn, T. et al. ”Eye movements in code reading: Relaxing the linear order”. In Lucia, A. D., Bird, C. & Oliveto, R. (eds.) Proceedings of the 2015 IEEE 23rd International Conference on Program Comprehension ICPC, 255–265 (IEEE, 2015)

work page 2015
[41]

Peitek, N. et al. ”Correlates of programmer efficacy and their link to experience: A combined EEG and eye-tracking study”. In Roychoudhury, A., Cadar, C. & Kim, M. (eds.) Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 120–131 (2022)

work page 2022
[42]

& Prat, C

Kuo, C.-H. & Prat, C. Computer programmers show distinct, expertise-dependent brain responses to violations in form and meaning when reading code. Sci Rep 14 (2024)

work page 2024
[43]

& Grainger, J

Wen, Y ., Mirault, J. & Grainger, J. Fast syntax in the brain: Electrophysiological evidence from the rapid parallel visual presentation paradigm (RPVP). J Exp Psychol Learn Mem Cogn 47, 99 (2021)

work page 2021
[44]

Hutzler, F. et al. Welcome to the real world: Validating fixation-related brain potentials for ecologically valid settings. Brain Res 1172, 124–129 (2007)

work page 2007
[45]

Federmeier, K. D. Thinking ahead: The role and roots of prediction in language comprehension. Psychophysiology 44, 491–505 (2007)

work page 2007
[46]

& Mecklinger, A

H¨oltje, G. & Mecklinger, A. Benefits and costs of predictive processing: How sentential constraint and word expectedness affect memory formation. Brain Res 1788, 147942 (2022)

work page 2022
[47]

Hubbard, R. J. & Federmeier, K. D. The impact of linguistic prediction violations on downstream recognition memory and sentence recall. J Cogn Neurosci 36, 1–23 (2024)

work page 2024
[48]

Friederici, A. D. Towards a neural basis of auditory sentence processing. Trends Cogn Sci 6, 78–84 (2002)

work page 2002
[49]

In the company of other words: Electrophysiological evidence for single-word and sentence context effects

Kutas, M. In the company of other words: Electrophysiological evidence for single-word and sentence context effects. Lang Cogn Process 8, 533–572 (1993)

work page 1993
[50]

& Meltzer-Asscher, A

Ness, T. & Meltzer-Asscher, A. Lexical inhibition due to failed prediction: Behavioral evidence and ERP correlates. J Exp Psychol Learn Mem Cogn 44, 1269 (2018)

work page 2018
[51]

bag-of-arguments

Chow, W.-Y ., Smith, C., Lau, E. & Phillips, C. A “bag-of-arguments” mechanism for initial verb predictions.Lang Cogn Neurosci 31, 577–596 (2016)

work page 2016
[52]

& Roehm, D

Freunberger, D. & Roehm, D. Semantic prediction in language comprehension: evidence from brain potentials. Lang Cogn Neurosci 31, 1193–1205 (2016)

work page 2016
[53]

Zirnstein, M., van Hell, J. G. & Kroll, J. F. Cognitive control ability mediates prediction costs in monolinguals and bilinguals. Cognition 176, 87–106 (2018)

work page 2018
[54]

R., Steen, A

Ng, S., Payne, B. R., Steen, A. A., Stine-Morrow, E. A. & Federmeier, K. D. Use of contextual information and prediction by struggling adult readers: Evidence from reading times and event-related potentials. Sci Stud Read 21, 359–375 (2017)

work page 2017
[55]

J., Rommers, J., Jacobs, C

Hubbard, R. J., Rommers, J., Jacobs, C. L. & Federmeier, K. D. Downstream behavioral and electrophysiological consequences of word prediction on recognition memory. Front Hum Neurosci 13, 291 (2019)

work page 2019
[56]

& Kuperberg, G

Brothers, T., Morgan, E., Yacovone, A. & Kuperberg, G. Multiple predictions during language comprehension: Friends, foes, or indifferent companions? Cognition 241, 105602 (2023)

work page 2023
[57]

R., E., A

Kuperberg, G. R., E., A. & T., B. The P600 does not reflect error correction: A confidence accumulation account of late posterior and frontal positivities. Manuscript under review (under review)

work page
[58]

Zhuang, Y ., Yan, Y ., DeLong, L. A. & Yeh, M. K. Do developer perceptions have borders? Comparing C code responses across continents. Softw Qual J 32, 431–457 (2024). 13/15

work page 2024
[59]

& Hanenberg, S

Siegmund, J., K¨astner, C., Liebig, J., Apel, S. & Hanenberg, S. Measuring and modeling programming experience. Empir Softw Eng 19, 1299–1334 (2014)

work page 2014
[60]

Oldfield, R. C. The assessment and analysis of handedness: The Edinburgh inventory. Neuropsychologia 9, 97–113 (1971)

work page 1971
[61]

& Hooge, I

Hessels, R., Niehorster, D., Kemner, C. & Hooge, I. ”Noise-robust fixation detection in eye movement data: Identification by two-means clustering (I2MC)”. Behav Res Methods 49, 1802–1823 (2017)

work page 2017
[62]

& Crepaldi, D

Carr, J., Pescuma, V ., Furlan, M., Ktori, M. & Crepaldi, D. Algorithms for the automated correction of vertical drift in eye-tracking data. Behav Res Methods 54, 287–310 (2022)

work page 2022
[63]

Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977)

work page 1977
[64]

Report of the committee on methods of clinical examination in electroencephalography

Jasper, H. Report of the committee on methods of clinical examination in electroencephalography. Electroencephalogr Clin Neurophysiol 10, 370–375 (1958)

work page 1958
[65]

& Valenza, G

Candia-Rivera, D. & Valenza, G. Cluster permutation analysis for EEG series based on non-parametric Wilcoxon–Mann– Whitney statistical tests. SoftwareX 19, 101170 (2022)

work page 2022
[66]

M., Urbach, T

Groppe, D. M., Urbach, T. P. & Kutas, M. Mass univariate analysis of event-related brain potentials/fields II: Simulation studies. Psychophysiology 48, 1726–1737 (2011). Author contribution Annabelle Bergum, Anna-Maria Maurer, Norman Peitek, Regine Bader, Axel Mecklinger, Janet Siegmund, and Sven Apel developed the concept and the general research idea. A...

work page 2011

[1] [1]

& DeLine, R

LaToza, T., Venolia, G. & DeLine, R. ”Maintaining mental models: A study of developer work habits”. In Osterweil, L. J., Rombach, H. D. & Soffa, M. L. (eds.) Proc Int Conf Softw Eng, 492–501 (ACM, 2006)

work page 2006

[2] [2]

What programmers really do: An observational study

Tiarks, R. What programmers really do: An observational study. Softwaretechnik-Trends 31, 36–37 (2011)

work page 2011

[3] [3]

& Lanza, M

Minelli, R., Mocci, A. & Lanza, M. ”I know what you did last summer - an investigation of how developers spend their time”. In Lucia, A. D., Bird, C. & Oliveto, R. (eds.) Proceedings of the 2015 IEEE 23rd International Conference on Program Comprehension ICPC, 25–35 (IEEE, 2015)

work page 2015

[4] [4]

Meyer, A. N. et al. Detecting developers’ task switches and types. IEEE Transactions on Software Engineering 48, 225–240 (2022)

work page 2022

[5] [5]

N., Barr, E

Meyer, A. N., Barr, E. T., Bird, C. & Zimmermann, T. Today was a good day: The daily life of software developers. IEEE Transactions on Software Engineering 47, 863–880 (2021)

work page 2021

[6] [6]

N., Barton, L

Meyer, A. N., Barton, L. E., Murphy, G. C., Zimmermann, T. & Fritz, T. The work life of developers: Activities, switches and perceived productivity. IEEE Transactions on Software Engineering 43, 1178–1193 (2017)

work page 2017

[7] [7]

Gopstein, D. et al. ”Understanding misunderstandings in source code”. In Bodden, E., Sch ¨afer, W., van Deursen, A. & Zisman, A. (eds.) Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering ESEC/FSE, 129–139 (ACM, 2017)

work page 2017

[8] [8]

H., Frankl, P

Gopstein, D., Zhou, H. H., Frankl, P. & Cappos, J. ”Prevalence of confusing code in software projects”. In Zaidman, A., Kamei, Y . & Hill, E. (eds.)Proceedings of the 15th International Conference on Mining Software Repositories MSR(2018)

work page 2018

[9] [9]

Medeiros, F. et al. An investigation of misunderstanding code patterns in C open-source software projects. Empir Softw Eng 24, 1693–1726 (2019)

work page 2019

[10] [10]

& Viana, W

Pinheiro, O., Rocha, L. & Viana, W. ”How they relate and leave: Understanding atoms of confusion in open-source Java projects”. In Moonen, L., Newman, C. D. & Gorla, A. (eds.) 2023 IEEE 23rd International Working Conference on Source Code Analysis and Manipulation SCAM, 119–130 (IEEE, 2023)

work page 2023

[11] [11]

& Cappos, J

Gopstein, D., Fayard, A.-L., Apel, S. & Cappos, J. ”Thinking aloud about confusing code: A qualitative investigation of program comprehension and atoms of confusion”. In Devanbu, P., Cohen, M. B. & Zimmermann, T. (eds.) 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering ESEC/FSE, 605—-616 (Asso...

work page 2020

[12] [12]

de Oliveira, B. et al. ”Atoms of confusion: The eyes do not lie”. In Cavalcante, E., Dantas, F. & Batista, T. (eds.) 34th Brazilian Symposium on Software Engineering SBES, 243–252 (ACM, 2020)

work page 2020

[13] [13]

& Aniche, M

Langhout, C. & Aniche, M. ”Atoms of confusion in Java”. In O’Conner, L. (ed.)29th IEEE/ACM International Conference on Program Comprehension ICPC, 25–35 (2021)

work page 2021

[14] [14]

The apple goto fail vulnerability: lessons learned

Wheeler, D. The apple goto fail vulnerability: lessons learned. https://dwheeler.com/essays/apple-goto-fail.html (2014). Accessed: 2025-01-20

work page 2014

[15] [15]

& Watson, T

Boyes, H., Norris, P., Bryant, I. & Watson, T. Trustworthy software: lessons from goto fail & heartbleed bugs. In 9th IET International Conference on System Safety and Cyber Security (2014) (IET, 2014)

work page 2014

[16] [16]

& Zhuang, Y

Yeh, M., Gopstein, D., Yan, Y . & Zhuang, Y . ”Detecting and comparing brain activity in short program comprehension using EEG”. In 2017 IEEE Frontiers in Education Conference FIE, 1–5 (IEEE, 2017)

work page 2017

[17] [17]

da Costa, J. A. S. et al. Seeing confusion through a new lens: On the impact of atoms of confusion on novices’ code comprehension. Empir Softw Eng 28, 81 (2023)

work page 2023

[18] [18]

K.-C., Yan, Y ., Zhuang, Y

Yeh, M. K.-C., Yan, Y ., Zhuang, Y . & DeLong, L. A. Identifying program confusion using electroencephalogram measurements. Behaviour @AND@ Information Technology 41, 2528–2545 (2022)

work page 2022

[19] [19]

Siegmund, J. et al. ”Understanding understanding source code with functional magnetic resonance imaging”. In Jalote, P., C., L. & van der Hoek, A. (eds.) Proc Int Conf Softw Eng, 378–389 (ACM, 2014)

work page 2014

[20] [20]

Peitek, N. et al. A look into programmers’ heads. IEEE Transactions on Software Engineering 46, 442–462 (2020)

work page 2020

[21] [21]

& Castelo-Branco, M

Dur˜aes, J., Madeira, H., Castelhano, J., Duarte, I. & Castelo-Branco, M. ”WAP: Understanding the brain at software debugging”. In Bradbury, J. (ed.) 27th IEEE International Symposium on Software Reliability Engineering ISSRE, 87–92 (IEEE, 2016)

work page 2016

[22] [22]

Castelhano, J. et al. The role of the insula in intuitive expert bug detection in computer code: An fMRI study. Brain Imaging Behav 13, 623–637 (2019)

work page 2019

[23] [23]

& Weimer, W

Floyd, B., Santander, T. & Weimer, W. ”Decoding the representation of code in the brain: An fMRI study of code review and expertise”. In Uchitel, S., Orso, A. & Robillard, M. P. (eds.) Proc Int Conf Softw Eng, 175–186 (IEEE, 2017)

work page 2017

[24] [24]

An introduction to the event-related potential technique (MIT press, 2014)

Luck, S. An introduction to the event-related potential technique (MIT press, 2014)

work page 2014

[25] [25]

Y ., Ledoux, K., Camblin, C

Swaab, T. Y ., Ledoux, K., Camblin, C. C. & Boudewyn, M. A.The Oxford handbook of event-related potential components, chap. ”Language-related ERP components” (Oxford University Press, 2011)

work page 2011

[26] [26]

Thornhill, D. E. & Petten, C. V . Lexical versus conceptual anticipation during sentence processing: Frontal positivity and N400 ERP components. Int J Psychophysiol 83, 382–392 (2012)

work page 2012

[27] [27]

R., Brothers, T

Kuperberg, G. R., Brothers, T. & Wlotko, E. W. A tale of two positivities and the N400: Distinct neural signatures are evoked by confirmed and violated predictions at different levels of representation. J Cogn Neurosci 32, 12–35 (2020)

work page 2020

[28] [28]

& Hillyard, S

Kutas, M. & Hillyard, S. A. Reading senseless sentences: Brain potentials reflect semantic incongruity. Science 207, 203–205 (1980)

work page 1980

[29] [29]

& Steinhauer, K

Kasparian, K. & Steinhauer, K. Confusing similar words: ERP correlates of lexical-semantic processing in first language attrition and late second language acquisition. Neuropsychologia 93, 200–217 (2016)

work page 2016

[30] [30]

& Honrubia-Serrano, M

N´u˜nez-Pe˜na, M. & Honrubia-Serrano, M. P600 related to rule violation in an arithmetic task. Brain Res Cogn Brain Res 18, 130–141 (2004)

work page 2004

[31] [31]

& Holcomb, P

Osterhout, L. & Holcomb, P. J. Event-related brain potentials elicited by syntactic anomaly. J Mem Lang 31, 785–806 (1992)

work page 1992

[32] [32]

D., Gunter, T

Friederici, A. D., Gunter, T. C., Hahne, A. & Mauth, K. The relative timing of syntactic and semantic processes in sentence comprehension. Neuroreport 15, 165–169 (2004)

work page 2004

[33] [33]

& Kuperberg, G

Paczynski, M. & Kuperberg, G. R. Multiple influences of semantic memory on sentence processing: Distinct effects of semantic relatedness on violations of real-world event/state knowledge and animacy selection restrictions. J Mem Lang 67, 426–448 (2012)

work page 2012

[34] [34]

W., Warnke, L

Brothers, T., Wlotko, E. W., Warnke, L. & Kuperberg, G. R. Going the extra mile: Effects of discourse context on two late positivities during language comprehension. Neurobiol Lang (Camb) 1, 135–160 (2020)

work page 2020

[35] [35]

Kuperberg, G. R. Separate streams or probabilistic inference? What the N400 can tell us about the comprehension of events. Lang Cogn Neurosci 31, 602–616 (2016). 12/15

work page 2016

[36] [36]

& Luka, B

Van Petten, C. & Luka, B. J. Prediction during language comprehension: Benefits, costs, and ERP components. Int J Psychophysiol 83, 176–190 (2012)

work page 2012

[37] [37]

Kuperberg, G. R. et al. Multimodal neuroimaging evidence for looser lexico-semantic networks in schizophrenia: Evidence from masked indirect semantic priming. Neuropsychologia 124, 337–349 (2019)

work page 2019

[38] [38]

& Apel, S

Peitek, N., Siegmund, J. & Apel, S. ”What drives the reading order of programmers? An eye tracking study”, 342–353 (ACM, 2020)

work page 2020

[39] [39]

& Maletic, J

Aljehane, S., Sharif, B. & Maletic, J. ”Determining differences in reading behavior between experts and novices by investigating eye movement on source code constructs during a bug fixing task”. In Bulling, A. et al. (eds.) Proc Eye Track Res Appl Symp, 1–6 (ACM, 2021)

work page 2021

[40] [40]

Busjahn, T. et al. ”Eye movements in code reading: Relaxing the linear order”. In Lucia, A. D., Bird, C. & Oliveto, R. (eds.) Proceedings of the 2015 IEEE 23rd International Conference on Program Comprehension ICPC, 255–265 (IEEE, 2015)

work page 2015

[41] [41]

Peitek, N. et al. ”Correlates of programmer efficacy and their link to experience: A combined EEG and eye-tracking study”. In Roychoudhury, A., Cadar, C. & Kim, M. (eds.) Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 120–131 (2022)

work page 2022

[42] [42]

& Prat, C

Kuo, C.-H. & Prat, C. Computer programmers show distinct, expertise-dependent brain responses to violations in form and meaning when reading code. Sci Rep 14 (2024)

work page 2024

[43] [43]

& Grainger, J

Wen, Y ., Mirault, J. & Grainger, J. Fast syntax in the brain: Electrophysiological evidence from the rapid parallel visual presentation paradigm (RPVP). J Exp Psychol Learn Mem Cogn 47, 99 (2021)

work page 2021

[44] [44]

Hutzler, F. et al. Welcome to the real world: Validating fixation-related brain potentials for ecologically valid settings. Brain Res 1172, 124–129 (2007)

work page 2007

[45] [45]

Federmeier, K. D. Thinking ahead: The role and roots of prediction in language comprehension. Psychophysiology 44, 491–505 (2007)

work page 2007

[46] [46]

& Mecklinger, A

H¨oltje, G. & Mecklinger, A. Benefits and costs of predictive processing: How sentential constraint and word expectedness affect memory formation. Brain Res 1788, 147942 (2022)

work page 2022

[47] [47]

Hubbard, R. J. & Federmeier, K. D. The impact of linguistic prediction violations on downstream recognition memory and sentence recall. J Cogn Neurosci 36, 1–23 (2024)

work page 2024

[48] [48]

Friederici, A. D. Towards a neural basis of auditory sentence processing. Trends Cogn Sci 6, 78–84 (2002)

work page 2002

[49] [49]

In the company of other words: Electrophysiological evidence for single-word and sentence context effects

Kutas, M. In the company of other words: Electrophysiological evidence for single-word and sentence context effects. Lang Cogn Process 8, 533–572 (1993)

work page 1993

[50] [50]

& Meltzer-Asscher, A

Ness, T. & Meltzer-Asscher, A. Lexical inhibition due to failed prediction: Behavioral evidence and ERP correlates. J Exp Psychol Learn Mem Cogn 44, 1269 (2018)

work page 2018

[51] [51]

bag-of-arguments

Chow, W.-Y ., Smith, C., Lau, E. & Phillips, C. A “bag-of-arguments” mechanism for initial verb predictions.Lang Cogn Neurosci 31, 577–596 (2016)

work page 2016

[52] [52]

& Roehm, D

Freunberger, D. & Roehm, D. Semantic prediction in language comprehension: evidence from brain potentials. Lang Cogn Neurosci 31, 1193–1205 (2016)

work page 2016

[53] [53]

Zirnstein, M., van Hell, J. G. & Kroll, J. F. Cognitive control ability mediates prediction costs in monolinguals and bilinguals. Cognition 176, 87–106 (2018)

work page 2018

[54] [54]

R., Steen, A

Ng, S., Payne, B. R., Steen, A. A., Stine-Morrow, E. A. & Federmeier, K. D. Use of contextual information and prediction by struggling adult readers: Evidence from reading times and event-related potentials. Sci Stud Read 21, 359–375 (2017)

work page 2017

[55] [55]

J., Rommers, J., Jacobs, C

Hubbard, R. J., Rommers, J., Jacobs, C. L. & Federmeier, K. D. Downstream behavioral and electrophysiological consequences of word prediction on recognition memory. Front Hum Neurosci 13, 291 (2019)

work page 2019

[56] [56]

& Kuperberg, G

Brothers, T., Morgan, E., Yacovone, A. & Kuperberg, G. Multiple predictions during language comprehension: Friends, foes, or indifferent companions? Cognition 241, 105602 (2023)

work page 2023

[57] [57]

R., E., A

Kuperberg, G. R., E., A. & T., B. The P600 does not reflect error correction: A confidence accumulation account of late posterior and frontal positivities. Manuscript under review (under review)

work page

[58] [58]

Zhuang, Y ., Yan, Y ., DeLong, L. A. & Yeh, M. K. Do developer perceptions have borders? Comparing C code responses across continents. Softw Qual J 32, 431–457 (2024). 13/15

work page 2024

[59] [59]

& Hanenberg, S

Siegmund, J., K¨astner, C., Liebig, J., Apel, S. & Hanenberg, S. Measuring and modeling programming experience. Empir Softw Eng 19, 1299–1334 (2014)

work page 2014

[60] [60]

Oldfield, R. C. The assessment and analysis of handedness: The Edinburgh inventory. Neuropsychologia 9, 97–113 (1971)

work page 1971

[61] [61]

& Hooge, I

Hessels, R., Niehorster, D., Kemner, C. & Hooge, I. ”Noise-robust fixation detection in eye movement data: Identification by two-means clustering (I2MC)”. Behav Res Methods 49, 1802–1823 (2017)

work page 2017

[62] [62]

& Crepaldi, D

Carr, J., Pescuma, V ., Furlan, M., Ktori, M. & Crepaldi, D. Algorithms for the automated correction of vertical drift in eye-tracking data. Behav Res Methods 54, 287–310 (2022)

work page 2022

[63] [63]

Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977)

work page 1977

[64] [64]

Report of the committee on methods of clinical examination in electroencephalography

Jasper, H. Report of the committee on methods of clinical examination in electroencephalography. Electroencephalogr Clin Neurophysiol 10, 370–375 (1958)

work page 1958

[65] [65]

& Valenza, G

Candia-Rivera, D. & Valenza, G. Cluster permutation analysis for EEG series based on non-parametric Wilcoxon–Mann– Whitney statistical tests. SoftwareX 19, 101170 (2022)

work page 2022

[66] [66]

M., Urbach, T

Groppe, D. M., Urbach, T. P. & Kutas, M. Mass univariate analysis of event-related brain potentials/fields II: Simulation studies. Psychophysiology 48, 1726–1737 (2011). Author contribution Annabelle Bergum, Anna-Maria Maurer, Norman Peitek, Regine Bader, Axel Mecklinger, Janet Siegmund, and Sven Apel developed the concept and the general research idea. A...

work page 2011