Investigating Compilation Errors of Students Learning Haskell

Boldizs\'ar N\'emeth (E\"otv\"os Lor\'and University); Erina Makihara (Nara Institute of Science; Eunjong Choi (Nara Institute of Science; Hajimu Iida (Nara Institute of Science; Technology)

arxiv: 1906.11450 · v1 · pith:4ZRFB6F7new · submitted 2019-06-27 · 💻 cs.PL

Investigating Compilation Errors of Students Learning Haskell

Boldizs\'ar N\'emeth (E\"otv\"os Lor\'and University) , Eunjong Choi (Nara Institute of Science , Technology) , Erina Makihara (Nara Institute of Science , Hajimu Iida (Nara Institute of Science This is my paper

Pith reviewed 2026-05-25 14:16 UTC · model grok-4.3

classification 💻 cs.PL

keywords Haskellcompilation errorsnovice programmersfunctional programmingstudent submissionslearning difficulties

0 comments

The pith

Analyzing student code samples uncovers root causes of Haskell compilation errors beyond surface syntax.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper examines compilation errors made by students new to Haskell after learning imperative programming. Earlier studies have reported error frequencies or examined limited error types without probing the underlying issues. The authors focus on reviewing actual student submissions to identify the conceptual problems driving those errors. This approach aims to suggest concrete ways to improve teaching methods and tools for functional programming.

Core claim

By analyzing samples of their submissions, the study uncovers the root problems with the student solutions, providing insights into how to improve learning efficiency for Haskell.

What carries the argument

Examination of student submission samples to trace compiler errors back to fundamental difficulties in functional programming concepts.

If this is right

Teaching approaches can be adjusted to target the specific conceptual gaps identified in the samples.
Feedback tools can be built to address the recurring root problems instead of only fixing syntax.
The transition from imperative to functional programming can be supported with targeted interventions.
Overall learning efficiency for Haskell and similar languages can increase through these insights.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The sample-analysis method could be applied to student work in other functional languages to find parallel difficulties.
The identified patterns might guide the design of automated hint systems that explain concepts rather than just errors.
Curriculum designers could use the findings to decide when and how to introduce functional ideas in early courses.

Load-bearing premise

That the chosen student submission samples represent typical novice difficulties and that error patterns directly indicate root learning problems rather than assignment constraints or syntax surface issues.

What would settle it

A study of a larger or differently sourced collection of student submissions that identifies substantially different root causes for the same compilation errors would undermine the central claim.

Figures

Figures reproduced from arXiv: 1906.11450 by Boldizs\'ar N\'emeth (E\"otv\"os Lor\'and University), Erina Makihara (Nara Institute of Science, Eunjong Choi (Nara Institute of Science, Hajimu Iida (Nara Institute of Science, Technology).

**Figure 1.** Figure 1: Collection of data for the study Student º Site 3 Server ú BEAD 1 Teacher  Logs p Results submission final submission results compile & run compile & run results logging analysis 3.2 Steps of the investigation To summarize our analysis of the extracted dataset, we present it in four steps: 1. We analyse how the success ratio of the student submissions changes in the course of the semester. 2. We assign co… view at source ↗

**Figure 2.** Figure 2: Results of student solutions 2 3 4 5 6 7 8 9 10 0% 20% 40% 60% 80% 100% Weeks of study Correct Not tested Wrong results Runtime error Compile error [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Student errors falsely related to syntactic probl [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Categories of students errors based on the error me [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Correction times of errors in different categorie [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: Response times of errors in different categories [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

read the original abstract

While functional programming is an efficient way to express complex software, functional programming languages have a steep learning curve. Haskell can be challenging to learn for students who were only introduced to imperative programming. It is important to look for methods and tools that may reduce the difficulty of learning functional programming. Finding methods to help students requires understanding the errors that students make while learning Haskell. There are several previous studies revealing data about Haskell compiler errors, but they do not focus on the analysis of the compiler errors or they only study a certain kind of compiler errors. This study investigates compilation errors of novice Haskell students and make suggestions on how their learning efficiency can be improved. Unlike previous studies we focus on uncovering the root problems with the student solutions by analysing samples of their submissions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The abstract promises root-cause analysis of Haskell student errors via submission samples but supplies no method, sample details, or results, so the claim stays untested.

read the letter

The paper's core move is to say that prior Haskell error studies either skip deep analysis or stick to narrow error types, and that looking at actual submission samples will reveal the root learning problems instead. That distinction is the main thing it offers over the cited work. It correctly flags that functional programming is tough for students coming from imperative backgrounds and that better understanding of errors could help teaching tools. The motivation is straightforward and the literature nod is appropriate for an education-focused piece. Beyond that, the abstract is silent on sample size, selection criteria, analysis steps, or any quantitative or qualitative findings. The inference that error patterns expose conceptual roots rather than surface syntax, assignment wording, or GHC quirks is left unsupported. No interviews, think-alouds, or other checks are mentioned to separate those possibilities. That assumption carries the whole argument, and without evidence it does not hold up. The work is aimed at CS-education researchers who build tools or curricula for functional programming, but only the full manuscript with actual data would give them anything usable. A reader looking for solid empirical grounding on novice difficulties would find the current version too thin to act on. I would not bring it to a reading group or cite it on this basis. It does not yet merit peer review because the central claim rests on unshown evidence rather than delivered analysis.

Referee Report

2 major / 0 minor

Summary. The paper investigates compilation errors made by novice Haskell students. It claims that, unlike prior work which either reports error data without deep analysis or focuses on limited error types, this study uncovers root problems in student solutions through analysis of submission samples and offers suggestions for improving learning efficiency.

Significance. If the empirical analysis validly maps observed compiler-error distributions to underlying conceptual difficulties (rather than surface syntax, assignment wording, or message quirks), the work could inform targeted interventions for functional programming education. The focus on root causes is a potentially useful shift from purely quantitative error counts, but the manuscript provides no methodological details or validation to support this mapping.

major comments (2)

[Abstract] Abstract: the central claim that 'analysing samples of their submissions' uncovers root problems (as opposed to surface syntax or assignment constraints) is presented without any description of sample size, selection criteria, analysis method, quantitative results, or independent validation such as interviews or conceptual probes.
[Abstract] Abstract / introduction: the inference that error patterns are diagnostic of root learning problems rests on an untested assumption; no evidence is supplied that the chosen error categories distinguish conceptual issues from GHC message artifacts or task-specific syntax slips.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the major comments below and will revise the manuscript to strengthen the abstract and introduction as appropriate.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that 'analysing samples of their submissions' uncovers root problems (as opposed to surface syntax or assignment constraints) is presented without any description of sample size, selection criteria, analysis method, quantitative results, or independent validation such as interviews or conceptual probes.

Authors: We agree the abstract is too terse on these points. The full manuscript reports analysis of over 1,800 submissions collected across two course offerings, with selection criteria based on error frequency thresholds, a mixed quantitative-qualitative method (error categorization followed by manual inspection of representative samples), and quantitative results in the form of error distributions. No interviews or conceptual probes were performed. We will expand the abstract to summarize sample size, criteria, method, and results while noting the lack of independent validation as a limitation. revision: yes
Referee: [Abstract] Abstract / introduction: the inference that error patterns are diagnostic of root learning problems rests on an untested assumption; no evidence is supplied that the chosen error categories distinguish conceptual issues from GHC message artifacts or task-specific syntax slips.

Authors: The results section supplies this evidence via case-by-case examination of submissions, showing how particular error categories (e.g., type mismatches involving polymorphic functions) align with conceptual gaps rather than surface syntax or GHC phrasing alone. We acknowledge the abstract and introduction do not foreground these distinctions sufficiently. We will revise both sections to reference the supporting examples from the analysis. The study does not include separate validation (e.g., interviews), which we will note as a limitation. revision: partial

Circularity Check

0 steps flagged

No circularity: empirical study of external student submissions

full rationale

The paper is an empirical analysis of compilation errors from novice Haskell student submissions. It reports observed error frequencies and categories drawn directly from the collected data, without any equations, fitted parameters, predictions that reduce to inputs, or load-bearing self-citations that would make the central claim equivalent to its own assumptions by construction. The derivation chain consists of data collection, categorization, and descriptive reporting; none of the enumerated circularity patterns apply. The study is self-contained against its external benchmark (the student submission corpus) and receives the default non-circularity finding.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The claim rests on the domain assumption that error patterns in sampled submissions reveal root learning difficulties. No free parameters or invented entities.

axioms (1)

domain assumption Student code submissions reflect their learning challenges in Haskell.
Invoked when the study treats observed compilation errors as indicators of root problems.

pith-pipeline@v0.9.0 · 5690 in / 947 out tokens · 23912 ms · 2026-05-25T14:16:02.281066+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

14 extracted references · 14 canonical work pages

[1]

& Kieburtz, D

Hudak, P., Peyton Jones, S., Wadler, P., Boutel, B., Fairbairn, J., Fasel, J., ... & Kieburtz, D. (1992). Report on the programming language Haskell: a non-strict, purely functional language version 1.2. ACM SigPlan notices, 27(5), 1-164. doi:10.1145/130697.130699

work page doi:10.1145/130697.130699 1992
[2]

(2014, November)

Ray, B., Posnett, D., Filkov, V., Devanbu, P. (2014, November). A large scale study of programming languages and code quality in github. In Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering (pp. 155-165). ACM. doi:10.1145/2635868.2635922

work page doi:10.1145/2635868.2635922 2014
[3]

Functional Baby Talk: Analysis of Code Fragments from Novice Haskell Programmers

Singer, J., & Archibald, B. Functional Baby Talk: Analysis of Code Fragments from Novice Haskell Programmers. In Proceedings TFPIE 2017. EPTCS 270, 2018, pp. 37-51. doi:10.4204/EPTCS.270.3

work page doi:10.4204/eptcs.270.3 2017
[4]

(2003, August)

Heeren, B., Leijen, D., & van IJzendoorn, A. (2003, August). Helium, for learning Haskell. In Proceedings of the 2003 ACM SIGPLAN workshop on Haskell (pp. 62-71). ACM. doi:10.1145/871895.871902

work page doi:10.1145/871895.871902 2003
[5]

Gerdes, A., Heeren, B., Jeuring, J., & van Binsbergen, L. T. (2017). Ask-Elle: an adaptable programming tutor for Haskell giving automated feedback. International Journal of Artificial Intelligence in Education, 27(1), 65-100. doi:10.1007/s40593-015-0080-x

work page doi:10.1007/s40593-015-0080-x 2017
[6]

S., Homer, J., & Gee, R

Pettit, R. S., Homer, J., & Gee, R. (2017, March). Do Enhanced Compiler Error Messages Help Students?: Results Inconclusive. In Proceedings of the 2017 ACM SIGCSE Technical Symposium on Computer Science Education (pp. 465-470). ACM. doi:10.1145/3017680.3017768

work page doi:10.1145/3017680.3017768 2017
[7]

(2016, November)

Barik, T. (2016, November). How should static analysis tools explain anomalies to developers?. In Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering (pp. 1118-1120). ACM. doi:10.1145/2950290.2983968

work page doi:10.1145/2950290.2983968 2016
[8]

(2017, May)

Barik, T., Smith, J., Lubick, K., Holmes, E., Feng, J., Murphy-Hill, E., & Parnin, C. (2017, May). Do developers read compiler error messages?. In Proceedings of the 39th International Conference on Software Engineering (pp. 575-585). IEEE Press. doi:10.1109/ICSE.2017.59

work page doi:10.1109/icse.2017.59 2017
[9]

E\"otv\"os Lor\'and University

Functional Programming: Introduction to Haskell (BSc.). E\"otv\"os Lor\'and University. \ ://lambda.inf.elte.hu/Index_en.xml#introduction-to-haskell-bsc \ accessed: (2019, April). Permalink: https://perma.cc/L423-CFZG

work page 2019
[10]

P., Hall, C., Hammond, K., Partain, W., & Wadler, P

Jones, S. P., Hall, C., Hammond, K., Partain, W., & Wadler, P. (1993, July). The Glasgow Haskell compiler: a technical overview. In Proc. UK Joint Framework for Information Technology (JFIT) Technical Conference (Vol. 93)

work page 1993
[11]

, " * write output.state after.block = add.period write

ENTRY address author booktitle chapter edition editor eid howpublished institution journal key month note number organization pages publisher school series title type url volume year label INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts #0 'before.all := #1 'mid.sentence := #2 'after.sentence := #3 'afte...

work page
[12]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize ":" * " " *...

work page
[13]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION stupid.colon after.authors 'output.state := FUNCTION insert.comma output.state before.all = 'skip between.elements 'output.state := if FUNCTION new.sentence output.state after.block = '...

work page
[14]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION stupid.colon after.authors 'output.state := FUNCTION insert.comma output.state before.all = 'skip between.elements 'output.state := if FUNCTION new.sentence output.state after.block = '...

work page

[1] [1]

& Kieburtz, D

Hudak, P., Peyton Jones, S., Wadler, P., Boutel, B., Fairbairn, J., Fasel, J., ... & Kieburtz, D. (1992). Report on the programming language Haskell: a non-strict, purely functional language version 1.2. ACM SigPlan notices, 27(5), 1-164. doi:10.1145/130697.130699

work page doi:10.1145/130697.130699 1992

[2] [2]

(2014, November)

Ray, B., Posnett, D., Filkov, V., Devanbu, P. (2014, November). A large scale study of programming languages and code quality in github. In Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering (pp. 155-165). ACM. doi:10.1145/2635868.2635922

work page doi:10.1145/2635868.2635922 2014

[3] [3]

Functional Baby Talk: Analysis of Code Fragments from Novice Haskell Programmers

Singer, J., & Archibald, B. Functional Baby Talk: Analysis of Code Fragments from Novice Haskell Programmers. In Proceedings TFPIE 2017. EPTCS 270, 2018, pp. 37-51. doi:10.4204/EPTCS.270.3

work page doi:10.4204/eptcs.270.3 2017

[4] [4]

(2003, August)

Heeren, B., Leijen, D., & van IJzendoorn, A. (2003, August). Helium, for learning Haskell. In Proceedings of the 2003 ACM SIGPLAN workshop on Haskell (pp. 62-71). ACM. doi:10.1145/871895.871902

work page doi:10.1145/871895.871902 2003

[5] [5]

Gerdes, A., Heeren, B., Jeuring, J., & van Binsbergen, L. T. (2017). Ask-Elle: an adaptable programming tutor for Haskell giving automated feedback. International Journal of Artificial Intelligence in Education, 27(1), 65-100. doi:10.1007/s40593-015-0080-x

work page doi:10.1007/s40593-015-0080-x 2017

[6] [6]

S., Homer, J., & Gee, R

Pettit, R. S., Homer, J., & Gee, R. (2017, March). Do Enhanced Compiler Error Messages Help Students?: Results Inconclusive. In Proceedings of the 2017 ACM SIGCSE Technical Symposium on Computer Science Education (pp. 465-470). ACM. doi:10.1145/3017680.3017768

work page doi:10.1145/3017680.3017768 2017

[7] [7]

(2016, November)

Barik, T. (2016, November). How should static analysis tools explain anomalies to developers?. In Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering (pp. 1118-1120). ACM. doi:10.1145/2950290.2983968

work page doi:10.1145/2950290.2983968 2016

[8] [8]

(2017, May)

Barik, T., Smith, J., Lubick, K., Holmes, E., Feng, J., Murphy-Hill, E., & Parnin, C. (2017, May). Do developers read compiler error messages?. In Proceedings of the 39th International Conference on Software Engineering (pp. 575-585). IEEE Press. doi:10.1109/ICSE.2017.59

work page doi:10.1109/icse.2017.59 2017

[9] [9]

E\"otv\"os Lor\'and University

Functional Programming: Introduction to Haskell (BSc.). E\"otv\"os Lor\'and University. \ ://lambda.inf.elte.hu/Index_en.xml#introduction-to-haskell-bsc \ accessed: (2019, April). Permalink: https://perma.cc/L423-CFZG

work page 2019

[10] [10]

P., Hall, C., Hammond, K., Partain, W., & Wadler, P

Jones, S. P., Hall, C., Hammond, K., Partain, W., & Wadler, P. (1993, July). The Glasgow Haskell compiler: a technical overview. In Proc. UK Joint Framework for Information Technology (JFIT) Technical Conference (Vol. 93)

work page 1993

[11] [11]

, " * write output.state after.block = add.period write

ENTRY address author booktitle chapter edition editor eid howpublished institution journal key month note number organization pages publisher school series title type url volume year label INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts #0 'before.all := #1 'mid.sentence := #2 'after.sentence := #3 'afte...

work page

[12] [12]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize ":" * " " *...

work page

[13] [13]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION stupid.colon after.authors 'output.state := FUNCTION insert.comma output.state before.all = 'skip between.elements 'output.state := if FUNCTION new.sentence output.state after.block = '...

work page

[14] [14]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION stupid.colon after.authors 'output.state := FUNCTION insert.comma output.state before.all = 'skip between.elements 'output.state := if FUNCTION new.sentence output.state after.block = '...

work page