Automated Correction for Syntax Errors in Programming Assignments using Recurrent Neural Networks

Rishabh Singh; Sahil Bhatia

arxiv: 1603.06129 · v1 · pith:OKADTFBWnew · submitted 2016-03-19 · 💻 cs.PL · cs.AI· cs.LG· cs.SE

Automated Correction for Syntax Errors in Programming Assignments using Recurrent Neural Networks

Sahil Bhatia , Rishabh Singh This is my paper

classification 💻 cs.PL cs.AIcs.LGcs.SE

keywords errorssyntaxstudentsubmissionsfeedbackprogrammingtokenmodel

0 comments

read the original abstract

We present a method for automatically generating repair feedback for syntax errors for introductory programming problems. Syntax errors constitute one of the largest classes of errors (34%) in our dataset of student submissions obtained from a MOOC course on edX. The previous techniques for generating automated feed- back on programming assignments have focused on functional correctness and style considerations of student programs. These techniques analyze the program AST of the program and then perform some dynamic and symbolic analyses to compute repair feedback. Unfortunately, it is not possible to generate ASTs for student pro- grams with syntax errors and therefore the previous feedback techniques are not applicable in repairing syntax errors. We present a technique for providing feedback on syntax errors that uses Recurrent neural networks (RNNs) to model syntactically valid token sequences. Our approach is inspired from the recent work on learning language models from Big Code (large code corpus). For a given programming assignment, we first learn an RNN to model all valid token sequences using the set of syntactically correct student submissions. Then, for a student submission with syntax errors, we query the learnt RNN model with the prefix to- ken sequence to predict token sequences that can fix the error by either replacing or inserting the predicted token sequence at the error location. We evaluate our technique on over 14, 000 student submissions with syntax errors. Our technique can completely re- pair 31.69% (4501/14203) of submissions with syntax errors and in addition partially correct 6.39% (908/14203) of the submissions.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Automatic Repair and Type Binding of Undeclared Variables using Neural Networks
cs.SE 2019-07 unverdicted novelty 4.0

Neural network trained on AST structural details repairs undeclared variable errors and infers types, reporting 81% success on location/identification and 80% on types for 1059 programs in the prutor dataset.