Learning to Represent Programs with Graphs

Mahmoud Khademi; Marc Brockschmidt; Miltiadis Allamanis

arxiv: 1711.00740 · v3 · pith:JPDZE3F4new · submitted 2017-11-01 · 💻 cs.LG · cs.AI· cs.PL· cs.SE

Learning to Represent Programs with Graphs

Miltiadis Allamanis , Marc Brockschmidt , Mahmoud Khademi This is my paper

classification 💻 cs.LG cs.AIcs.PLcs.SE

keywords codegraphslearningmethodsprogramvariablevarmisuseconsidered

0 comments

read the original abstract

Learning tasks on source code (i.e., formal languages) have been considered recently, but most work has tried to transfer natural language methods and does not capitalize on the unique opportunities offered by code's known syntax. For example, long-range dependencies induced by using the same variable or function in distant locations are often not considered. We propose to use graphs to represent both the syntactic and semantic structure of code and use graph-based deep learning methods to learn to reason over program structures. In this work, we present how to construct graphs from source code and how to scale Gated Graph Neural Networks training to such large graphs. We evaluate our method on two tasks: VarNaming, in which a network attempts to predict the name of a variable given its usage, and VarMisuse, in which the network learns to reason about selecting the correct variable that should be used at a given program location. Our comparison to methods that use less structured program representations shows the advantages of modeling known structure, and suggests that our models learn to infer meaningful names and to solve the VarMisuse task in many cases. Additionally, our testing showed that VarMisuse identifies a number of bugs in mature open-source projects.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
cs.CL 2023-10 unverdicted novelty 8.0

SWE-bench reveals that even top language models like Claude 2 resolve only 1.96% of 2,294 real-world GitHub issues, highlighting a gap in practical coding capabilities.
Exploring Code Analysis: Zero-Shot Insights on Syntax and Semantics with LLMs
cs.SE 2023-05 unverdicted novelty 7.0

LLMs achieve strong results on syntax parsing tasks but show limited and variable performance on dynamic reasoning, with a clear performance hierarchy across model scales.
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
cs.SE 2021-02 unverdicted novelty 6.0

CodeXGLUE supplies a standardized collection of 10 code-related tasks, 14 datasets, an evaluation platform, and BERT-, GPT-, and encoder-decoder-style baselines.
MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization
cs.PL 2026-05 unverdicted novelty 5.0

MileStone models compiler phase ordering as a multi-objective optimization problem using graph representations, GNN predictions, and RL agents to find Pareto-optimal pass sequences under user constraints.
PLMGH: What Matters in PLM-GNN Hybrids for Code Classification and Vulnerability Detection
cs.SE 2026-04 unverdicted novelty 5.0

Controlled experiments show PLM-GNN hybrids improve code tasks over GNN-only baselines, with PLM source having larger impact than GNN backbone.
CodePori: Large-Scale System for Autonomous Software Development Using Multi-Agent Technology
cs.SE 2024-02 unverdicted novelty 4.0

CodePori is a multi-agent LLM system for code generation whose participant evaluation identifies practical challenges like memory limits and hallucinations missed by binary benchmarks.