pith. sign in

arxiv: 1608.02715 · v1 · pith:675ZYTJ3new · submitted 2016-08-09 · 💻 cs.SE · stat.ML

A deep language model for software code

classification 💻 cs.SE stat.ML
keywords codelanguagesoftwaremodeldeeplearning-basedlongmemory
0
0 comments X
read the original abstract

Existing language models such as n-grams for software code often fail to capture a long context where dependent code elements scatter far apart. In this paper, we propose a novel approach to build a language model for software code to address this particular issue. Our language model, partly inspired by human memory, is built upon the powerful deep learning-based Long Short Term Memory architecture that is capable of learning long-term dependencies which occur frequently in software code. Results from our intrinsic evaluation on a corpus of Java projects have demonstrated the effectiveness of our language model. This work contributes to realizing our vision for DeepSoft, an end-to-end, generic deep learning-based framework for modeling software and its development process.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.