Neural Program Synthesis with Priority Queue Training

arxiv: 1801.03526 · v2 · pith:XKCDJQ6Wnew · submitted 2018-01-10 · 💻 cs.AI

Neural Program Synthesis with Priority Queue Training

Daniel A. Abolafia , Mohammad Norouzi , Jonathan Shen , Rui Zhao , Quoc V. Le This is my paper

classification 💻 cs.AI

keywords programspriorityqueuealgorithmprogrambaselinescalledfunction

0 comments p. Extension

pith:XKCDJQ6W Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{XKCDJQ6W}

Prints a linked pith:XKCDJQ6W badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We consider the task of program synthesis in the presence of a reward function over the output of programs, where the goal is to find programs with maximal rewards. We employ an iterative optimization scheme, where we train an RNN on a dataset of K best programs from a priority queue of the generated programs so far. Then, we synthesize new programs and add them to the priority queue by sampling from the RNN. We benchmark our algorithm, called priority queue training (or PQT), against genetic algorithm and reinforcement learning baselines on a simple but expressive Turing complete programming language called BF. Our experimental results show that our simple PQT algorithm significantly outperforms the baselines. By adding a program length penalty to the reward function, we are able to synthesize short, human readable programs.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Automated Modeling Method for Pathloss Model Discovery
cs.LG 2025-05 unverdicted novelty 5.0

Automated methods based on Deep Symbolic Regression and Kolmogorov-Arnold Networks discover compact, interpretable path loss models that achieve high accuracy and reduce prediction errors by up to 75% compared to trad...