NLI autonomously discovers a vocabulary of primitive operations and interprets variable-length programs via a neural executor, allowing end-to-end training and gradient-based test-time adaptation that outperforms prior methods on combinatorial generalization tasks.
arXiv preprint arXiv:2106.05126 , year=
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.LG 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
NICO-TSP learns a 2-opt local search policy for TSP using edge tokens, imitation learning on short trajectories, and critic-free group RL on longer rollouts, yielding more step-efficient improvement and better out-of-distribution generalization than prior neural and heuristic baselines.
citing papers explorer
-
Gradient-Based Program Synthesis with Neurally Interpreted Languages
NLI autonomously discovers a vocabulary of primitive operations and interprets variable-length programs via a neural executor, allowing end-to-end training and gradient-based test-time adaptation that outperforms prior methods on combinatorial generalization tasks.
-
A First Guess is Rarely the Final Answer: Learning to Search in the Traveling Salesperson Problem
NICO-TSP learns a 2-opt local search policy for TSP using edge tokens, imitation learning on short trajectories, and critic-free group RL on longer rollouts, yielding more step-efficient improvement and better out-of-distribution generalization than prior neural and heuristic baselines.