pith. sign in

arxiv: 2509.26476 · v2 · pith:RWP6ZAGInew · submitted 2025-09-30 · 💻 cs.CL · cs.AI· cs.LG· cs.PF· cs.SE

Regression Language Models for Code

classification 💻 cs.CL cs.AIcs.LGcs.PFcs.SE
keywords codelanguagesregressionacrossaveragelanguagemodelnetworks
0
0 comments X
read the original abstract

We study code-to-metric regression: predicting numeric outcomes of code executions, a challenging task due to the open-ended nature of programming languages. While prior methods have resorted to heavy and domain-specific feature engineering, we show that a single unified Regression Language Model (RLM) using a frozen LLM encoder can simultaneously predict directly from text, (i) the memory footprint of code across multiple high-level languages such as Python and C++, (ii) the latency of Triton GPU kernels, and (iii) the accuracy and speed of trained neural networks represented in ONNX. In particular, a relatively small 300M parameter RLM based on T5Gemma, obtains $>$0.9 Spearman-rank on competitive programming submissions from APPS, and a single unified model achieves $>$0.5 average Spearman-rank across 17 separate languages from CodeNet. Furthermore, the RLM can obtain the highest average Kendall-Tau of 0.46 on five classic NAS design spaces previously dominated by graph neural networks, and simultaneously predict architecture latencies on numerous hardware platforms.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Distribution-Aware Reward: Reinforcement Learning over Predictive Distributions for LLM Regression

    cs.LG 2026-05 unverdicted novelty 7.0

    Distribution-Aware Reward optimizes LLM regression by treating rollouts as empirical predictive distributions and rewarding marginal improvements in CRPS quality rather than point accuracy alone.

  2. LASER: Language Model Regression for Semi-Structured Workflow Resource and Runtime Estimation

    cs.LG 2025-12 unverdicted novelty 7.0

    LASER uses fine-tuned LLMs with scientific-notation output encoding and constrained decoding to regress resource and runtime targets from semi-structured workflow configurations, outperforming tabular baselines and ex...