Search-based LLMs for code optimization

Shuzheng Gao, Cuiyun Gao, Wenchao Gu, Michael R · 2024 · arXiv 2408.12159

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

On the Road to Personalized Code Intelligence: Portraiting and Assisting Developers Based on Their In-IDE Behaviors

cs.SE · 2026-05-28 · unverdicted · novelty 7.0

VirtualME is a new infrastructure that continuously extracts and interprets in-IDE developer behaviors to build personalized personas, delivering 33.8% better performance on repository-level knowledge Q&A than generic baselines.

AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search

cs.SE · 2026-04-12 · unverdicted · novelty 7.0

AdverMCTS frames code generation as a minimax game where an attacker evolves tests to expose flaws in solver-generated code, yielding more robust outputs than static-test baselines.

AutoTrainess: Teaching Language Models to Improve Language Models Autonomously

cs.CL · 2026-06-30 · unverdicted · novelty 6.0

AutoTrainess exposes training operations via agent-computer interfaces and outperforms CLI-only baselines on PostTrainBench with scores of 26.94 vs 23.21 for GPT-5.4 and similar gains on other models.

citing papers explorer

Showing 3 of 3 citing papers.

On the Road to Personalized Code Intelligence: Portraiting and Assisting Developers Based on Their In-IDE Behaviors cs.SE · 2026-05-28 · unverdicted · none · ref 23
VirtualME is a new infrastructure that continuously extracts and interprets in-IDE developer behaviors to build personalized personas, delivering 33.8% better performance on repository-level knowledge Q&A than generic baselines.
AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search cs.SE · 2026-04-12 · unverdicted · none · ref 13
AdverMCTS frames code generation as a minimax game where an attacker evolves tests to expose flaws in solver-generated code, yielding more robust outputs than static-test baselines.
AutoTrainess: Teaching Language Models to Improve Language Models Autonomously cs.CL · 2026-06-30 · unverdicted · none · ref 16
AutoTrainess exposes training operations via agent-computer interfaces and outperforms CLI-only baselines on PostTrainBench with scores of 26.94 vs 23.21 for GPT-5.4 and similar gains on other models.

Search-based LLMs for code optimization

fields

years

verdicts

representative citing papers

citing papers explorer