EMO-STA evolves a shared program archive across task families then adapts candidates to targets, outperforming matched-compute single-task evolution in most of eight families while reducing overfitting on low-data tasks like ARC.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3representative citing papers
OPT-BENCH and OPT-Agent evaluate LLM self-optimization in large search spaces, showing stronger models improve via feedback but stay constrained by base capacity and below human performance.
Denoising Recursion Models train multi-step noise reversal in looped transformers and outperform the prior Tiny Recursion Model on ARC-AGI.
citing papers explorer
-
Evolutionary Multi-Task Optimization for LLM-Guided Program Discovery
EMO-STA evolves a shared program archive across task families then adapts candidates to targets, outperforming matched-compute single-task evolution in most of eight families while reducing overfitting on low-data tasks like ARC.
-
OPT-BENCH: Evaluating the Iterative Self-Optimization of LLM Agents in Large-Scale Search Spaces
OPT-BENCH and OPT-Agent evaluate LLM self-optimization in large search spaces, showing stronger models improve via feedback but stay constrained by base capacity and below human performance.
-
One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models
Denoising Recursion Models train multi-step noise reversal in looped transformers and outperform the prior Tiny Recursion Model on ARC-AGI.