Title resolution pending

Write training code undercode/and train a candidate model

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?

cs.AI · 2026-04-12 · unverdicted · novelty 7.0 · 2 refs

Agent² RL-Bench shows LLM agents can occasionally engineer online RL post-training pipelines that boost performance (e.g., ALFWorld from 4.85 to 93.28) but stable success remains rare under fixed budgets.

citing papers explorer

Showing 1 of 1 citing paper.

Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training? cs.AI · 2026-04-12 · unverdicted · none · ref 3 · 2 links
Agent² RL-Bench shows LLM agents can occasionally engineer online RL post-training pipelines that boost performance (e.g., ALFWorld from 4.85 to 93.28) but stable success remains rare under fixed budgets.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer