Title resolution pending

Avoids blind retries (repeating failed calls without change)

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling

cs.AI · 2026-04-09 · unverdicted · novelty 7.0 · 2 refs

Plan-RewardBench is a trajectory-level preference benchmark that evaluates how well reward models distinguish preferred agent trajectories from hard distractors across safety refusal, tool handling, complex planning, and error recovery tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling cs.AI · 2026-04-09 · unverdicted · none · ref 14 · 2 links
Plan-RewardBench is a trajectory-level preference benchmark that evaluates how well reward models distinguish preferred agent trajectories from hard distractors across safety refusal, tool handling, complex planning, and error recovery tasks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer