Preference-based trajectory evaluation reduces tied comparisons from roughly 75% to 35% across agentic benchmarks by using temporal preferences over progress and return profiles.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.LG 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Causal localization via attribution and patching identifies a temporal preference subgraph in mid-to-upper layers of Qwen3-4B-Instruct-2507, with time-horizon geometry in the residual stream and initial evidence for steering-vector control.
citing papers explorer
-
Offline Preference-Based Trajectory Evaluation
Preference-based trajectory evaluation reduces tied comparisons from roughly 75% to 35% across agentic benchmarks by using temporal preferences over progress and return profiles.
-
Temporal Preference Concepts and their Functions in a Large Language Model
Causal localization via attribution and patching identifies a temporal preference subgraph in mid-to-upper layers of Qwen3-4B-Instruct-2507, with time-horizon geometry in the residual stream and initial evidence for steering-vector control.