Title resolution pending

Venkatesh Mishra, Amir Saeidi, Satyam Raj, Mutsumi Nakamura, Jayanth Srinivasa, Gaowen Liu, Ali Payani, Chitta Baral · 2025 · arXiv 2508.20931

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Self-Evolution for Multi-Turn Tool-Calling Agents via Divergence-Point Preference Learning

cs.LG · 2026-06-22 · unverdicted · novelty 4.0

ToolGraph plus DPO on divergence-point preferences lifts weighted average reward on 375 tau2-bench tasks from 0.304 to 0.355.

citing papers explorer

Showing 1 of 1 citing paper.

Self-Evolution for Multi-Turn Tool-Calling Agents via Divergence-Point Preference Learning cs.LG · 2026-06-22 · unverdicted · none · ref 7
ToolGraph plus DPO on divergence-point preferences lifts weighted average reward on 375 tau2-bench tasks from 0.304 to 0.355.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer