arXiv preprint arXiv:2603.01940 , year=

Jinpeng Chen, Cheng Gong, Hanbo Li, Ziru Liu, Zichen Tian, Xinyu Fu, Shi Wu, Chenyang Zhang, Wu Zhang, Suiyun Zhang, Dandan Tu, Rui Liu · 2026 · arXiv 2603.01940

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

ISE: An Execution-Grounded Recipe for Multi-Turn OS-Agent Trajectories

cs.CL · 2026-06-09 · conditional · novelty 7.0

ISE creates 23,132 execution-grounded multi-turn OS agent trajectories via intent simulation and live execution, improving agent performance on ClawEval from 19.3 to 37.7 pass@1 with Qwen3-8B.

WRIT: Write-Read Intensive Trajectory Synthesis for Multi-Turn User-Facing Agents

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

WRIT is a synthesis pipeline that generates write-read intensive trajectories along axes of write-decision count and per-decision evidence burden, enabling a 4B model to outperform GPT-5.1 on τ²-bench with reduced inference tokens.

citing papers explorer

Showing 2 of 2 citing papers after filters.

ISE: An Execution-Grounded Recipe for Multi-Turn OS-Agent Trajectories cs.CL · 2026-06-09 · conditional · none · ref 45
ISE creates 23,132 execution-grounded multi-turn OS agent trajectories via intent simulation and live execution, improving agent performance on ClawEval from 19.3 to 37.7 pass@1 with Qwen3-8B.
WRIT: Write-Read Intensive Trajectory Synthesis for Multi-Turn User-Facing Agents cs.CL · 2026-06-01 · unverdicted · none · ref 3
WRIT is a synthesis pipeline that generates write-read intensive trajectories along axes of write-decision count and per-decision evidence burden, enabling a 4B model to outperform GPT-5.1 on τ²-bench with reduced inference tokens.

arXiv preprint arXiv:2603.01940 , year=

fields

years

verdicts

representative citing papers

citing papers explorer