InAnnual Meeting of the Association for Computational Linguistics (ACL)

AppWorld: A Controllable World of Apps, People for Benchmarking Interactive Coding Agents

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Next-Token Prediction: An RLVR Proof of Concept for Tool-Use Agents on Atlassian Workflows

cs.AI · 2026-07-01 · unverdicted · novelty 4.0

RLVR training on five synthetic Atlassian API environments raises average tool-use reward for Qwen models from 0.35-0.92 to 0.95-1.00 on four non-degenerate scenarios.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Beyond Next-Token Prediction: An RLVR Proof of Concept for Tool-Use Agents on Atlassian Workflows cs.AI · 2026-07-01 · unverdicted · none · ref 14
RLVR training on five synthetic Atlassian API environments raises average tool-use reward for Qwen models from 0.35-0.92 to 0.95-1.00 on four non-degenerate scenarios.

InAnnual Meeting of the Association for Computational Linguistics (ACL)

fields

years

verdicts

representative citing papers

citing papers explorer