12 EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL P

Bingguang Hao, Zengzhuang Xu, Yuntao Wen, Xinyi Xu, Yang Liu, Tong Zhao, Maolin Wang, Long Chen, Dong Wang, Yicheng Chen, et al · 2026 · arXiv 2601.01498

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

SEAL: Synergistic Co-Evolution of Agents and Learning Environments

cs.CL · 2026-05-23 · unverdicted · novelty 6.0

SEAL co-evolves LLM agents and environments via shared turn-level failure diagnoses, yielding +8.25 to +26.25 point gains on tool-use tasks with only 400 samples.

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

cs.CL · 2026-05-18 · unverdicted · novelty 6.0

EnvFactory automates synthesis of executable tool environments and natural multi-turn trajectories from authentic sources to enable efficient Agentic RL, delivering up to +15% gains on BFCLv3 with only 85 environments.

Hybrid Inspection and Task-Based Access Control in Zero-Trust Agentic AI

cs.AI · 2026-05-04 · unverdicted · novelty 5.0

A hybrid deterministic-plus-semantic interception layer for continuous task-based authorization of multi-turn LLM agent tool invocations, with new multi-turn datasets and initial experiments.

citing papers explorer

Showing 3 of 3 citing papers.

SEAL: Synergistic Co-Evolution of Agents and Learning Environments cs.CL · 2026-05-23 · unverdicted · none · ref 29
SEAL co-evolves LLM agents and environments via shared turn-level failure diagnoses, yielding +8.25 to +26.25 point gains on tool-use tasks with only 400 samples.
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL cs.CL · 2026-05-18 · unverdicted · none · ref 1
EnvFactory automates synthesis of executable tool environments and natural multi-turn trajectories from authentic sources to enable efficient Agentic RL, delivering up to +15% gains on BFCLv3 with only 85 environments.
Hybrid Inspection and Task-Based Access Control in Zero-Trust Agentic AI cs.AI · 2026-05-04 · unverdicted · none · ref 22
A hybrid deterministic-plus-semantic interception layer for continuous task-based authorization of multi-turn LLM agent tool invocations, with new multi-turn datasets and initial experiments.

12 EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL P

fields

years

verdicts

representative citing papers

citing papers explorer