Termigen: High-fidelity environment and robust trajectory synthesis for terminal agents

Kaijie Zhu, Yuzhou Nie, Yijiang Li, Yiming Huang, Jialian Wu, Jiang Liu, Ximeng Sun, Zhenfei Yin, Lun Wang, Zicheng Liu, et al · 2026 · arXiv 2602.07274

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

representative citing papers

Terminal-World: Scaling Terminal-Agent Environments via Agent Skills

cs.CL · 2026-05-20 · unverdicted · novelty 7.0

Terminal-World is a skill-based synthesis pipeline that generates 5,723 training environments and produces Terminal-World-32B which outperforms baselines on Terminal-Bench 2.0 using only 1.2% of the data.

LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents

cs.CL · 2026-05-28 · unverdicted · novelty 6.0

LiteCoder-Terminal-Gen creates synthetic terminal datasets that, after SFT and DMPO on Qwen models, yield 29.06%, 18.54%, and 34.00% pass@1 on Terminal Bench 1.0, 2.0, and Pro.

Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World

cs.AI · 2026-05-25 · unverdicted · novelty 6.0

Claw-Anything benchmark tests LLM agents on proactive assistance in complex simulated user digital environments with long histories, interdependent services, and noise, where GPT-5.5 scores 34.5% pass@1.

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

cs.AI · 2026-05-19 · unverdicted · novelty 6.0

OpenComputer introduces a verifier-grounded framework with state verifiers, self-evolving layers, task synthesis, and auditable evaluation for 33 desktop apps and 1000 tasks to support computer-use AI agents.

Toward Scalable Terminal Task Synthesis via Skill Graphs

cs.AI · 2026-04-28 · unverdicted · novelty 6.0

SkillSynth uses a scenario-mediated skill graph to sample workflow paths and generate executable terminal tasks, enabling controlled diversity in training trajectories for agents.

citing papers explorer

Showing 5 of 5 citing papers after filters.

Terminal-World: Scaling Terminal-Agent Environments via Agent Skills cs.CL · 2026-05-20 · unverdicted · none · ref 17
Terminal-World is a skill-based synthesis pipeline that generates 5,723 training environments and produces Terminal-World-32B which outperforms baselines on Terminal-Bench 2.0 using only 1.2% of the data.
LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents cs.CL · 2026-05-28 · unverdicted · none · ref 13
LiteCoder-Terminal-Gen creates synthetic terminal datasets that, after SFT and DMPO on Qwen models, yield 29.06%, 18.54%, and 34.00% pass@1 on Terminal Bench 1.0, 2.0, and Pro.
Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World cs.AI · 2026-05-25 · unverdicted · none · ref 36
Claw-Anything benchmark tests LLM agents on proactive assistance in complex simulated user digital environments with long histories, interdependent services, and noise, where GPT-5.5 scores 34.5% pass@1.
OpenComputer: Verifiable Software Worlds for Computer-Use Agents cs.AI · 2026-05-19 · unverdicted · none · ref 22
OpenComputer introduces a verifier-grounded framework with state verifiers, self-evolving layers, task synthesis, and auditable evaluation for 33 desktop apps and 1000 tasks to support computer-use AI agents.
Toward Scalable Terminal Task Synthesis via Skill Graphs cs.AI · 2026-04-28 · unverdicted · none · ref 16
SkillSynth uses a scenario-mediated skill graph to sample workflow paths and generate executable terminal tasks, enabling controlled diversity in training trajectories for agents.

Termigen: High-fidelity environment and robust trajectory synthesis for terminal agents

fields

years

verdicts

representative citing papers

citing papers explorer