Automated skill discovery for language agents through exploration and iterative feedback

Automated Skill Discovery for Language Agents through Exploration · 2025 · arXiv 2506.04287

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Co-Evolving Skill Generation and Policy Optimization

cs.CL · 2026-06-07 · unverdicted · novelty 7.0

Framework estimates context-dependent marginal utility of candidate skills via reward gaps in matched base vs. skill-augmented rollouts to filter skills and co-train policy as generator.

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

cs.LG · 2026-04-08 · unverdicted · novelty 7.0

This survey introduces the Generate-Filter-Control-Replay (GFCR) taxonomy to structure rollout pipelines for RL-based post-training of reasoning LLMs.

SKILL-DISCO: Distilling and Compiling Agent Traces into Reusable Procedural Skills

cs.AI · 2026-06-25 · unverdicted · novelty 5.0

SkillDisCo distills reusable PFSM subgraphs from successful agent traces and compiles them into callable procedural skills, improving success rates and reducing turns on ALFWorld and WebArena.

SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History

cs.LG · 2026-06-07 · unverdicted · novelty 5.0

SkillHone introduces a harness that maintains persistent decision histories to support continual evolution of language-model agent skills, reporting 15.8-point gains on GAIA over a commercial deep-research agent.

citing papers explorer

Showing 4 of 4 citing papers.

Co-Evolving Skill Generation and Policy Optimization cs.CL · 2026-06-07 · unverdicted · none · ref 25
Framework estimates context-dependent marginal utility of candidate skills via reward gaps in matched base vs. skill-augmented rollouts to filter skills and co-train policy as generator.
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning cs.LG · 2026-04-08 · unverdicted · none · ref 153
This survey introduces the Generate-Filter-Control-Replay (GFCR) taxonomy to structure rollout pipelines for RL-based post-training of reasoning LLMs.
SKILL-DISCO: Distilling and Compiling Agent Traces into Reusable Procedural Skills cs.AI · 2026-06-25 · unverdicted · none · ref 25
SkillDisCo distills reusable PFSM subgraphs from successful agent traces and compiles them into callable procedural skills, improving success rates and reducing turns on ALFWorld and WebArena.
SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History cs.LG · 2026-06-07 · unverdicted · none · ref 3
SkillHone introduces a harness that maintains persistent decision histories to support continual evolution of language-model agent skills, reporting 15.8-point gains on GAIA over a commercial deep-research agent.

Automated skill discovery for language agents through exploration and iterative feedback

fields

years

verdicts

representative citing papers

citing papers explorer