Gradient coupling: The hidden barrier to generalization in agentic reinforcement learning

Jingyu Liu, Xiaopeng Wu, Jingquan Peng, Kehan Chen, Chuan Yu, Lizhong Ding, Yong Liu · 2025 · arXiv 2509.23870

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

HIPIF: Hierarchical Planning and Information Folding for Long-Horizon LLM Agent Learning

cs.AI · 2026-06-09 · unverdicted · novelty 4.0

HIPIF trains LLM agents end-to-end using subgoal-based hierarchical planning and information folding of completed histories, plus hierarchical reflection and process rewards, to handle long-horizon tasks without auxiliary models or expert trajectories.

citing papers explorer

Showing 1 of 1 citing paper.

HIPIF: Hierarchical Planning and Information Folding for Long-Horizon LLM Agent Learning cs.AI · 2026-06-09 · unverdicted · none · ref 12
HIPIF trains LLM agents end-to-end using subgoal-based hierarchical planning and information folding of completed histories, plus hierarchical reflection and process rewards, to handle long-horizon tasks without auxiliary models or expert trajectories.

Gradient coupling: The hidden barrier to generalization in agentic reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer