Proposes PARPO for reward decoupling with user anchors and PSGM for preference-aligned skill memory in personalized agentic RL, reporting outperformance on ETAPP benchmarks.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
Hedwig is a coding agent that dynamically adjusts its autonomy by learning behavioral guidelines from developer decisions and feedback over time.
AgentClick is a localhost npm server and skill-based plugin that connects terminal AI agents to a structured web UI for human review of plans, code execution, memory, and errors.
citing papers explorer
-
From Correctness to Preference: A Framework for Personalized Agentic Reinforcement Learning
Proposes PARPO for reward decoupling with user anchors and PSGM for preference-aligned skill memory in personalized agentic RL, reporting outperformance on ETAPP benchmarks.
-
Hedwig: Dynamic Autonomy for Coding Agents Under Local Oversight
Hedwig is a coding agent that dynamically adjusts its autonomy by learning behavioral guidelines from developer decisions and feedback over time.
-
AgentClick: A Skill-Based Human-in-the-Loop Review Layer for Terminal AI Agents
AgentClick is a localhost npm server and skill-based plugin that connects terminal AI agents to a structured web UI for human review of plans, code execution, memory, and errors.