pith. sign in

hub Mixed citations

arXiv preprint arXiv:2503.23383 , year=

Mixed citation behavior. Most common role is background (67%).

19 Pith papers citing it
Background 67% of classified citations

hub tools

citation-role summary

background 7 baseline 1 other 1

citation-polarity summary

years

2026 12 2025 7

clear filters

representative citing papers

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

cs.CL · 2026-05-27 · unverdicted · novelty 6.0 · 2 refs

AXPO addresses the Thinking-Acting Gap in agentic RL training by targeted resampling of tool calls in all-wrong subgroups, delivering +1.8pp gains over GRPO on nine multimodal benchmarks with an 8B model beating a 32B baseline on Pass@4.

Harnessing LLM Agents with Skill Programs

cs.AI · 2026-05-18 · conditional · novelty 6.0

HASP upgrades textual skills into executable Program Functions that intervene in LLM agent loops at inference, post-training, or self-evolution, delivering 25% gains over ReAct and 30.4% over Search-R1 on reasoning benchmarks.

citing papers explorer

Showing 1 of 1 citing paper after filters.