pith. sign in

hub

React: Synergizing reasoning and acting in language models

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

hub tools

citation-role summary

background 2 baseline 1

citation-polarity summary

years

2026 10 2025 1

clear filters

representative citing papers

Verifiable Process Rewards for Agentic Reasoning

cs.AI · 2026-05-11 · unverdicted · novelty 6.0

VPR converts symbolic, constraint, or posterior oracles into dense turn-level rewards for RL, improving credit assignment in agentic reasoning and transferring to general benchmarks.

EvoMAS: Learning Execution-Time Workflows for Multi-Agent Systems

cs.AI · 2026-05-09 · unverdicted · novelty 6.0

EvoMAS trains a workflow adapter with policy gradients to dynamically instantiate stage-specific multi-agent workflows from a fixed agent pool, using explicit task-state construction and terminal success signals, and outperforms static baselines on GAIA, HLE, and DeepResearcher.

citing papers explorer

Showing 10 of 10 citing papers after filters.