pith. sign in

Search-r1: Training LLMs to reason and leverage search engines with reinforcement learning

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

years

2026 6

representative citing papers

Joint Optimization of Multi-agent Memory System

cs.MA · 2026-03-13 · unverdicted · novelty 6.0

CoMAM jointly optimizes agents in multi-agent LLM memory systems via end-to-end RL and adaptive credit assignment to improve collaboration and performance.

citing papers explorer

Showing 6 of 6 citing papers.