pith. sign in

hub Canonical reference

Multi-agent evolve: Llm self-improve through co-evolution.arXiv preprint arXiv:2510.23595

Canonical reference. 75% of citing Pith papers cite this work as background.

10 Pith papers citing it
Background 75% of classified citations

hub tools

citation-role summary

background 6 baseline 1 method 1

citation-polarity summary

years

2026 10

representative citing papers

AIPO: Learning to Reason from Active Interaction

cs.CL · 2026-05-08 · unverdicted · novelty 6.0 · 2 refs

AIPO adds active multi-agent consultation (Verify, Knowledge, Reasoning agents) plus custom importance sampling to RLVR training so LLMs expand their reasoning boundary and then operate without the agents.

citing papers explorer

Showing 10 of 10 citing papers.