pith. sign in

Canonical reference

Laminar: A scalable asyn- chronous rl post-training framework

Canonical reference. 100% of citing Pith papers cite this work as background.

13 Pith papers citing it
Background 100% of classified citations

citation-role summary

background 5

citation-polarity summary

years

2026 12 2025 1

verdicts

UNVERDICTED 13

roles

background 5

polarities

background 5

clear filters

representative citing papers

Libra: Efficient Resource Management for Agentic RL Post-Training

cs.LG · 2026-06-02 · unverdicted · novelty 4.0

Libra optimizes GPU allocation across rollout and training in agentic RL via an elastic hybrid pool and C-MLFQ scheduler based on tool-return causal signals, claiming up to 3.0x throughput and 2.5x faster reward convergence on 48 A800 GPUs.

citing papers explorer

Showing 12 of 12 citing papers after filters.