KartikNagpal, DayiDong, Jean-BaptisteBouvier, andNegarMehr

Nagpal, K · 2025 · arXiv 2502.16863

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Reflect-R1: Evidence-Driven Reflection for Self-Correction in Long Video Understanding

cs.CV · 2026-06-26 · unverdicted · novelty 7.0 · 2 refs

Reflect-R1 introduces the first evidence-driven self-correction framework for long video understanding using a three-stage pipeline, stage-decoupled RL via SD-GRPO, and a 120K dataset to achieve SOTA on VideoMME and LongVideoBench.

Clarus: Coordinating Autonomous Research Agents toward Web-Scale Scientific Collaboration

cs.AI · 2026-06-29 · unverdicted · novelty 5.0

Clarus is a four-layer collaboration infrastructure with a project-agent-resource model that reformulates research as an open, traceable, multi-participant process.

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

cs.CL · 2026-04-10 · unverdicted · novelty 5.0

A survey of credit assignment techniques in LLM reinforcement learning that distinguishes maturing methods for reasoning from new approaches needed for agentic settings and provides supporting resources.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Clarus: Coordinating Autonomous Research Agents toward Web-Scale Scientific Collaboration cs.AI · 2026-06-29 · unverdicted · none · ref 16
Clarus is a four-layer collaboration infrastructure with a project-agent-resource model that reformulates research as an open, traceable, multi-participant process.

KartikNagpal, DayiDong, Jean-BaptisteBouvier, andNegarMehr

fields

years

verdicts

representative citing papers

citing papers explorer