pith. sign in

hub Canonical reference

Unifying group-relative and self-distillation policy optimization via sample routing

Canonical reference. 100% of citing Pith papers cite this work as background.

11 Pith papers citing it
Background 100% of classified citations

hub tools

citation-role summary

background 6

citation-polarity summary

years

2026 11

verdicts

UNVERDICTED 11

roles

background 6

polarities

background 6

representative citing papers

VISD: Enhancing Video Reasoning via Structured Self-Distillation

cs.CV · 2026-05-07 · unverdicted · novelty 5.0 · 4 refs

VISD proposes structured self-distillation with a multi-dimensional judge model and direction-magnitude decoupling to improve token-level credit assignment and convergence speed in VideoLLM reasoning training.

citing papers explorer

Showing 11 of 11 citing papers.