Gaussian-Weighted Difference Sampling (GWDS) We consider a probabilistic sampling strategy based on a Gaussian weighting over IoU differences

13 Video-OPD: Efficient Post-Training of MLLMs for Temporal Video Grounding via On-Policy Distillation B · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

cs.CV · 2026-02-03 · unverdicted · novelty 7.0

Video-OPD uses on-policy distillation from a frontier teacher to turn sparse episode rewards into dense step-wise signals for more efficient post-training of MLLMs on temporal video grounding.

citing papers explorer

Showing 1 of 1 citing paper.

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation cs.CV · 2026-02-03 · unverdicted · none · ref 22
Video-OPD uses on-policy distillation from a frontier teacher to turn sparse episode rewards into dense step-wise signals for more efficient post-training of MLLMs on temporal video grounding.

Gaussian-Weighted Difference Sampling (GWDS) We consider a probabilistic sampling strategy based on a Gaussian weighting over IoU differences

fields

years

verdicts

representative citing papers

citing papers explorer