Datasets and recipes for video temporal grounding via reinforce- ment learning

Chen, R · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

cs.CV · 2026-02-03 · unverdicted · novelty 7.0

Video-OPD uses on-policy distillation from a frontier teacher to turn sparse episode rewards into dense step-wise signals for more efficient post-training of MLLMs on temporal video grounding.

citing papers explorer

Showing 1 of 1 citing paper.

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation cs.CV · 2026-02-03 · unverdicted · none · ref 2
Video-OPD uses on-policy distillation from a frontier teacher to turn sparse episode rewards into dense step-wise signals for more efficient post-training of MLLMs on temporal video grounding.

Datasets and recipes for video temporal grounding via reinforce- ment learning

fields

years

verdicts

representative citing papers

citing papers explorer