Videollm-online: Online video large language model for streaming video

Joya Chen, Zhaoyang Lv, Shiwei Wu, Kevin Qinghong Lin, Chenan Song, Difei Gao, Jia-Wei Liu, Ziteng Gao, Dongxing Mao, Mike Zheng Shou · 2024

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

citation-role summary

dataset 1

citation-polarity summary

use dataset 1

representative citing papers

DiscussLLM: Teaching Large Language Models When to Speak

cs.CL · 2025-08-25 · unverdicted · novelty 5.0

DiscussLLM introduces a two-stage synthetic data pipeline to annotate multi-turn discussions with five intervention types and trains LLMs to time contributions via a silent token or proactive responses.

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

cs.CV · 2025-01-22 · unverdicted · novelty 4.0

VideoLLaMA3 uses a vision-centric training paradigm and token-reduction design to reach competitive results on image and video benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

DiscussLLM: Teaching Large Language Models When to Speak cs.CL · 2025-08-25 · unverdicted · none · ref 22
DiscussLLM introduces a two-stage synthetic data pipeline to annotate multi-turn discussions with five intervention types and trains LLMs to time contributions via a silent token or proactive responses.
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding cs.CV · 2025-01-22 · unverdicted · none · ref 105
VideoLLaMA3 uses a vision-centric training paradigm and token-reduction design to reach competitive results on image and video benchmarks.

Videollm-online: Online video large language model for streaming video

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer