Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Haodong Li; Hao Liao; Jianxun Lian; Qi Ran; Tao Jin; Wang Lin; Weicai Yan; Xing Xie; Yuhong Dai

arxiv: 2603.03447 · v3 · pith:Q4YHJ2EFnew · submitted 2026-03-03 · 💻 cs.CV

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Weicai Yan , Yuhong Dai , Qi Ran , Haodong Li , Wang Lin , Tao Jin , Xing Xie , Hao Liao

show 1 more author

Jianxun Lian

This is my paper

classification 💻 cs.CV

keywords real-timecompanionsinteractiveproact-vlproactivegaminghuman-likequality

0 comments

read the original abstract

Proactive and real-time interactive experiences are essential for human-like AI companions, yet face three key challenges: (1) achieving low-latency inference under continuous streaming inputs, (2) autonomously deciding when to respond, and (3) controlling both quality and quantity of generated content to meet real-time constraints. In this work, we instantiate AI companions through two gaming scenarios, commentator and guide, selected for their suitability for automatic evaluation. We introduce the Live Gaming Benchmark, a large-scale dataset with three representative scenarios: solo commentary, co-commentary, and user guidance, and present Proact-VL, a general framework that shapes multimodal language models into proactive, real-time interactive agents capable of human-like environment perception and interaction. Extensive experiments show Proact-VL achieves superior response latency and quality while maintaining strong video understanding capabilities, demonstrating its practicality for real-time interactive applications.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Towards Generalist Game Players: An Investigation of Foundation Models in the Game Multiverse
cs.CV 2026-05 unverdicted novelty 5.0

The paper organizes research on generalist game AI into Dataset, Model, Harness, and Benchmark pillars and charts a five-level progression from single-game mastery to agents that create and live inside game multiverses.
Towards Generalist Game Players: An Investigation of Foundation Models in the Game Multiverse
cs.CV 2026-05 unverdicted novelty 3.0

This work traces four eras of generalist game players across dataset, model, harness, and benchmark pillars and charts a five-level roadmap ending in agents that create and evolve within game multiverses.