Multi-agent VLM frameworks outperform single VLMs for automated coding of on-screen collaborative learning behaviors using the ICAP framework.
Multi-modal agent tuning: Building a vlm-driven agent for efficient tool usage.arXiv preprint arXiv:2412.15606,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
fields
cs.AI 2roles
background 1polarities
support 1representative citing papers
citing papers explorer
-
Single-agent vs. Multi-agents for Automated Video Analysis of On-Screen Collaborative Learning Behaviors
Multi-agent VLM frameworks outperform single VLMs for automated coding of on-screen collaborative learning behaviors using the ICAP framework.
- Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents