MOTOR-Bench supplies a real-world video dataset for structured mental state understanding in learning settings, while MOTOR-MAS improves zero-shot prediction of behavior, cognition, and emotion labels over single models and other multi-agent systems.
Affectgpt: A new dataset, model, and benchmark for emotion understanding with multimodal large language models
5 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 5verdicts
UNVERDICTED 5roles
baseline 1polarities
baseline 1representative citing papers
EmoTrans is a new video benchmark with four progressive tasks that measures how well current multimodal LLMs handle dynamic emotion transitions rather than static recognition.
Introduces UXBench benchmark for MLLM UI UX reasoning and UI-UX model achieving 0.7963 accuracy via RL enhancements on Qwen3-VL base.
DeceptionX is an MLLM framework that performs explainable deception detection through structured chain-of-thought reasoning on audiovisual cues, trained via a three-stage pipeline on the new DeceptChain dataset and a DARE redundancy elimination strategy.
EmoS is a new high-fidelity benchmark for fine-grained streaming emotional understanding that produces measurable gains when used to fine-tune multimodal large language models.
citing papers explorer
-
Reasoning for Mobile User Experience with Multimodal LLMs: Task, Benchmark, and Approach
Introduces UXBench benchmark for MLLM UI UX reasoning and UI-UX model achieving 0.7963 accuracy via RL enhancements on Qwen3-VL base.