Affectgpt: A new dataset, model, and benchmark for emotion understanding with multimodal large language models

Zheng Lian, Haoyu Chen, Lan Chen, Haiyang Sun, Licai Sun, Yong Ren, Zebang Cheng, Bin Liu, Rui Liu, Xiaojiang Peng, et al · 2025 · arXiv 2501.16566

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

MOTOR-Bench: A Real-world Dataset and Multi-agent Framework for Zero-shot Human Mental State Understanding

cs.CV · 2026-05-10 · unverdicted · novelty 7.0

MOTOR-Bench supplies a real-world video dataset for structured mental state understanding in learning settings, while MOTOR-MAS improves zero-shot prediction of behavior, cognition, and emotion labels over single models and other multi-agent systems.

EmoTrans: A Benchmark for Understanding, Reasoning, and Predicting Emotion Transitions in Multimodal LLMs

cs.CV · 2026-04-25 · unverdicted · novelty 7.0

EmoTrans is a new video benchmark with four progressive tasks that measures how well current multimodal LLMs handle dynamic emotion transitions rather than static recognition.

Reasoning for Mobile User Experience with Multimodal LLMs: Task, Benchmark, and Approach

cs.AI · 2026-06-11 · unverdicted · novelty 6.0

Introduces UXBench benchmark for MLLM UI UX reasoning and UI-UX model achieving 0.7963 accuracy via RL enhancements on Qwen3-VL base.

DeceptionX: Explainable Deception Detection with Multimodal Large Language Models

cs.CV · 2026-06-09 · unverdicted · novelty 5.0

DeceptionX is an MLLM framework that performs explainable deception detection through structured chain-of-thought reasoning on audiovisual cues, trained via a three-stage pipeline on the new DeceptChain dataset and a DARE redundancy elimination strategy.

EmoS: A High-Fidelity Multimodal Benchmark for Fine-grained Streaming Emotional Understanding

cs.CL · 2026-05-09 · unverdicted · novelty 5.0

EmoS is a new high-fidelity benchmark for fine-grained streaming emotional understanding that produces measurable gains when used to fine-tune multimodal large language models.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Reasoning for Mobile User Experience with Multimodal LLMs: Task, Benchmark, and Approach cs.AI · 2026-06-11 · unverdicted · none · ref 15
Introduces UXBench benchmark for MLLM UI UX reasoning and UI-UX model achieving 0.7963 accuracy via RL enhancements on Qwen3-VL base.

Affectgpt: A new dataset, model, and benchmark for emotion understanding with multimodal large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer