Table 15 gives the full 11-model × 10-topic Composite matrix, allowing inspection of whether the topic-difficulty ordering is preserved at the per-model level

reports topic-level pooled results showing Romantic Relationships as the mostdifficult topic, modest topic effects overall ( 𝜂2 ≤ 0 · 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

AttuneBench: A Conversation-Based Benchmark for LLM Emotional Intelligence

cs.AI · 2026-05-20 · unverdicted · novelty 6.0

AttuneBench introduces a multi-turn conversation benchmark using participant annotations to evaluate LLM emotional intelligence, finding that model performance on emotion recognition, behavior classification, preference prediction, and response quality are largely independent.

citing papers explorer

Showing 1 of 1 citing paper.

AttuneBench: A Conversation-Based Benchmark for LLM Emotional Intelligence cs.AI · 2026-05-20 · unverdicted · none · ref 12
AttuneBench introduces a multi-turn conversation benchmark using participant annotations to evaluate LLM emotional intelligence, finding that model performance on emotion recognition, behavior classification, preference prediction, and response quality are largely independent.

Table 15 gives the full 11-model × 10-topic Composite matrix, allowing inspection of whether the topic-difficulty ordering is preserved at the per-model level

fields

years

verdicts

representative citing papers

citing papers explorer