JobBench is a new benchmark with 130 occupational tasks where the best of 36 tested AI models achieves only 45.9% success.
Prolific
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4verdicts
UNVERDICTED 4roles
background 1polarities
support 1representative citing papers
Sympatheia introduces a continuous affect-conditioned speech dialogue model and the Sympatheia-18k synthetic dataset, showing improved emotional appropriateness over baselines when speech cues are limited.
User study shows participants select simplification techniques for responsive line charts according to dataset characteristics rather than screen size, with interaction complexity not uniformly increasing engagement.
An online experiment finds that showing users an overview of an AI's values reduces reliance on AI suggestions during writing tasks.
citing papers explorer
-
JobBench: Aligning Agent Work With Human Will
JobBench is a new benchmark with 130 occupational tasks where the best of 36 tested AI models achieves only 45.9% success.
-
Sympatheia: Emotionally Adaptive Voice Assistant with Continuous Affect Conditioning
Sympatheia introduces a continuous affect-conditioned speech dialogue model and the Sympatheia-18k synthetic dataset, showing improved emotional appropriateness over baselines when speech cues are limited.
-
Beyond One-Size-Fits-All: User Strategies for Simplification Technique and Level Selection in Responsive Line Charts
User study shows participants select simplification techniques for responsive line charts according to dataset characteristics rather than screen size, with interaction complexity not uniformly increasing engagement.
-
Framing an AI with Values Reduces AI Reliance in AI-supported Writing Tasks
An online experiment finds that showing users an overview of an AI's values reduces reliance on AI suggestions during writing tasks.