URL https://aclanthology.org/2024.ac l-long.847/

Association for Computational Linguistics · 2024 · DOI 10.18653/v1/2024.acl-long.847

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling

cs.AI · 2026-05-25 · unverdicted · novelty 7.0

OmniToM is a new benchmark for Theory of Mind in LLMs that evaluates explicit belief extraction and seven-dimensional labeling from 895 stories, revealing an actor-specific belief-tracking bottleneck.

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

Introduces BeliefTrack benchmark diagnosing three CBM failures in LLMs and shows RL with belief-state rewards cuts failure rates by 70.9% while representation steering cuts them by 46.1%.

Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models

cs.CL · 2025-11-11 · unverdicted · novelty 6.0

LLM moral robustness under persona role-play is largely determined by model family with Claude models most consistent, while susceptibility shows little family dependence.

DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories

cs.CL · 2026-04-22

citing papers explorer

Showing 3 of 3 citing papers after filters.

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling cs.AI · 2026-05-25 · unverdicted · none · ref 5
OmniToM is a new benchmark for Theory of Mind in LLMs that evaluates explicit belief extraction and seven-dimensional labeling from 895 stories, revealing an actor-specific belief-tracking bottleneck.
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models cs.AI · 2026-05-28 · unverdicted · none · ref 4
Introduces BeliefTrack benchmark diagnosing three CBM failures in LLMs and shows RL with belief-state rewards cuts failure rates by 70.9% while representation steering cuts them by 46.1%.
Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models cs.CL · 2025-11-11 · unverdicted · none · ref 10
LLM moral robustness under persona role-play is largely determined by model family with Claude models most consistent, while susceptibility shows little family dependence.

URL https://aclanthology.org/2024.ac l-long.847/

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer