Tomchallenges: A principle-guided dataset and diverse evaluation tasks for exploring theory of mind.arXiv preprint arXiv:2305.15068, 2023

Ziqiao Ma, Jaron Sansom, Run Peng, Joyce Chai · 2023 · arXiv 2305.15068

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

OSCToM: RL-Guided Adversarial Generation for High-Order Theory of Mind

cs.AI · 2026-05-19 · unverdicted · novelty 5.0

OSCToM uses RL-guided generation with an extended DSL and surrogate models to create nested belief conflict tasks, raising FANToM accuracy from 0.2% to 76% while being 6x more efficient.

citing papers explorer

Showing 1 of 1 citing paper.

OSCToM: RL-Guided Adversarial Generation for High-Order Theory of Mind cs.AI · 2026-05-19 · unverdicted · none · ref 14
OSCToM uses RL-guided generation with an extended DSL and surrogate models to create nested belief conflict tasks, raising FANToM accuracy from 0.2% to 76% while being 6x more efficient.

Tomchallenges: A principle-guided dataset and diverse evaluation tasks for exploring theory of mind.arXiv preprint arXiv:2305.15068, 2023

fields

years

verdicts

representative citing papers

citing papers explorer