EgoPlan-Bench2: A benchmark for multimodal large language model planning in real-world scenarios.International Journal of Com- puter Vision, 134(5):222, 2026

Lu Qiu, Yi Chen, Yuying Ge, Yixiao Ge, Ying Shan, Xihui Liu · 2026 · DOI 10.1007/s11263-026-02826-y

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Token Predictors Are Not Planners: Building Physically Grounded Causal Reasoners

cs.AI · 2026-06-01 · unverdicted · novelty 6.0

Introduces a new diagnostic benchmark and million-scale reasoning corpus showing that training on explicit causal traces improves next-state prediction in embodied planning, with reported gains from data scaling.

citing papers explorer

Showing 1 of 1 citing paper.

Token Predictors Are Not Planners: Building Physically Grounded Causal Reasoners cs.AI · 2026-06-01 · unverdicted · none · ref 20
Introduces a new diagnostic benchmark and million-scale reasoning corpus showing that training on explicit causal traces improves next-state prediction in embodied planning, with reported gains from data scaling.

EgoPlan-Bench2: A benchmark for multimodal large language model planning in real-world scenarios.International Journal of Com- puter Vision, 134(5):222, 2026

fields

years

verdicts

representative citing papers

citing papers explorer