PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models cs.AI · 2026-05-20