No Plan, Yet Human: A Reactive Robotics Model Predicts Human Planning Failures on a Clinical Task
Pith reviewed 2026-05-20 17:37 UTC · model grok-4.3
The pith
A reactive robotics model reproduces the pattern of human planning failures on the Tower of London test better than a planning baseline when planning capacity is reduced.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Without any lookahead planning or knowledge of human cognition, AICON reproduces the fine-grained human difficulty ordering across 24 problems better than structural task parameters and generalizes to held-out problems in a leave-two-out evaluation. Crucially, AICON outperforms a planning baseline for groups with reduced planning capacity while the planning baseline better captures healthy controls. This dissociation was predicted by the original AICON paper, which noted that the model's failure modes resemble those of Parkinson's patients who struggle with goal hierarchies but not move counts. This suggests that as planning capacity is reduced, human behavior shifts toward the reactive mode
What carries the argument
AICON, a reactive gradient-descent framework that solves sequential manipulation tasks through continuous real-time cost minimization without explicit search or lookahead.
If this is right
- The reactive model can predict the specific problems and error types that arise for clinical groups on planning assessments.
- Human performance on sequential tasks transitions from planning-based to reactive as capacity decreases.
- The same model abstraction accounts for both robotic control and human behavior on planning tests.
- Leave-two-out generalization shows the model identifies general sources of task difficulty rather than fitting only the training set.
Where Pith is reading between the lines
- Similar reactive models might predict performance drops in other sequential decisions under fatigue, stress, or impairment.
- Interventions that support planning capacity could be evaluated by whether they increase the relative fit of planning models over reactive ones.
- The pattern may appear in additional biological systems where reactive control serves as a default under resource limits.
Load-bearing premise
The assumption that superior fit by the reactive model to reduced-capacity groups means humans are actually using reactive behavior rather than some other form of impaired planning.
What would settle it
Collect error patterns from a new group of participants with reduced planning capacity on additional Tower of London problems and check whether those patterns align more closely with the reactive model's predictions than with the planning baseline's predictions.
Figures
read the original abstract
Understanding why some sequential planning problems are harder than others requires models that go beyond average performance. They should capture the specific pattern of which problems are hard, and ideally fail in the same way people do when planning capacity is reduced. We apply AICON, a reactive gradient-descent framework developed for robotic manipulation, to the Tower of London test, a cognitive test used to assess planning in Parkinson's disease, mild cognitive impairment, and stroke. Without any lookahead planning or knowledge of human cognition, AICON reproduces the fine-grained human difficulty ordering across 24 problems better than structural task parameters and generalizes to held-out problems in a leave-two-out evaluation. Crucially, AICON outperforms a planning baseline for groups with reduced planning capacity while the planning baseline better captures healthy controls. This dissociation was predicted by the original AICON paper, which noted that the model's failure modes resemble those of Parkinson's patients who struggle with goal hierarchies but not move counts. This suggests that as planning capacity is reduced, human behavior shifts toward the reactive mode AICON models. The finding extends a broader pattern: AICON, originally built for robotics, now captures aspects of biological behavior across perception, eye movements, and sequential planning, suggesting its core abstraction reflects something real about how biological systems are organized.
Editorial analysis
A structured set of objections, weighed in public.
Circularity Check
Dissociation interpretation anchored in original AICON paper's resemblance note to Parkinson's
specific steps
-
self citation load bearing
[Abstract]
"Crucially, AICON outperforms a planning baseline for groups with reduced planning capacity while the planning baseline better captures healthy controls. This dissociation was predicted by the original AICON paper, which noted that the model's failure modes resemble those of Parkinson's patients who struggle with goal hierarchies but not move counts. This suggests that as planning capacity is reduced, human behavior shifts toward the reactive mode AICON models."
The suggestion that the observed dissociation demonstrates a human shift to the reactive mode is justified by citing the prior AICON paper's resemblance observation rather than by direct, quantitative alignment of AICON-generated error types (e.g., subgoal commitment failures) against the human error distributions collected in the reduced-capacity cohorts.
full rationale
The manuscript supplies independent empirical content via its application of AICON to the Tower of London task, superior reproduction of fine-grained difficulty ordering across 24 problems, and leave-two-out generalization. However, the central interpretive step—that the performance dissociation between AICON and the planning baseline for reduced-capacity groups indicates a shift toward reactive control—rests on the original AICON paper's qualitative note about resemblance to Parkinson's goal-hierarchy deficits rather than new quantitative matching of specific error patterns in the current data. This produces moderate self-citation dependence on the interpretive claim while leaving the raw performance results grounded in fresh experiments.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption AICON operates without lookahead planning or knowledge of human cognition
- domain assumption Tower of London performance differences reflect planning capacity variations across clinical groups
Reference graph
Works this paper leans on
-
[1]
Albrecht, R., Ragni, M.: Spatial planning: An ACT-R model for the Tower of Londontask.In:InternationalConferenceonSpatialCognition.pp.222–236(2014)
work page 2014
-
[2]
Behavioral and Brain Sciences20(4), 723–742 (1997)
Ballard, D.H., Hayhoe, M.M., Pook, P.K., Rao, R.P.: Deictic codes for the embod- iment of cognition. Behavioral and Brain Sciences20(4), 723–742 (1997)
work page 1997
-
[3]
Neuropharmacology37(4-5), 407– 419 (1998)
Balleine, B.W., Dickinson, A.: Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology37(4-5), 407– 419 (1998)
work page 1998
-
[4]
bioRxiv, 2024.06.20.599814 (2024)
Battaje, A., Godinez, A., Hanning, N.M., Rolfs, M., Brock, O.: An information processing pattern from robotics predicts properties of the human visual system. bioRxiv, 2024.06.20.599814 (2024)
work page 2024
-
[5]
Cognitive Brain Research20(3), 462–472 (2004) 12 M
Kaller, C.P., Unterrainer, J.M., Rahm, B., Halsband, U.: The impact of problem structure on planning: Insights from the Tower of London task. Cognitive Brain Research20(3), 462–472 (2004) 12 M. Migacev, V. Mengers, et al
work page 2004
-
[6]
Psychological Assessment24(1), 46 (2012)
Kaller,C.P.,Unterrainer,J.M.,Stahl,C.:AssessingplanningabilitywiththeTower of London task: psychometric properties of a structurally balanced problem set. Psychological Assessment24(1), 46 (2012)
work page 2012
-
[7]
Philosophy of Science78(4), 601–627 (2011)
Kaplan, D.M., Craver, C.F.: The explanatory force of dynamical and mathematical models in neuroscience: A mechanistic perspective. Philosophy of Science78(4), 601–627 (2011)
work page 2011
-
[8]
Biometrika30(1-2), 81–93 (1938)
Kendall, M.G.: A new measure of rank correlation. Biometrika30(1-2), 81–93 (1938)
work page 1938
-
[9]
Kirsh,D.,Maglio,P.:Ondistinguishingepistemicfrompragmaticaction.Cognitive Science18(4), 513–549 (1994)
work page 1994
-
[10]
Neuropsychologia75, 646–655 (2015)
Köstering, L., Schmidt, C.S., Egger, K., Amtage, F., Peter, J., Klöppel, S., Beume, L.A., Hoeren, M., Weiller, C., Kaller, C.P.: Assessment of planning performance in clinical samples: Reliability and validity of the Tower of London task (TOL-F). Neuropsychologia75, 646–655 (2015)
work page 2015
-
[11]
Archives of Clinical Neuropsychology31(7), 738–753 (2016)
Köstering, L., Schmidt, C.S., Weiller, C., Kaller, C.P.: Analyses of rule breaks and errors during planning in computerized tower tasks: Insights from neurological patients. Archives of Clinical Neuropsychology31(7), 738–753 (2016)
work page 2016
-
[12]
Journal of Neuroscience43(7), 1074–1088 (2023)
Levenstein, D., Alvarez, V.A., Amarasingham, A., Azab, H., Chen, Z.S., Gerkin, R.C., Hasenstaub, A., Iyer, R., Jolivet, R.B., Marzen, S., et al.: On the role of theory and modeling in neuroscience. Journal of Neuroscience43(7), 1074–1088 (2023)
work page 2023
-
[13]
The International Journal of Robotics Research 41(8), 741–777 (2022)
Martín-Martín, R., Brock, O.: Coupled recursive estimation for online interactive perception of articulated objects. The International Journal of Robotics Research 41(8), 741–777 (2022)
work page 2022
-
[14]
Mattar, M.G., Lengyel, M.: Planning in the brain. Neuron110(6), 914–934 (2022)
work page 2022
-
[15]
McKinlay, A., Kaller, C., Grace, R., Dalrymple-Alford, J., Anderson, T., Fink, J., Roger, D.: Planning in parkinson’s disease: A matter of problem structure? Neuropsychologia46(1), 384–389 (2008)
work page 2008
-
[16]
In: IEEE International Conference on Robotics and Automation
Mengers, V., Brock, O.: No plan but everything under control: Robustly solving se- quential tasks with dynamically composed gradient descent. In: IEEE International Conference on Robotics and Automation. pp. 90–96 (2025)
work page 2025
-
[17]
Scientific Reports14, 27314 (2024)
Mengers, V., Raoufi, M., Brock, O., Hamann, H., Romanczuk, P.: Leveraging un- certainty in collective opinion dynamics with heterogeneity. Scientific Reports14, 27314 (2024)
work page 2024
-
[18]
Journal of Vision25(2), 6 (2025)
Mengers, V., Roth, N., Brock, O., Obermayer, K., Rolfs, M.: A robotics-inspired scanpath model reveals the importance of uncertainty and semantic object cues for gaze guidance in dynamic scenes. Journal of Vision25(2), 6 (2025)
work page 2025
-
[19]
Philosophical Transactions of the Royal Society of London
Shallice, T.: Specific impairments of planning. Philosophical Transactions of the Royal Society of London. B, Biological Sciences298(1089), 199–209 (1982)
work page 1982
-
[20]
Artificial Intelligence125(1-2), 119–153 (2001)
Slaney, J., Thiébaux, S.: Blocks World revisited. Artificial Intelligence125(1-2), 119–153 (2001)
work page 2001
-
[21]
Quarterly Journal of Experimental Psychology 64(3), 485–503 (2011)
Waldron, S.M., Patrick, J., Duggan, G.B.: The influence of goal-state access cost on planning during problem solving. Quarterly Journal of Experimental Psychology 64(3), 485–503 (2011)
work page 2011
-
[22]
In: Proceedings of the Annual Meeting of the Cognitive Science Society
Zhang, C., Lipovetzky, N., Kemp, C.: Comparing AI planning algorithms with humans on the Tower of London task. In: Proceedings of the Annual Meeting of the Cognitive Science Society. vol. 45 (2023)
work page 2023
-
[23]
In: Proceedings of the Annual Meet- ing of the Cognitive Science Society
Zhang, C., Liu, Y., Kulic, D., Carreno-Medrano, P., Burke, M.: Modeling human sequential decision-making in the Tower of London: Incorporating individual differ- ences and timing-based replanning inference. In: Proceedings of the Annual Meet- ing of the Cognitive Science Society. vol. 47 (2025)
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.