A Motivational Architecture for Conversational AGI
Pith reviewed 2026-06-28 06:14 UTC · model grok-4.3
The pith
A ten-stage pipeline recasts motivational homeostasis for conversational agents using seven dialogue drives.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that conversational agents can operate under a motivational architecture by regulating seven dialogue-native drives—competence, uncertainty reduction, affiliation, affinity, legitimacy, nurturing, and aesthetic coherence—rather than bodily deficits. This architecture uses a ten-stage motivational processing pipeline that separates cognitive modulation from situational appraisal, a dual decision strategy that blends urgency-driven fast responses with deliberative multi-goal optimization, and a functional distinction between pre-action feelings and post-action emotions. The same structure is applied to two example agents and indicated for use in social robotics and broader AGI
What carries the argument
The ten-stage motivational processing pipeline, which separates cognitive modulation from situational appraisal while supporting regulation of the seven dialogue drives.
If this is right
- The dual decision strategy produces both rapid replies when urgency is high and balanced choices across multiple drives when time allows.
- Pre-action feelings guide immediate selection of speech acts while post-action emotions update the agent's internal state after the act occurs.
- The same drive set and pipeline apply across companion-style and research-style conversational agents.
- The architecture supplies a direct route for extending motivational control to social robotics without redesigning the core stages.
Where Pith is reading between the lines
- A similar drive-recasting exercise could be attempted for agents whose primary loop is visual or physical rather than linguistic.
- Implementation followed by measurement of conversation length and coherence would test whether the seven drives suffice in practice.
- The separation of modulation from appraisal stages suggests the pipeline could be mapped onto existing modular execution systems without major rewriting.
- If the drives transfer well, one could experiment with subsets or additions to match narrower task domains.
Load-bearing premise
The seven dialogue-native drives form a sufficient and transferable basis for regulating agent behavior in linguistic interactions without additional mechanisms.
What would settle it
Build an agent on the proposed pipeline and drives, then run extended conversations that require motivational factors outside the seven listed; consistent failure to produce appropriate speech or actions in those cases would falsify sufficiency.
read the original abstract
Motivational architectures in cognitive AI have largely been designed for physical agents regulating bodily needs. Conversational agents operate in a different regime: their sensorimotor loop is linguistic, their environment is a user's evolving mental state, and their consequential actions are speech acts, tool invocations, and strategic silences. This paper proposes a conversational reinterpretation of the OpenPsi motivational lineage, coupled to MetaMo's higher-level motivational scaffold, for agents built on a modular execution substrate. Homeostasis is recast in dialogue-native terms: the agent regulates competence, uncertainty reduction, affiliation, affinity, legitimacy, nurturing, and aesthetic coherence rather than bodily deficits. We propose three contributions: a ten-stage motivational processing pipeline that architecturally separates cognitive modulation from situational appraisal; a dual decision strategy blending urgency-driven fast response with deliberative multi-goal optimization; and an architecturally useful distinction between pre-action feelings and post-action emotions as functionally different forms of affect. We specialize the framework to two example agents -- CompanionAgent and ResearchAgent -- and sketch its extension to social robotics and domain-generic human-level AGI.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a motivational architecture for conversational AGI by reinterpreting the OpenPsi lineage and MetaMo scaffold for agents whose primary actions are linguistic. Homeostasis is redefined around seven dialogue-native drives (competence, uncertainty reduction, affiliation, affinity, legitimacy, nurturing, aesthetic coherence). Three contributions are outlined: a ten-stage motivational processing pipeline separating cognitive modulation from appraisal, a dual decision strategy (urgency-driven fast response plus deliberative multi-goal optimization), and a distinction between pre-action feelings and post-action emotions. The framework is specialized to CompanionAgent and ResearchAgent examples with suggested extensions to social robotics and domain-generic AGI.
Significance. If implemented and shown to produce coherent regulation, the architecture could supply a reusable scaffold for motivational control in non-embodied agents, extending cognitive-AI techniques beyond physical homeostasis. The explicit separation of pipeline stages and the pre-/post-action affect distinction would be useful design primitives for modular execution substrates.
major comments (3)
- [Abstract / Contributions] Abstract and contributions section: the claim that the seven listed drives form a sufficient and transferable basis for regulating linguistic agents is presented without any drive-to-action mapping, state-transition rules, or regulatory-adequacy argument. This assertion is load-bearing for the completeness of the proposed system yet remains an untested stipulation.
- [Ten-stage motivational processing pipeline] Ten-stage pipeline description: the pipeline is introduced at the level of stage names and high-level separation of modulation from appraisal, but supplies neither pseudocode, data-flow diagrams, nor example state transitions. Without these, it is impossible to verify that the claimed architectural separation is achieved or that the dual decision strategy integrates with the stages.
- [Example agents] Specialization to example agents: the CompanionAgent and ResearchAgent sketches assert that the drives plus pipeline suffice for coherent behavior, yet contain no concrete illustration of how any drive influences a speech act, tool call, or strategic silence. This leaves the sufficiency claim without even illustrative support.
minor comments (2)
- [Drives definition] Notation for the seven drives is introduced without an explicit table or equation defining their activation thresholds or interaction weights.
- [Related work] The manuscript reuses terminology from OpenPsi and MetaMo; a short comparison table would clarify which components are carried over versus newly adapted for dialogue.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which help clarify how the architectural proposal can be strengthened. We respond to each major comment below and indicate planned revisions.
read point-by-point responses
-
Referee: [Abstract / Contributions] Abstract and contributions section: the claim that the seven listed drives form a sufficient and transferable basis for regulating linguistic agents is presented without any drive-to-action mapping, state-transition rules, or regulatory-adequacy argument. This assertion is load-bearing for the completeness of the proposed system yet remains an untested stipulation.
Authors: The seven drives are motivated by prior work in dialogue psychology and the OpenPsi/MetaMo lineage, with their applicability argued via the two example agents. We agree that an explicit regulatory-adequacy argument and initial mappings would make the claim more robust. We will add a short subsection in the contributions and a table of drive-to-action mappings in the revised manuscript. revision: yes
-
Referee: [Ten-stage motivational processing pipeline] Ten-stage pipeline description: the pipeline is introduced at the level of stage names and high-level separation of modulation from appraisal, but supplies neither pseudocode, data-flow diagrams, nor example state transitions. Without these, it is impossible to verify that the claimed architectural separation is achieved or that the dual decision strategy integrates with the stages.
Authors: The manuscript presents the pipeline at the level of architectural principles rather than implementation details. To allow verification of the separation and integration, we will add a data-flow diagram and high-level pseudocode for the ten stages plus the dual decision strategy in a new figure and appendix. revision: yes
-
Referee: [Example agents] Specialization to example agents: the CompanionAgent and ResearchAgent sketches assert that the drives plus pipeline suffice for coherent behavior, yet contain no concrete illustration of how any drive influences a speech act, tool call, or strategic silence. This leaves the sufficiency claim without even illustrative support.
Authors: The current sketches are intentionally high-level to illustrate specialization. We will expand the CompanionAgent section with one worked example tracing how a single drive (uncertainty reduction) modulates a short dialogue sequence including speech acts and a strategic silence. revision: yes
Circularity Check
No significant circularity detected in the architectural proposal
full rationale
The paper is a conceptual proposal for reinterpreting prior motivational frameworks (OpenPsi, MetaMo) in conversational terms, positing seven drives and a ten-stage pipeline as contributions. No mathematical derivation chain, equations, fitted parameters, or predictions are claimed that reduce to inputs by construction. The sufficiency of the drives is presented as an assumption for the architecture rather than a derived result. Self-citations to the authors' lineage are present but do not serve as load-bearing justification for a theorem or uniqueness claim; the work remains a self-contained architectural sketch without circular reduction.
Axiom & Free-Parameter Ledger
free parameters (1)
- Selection of seven dialogue-native drives
axioms (1)
- domain assumption Conversational agents operate in a linguistic sensorimotor loop whose environment is the user's evolving mental state and whose actions are speech acts, tool invocations, and strategic silences.
invented entities (2)
-
Ten-stage motivational processing pipeline
no independent evidence
-
Distinction between pre-action feelings and post-action emotions
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Artificial General Intelligence: 18th International Conference, AGI 2025, Proceedings, Part I , editor =
Rui Lian and Ben Goertzel , title =. Artificial General Intelligence: 18th International Conference, AGI 2025, Proceedings, Part I , editor =. 2026 , doi =
2025
-
[2]
Ben Goertzel and Cassio Pennachin and Nil Geisweiller , title =
-
[3]
Joscha Bach , title =
-
[4]
Ben Goertzel and others , title =
-
[5]
Ben Goertzel and Anna Mikeda , title =
-
[6]
Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990--2010) , journal =
J. Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990--2010) , journal =
1990
-
[7]
O'Brien and Carrie J
Joon Sung Park and Joseph C. O'Brien and Carrie J. Cai and Meredith Ringel Morris and Percy Liang and Michael S. Bernstein , title =. Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology , publisher =
-
[8]
Wang and Y
Y. Wang and Y. Chen and F. Zhong and L. Ma and Y. Wang , title =. International Conference on Learning Representations (ICLR 2025) , year =
2025
-
[9]
Liu and S
X.B. Liu and S. Fang and W. Shi and C.-S. Wu and T. Igarashi and X.A. Chen , title =. Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems , publisher =
2025
-
[10]
Cai and Ben Goertzel and C
Z. Cai and Ben Goertzel and C. Zhou and Y. Zhang and M. Jiang and G. Yu , title =. Cognitive Systems Research , volume =
-
[11]
Artificial General Intelligence (AGI 2019) , series =
Joscha Bach and Marcelo Coutinho and Leonhard Lichtinger , title =. Artificial General Intelligence (AGI 2019) , series =
2019
-
[12]
Rao and M.P
A.S. Rao and M.P. Georgeff , title =. Proceedings of the 2nd International Conference on Principles of Knowledge Representation and Reasoning , pages =
-
[13]
Cognitive Computation , volume =
Ron Sun , title =. Cognitive Computation , volume =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.