Agent Manufacturing: Foundation-Model Agents as First-Class Industrial Entities

Yilei Zhang

arxiv: 2605.24823 · v1 · pith:6LEHIB2Unew · submitted 2026-05-24 · 💻 cs.AI

Agent Manufacturing: Foundation-Model Agents as First-Class Industrial Entities

Yilei Zhang This is my paper

Pith reviewed 2026-06-30 11:53 UTC · model grok-4.3

classification 💻 cs.AI

keywords agent manufacturingfoundation-model agentscoordinative cognitionmanufacturing paradigmssmart manufacturingmulti-agent systemsautonomous agentsindustry 5.0

0 comments

The pith

Manufacturing enters a fifth paradigm when foundation-model agents become the principal mechanism for coordinating production through open-ended reasoning and negotiation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper claims manufacturing has moved through four paradigms that automated physical and routine tasks while leaving coordinative cognition—interpretation of goals, allocation of resources, diagnosis of issues, negotiation, and governance—with humans. It argues foundation-model agents are now redistributing this layer by interpreting open-ended goals, planning across long horizons, calling tools and machines, and negotiating with other agents and people. This creates a distinct new category called Agent Manufacturing, defined strictly as systems where such agent reasoning is the main coordination method. The definition is narrower than prior ideas of cognitive manufacturing or Industry 5.0 and separates the new systems from earlier multi-agent setups that worked only inside fixed protocols. A reader would care because the claim points to a reorganization where factories could run their planning and decision layers through autonomous agents rather than human engineers and managers.

Core claim

Manufacturing is undergoing a fifth transition in which foundation-model-based autonomous agents primarily redistribute the coordinative cognition of production—interpretive, allocative, diagnostic, negotiative, and governance work—rather than the physical or routine-cognitive layers below it. A manufacturing system qualifies as Agent Manufacturing when its principal coordination mechanism is reasoning performed by foundation-model agents that can interpret open-ended goals, plan over long horizons, invoke tools and machines, and negotiate with other agents and humans. This definition is narrower and more falsifiable than existing literature on cognitive manufacturing or Industry 5.0 and dis

What carries the argument

The operational definition of Agent Manufacturing as systems whose principal coordination mechanism is reasoning by foundation-model agents that interpret open-ended goals, plan long-term, invoke tools, and negotiate.

If this is right

Factory design will center on agent reasoning layers rather than human planners for allocation and governance.
Negotiation protocols will expand from closed machine-to-machine exchanges to open agent-to-agent and agent-to-human interactions.
Existing multi-agent manufacturing systems limited to fixed protocols will be reclassified outside the new paradigm.
Coordination failures will be diagnosed and corrected by agent reasoning instead of human operational managers.
Open-ended goal interpretation will replace rigid production schedules as the starting point for manufacturing runs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Workforce roles may shift from direct coordination to oversight of agent performance and exception handling.
Supply-chain and cross-factory negotiations could become fully agent-mediated, reducing human involvement in contracting.
Pilot tests in controlled production lines would quickly reveal whether current foundation models meet the reliability threshold for the definition.
The same agent-coordination pattern could apply to non-manufacturing domains such as logistics networks or energy grids.

Load-bearing premise

Foundation-model agents can already reliably carry out the full range of coordinative cognition tasks at industrial scale in a way that creates a distinct paradigm rather than a small extension of existing smart-manufacturing systems.

What would settle it

A real factory pilot in which foundation-model agents receive open-ended production goals, run coordination for weeks without human planners, and are measured on success in goal interpretation, long-horizon planning, tool invocation, and negotiation outcomes.

read the original abstract

Manufacturing has passed through four widely recognized paradigms - mechanization, electrification, programmable automation, and Smart Manufacturing - each defined by the kind of work it shifted from humans to machines. In every case, one layer of industrial work remained fundamentally human: the coordinative cognition of production, comprising the interpretive, allocative, diagnostic, negotiative, and governance work exercised by engineers, planners, and operational managers. We argue that a fifth transition is now underway in which this layer, rather than the physical or routine-cognitive layers below it, is what foundation-model-based autonomous agents primarily redistribute. We name this paradigm Agent Manufacturing and define it operationally: a manufacturing system is an instance of Agent Manufacturing when its principal coordination mechanism is reasoning performed by foundation-model agents that can interpret open-ended goals, plan over long horizons, invoke tools and machines, and negotiate with other agents and humans. This is a narrower and more falsifiable definition than the existing literature on cognitive manufacturing or Industry 5.0 provides, and it distinguishes the paradigm sharply from classical multi-agent manufacturing systems, which were autonomous only within closed protocol spaces.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper coins 'Agent Manufacturing' as a new label for systems where foundation-model agents handle coordination, but offers only a definition with no evidence that the shift is real or that current models can do the work.

read the letter

The main takeaway is that this is a position paper defining Agent Manufacturing as any system whose top-level coordination comes from foundation-model agents that handle open goals, long planning, tool use, and negotiation. It claims this marks a fifth paradigm after smart manufacturing.

What stands out is the attempt to make the definition narrower and more falsifiable than the usual Industry 5.0 language. It also draws a clean line against older multi-agent manufacturing work that stayed inside fixed protocols. That distinction is useful for people who track how these systems are described.

The soft spot is the absence of any supporting material. The abstract asserts the transition is underway and that foundation models already perform the full set of coordinative tasks at industrial scale, yet there are no benchmarks, case studies, capability thresholds, or even worked examples. Without that, the definition cannot yet be applied to any actual system. The circularity risk the reader flagged is real: the classification depends on accepting the authors' framing of what counts as the principal mechanism.

This piece is aimed at readers who follow industrial AI and operations research and want a fresh label to organize their thinking. It does not contain results, derivations, or data that would move the technical conversation forward.

I would not send it for peer review in this form. It needs concrete evidence or at least falsifiable predictions before it merits referee time.

Referee Report

3 major / 1 minor

Summary. The paper proposes a fifth manufacturing paradigm, 'Agent Manufacturing,' defined operationally as any manufacturing system whose principal coordination mechanism consists of reasoning by foundation-model agents capable of interpreting open-ended goals, planning over long horizons, invoking tools/machines, and negotiating with other agents and humans. It argues this redistributes the coordinative cognition layer (interpretive, allocative, diagnostic, negotiative, governance) previously performed by humans, distinguishing the paradigm from mechanization, electrification, programmable automation, Smart Manufacturing, classical multi-agent systems, and the broader Industry 5.0 literature by emphasizing falsifiability and open-ended reasoning rather than closed protocols.

Significance. If the operational definition can be applied consistently to real systems and if foundation-model agents reach the required reliability thresholds, the framework could sharpen discussions of AI-driven coordination in manufacturing and supply a clearer boundary condition than existing cognitive-manufacturing or Industry 5.0 proposals. The manuscript's explicit attempt to supply a narrower, falsifiable criterion is a constructive contribution to paradigm classification, though its impact will depend on subsequent empirical mapping rather than on the definition alone.

major comments (3)

[Abstract] Abstract: The claim that 'a fifth transition is now underway' is presented as a factual assertion yet is unsupported by any cited systems, benchmarks, case studies, or capability thresholds demonstrating that current foundation-model agents already perform the full set of coordinative tasks (interpretive through governance) at industrial reliability levels. This assertion is load-bearing for the paper's central thesis that a distinct paradigm shift has begun.
[Abstract] Abstract: The operational definition requires agents that 'can interpret open-ended goals, plan over long horizons, invoke tools and machines, and negotiate...' to serve as the principal mechanism, but the manuscript supplies neither quantitative reliability standards nor references to existing deployments that meet these criteria, leaving the definition unable to classify any concrete system as Agent Manufacturing versus an incremental addition to Smart Manufacturing.
[Abstract] Abstract: The distinction from 'classical multi-agent manufacturing systems, which were autonomous only within closed protocol spaces' is drawn solely in terms of the authors' chosen framing of 'principal coordination mechanism' and open-ended versus closed reasoning; without an independent metric or worked example of how the classification would be applied to an actual production line, the boundary risks circularity.

minor comments (1)

[Abstract] The abstract refers to 'four widely recognized paradigms' without citing the canonical sources that establish this periodization; adding the relevant historical references would improve traceability.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments on our manuscript. The feedback correctly identifies areas where the abstract's claims require additional qualification and support to strengthen the presentation of the proposed paradigm. We respond point-by-point below and will incorporate revisions to address the concerns raised.

read point-by-point responses

Referee: [Abstract] Abstract: The claim that 'a fifth transition is now underway' is presented as a factual assertion yet is unsupported by any cited systems, benchmarks, case studies, or capability thresholds demonstrating that current foundation-model agents already perform the full set of coordinative tasks (interpretive through governance) at industrial reliability levels. This assertion is load-bearing for the paper's central thesis that a distinct paradigm shift has begun.

Authors: We agree that the abstract wording could be misread as asserting an established fact rather than an argued position. The manuscript frames the claim as 'We argue that a fifth transition is now underway,' reflecting our interpretation of emerging trends rather than a claim of completed industrial-scale deployment. To address this, we will revise the abstract to explicitly qualify the transition as based on the demonstrated trajectory of foundation-model capabilities and emerging pilot integrations, while adding a supporting citation to recent agent deployments in the revised version. revision: yes
Referee: [Abstract] Abstract: The operational definition requires agents that 'can interpret open-ended goals, plan over long horizons, invoke tools and machines, and negotiate...' to serve as the principal mechanism, but the manuscript supplies neither quantitative reliability standards nor references to existing deployments that meet these criteria, leaving the definition unable to classify any concrete system as Agent Manufacturing versus an incremental addition to Smart Manufacturing.

Authors: The definition is deliberately qualitative to remain durable as agent capabilities evolve, focusing on the nature of the coordination mechanism rather than fixed performance thresholds. We acknowledge that guidance on application would improve clarity. In the revision we will add a dedicated paragraph outlining criteria for identifying the 'principal coordination mechanism' in a given system, including how to distinguish open-ended reasoning from incremental automation, without introducing specific numerical reliability metrics at this conceptual stage. revision: yes
Referee: [Abstract] Abstract: The distinction from 'classical multi-agent manufacturing systems, which were autonomous only within closed protocol spaces' is drawn solely in terms of the authors' chosen framing of 'principal coordination mechanism' and open-ended versus closed reasoning; without an independent metric or worked example of how the classification would be applied to an actual production line, the boundary risks circularity.

Authors: The distinction rests on a substantive technical difference: traditional multi-agent systems operate within predefined protocols, whereas foundation-model agents enable open-ended goal interpretation and negotiation outside such constraints. To reduce any appearance of circularity, we will include a worked example in the revised manuscript illustrating the classification process applied to a concrete production scenario, specifying observable indicators for determining the principal mechanism independently of the paradigm label. revision: yes

Circularity Check

0 steps flagged

No circularity: definitional proposal with no self-referential reduction

full rationale

The paper advances an operational definition of a new paradigm (Agent Manufacturing) and argues that a transition is underway based on the capabilities of foundation-model agents. This is a conceptual framing exercise rather than a derivation chain containing equations, fitted parameters, or predictions that reduce to the inputs by construction. No self-citations, uniqueness theorems, or ansatzes appear in the provided text. The definition explicitly distinguishes itself from prior literature (cognitive manufacturing, Industry 5.0, classical multi-agent systems) without tautological collapse. The central claim remains an argument about capability redistribution, not a result forced by its own definitional inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The paper introduces the named paradigm and rests on the untested premise that foundation models can execute industrial coordinative cognition at scale.

axioms (1)

domain assumption Foundation-model agents can interpret open-ended goals, plan over long horizons, invoke tools, and negotiate in manufacturing contexts.
This capability is presupposed in the operational definition of the paradigm.

invented entities (1)

Agent Manufacturing no independent evidence
purpose: To label and demarcate a claimed fifth manufacturing paradigm
A new conceptual category whose independent existence is asserted rather than demonstrated.

pith-pipeline@v0.9.1-grok · 5715 in / 1344 out tokens · 29057 ms · 2026-06-30T11:53:02.768606+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

36 extracted references · 9 canonical work pages · 5 internal anchors

[1]

Power and Progress: Our Thousand-Year Struggle Over Technology and Prosperity

Daron Acemoglu and Simon Johnson. Power and Progress: Our Thousand-Year Struggle Over Technology and Prosperity. PublicAffairs, 2023

2023
[2]

Artificial intelligence, automation, and work

Daron Acemoglu and Pascual Restrepo. Artificial intelligence, automation, and work. Technical Report 24196, National Bureau of Economic Research, 2018

2018
[3]

Tasks, automation, and the rise in U.S

Daron Acemoglu and Pascual Restrepo. Tasks, automation, and the rise in U.S. wage inequality. Econometrica, 90 0 (5), 2022

2022
[4]

Automation and rent dissipation: Implications for wages, inequality, and productivity

Daron Acemoglu and Pascual Restrepo. Automation and rent dissipation: Implications for wages, inequality, and productivity. Technical Report 32536, National Bureau of Economic Research, 2024

2024
[5]

Do as I can, not as I say: Grounding language in robotic affordances

Michael Ahn et al. Do as I can, not as I say: Grounding language in robotic affordances. In Conference on Robot Learning (CoRL), 2022

2022
[6]

Sadik, Tommi Mikkonen, Muhammad Waseem, and Niko M\"akitalo

Muhammad Ashfaq, Ahmed R. Sadik, Tommi Mikkonen, Muhammad Waseem, and Niko M\"akitalo. LLM -enhanced holonic architecture for ad-hoc scalable SoS . arXiv:2501.07992, 2025

work page arXiv 2025
[7]

_0 : A vision-language-action flow model for general robot control

Kevin Black et al. _0 : A vision-language-action flow model for general robot control. Technical report, Physical Intelligence, 2024. Technical report. https://www.pi.website/blog/pi0

2024
[8]

RT-2 : Vision-language-action models transfer web knowledge to robotic control

Anthony Brohan et al. RT-2 : Vision-language-action models transfer web knowledge to robotic control. In Conference on Robot Learning (CoRL), 2023

2023
[9]

A survey on LLM -based multi-agent system: Recent advances and new frontiers in application

Shuaihang Chen et al. A survey on LLM -based multi-agent system: Recent advances and new frontiers in application. arXiv:2412.17481, 2024

work page arXiv 2024
[10]

Agentic AI and occupational displacement: A multi-regional task exposure analysis of emerging labor market disruption

Ravish Gupta and Saket Kumar. Agentic AI and occupational displacement: A multi-regional task exposure analysis of emerging labor market disruption. arXiv:2604.00186, 2026

work page arXiv 2026
[11]

Friedrich A. Hayek. The use of knowledge in society. American Economic Review, 35 0 (4), 1945

1945
[12]

Foundation-Model-Based Agents in Industrial Automation: Purposes, Capabilities, and Open Challenges

Vincent Henkel, Felix Gehlhoff, David Kube, Asaad Almutareb, Luis Cruz, Bernd Hellingrath, Philip Koch, Christoph Legat, Florian Mohr, Michael Oberle, Felix Ocker, Thorsten Schoeler, Mario Thron, Nico Andre T\"opfer, Lucas Vogt, and Yuchen Xia. Foundation-model-based agents in industrial automation: Purposes, capabilities, and open challenges. arXiv:2605....

work page internal anchor Pith review Pith/arXiv arXiv 2026
[13]

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

Sirui Hong et al. MetaGPT : Meta programming for multi-agent collaborative framework. In arXiv:2308.00352, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[14]

Language models as zero-shot planners: Extracting actionable knowledge for embodied agents

Wenlong Huang et al. Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. In International Conference on Machine Learning (ICML), 2022

2022
[15]

Cognition in the Wild

Edwin Hutchins. Cognition in the Wild. MIT Press, 1995

1995
[16]

Recommendations for implementing the strategic initiative INDUSTRIE 4.0

Henning Kagermann, Wolfgang Wahlster, and Johannes Helbig. Recommendations for implementing the strategic initiative INDUSTRIE 4.0 . Technical report, Acatech, 2013

2013
[17]

OpenVLA: An Open-Source Vision-Language-Action Model

Moo Jin Kim et al. OpenVLA : An open-source vision-language-action model. arXiv:2406.09246, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[18]

Edward A. Lee. Cyber physical systems: Design challenges. In IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing (ISORC) , 2008

2008
[19]

Industry 5.0: Prospect and retrospect

Jiewu Leng et al. Industry 5.0: Prospect and retrospect. Journal of Manufacturing Systems, 65, 2022

2022
[20]

Transferring vision-language-action models to industry applications: Architectures, performance, and challenges

Shuai Li, Yizhe Chen, Dong Li, Sichao Liu, Dapeng Lan, Yu Liu, and Zhibo Pang. Transferring vision-language-action models to industry applications: Architectures, performance, and challenges. arXiv:2509.23121, 2025

work page arXiv 2025
[21]

Large language model-enabled multi-agent manufacturing systems

Jonghan Lim, Birgit Vogel-Heuser, and Ilya Kovalenko. Large language model-enabled multi-agent manufacturing systems. In 2024 IEEE 20th International Conference on Automation Science and Engineering (CASE), pages 3940--3946, 2024

2024
[22]

Cyber-physical production systems: Roots, expectations and R&D challenges

L\'aszl\'o Monostori. Cyber-physical production systems: Roots, expectations and R&D challenges. Procedia CIRP, 17, 2014

2014
[23]

Open X-Embodiment : Robotic learning datasets and RT-X models

Open X-Embodiment Collaboration . Open X-Embodiment : Robotic learning datasets and RT-X models. In IEEE International Conference on Robotics and Automation (ICRA) , 2024

2024
[24]

Generative agents: Interactive simulacra of human behavior

Joon Sung Park et al. Generative agents: Interactive simulacra of human behavior. In ACM Symposium on User Interface Software and Technology (UIST), 2023

2023
[25]

Industrial foundation model

Lei Ren, Haiteng Wang, Jiabao Dong, et al. Industrial foundation model. IEEE Transactions on Cybernetics, 55 0 (5): 0 2286--2301, 2025

2025
[26]

Weiming Shen, Qi Hao, Hyun Joong Yoon, and Douglas H. Norrie. Applications of agent-based systems in intelligent manufacturing. Advanced Engineering Informatics, 20 0 (4), 2006

2006
[27]

Industrial copilot deployments: Thyssenkrupp Marine Systems , Erlangen Electronics Factory , and PepsiCo , 2024--2026

Siemens AG . Industrial copilot deployments: Thyssenkrupp Marine Systems , Erlangen Electronics Factory , and PepsiCo , 2024--2026. Press releases and case studies, https://press.siemens.com

2024
[28]

Herbert A. Simon. Administrative Behavior. Macmillan, 1947

1947
[29]

Herbert A. Simon. A behavioral model of rational choice. Quarterly Journal of Economics, 69 0 (1), 1955

1955
[30]

Sumers, Shunyu Yao, Karthik Narasimhan, and Thomas L

Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, and Thomas L. Griffiths. Cognitive architectures for language agents. Transactions on Machine Learning Research (TMLR), 2023

2023
[31]

A survey on large language model based autonomous agents

Lei Wang et al. A survey on large language model based autonomous agents. Frontiers of Computer Science, 18 0 (6), 2024

2024
[32]

An Introduction to Multiagent Systems

Michael Wooldridge. An Introduction to Multiagent Systems. Wiley, 2nd edition, 2009

2009
[33]

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

Qingyun Wu et al. AutoGen : Enabling next-gen LLM applications via multi-agent conversation. arXiv:2308.08155, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[34]

The Rise and Potential of Large Language Model Based Agents: A Survey

Zhiheng Xi et al. The rise and potential of large language model based agents: A survey. arXiv:2309.07864, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[35]

When embodied AI meets Industry 5.0 : Human-centered smart manufacturing

Jiawei Xu, Quanyuan Sun, Qing-Long Han, and Yang Tang. When embodied AI meets Industry 5.0 : Human-centered smart manufacturing. IEEE/CAA Journal of Automatica Sinica, 12 0 (3): 0 485--501, 2025

2025
[36]

Industry 4.0 and Industry 5.0 --- inception, conception and perception

Xun Xu, Yuqian Lu, Birgit Vogel-Heuser, and Lihui Wang. Industry 4.0 and Industry 5.0 --- inception, conception and perception. Journal of Manufacturing Systems, 61, 2021

2021

[1] [1]

Power and Progress: Our Thousand-Year Struggle Over Technology and Prosperity

Daron Acemoglu and Simon Johnson. Power and Progress: Our Thousand-Year Struggle Over Technology and Prosperity. PublicAffairs, 2023

2023

[2] [2]

Artificial intelligence, automation, and work

Daron Acemoglu and Pascual Restrepo. Artificial intelligence, automation, and work. Technical Report 24196, National Bureau of Economic Research, 2018

2018

[3] [3]

Tasks, automation, and the rise in U.S

Daron Acemoglu and Pascual Restrepo. Tasks, automation, and the rise in U.S. wage inequality. Econometrica, 90 0 (5), 2022

2022

[4] [4]

Automation and rent dissipation: Implications for wages, inequality, and productivity

Daron Acemoglu and Pascual Restrepo. Automation and rent dissipation: Implications for wages, inequality, and productivity. Technical Report 32536, National Bureau of Economic Research, 2024

2024

[5] [5]

Do as I can, not as I say: Grounding language in robotic affordances

Michael Ahn et al. Do as I can, not as I say: Grounding language in robotic affordances. In Conference on Robot Learning (CoRL), 2022

2022

[6] [6]

Sadik, Tommi Mikkonen, Muhammad Waseem, and Niko M\"akitalo

Muhammad Ashfaq, Ahmed R. Sadik, Tommi Mikkonen, Muhammad Waseem, and Niko M\"akitalo. LLM -enhanced holonic architecture for ad-hoc scalable SoS . arXiv:2501.07992, 2025

work page arXiv 2025

[7] [7]

_0 : A vision-language-action flow model for general robot control

Kevin Black et al. _0 : A vision-language-action flow model for general robot control. Technical report, Physical Intelligence, 2024. Technical report. https://www.pi.website/blog/pi0

2024

[8] [8]

RT-2 : Vision-language-action models transfer web knowledge to robotic control

Anthony Brohan et al. RT-2 : Vision-language-action models transfer web knowledge to robotic control. In Conference on Robot Learning (CoRL), 2023

2023

[9] [9]

A survey on LLM -based multi-agent system: Recent advances and new frontiers in application

Shuaihang Chen et al. A survey on LLM -based multi-agent system: Recent advances and new frontiers in application. arXiv:2412.17481, 2024

work page arXiv 2024

[10] [10]

Agentic AI and occupational displacement: A multi-regional task exposure analysis of emerging labor market disruption

Ravish Gupta and Saket Kumar. Agentic AI and occupational displacement: A multi-regional task exposure analysis of emerging labor market disruption. arXiv:2604.00186, 2026

work page arXiv 2026

[11] [11]

Friedrich A. Hayek. The use of knowledge in society. American Economic Review, 35 0 (4), 1945

1945

[12] [12]

Foundation-Model-Based Agents in Industrial Automation: Purposes, Capabilities, and Open Challenges

Vincent Henkel, Felix Gehlhoff, David Kube, Asaad Almutareb, Luis Cruz, Bernd Hellingrath, Philip Koch, Christoph Legat, Florian Mohr, Michael Oberle, Felix Ocker, Thorsten Schoeler, Mario Thron, Nico Andre T\"opfer, Lucas Vogt, and Yuchen Xia. Foundation-model-based agents in industrial automation: Purposes, capabilities, and open challenges. arXiv:2605....

work page internal anchor Pith review Pith/arXiv arXiv 2026

[13] [13]

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

Sirui Hong et al. MetaGPT : Meta programming for multi-agent collaborative framework. In arXiv:2308.00352, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[14] [14]

Language models as zero-shot planners: Extracting actionable knowledge for embodied agents

Wenlong Huang et al. Language models as zero-shot planners: Extracting actionable knowledge for embodied agents. In International Conference on Machine Learning (ICML), 2022

2022

[15] [15]

Cognition in the Wild

Edwin Hutchins. Cognition in the Wild. MIT Press, 1995

1995

[16] [16]

Recommendations for implementing the strategic initiative INDUSTRIE 4.0

Henning Kagermann, Wolfgang Wahlster, and Johannes Helbig. Recommendations for implementing the strategic initiative INDUSTRIE 4.0 . Technical report, Acatech, 2013

2013

[17] [17]

OpenVLA: An Open-Source Vision-Language-Action Model

Moo Jin Kim et al. OpenVLA : An open-source vision-language-action model. arXiv:2406.09246, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[18] [18]

Edward A. Lee. Cyber physical systems: Design challenges. In IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing (ISORC) , 2008

2008

[19] [19]

Industry 5.0: Prospect and retrospect

Jiewu Leng et al. Industry 5.0: Prospect and retrospect. Journal of Manufacturing Systems, 65, 2022

2022

[20] [20]

Transferring vision-language-action models to industry applications: Architectures, performance, and challenges

Shuai Li, Yizhe Chen, Dong Li, Sichao Liu, Dapeng Lan, Yu Liu, and Zhibo Pang. Transferring vision-language-action models to industry applications: Architectures, performance, and challenges. arXiv:2509.23121, 2025

work page arXiv 2025

[21] [21]

Large language model-enabled multi-agent manufacturing systems

Jonghan Lim, Birgit Vogel-Heuser, and Ilya Kovalenko. Large language model-enabled multi-agent manufacturing systems. In 2024 IEEE 20th International Conference on Automation Science and Engineering (CASE), pages 3940--3946, 2024

2024

[22] [22]

Cyber-physical production systems: Roots, expectations and R&D challenges

L\'aszl\'o Monostori. Cyber-physical production systems: Roots, expectations and R&D challenges. Procedia CIRP, 17, 2014

2014

[23] [23]

Open X-Embodiment : Robotic learning datasets and RT-X models

Open X-Embodiment Collaboration . Open X-Embodiment : Robotic learning datasets and RT-X models. In IEEE International Conference on Robotics and Automation (ICRA) , 2024

2024

[24] [24]

Generative agents: Interactive simulacra of human behavior

Joon Sung Park et al. Generative agents: Interactive simulacra of human behavior. In ACM Symposium on User Interface Software and Technology (UIST), 2023

2023

[25] [25]

Industrial foundation model

Lei Ren, Haiteng Wang, Jiabao Dong, et al. Industrial foundation model. IEEE Transactions on Cybernetics, 55 0 (5): 0 2286--2301, 2025

2025

[26] [26]

Weiming Shen, Qi Hao, Hyun Joong Yoon, and Douglas H. Norrie. Applications of agent-based systems in intelligent manufacturing. Advanced Engineering Informatics, 20 0 (4), 2006

2006

[27] [27]

Industrial copilot deployments: Thyssenkrupp Marine Systems , Erlangen Electronics Factory , and PepsiCo , 2024--2026

Siemens AG . Industrial copilot deployments: Thyssenkrupp Marine Systems , Erlangen Electronics Factory , and PepsiCo , 2024--2026. Press releases and case studies, https://press.siemens.com

2024

[28] [28]

Herbert A. Simon. Administrative Behavior. Macmillan, 1947

1947

[29] [29]

Herbert A. Simon. A behavioral model of rational choice. Quarterly Journal of Economics, 69 0 (1), 1955

1955

[30] [30]

Sumers, Shunyu Yao, Karthik Narasimhan, and Thomas L

Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, and Thomas L. Griffiths. Cognitive architectures for language agents. Transactions on Machine Learning Research (TMLR), 2023

2023

[31] [31]

A survey on large language model based autonomous agents

Lei Wang et al. A survey on large language model based autonomous agents. Frontiers of Computer Science, 18 0 (6), 2024

2024

[32] [32]

An Introduction to Multiagent Systems

Michael Wooldridge. An Introduction to Multiagent Systems. Wiley, 2nd edition, 2009

2009

[33] [33]

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

Qingyun Wu et al. AutoGen : Enabling next-gen LLM applications via multi-agent conversation. arXiv:2308.08155, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[34] [34]

The Rise and Potential of Large Language Model Based Agents: A Survey

Zhiheng Xi et al. The rise and potential of large language model based agents: A survey. arXiv:2309.07864, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[35] [35]

When embodied AI meets Industry 5.0 : Human-centered smart manufacturing

Jiawei Xu, Quanyuan Sun, Qing-Long Han, and Yang Tang. When embodied AI meets Industry 5.0 : Human-centered smart manufacturing. IEEE/CAA Journal of Automatica Sinica, 12 0 (3): 0 485--501, 2025

2025

[36] [36]

Industry 4.0 and Industry 5.0 --- inception, conception and perception

Xun Xu, Yuqian Lu, Birgit Vogel-Heuser, and Lihui Wang. Industry 4.0 and Industry 5.0 --- inception, conception and perception. Journal of Manufacturing Systems, 61, 2021

2021