Large Language Models in Process Systems Engineering: Opportunities, Architectures, and Industrial Deployment Challenges
Pith reviewed 2026-06-27 08:53 UTC · model grok-4.3
The pith
LLMs show genuine promise for natural language tasks in process systems engineering but remain challenging for real-time execution and safety guarantees.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
LLMs show genuine promise for tasks involving natural language, including querying documentation, synthesizing unstructured knowledge, and enabling flexible human-machine interaction; applications requiring real-time execution, constraint satisfaction, or formal safety guarantees remain challenging.
What carries the argument
Systematic categorization of the literature into seven application areas with critical assessment of demonstrated capabilities versus aspirational claims.
If this is right
- LLMs can effectively support querying technical documentation in process design and engineering.
- Models can synthesize unstructured knowledge for molecular design and synthesis tasks.
- Flexible human-machine interaction becomes feasible in process modeling and simulation.
- Challenges limit use in optimization, scheduling, process control, and fault detection where constraints and safety are key.
- Open problems remain for integrating LLMs into industrial systems with real-time and safety requirements.
Where Pith is reading between the lines
- Industry might start with hybrid systems where LLMs handle language interfaces and traditional tools manage calculations.
- Research could focus on fine-tuning or combining LLMs with symbolic methods to address constraint issues.
- Deployment challenges suggest need for benchmarks specific to PSE safety standards.
- Similar patterns may appear in other engineering fields like mechanical or electrical systems engineering.
Load-bearing premise
The literature reviewed is representative of current capabilities and limitations in each of the seven categories; the selection and interpretation of papers accurately reflects demonstrated versus aspirational performance.
What would settle it
A demonstration of an LLM reliably performing real-time constraint satisfaction in a process control application without human oversight would challenge the assessment of remaining challenges.
Figures
read the original abstract
Large Language Models (LLMs) have rapidly emerged as tools of interest across engineering disciplines, and Process Systems Engineering (PSE) is no exception. This survey provides a systematic review of LLM applications in PSE, organizing the literature into seven categories: (1) process design and engineering, (2) molecular design and synthesis, (3) process modeling and simulation, (4) time-series forecasting, (5) optimization and scheduling, (6) process control, and (7) fault detection and diagnosis. For each category, we summarize the state of the art, identify common methodological approaches, and critically assess demonstrated capabilities versus aspirational claims. We find that LLMs show genuine promise for tasks involving natural language, including querying documentation, synthesizing unstructured knowledge, and enabling flexible human-machine interaction. However, applications requiring real-time execution, constraint satisfaction, or formal safety guarantees remain challenging. We conclude by identifying open problems and productive research directions for the PSE community.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript is a systematic literature survey of LLM applications in Process Systems Engineering. It organizes existing work into seven categories—(1) process design and engineering, (2) molecular design and synthesis, (3) process modeling and simulation, (4) time-series forecasting, (5) optimization and scheduling, (6) process control, and (7) fault detection and diagnosis—summarizing state-of-the-art approaches in each and explicitly contrasting demonstrated capabilities with aspirational claims. The central conclusion is that LLMs show genuine promise for natural-language tasks such as querying documentation, synthesizing unstructured knowledge, and flexible human-machine interaction, while applications requiring real-time execution, constraint satisfaction, or formal safety guarantees remain challenging. The paper closes by identifying open problems and productive research directions.
Significance. If the assessment holds, the survey provides a timely, structured overview of an emerging intersection between LLMs and PSE. Its explicit separation of demonstrated versus aspirational performance is a strength that can help the community avoid over-allocation of effort to currently infeasible uses. The seven-category organization and identification of open problems offer clear guidance for future work. As a survey paper, it contains no primary empirical results, theorems, or code, but the balanced critical assessment adds value beyond a simple reference compilation.
minor comments (2)
- The abstract claims a 'systematic review' but does not state the search databases, keywords, date range, or total number of papers included; adding these details would help readers gauge the survey's scope without altering the central claims.
- In the concluding section on open problems, the listed directions could be more explicitly mapped back to the seven categories to improve traceability for readers focused on a specific application area.
Simulated Author's Rebuttal
We thank the referee for their positive and constructive review. We are pleased that the assessment recognizes the value of our systematic categorization, the distinction between demonstrated and aspirational capabilities, and the identification of open problems. The recommendation to accept is appreciated.
Circularity Check
No significant circularity in literature survey
full rationale
This is a systematic review paper that organizes existing external literature into seven categories, summarizes approaches, and contrasts demonstrated versus aspirational performance. No original derivations, equations, fitted parameters, theorems, or quantitative predictions are advanced. All claims rest on cited external references rather than internal constructions or self-citation chains. The central conclusions are therefore self-contained against external benchmarks and receive the default non-circularity finding for survey-style work.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
AIChE Journal , volume=
Toward self-driving processes: A deep reinforcement learning approach to control , author=. AIChE Journal , volume=. 2019 , publisher=
2019
-
[2]
Advances in Neural Information Processing Systems , volume=
Attention is all you need , author=. Advances in Neural Information Processing Systems , volume=
-
[3]
Scaling Laws for Neural Language Models
Scaling laws for neural language models , author=. arXiv preprint arXiv:2001.08361 , year=
work page internal anchor Pith review Pith/arXiv arXiv 2001
-
[4]
Advances in Neural Information Processing Systems , volume=
Training language models to follow instructions with human feedback , author=. Advances in Neural Information Processing Systems , volume=
-
[5]
AIChE Journal , volume=
Toward automatic generation of control structures for process flow diagrams with large language models , author=. AIChE Journal , volume=. 2024 , publisher=
2024
-
[6]
Talking like piping and instrumentation diagrams (
Alimin, Achmad Anggawirya and Goldstein, Dominik P and Schulze Balhorn, Lukas and Schweidtmann, Artur M , booktitle=. Talking like piping and instrumentation diagrams (. 2025 , address=
2025
-
[7]
Computer Aided Chemical Engineering , volume=
Toward autocorrection of chemical process flowsheets using large language models , author=. Computer Aided Chemical Engineering , volume=. 2024 , publisher=
2024
-
[8]
Nature , volume=
Autonomous chemical research with large language models , author=. Nature , volume=. 2023 , publisher=
2023
-
[9]
Nature Machine Intelligence , volume=
Augmenting large language models with chemistry tools , author=. Nature Machine Intelligence , volume=. 2024 , publisher=
2024
-
[10]
Nature Machine Intelligence , volume=
Leveraging large language models for predictive chemistry , author=. Nature Machine Intelligence , volume=. 2024 , publisher=
2024
-
[11]
Chemical Science , volume=
A review of large language models and autonomous agents in chemistry , author=. Chemical Science , volume=. 2025 , publisher=
2025
-
[12]
Frontiers of Chemical Science and Engineering , volume=
The potential and challenges of large language model agent systems in chemical process simulation: from automated modeling to intelligent design , author=. Frontiers of Chemical Science and Engineering , volume=. 2025 , publisher=
2025
-
[13]
Vyas, Javal and Mercang. Autonomous control leveraging. arXiv preprint arXiv:2507.07115 , year=
-
[14]
Gill, Milapji Singh and Vyas, Javal and Markaj, Artan and Gehlhoff, Felix and Mercang. Leveraging. arXiv preprint arXiv:2505.02076 , year=
-
[15]
2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETFA) , pages=
Control industrial automation system with large language model agents , author=. 2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETFA) , pages=. 2025 , organization=
2025
-
[16]
Jin, Ming and Wang, Shiyu and Ma, Lintao and Chu, Zhixuan and Zhang, James Y and Shi, Xiaoming and Chen, Pin-Yu and Liang, Yuxuan and Li, Yuan-Fang and Pan, Shirui and Wen, Qingsong , booktitle=. Time-
-
[17]
Qaid, Hamzah A. A. M. and Zhang, Bo and Li, Dan and Ng, See-Kiong and Li, Wei , journal=
-
[18]
Nature Chemical Engineering , volume=
Generative artificial intelligence in chemical engineering , author=. Nature Chemical Engineering , volume=. 2024 , publisher=
2024
-
[19]
Digital Chemical Engineering , volume=
Digitization of chemical process flow diagrams using deep convolutional neural networks , author=. Digital Chemical Engineering , volume=. 2023 , publisher=
2023
-
[20]
2023 , publisher=
Vogel, Gabriel and Hirtreiter, Edwin and Schulze Balhorn, Lukas and Schweidtmann, Artur M , journal=. 2023 , publisher=
2023
-
[21]
A multiagent-driven robotic
Song, Tao and Luo, Man and Zhang, Xiaolong and others , journal=. A multiagent-driven robotic. 2025 , publisher=
2025
-
[22]
Nature , volume=
An autonomous laboratory for the accelerated synthesis of novel materials , author=. Nature , volume=. 2023 , publisher=
2023
-
[23]
Nature , volume=
Autonomous mobile robots for exploratory synthetic chemistry , author=. Nature , volume=. 2024 , publisher=
2024
-
[24]
Multimodal large language model-based fault detection and diagnosis in context of
Alsaif, Khalid M and Albeshri, Aiiad A and Khemakhem, Maher A and Eassa, Fathy E , journal=. Multimodal large language model-based fault detection and diagnosis in context of. 2024 , publisher=
2024
-
[25]
2024 , organization=
Koziolek, Heiko and Koziolek, Anne , booktitle=. 2024 , organization=
2024
-
[26]
2025 , address=
Goldstein, Dominik P and Schulze Balhorn, Lukas and Alimin, Achmad Anggawirya and Schweidtmann, Artur M , booktitle=. 2025 , address=
2025
-
[27]
Rule-based autocorrection of Piping and Instrumentation Diagrams (
Schulze Balhorn, Lukas and Seijsener, Niels and Dao, Kevin and Kim, Minji and Goldstein, Dominik P and Driessen, Ge H M and Schweidtmann, Artur M , booktitle=. Rule-based autocorrection of Piping and Instrumentation Diagrams (. 2025 , address=
2025
-
[28]
Robotics and Computer-Integrated Manufacturing , volume=
A survey on potentials, pathways and challenges of large language models in new-generation intelligent manufacturing , author=. Robotics and Computer-Integrated Manufacturing , volume=. 2025 , publisher=
2025
-
[29]
IEEE/CAA Journal of Automatica Sinica , volume=
Reinforcement learning in process industries: Review and perspective , author=. IEEE/CAA Journal of Automatica Sinica , volume=
-
[30]
Zeng, Tong and Badrinarayanan, Srivathsan and Ock, Janghoon and Lai, Cheng-Kai and Barati Farimani, Amir , journal=
-
[31]
Guo, Xingang and Keivan, Darioush and Syed, Usman and Qin, Lianhui and Zhang, Huan and Dullerud, Geir and Seiler, Peter and Hu, Bin , journal=
-
[32]
Nature Communications , volume=
An automatic end-to-end chemical synthesis development platform powered by large language models , author=. Nature Communications , volume=. 2024 , publisher=
2024
-
[33]
Computers & Operations Research , pages=
Discovering heuristics with Large Language Models (. Computers & Operations Research , pages=. 2025 , publisher=
2025
-
[34]
2025 , publisher=
Tao, Laifa and Liu, Haifei and Ning, Guoao and Cao, Wenyan and Huang, Bohao and Lu, Chen , journal=. 2025 , publisher=
2025
-
[35]
IEEE Transactions on Industrial Informatics , volume=
Joint knowledge graph and large language model for fault diagnosis and its application in aviation assembly , author=. IEEE Transactions on Industrial Informatics , volume=. 2024 , publisher=
2024
-
[36]
Advances in Neural Information Processing Systems , volume=
Large language models are zero-shot time series forecasters , author=. Advances in Neural Information Processing Systems , volume=
-
[37]
Transactions on Machine Learning Research , year=
Chronos: Learning the language of time series , author=. Transactions on Machine Learning Research , year=
-
[38]
Proceedings of the 41st International Conference on Machine Learning (ICML) , volume=
A decoder-only foundation model for time-series forecasting , author=. Proceedings of the 41st International Conference on Machine Learning (ICML) , volume=. 2024 , publisher=
2024
-
[39]
Rasul, Kashif and Ashok, Arjun and Williams, Andrew Robert and Ghonia, Hena and Bhagwatkar, Rishika and Khorasani, Arian and Bayazi, Mohammad Javad Darvishi and Adamopoulos, George and Riachi, Roland and Hassen, Nadhir and others , journal=. Lag-
-
[40]
Leveraging generative
Woo, TaeYong and Kim, SangYoun and Tariq, Shahzeb and Heo, SungKu and Yoo, ChangKyoo , journal=. Leveraging generative. 2025 , publisher=
2025
-
[41]
Processes , volume=
A comparative review of large language models in engineering with emphasis on chemical engineering applications , author=. Processes , volume=. 2025 , publisher=
2025
-
[42]
Proceedings of the 41st International Conference on Machine Learning (ICML) , volume=
Unified training of universal time series forecasting transformers , author=. Proceedings of the 41st International Conference on Machine Learning (ICML) , volume=. 2024 , publisher=
2024
-
[43]
Garza, Azul and Challu, Cristian and Mergenthaler-Canseco, Max , journal=
-
[44]
Liu, Xu and Liu, Juncheng and Woo, Gerald and Aksu, Taha and Liang, Yuxuan and Zimmermann, Roger and Liu, Chenghao and Savarese, Silvio and Xiong, Caiming and Sahoo, Doyen , journal=. Moirai-
-
[45]
Goswami, Mononito and Szafer, Konrad and Choudhry, Arjun and Cai, Yifu and Li, Shuo and Dubrawski, Artur , journal=
-
[46]
2025 , publisher=
Zhang, Qi and Xu, Chao and Li, Jie and Sun, Yicheng and Bao, Jinsong and Zhang, Dan , journal=. 2025 , publisher=
2025
-
[47]
arXiv preprint arXiv:2405.14755 , year=
Large language models can be zero-shot anomaly detectors for time series? , author=. arXiv preprint arXiv:2405.14755 , year=
-
[48]
Gu, Yile and Xiong, Yifan and Mace, Jonathan and Jiang, Yuting and Hu, Yigong and Kasikci, Baris and Cheng, Peng , journal=
-
[49]
2025 , publisher=
Maher, Gabriel , journal=. 2025 , publisher=
2025
-
[50]
Sha, Hao and Mu, Yao and Jiang, Yuxuan and Chen, Li and Xu, Chenfeng and Luo, Ping and Li, Shengbo Eben and Tomizuka, Masayoshi and Zhan, Wei and Ding, Mingyu , journal=
-
[51]
Long, Keke and Shi, Haotian and Liu, Jiaxi and Li, Xiaopeng , journal=
-
[52]
Fang, Yin and Liang, Xiaozhuan and Zhang, Ningyu and Liu, Kangwei and Huang, Rui and Chen, Zhuo and Fan, Xiaohui and Chen, Huajun , journal=. Mol-
-
[53]
Galactica: A Large Language Model for Science
Galactica: A large language model for science , author=. arXiv preprint arXiv:2211.09085 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[54]
Proceedings of the 40th International Conference on Machine Learning (ICML) , volume=
Unifying molecular and textual representations via multi-task language modelling , author=. Proceedings of the 40th International Conference on Machine Learning (ICML) , volume=. 2023 , publisher=
2023
-
[55]
2024 , publisher=
Pei, Qizhi and Wu, Lijun and Gao, Kaiyuan and Liang, Xiaozhuan and Fang, Yin and Zhu, Jinhua and Xie, Shufang and Qin, Tao and Yan, Rui , booktitle=. 2024 , publisher=
2024
-
[56]
Proceedings of the 34th International Joint Conference on Artificial Intelligence (IJCAI) , year=
Enhancing chemical reaction and retrosynthesis prediction with large language model and dual-task learning , author=. Proceedings of the 34th International Joint Conference on Artificial Intelligence (IJCAI) , year=
-
[57]
arXiv preprint arXiv:2507.17448 , year=
Reasoning-driven retrosynthesis prediction with large language models via reinforcement learning , author=. arXiv preprint arXiv:2507.17448 , year=
-
[58]
Chinese Journal of Catalysis , volume=
From lab to fab: A large language model for chemical engineering , author=. Chinese Journal of Catalysis , volume=. 2025 , publisher=
2025
-
[59]
What can large language models do in chemistry?
Guo, Taicheng and Guo, Kehan and Nan, Bozhao and Liang, Zhenwen and Guo, Zhichun and Chawla, Nitesh V and Wiest, Olaf and Zhang, Xiangliang , booktitle=. What can large language models do in chemistry?
-
[60]
Large Language Model Agent for User-friendly Chemical Process Simulations
Large language model agent for user-friendly chemical process simulations , author=. arXiv preprint arXiv:2601.11650 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[61]
ACS Central Science , volume=
Leveraging prompt engineering in large language models for accelerating chemical research , author=. ACS Central Science , volume=. 2025 , publisher=
2025
-
[62]
Nature , volume=
A mobile robotic chemist , author=. Nature , volume=. 2020 , publisher=
2020
-
[63]
Nature , volume=
Magnetic control of tokamak plasmas through deep reinforcement learning , author=. Nature , volume=. 2022 , publisher=
2022
-
[64]
International Conference on Learning Representations , year=
A time series is worth 64 words: Long-term forecasting with transformers , author=. International Conference on Learning Representations , year=
-
[65]
International Conference on Learning Representations , year=
iTransformer: Inverted transformers are effective for time series forecasting , author=. International Conference on Learning Representations , year=
-
[66]
Wu, Qingyun and Bansal, Gagan and Zhang, Jieyu and Wu, Yiran and Li, Beibin and Zhu, Erkang and Jiang, Li and Zhang, Xiaoyun and Zhang, Shaokun and Liu, Jiale and Awadallah, Ahmed Hassan and White, Ryen W and Burger, Doug and Wang, Chi , booktitle=
-
[67]
Towards building the federated
Zhang, Jianyi and Vahidian, Saeed and Kuo, Martin and Li, Chunyuan and Zhang, Ruiyi and Yu, Tong and Zhou, Yufan and Wang, Guoyin and Chen, Yiran , journal=. Towards building the federated
-
[68]
Concrete Problems in AI Safety
Amodei, Dario and Olah, Chris and Steinhardt, Jacob and Christiano, Paul and Schulman, John and Man. Concrete problems in. arXiv preprint arXiv:1606.06565 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[69]
Control Engineering Practice , volume=
Machine learning for industrial sensing and control: A survey and practical perspective , author=. Control Engineering Practice , volume=. 2024 , publisher=
2024
-
[70]
Zhang, Di and Liu, Wei and Tan, Qian and Chen, Jingdan and Yan, Hang and Yan, Yuliang and Li, Jiatong and Huang, Weiran and Yue, Xiangyu and Zhou, Dongzhan and others , journal=
-
[71]
Computer Aided Chemical Engineering , volume=
Reinforcement learning--overview of recent progress and implications for process control , author=. Computer Aided Chemical Engineering , volume=. 2018 , publisher=
2018
-
[72]
Current Opinion in Chemical Engineering , volume=
Multi-agent systems for chemical engineering: A review and perspective , author=. Current Opinion in Chemical Engineering , volume=. 2026 , publisher=
2026
-
[73]
Nature Reviews Physics , volume=
Physics-informed machine learning , author=. Nature Reviews Physics , volume=. 2021 , publisher=
2021
-
[74]
On the Opportunities and Risks of Foundation Models
On the opportunities and risks of foundation models , author=. arXiv preprint arXiv:2108.07258 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[75]
Advances in Neural Information Processing Systems , volume=
Chain-of-thought prompting elicits reasoning in large language models , author=. Advances in Neural Information Processing Systems , volume=
-
[76]
Retrieval-augmented generation for knowledge-intensive
Lewis, Patrick and Perez, Ethan and Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir and Goyal, Naman and K. Retrieval-augmented generation for knowledge-intensive. Advances in Neural Information Processing Systems , volume=
-
[77]
A review of process fault detection and diagnosis: Part
Venkatasubramanian, Venkat and Rengaswamy, Raghunathan and Yin, Kewen and Kavuri, Surya N , journal=. A review of process fault detection and diagnosis: Part. 2003 , publisher=
2003
-
[78]
A review of process fault detection and diagnosis: Part
Venkatasubramanian, Venkat and Rengaswamy, Raghunathan and Kavuri, Surya N , journal=. A review of process fault detection and diagnosis: Part. 2003 , publisher=
2003
-
[79]
A review of process fault detection and diagnosis: Part
Venkatasubramanian, Venkat and Rengaswamy, Raghunathan and Kavuri, Surya N and Yin, Kewen , journal=. A review of process fault detection and diagnosis: Part. 2003 , publisher=
2003
-
[80]
Computers & Chemical Engineering , volume=
Advances in mathematical programming models for enterprise-wide optimization , author=. Computers & Chemical Engineering , volume=. 2012 , publisher=
2012
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.