pith. sign in

arxiv: 2602.03433 · v1 · pith:6CDR5A3Znew · submitted 2026-02-03 · 📡 eess.SY · cs.SY

When control meets large language models: From words to dynamics

Pith reviewed 2026-05-21 14:46 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords large language modelscontrol theoryprompt designsystem dynamicsalignmentinterpretabilitystate-space frameworkbidirectional continuum
0
0 comments X

The pith

Large language models and control theory form a bidirectional continuum from prompt design to system dynamics.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to establish that LLMs and control theory are connected in both directions, so that language models can help build and improve control systems while control ideas can be used to guide and stabilize LLM behavior. A reader would care because this framing could make AI tools more useful in engineering tasks like controller design and make the models themselves more reliable and understandable for real-world use. It maps how LLMs assist control work directly through design help and indirectly through research support. It then shows control methods improving LLMs via input changes, parameter edits, and activation tweaks. Finally it treats the models as state-space systems tied to external loops and flags open challenges for future work.

Core claim

The paper claims that the interconnection between LLMs and control theory is best understood as a bidirectional continuum running from prompt design to full system dynamics. LLMs advance control directly by assisting in controller design and synthesis and indirectly by augmenting research workflows. Control concepts in turn steer LLM trajectories away from undesired outputs, improving reachability and alignment through input optimization, parameter editing, and activation-level interventions. Deeper integration comes from viewing LLMs as dynamic systems in a state-space framework whose internal states connect to external control loops. The goal is to develop LLMs that are as interpretable, (

What carries the argument

The bidirectional continuum linking prompt design to system dynamics, in which prompts aid control synthesis while control methods optimize LLM inputs, parameters, and activations.

Load-bearing premise

That control-theoretic interventions such as input optimization and activation changes can steer LLM behavior without degrading core language performance or creating new instabilities.

What would settle it

A controlled test in which applying input optimization or activation interventions to a standard LLM produces no measurable gain in alignment metrics or causes a clear drop in language task accuracy would falsify the claimed benefits.

Figures

Figures reproduced from arXiv: 2602.03433 by Aleksei Tepljakov, Eduard Petlenkov, Juri Belikov, Komeil Nosrati.

Figure 1
Figure 1. Figure 1: Capabilities of LLMS. paradigm based on next-token prediction [8]. The latter combines both approaches, unifying NLP tasks under a text-to-text framework and enabling flexible sequence-to￾sequence modeling for both comprehension and genera￾tion [9]. Building upon these foundations, the GPT family developed by OpenAI pioneered large-scale self-supervised pre-training [10], enabling strong zero- and few-shot… view at source ↗
Figure 2
Figure 2. Figure 2: Conceptual map of three key intersections between control and LLMs. • What are challenges and future trends at this intersection? By addressing these questions, we aim to provide a struc￾tured understanding of this rapidly developing field and motivate future work at the convergence of dynamical mod￾eling, control theory, and LLMs. While we have already examined why this intersection matters and will conti… view at source ↗
Figure 3
Figure 3. Figure 3: Stanford Cart on an obstacle course with a young H. Moravec (1977) [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: When control meets LLMs: A timeline. feedback-driven approaches (see [PITH_FULL_IMAGE:figures/full_fig_p003_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: The early RLHF pipeline. had unified insights from psychology, optimal control, and temporal-difference learning, establishing RL as a distinct discipline. Conceptually, RL is grounded in control-theoretic frameworks such as Markov decision processes (MDPs), paralleling optimal control, where an agent seeks to mini￾mize cumulative cost or maximize reward. A key milestone was Q-learning (Watkins, 1989 [53];… view at source ↗
Figure 6
Figure 6. Figure 6: LLM-assisted controller tuning (out-of-the-loop and in-the-loop) [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Step response of the plant in Listing 4 under the PID controller with gains obtained via offline LLM-based tuning. proposed analytically and then validated through simulation. Pre-trained LLMs are effective in this setting: given high￾level performance requirements (e.g., rise time 𝑇𝑟 , settling time 𝑇𝑠 , overshoot limits 𝑀𝑝 , etc.), they can quickly generate multiple candidate gain sets, reducing manual t… view at source ↗
Figure 8
Figure 8. Figure 8: , the framework systematically selects, integrates, and iteratively refines strategies by analyzing task objectives, environmental constraints, and system dynamics, retrieving relevant API definitions, input/output specifications, and integration requirements to construct executable plans. For example, in a robot navigation task, AuDeRe identifies suit￾able algorithms, configures parameters, and refines so… view at source ↗
Figure 10
Figure 10. Figure 10: An end-to-end LLM-based control architecture composed of two layers: functional agents and a code agent. range of application domains, including building energy management systems (EMS) [141], power systems [142, 143, 144, 145], robotics [146, 147], transportation [148], in￾dustrial automation [66], biology [149], cybersecurity [150], aerospace [151, 152], marine systems [153], and other emerging areas [1… view at source ↗
Figure 9
Figure 9. Figure 9: LLM-based workflow for generating and validating control invariants to derive new attack patterns. MPC and adaptive control laws. Through sequential collab￾oration and continuous refinement, the framework achieves a high degree of autonomy in controller design, ensuring sta￾ble closed-loop performance while detecting and correcting degradation during deployment. Empirical results indicate the LLM-driven ag… view at source ↗
Figure 11
Figure 11. Figure 11: Control theoretic perspective of LLMs. that 𝑓𝜃𝑒 (𝑥𝑒 ) = 𝑦𝑒 while maintaining its behavior on unrelated inputs: 𝑓𝜃𝑒 (𝑥) = { 𝑦𝑒 , 𝑖𝑓 𝑥 ∈ 𝑖(𝑥𝑒 , 𝑦𝑒 ) 𝑓𝜃 (𝑥), 𝑖𝑓 𝑥 ∈ 𝑜(𝑥𝑒 , 𝑦𝑒 ), where 𝑖(𝑥𝑒 , 𝑦𝑒 ) denotes the in-scope region, typically in￾cluding 𝑥𝑒 and semantically related inputs 𝑛(𝑥𝑒 , 𝑦𝑒 ), and 𝑜(𝑥𝑒 , 𝑦𝑒 ) denotes the out-of-scope region of unrelated inputs. A successful model edit is typically evaluated a… view at source ↗
Figure 12
Figure 12. Figure 12: compares the computation patterns of RNNs, Transformers, and SSMs. RNNs rely on nonlinear recur￾rent updates, enabling fast autoregressive outputs but with limited parallelism and slower training. Transformers com￾pute large matrix multiplications across query–key pairs in parallel, allowing efficient training but slow autoregressive inference. Discrete SSMs can operate in either recurrent or convolutiona… view at source ↗
Figure 13
Figure 13. Figure 13: Overview of S-SSM with hardware-aware state expansions. dimensions. The S-SSM is then discretized as 𝐴̄ → 𝑆 𝐴̄ = exp(𝑆 Δ𝐴), 𝐵̄ → 𝑆 𝐵̄ = (𝑆Δ𝐴) −1( exp(𝑆 Δ𝐴) − 𝐼 ) 𝑆 Δ𝑆 𝐵 , where 𝑆 𝐴̄ ∈ ℝ𝑀×𝐿×𝐷×𝑁 and 𝑆 𝐵̄ ∈ ℝ𝑀×𝐿×𝐷×𝑁 are the selective state-transition and input matrices, now explicit functions of the input 𝑥. Consequently, the discrete SSM becomes a linear time-varying (LTV) (i.e., content-aware) system 𝑦 = S… view at source ↗
Figure 14
Figure 14. Figure 14: Comparison of Mamba and Mamba-2 architectures. controllability explicit, simplifying controller design, state feedback, and pole-zero placement. For an LTI system with transfer function 𝐻(𝑠) = 𝑏𝑛−1𝑠 𝑛−1 + 𝑏𝑛−2𝑠 𝑛−2 + ⋯ + 𝑏1 𝑠 + 𝑏0 𝑠 𝑛 + 𝑎𝑛−1𝑠 𝑛−1 + ⋯ + 𝑎1 𝑠 + 𝑎0 , the state matrix 𝐴 in the CCF is given as 𝐴𝑐 = ⎡ ⎢ ⎢ ⎢ ⎢ ⎣ 0 1 0 ⋯ 0 0 0 0 1 ⋯ 0 0 ⋮ ⋮ ⋱ ⋮ ⋮ 0 0 0 ⋯ 0 1 −𝑎𝑛−1 −𝑎𝑛−2 −𝑎𝑛−3 ⋯ −𝑎1 −𝑎0 ⎤ ⎥ ⎥ ⎥ ⎥ … view at source ↗
Figure 15
Figure 15. Figure 15: The largest risks faced by the world. 4.1. Challenges 4.1.1. LLM for Control (Indirect) LLMs offer substantial potential to enhance control re￾search workflows by supporting literature synthesis, code scaffolding, data preprocessing, and structured report gen￾eration. However, current uses are mostly conceptual and illustrative, with little empirical evidence of their impact on research quality. Several m… view at source ↗
Figure 16
Figure 16. Figure 16: Challenges and research opportunities at LLM–control interface. with stability and controllability guarantees, along with real-time deployment studies, will be key for safe and practical operation [PITH_FULL_IMAGE:figures/full_fig_p022_16.png] view at source ↗
read the original abstract

While large language models (LLMs) are transforming engineering and technology through enhanced control capabilities and decision support, they are simultaneously evolving into complex dynamical systems whose behavior must be regulated. This duality highlights a reciprocal connection in which prompts support control system design while control theory helps shape prompts to achieve specific goals efficiently. In this study, we frame this emerging interconnection of LLM and control as a bidirectional continuum, from prompt design to system dynamics. First, we investigate how LLMs can advance the field of control in two distinct capacities: directly, by assisting in the design and synthesis of controllers, and indirectly, by augmenting research workflows. Second, we examine how control concepts help LLMs steer their trajectories away from undesired meanings, improving reachability and alignment via input optimization, parameter editing, and activation-level interventions. Third, we look into deeper integrations by treating LLMs as dynamic systems within a state-space framework, where their internal representations are closely linked to external control loops. Finally, we identify key challenges and outline future research directions to understand LLM behavior and develop interpretable and controllable LLMs that are as trustworthy and robust as their electromechanical counterparts, thereby ensuring they continue to support and safeguard society.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript is a perspective paper that frames the relationship between large language models (LLMs) and control theory as a bidirectional continuum, ranging from prompt design to dynamical system modeling. It argues that LLMs can advance control engineering both directly (via controller design assistance) and indirectly (via workflow augmentation), while control-theoretic tools can improve LLM reachability, alignment, and interpretability through input optimization, parameter editing, and activation interventions. The paper further proposes treating LLMs as state-space dynamical systems and concludes by outlining challenges and future directions for developing trustworthy, controllable LLMs.

Significance. If the proposed conceptual connections are pursued with concrete formalizations and experiments, the work could help bridge the control systems and machine learning communities, potentially yielding more interpretable and robust LLM-based systems. The perspective is timely given the growing use of LLMs in engineering applications, but its value rests on stimulating follow-on technical research rather than on any new results presented here.

major comments (2)
  1. The central framing in the abstract and the section examining control concepts for LLMs asserts that interventions such as parameter editing and activation-level changes can steer LLM trajectories to improve alignment without degrading core language capabilities; however, this assumption is presented without any supporting derivation, reference to existing stability analyses, or discussion of potential instabilities, which is load-bearing for the claim of enhanced reachability and interpretability.
  2. In the discussion of LLMs as dynamic systems within a state-space framework, the manuscript links internal representations to external control loops but provides no explicit state-space equations, observability/controllability conditions, or example mappings from token sequences to state vectors; this absence weakens the proposed deeper integration and leaves the dynamical-systems analogy at a high level.
minor comments (2)
  1. The abstract and introduction would benefit from a clearer delineation of which parts are literature synthesis versus original framing, to help readers distinguish the paper's contributions from prior work on LLM prompting and alignment.
  2. Several terms (e.g., 'reachability' and 'activation-level interventions') are used without initial definitions or references to standard control or LLM literature, which could reduce accessibility for readers from either community.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback and recommendation for minor revision. We address each major comment below with specific plans for strengthening the manuscript while preserving its perspective nature.

read point-by-point responses
  1. Referee: The central framing in the abstract and the section examining control concepts for LLMs asserts that interventions such as parameter editing and activation-level changes can steer LLM trajectories to improve alignment without degrading core language capabilities; however, this assumption is presented without any supporting derivation, reference to existing stability analyses, or discussion of potential instabilities, which is load-bearing for the claim of enhanced reachability and interpretability.

    Authors: We thank the referee for identifying this gap in grounding. As a perspective paper, the manuscript focuses on outlining bidirectional connections rather than new derivations. We agree that additional context is warranted. In the revision, we will add references to existing literature on activation steering (e.g., works on representation engineering and steering vectors) and stability analyses of fine-tuned or edited LLMs. We will also include a concise discussion of potential instabilities, such as unintended capability degradation or trajectory divergence, and note how control-theoretic regularization could help mitigate them. These changes will appear in the section on control concepts for LLMs. revision: yes

  2. Referee: In the discussion of LLMs as dynamic systems within a state-space framework, the manuscript links internal representations to external control loops but provides no explicit state-space equations, observability/controllability conditions, or example mappings from token sequences to state vectors; this absence weakens the proposed deeper integration and leaves the dynamical-systems analogy at a high level.

    Authors: We appreciate this suggestion for greater concreteness. The current treatment is intentionally high-level to emphasize the conceptual framework and stimulate future work. To address the comment, the revised manuscript will include an illustrative example: a simplified state-space mapping where token embeddings serve as inputs, hidden-layer activations as states, and next-token predictions as outputs, with a brief discussion of how prompt-based inputs could relate to controllability. We will explicitly state that full observability and controllability conditions remain open research questions. This addition will be placed in the state-space framework section. revision: partial

Circularity Check

0 steps flagged

No significant circularity

full rationale

The manuscript is a perspective paper that outlines conceptual connections between control theory and LLMs as a bidirectional continuum from prompt design to system dynamics. It advances no new theorems, equations, derivations, or empirical results. No load-bearing steps reduce by construction to self-definitions, fitted inputs renamed as predictions, or self-citation chains. The framing is presented explicitly as a perspective device rather than a proven equivalence, rendering the analysis self-contained with no circular reductions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on the domain assumption that LLMs possess internal dynamics amenable to control interventions, without independent evidence or formalization supplied in the abstract.

axioms (1)
  • domain assumption Control theory concepts such as state-space representations and input optimization can be directly transferred to LLM internal states and output trajectories
    Invoked when examining how control helps steer LLMs and when treating LLMs as dynamic systems in a state-space framework.

pith-pipeline@v0.9.0 · 5751 in / 1179 out tokens · 72620 ms · 2026-05-21T14:46:49.730149+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

263 extracted references · 263 canonical work pages · 19 internal anchors

  1. [1]

    A survey on large language model benchmarks,

    S. Niet al., “A survey on large language model benchmarks,”arXiv preprint arXiv:2508.15361, 2025

  2. [2]

    Attention is all you need,

    A. Vaswaniet al., “Attention is all you need,”Adv. Neural Inf. Process. Syst., vol. 30, 2017

  3. [3]

    Large Language Models: A Survey

    S.Minaeeetal.,“Largelanguagemodels:Asurvey,”arXivpreprint arXiv:2402.06196, 2024

  4. [4]

    Chain-of-thought prompting elicits reasoning in large language models,

    J. Weiet al., “Chain-of-thought prompting elicits reasoning in large language models,”Adv. Neural Inf. Process. Syst., vol. 35, pp. 24824–24837, 2022

  5. [5]

    Augmented language models: A survey,

    G. Mialonet al., “Augmented language models: A survey,”Trans. Mach. Learn. Res., 2023

  6. [6]

    React: Synergizing reasoning and acting in language models,

    S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, and Y. Cao, “React: Synergizing reasoning and acting in language models,” inProc. 11th Int. Conf. Learn. Representations (ICLR), 2023

  7. [7]

    BERT: Pre- trainingofdeepbidirectionaltransformersforlanguageunderstand- ing,

    J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre- trainingofdeepbidirectionaltransformersforlanguageunderstand- ing,” inProc. 2019 Conf. North American Chapter of the Associa- tionforComputationalLinguistics:HumanLanguageTechnologies, 2019, pp. 4171–4186

  8. [8]

    Im- proving language understanding by generative pre-training,

    A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever, “Im- proving language understanding by generative pre-training,”Online manuscript, 2018

  9. [9]

    Exploring the limits of transfer learning with a unifiedtext-to-texttransformer,

    C. Raffelet al., “Exploring the limits of transfer learning with a unifiedtext-to-texttransformer,”J.Mach.Learn.Res.,vol.21,no.1, pp. 5485–5551, 2020

  10. [10]

    A brief overview of ChatGPT: The history, status quoandpotentialfuturedevelopment,

    T. Wuet al., “A brief overview of ChatGPT: The history, status quoandpotentialfuturedevelopment,”IEEE/CAAJ.Autom.Sinica, vol. 10, no. 5, pp. 1122–1136, 2023

  11. [11]

    LLaMA: Open and Efficient Foundation Language Models

    H.Touvronetal.,“LLAMA:Openandefficientfoundationlanguage models,”arXiv preprint arXiv:2302.13971, 2023

  12. [12]

    PaLM: Scaling language modeling with pathways,

    A. Chowdheryet al., “PaLM: Scaling language modeling with pathways,”J. Mach. Learn. Res., vol. 24, no. 240, pp. 1–113, 2023

  13. [13]

    Gemini: A Family of Highly Capable Multimodal Models

    R. Anilet al., “Gemini: A family of highly capable multimodal models,”arXiv preprint arXiv:2312.11805, 2023

  14. [14]

    DeepSeek:Paradigmshiftsandtechnicalevolution in large AI models,

    L.Xiongetal.,“DeepSeek:Paradigmshiftsandtechnicalevolution in large AI models,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 5, pp. 841–858, 2025

  15. [15]

    Exploring DeepSeek: A survey on advances, ap- plications, challenges and future directions,

    Z. Denget al., “Exploring DeepSeek: A survey on advances, ap- plications, challenges and future directions,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 5, pp. 872–893, 2025

  16. [16]

    LaMDA: Language Models for Dialog Applications

    R. Thoppilanet al., “LaMDA: Language models for dialog applica- tions,”arXiv preprint arXiv:2201.08239, 2022

  17. [17]

    Large Language Model Agent: A Survey on Methodology, Applications and Challenges

    J. Luoet al., “Large language model agent: A survey on methodol- ogy,applicationsandchallenges,”arXivpreprintarXiv:2503.21460, 2025

  18. [18]

    Large language models are human-level prompt engineers,

    Y. Zhouet al., “Large language models are human-level prompt engineers,” inProc. 11th Int. Conf. Learn. Representations (ICLR), 2023

  19. [19]

    Retrieval-Augmented Generation for Large Language Models: A Survey

    Y. Gaoet al., “Retrieval-augmented generation for large language models: A survey,”arXiv preprint arXiv:2312.10997, 2023

  20. [20]

    Hugging- GPT: Solving AI tasks with ChatGPT and its friends in Hugging Face,

    Y. Shen, K. Song, X. Tan, D. Li, W. Lu, and Y. Zhuang, “Hugging- GPT: Solving AI tasks with ChatGPT and its friends in Hugging Face,”Adv. Neural Inf. Process. Syst., vol. 36, pp. 38154–38180, 2023

  21. [21]

    ART: Automatic multi-step reasoning and tool-use for large language models

    B.Paranjapeetal.,“ART:Automaticmulti-stepreasoningandtool- use for large language models,”arXiv preprint arXiv:2303.09014, 2023

  22. [22]

    Toolformer:Languagemodelscanteachthemselves to use tools,

    T.Schicketal.,“Toolformer:Languagemodelscanteachthemselves to use tools,”Adv. Neural Inf. Process. Syst., vol. 36, pp. 68539– 68551, 2023

  23. [23]

    Gorilla: Large language model connected with massiveAPIs,

    S. G. Patilet al., “Gorilla: Large language model connected with massiveAPIs,”Adv.NeuralInf.Process.Syst.,vol.37,pp.126544– 126565, 2024

  24. [24]

    Asurveyonlargelanguagemodelbasedautonomous agents,

    L.Wangetal.,“Asurveyonlargelanguagemodelbasedautonomous agents,”Front. Comput. Sci., vol. 18, no. 6, p. 186345, 2024

  25. [25]

    The rise and potential of large language model based agents:Asurvey,

    Z. Xiet al., “The rise and potential of large language model based agents:Asurvey,”Sci.ChinaInf.Sci.,vol.68,no.2,p.121101,2025

  26. [26]

    Whensoftwaresecuritymeetslargelanguagemodels: A survey,

    X.Zhuetal.,“Whensoftwaresecuritymeetslargelanguagemodels: A survey,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 2, pp. 317–334, 2025

  27. [27]

    UnveilingLLMmechanismsthroughneural odes and control theory,

    Y.ZhangandQ.Dong,“UnveilingLLMmechanismsthroughneural odes and control theory,”arXiv preprint arXiv:2406.16985, 2024

  28. [28]

    Neural ODE transformers: Analyzing internal dy- namics and adaptive fine-tuning,

    A. Tonget al., “Neural ODE transformers: Analyzing internal dy- namics and adaptive fine-tuning,” inProc. 13th Int. Conf. Learn. Representations (ICLR), 2025

  29. [29]

    Linear feedback control systems for iterative prompt optimization in large language models,

    R. R. Karn, “Linear feedback control systems for iterative prompt optimization in large language models,”arXiv preprint arXiv:2501.11979, 2025

  30. [30]

    Linearly controlled language genera- tion with performative guarantees,

    E. Cheng and C. A. Alonso, “Linearly controlled language genera- tion with performative guarantees,” inProc. NeurIPS Workshop on Foundation Model Interventions, 2024

  31. [31]

    Mamba: Linear-time sequence modeling with selective state spaces,

    A. Gu and T. Dao, “Mamba: Linear-time sequence modeling with selective state spaces,” inProc. 1st Conf. on Language Modeling, 2024

  32. [32]

    Safe RLHF: Safe reinforcement learning from human feedback,

    J. Daiet al., “Safe RLHF: Safe reinforcement learning from human feedback,” inProc. 12th Int. Conf. Learn. Representations (ICLR), 2024

  33. [33]

    PIDformer: Transformer meets control theory,

    T. M. Nguyen, C. A. Uribe, T. M. Nguyen, and R. Baraniuk, “PIDformer: Transformer meets control theory,” inProc. 41st Int. Conf. Machine Learning, 2024

  34. [34]

    and Thomson, M., 2023

    A.Bhargavaetal.,“What’sthemagicword?acontroltheoryofLLM prompting,”arXiv preprint arXiv:2310.04444, 2023

  35. [35]

    Towardsautonomous system: Flexible modular production system enhanced with large language model agents,

    Y.Xia,M.Shenoy,N.Jazdi,andM.Weyrich,“Towardsautonomous system: Flexible modular production system enhanced with large language model agents,” inProc. IEEE 28th Int. Conf. Emerging Technologies and Factory Automation (ETFA), 2023, pp. 1–8

  36. [36]

    AuDeRe: Automated strategy decision and realization in robot planning and control via LLMs,

    Y. Meng, F. Chen, Y. Chen, and C. Fan, “AuDeRe: Automated strategy decision and realization in robot planning and control via LLMs,”arXiv preprint arXiv:2504.03015, 2025

  37. [37]

    LLMs-guidedadaptivecompensator:Bringingadap- tivity to automatic control systems with large language models,

    Z.Zhouetal.,“LLMs-guidedadaptivecompensator:Bringingadap- tivity to automatic control systems with large language models,” arXiv preprint arXiv:2507.20509, 2025

  38. [38]

    Berlin,Germany: Springer, 2015

    D.A.Novikov,Cybernetics:FromPasttoFuture. Berlin,Germany: Springer, 2015. Page 23 of 28

  39. [39]

    Wiener,Cybernetics: Or Control and Communication in the Animal and the Machine

    N. Wiener,Cybernetics: Or Control and Communication in the Animal and the Machine. Paris and Cambridge, MA: Hermann & Cie and MIT Press, 1948

  40. [40]

    A logical calculus of the ideas im- manent in nervous activity,

    W. McCulloch and W. Pitts, “A logical calculus of the ideas im- manent in nervous activity,”Bull. Math. Biol., vol. 5, pp. 115–137, 1943

  41. [41]

    Thehistoryofcyberneticsand artificialintelligence:AviewfromSaintPetersburg,

    A.L.FradkovandA.I.Shepeljavyi,“Thehistoryofcyberneticsand artificialintelligence:AviewfromSaintPetersburg,”Cybern.Phys., vol. 11, pp. 253–263, 2022

  42. [42]

    W. R. Ashby,An Introduction to Cybernetics. London: Chapman & Hall Ltd., 1956

  43. [43]

    A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955,

    J. McCarthy, M. L. Minsky, N. Rochester, and C. E. Shannon, “A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955,”AI Mag., vol. 27, no. 4, pp. 12–14, 2006

  44. [44]

    The perceptron: A perceiving and recognizing au- tomaton (Project PARA),

    F. Rosenblatt, “The perceptron: A perceiving and recognizing au- tomaton (Project PARA),” Cornell Aeronautical Laboratory, Tech. Rep. 85-460-1, 1957

  45. [45]

    Some studies in machine learning using the game of checkers,

    A. Samuel, “Some studies in machine learning using the game of checkers,”IBM J. Res. Dev., vol. 3, no. 3, pp. 210–229, 1959

  46. [46]

    TheStanfordCartandtheCMUrover,

    H.P.Moravec,“TheStanfordCartandtheCMUrover,”Proc.IEEE, vol. 71, no. 7, pp. 872–884, 1983

  47. [47]

    Stanford, CA, USA: Stanford Univ

    ——,Obstacle Avoidance and Navigation in the Real World by a Seeing Robot Rover. Stanford, CA, USA: Stanford Univ. Press, 1980

  48. [48]

    Evolution of robotic arms,

    M. E. Moran, “Evolution of robotic arms,”J. Robot. Surg., vol. 1, no. 2, pp. 103–111, 2007

  49. [49]

    Steps toward artificial intelligence,

    M. Minsky, “Steps toward artificial intelligence,”Proc. IEEE, vol. 49, no. 1, pp. 8–30, 1961

  50. [50]

    NewYork:McGraw-Hill,1965

    N.J.Nilsson,LearningMachines. NewYork:McGraw-Hill,1965

  51. [51]

    A heuristic approach to reinforcement learning control systems,

    M. D. Waltz and K. S. Fu, “A heuristic approach to reinforcement learning control systems,”IEEE Trans. Autom. Control, vol. 10, pp. 390–398, 1965

  52. [52]

    Applications of artificial intelligence techniques to a spacecraft control problem,

    J. M. Mendel, “Applications of artificial intelligence techniques to a spacecraft control problem,” National Aeronautics and Space Administration, Tech. Rep. NASA CR-755, 1966

  53. [53]

    Learningfromdelayedrewards,

    C.J.C.H.Watkins,“Learningfromdelayedrewards,”Ph.D.disser- tation, Cambridge University, Cambridge, England, 1989

  54. [54]

    Q-learning,

    C.J.C.H.WatkinsandP.Dayan,“Q-learning,”Mach.Learn.,vol.8, pp. 279–292, 1992

  55. [55]

    I. N. Aizenberg, N. N. Aizenberg, and J. P. L. Vandewalle,Multi- Valued and Universal Binary Neurons: Theory, Learning and Ap- plications. Springer Science & Business Media, 2000

  56. [56]

    The history of artificial intelligence,

    T. Mucci, “The history of artificial intelligence,” IBM Think, [Online]. Available: https://www.ibm.com/think/topics/history-of- artificial-intelligence

  57. [57]

    Pre-trained language models and their applications,

    H. Wang, J. Li, H. Wu, E. Hovy, and Y. Sun, “Pre-trained language models and their applications,”Engineering, vol. 25, pp. 51–64, 2022

  58. [58]

    Training language models to follow instructions with human feedback,

    L. Ouyanget al., “Training language models to follow instructions with human feedback,” inProc. Adv. Neural Inf. Process. Syst., vol. 35, 2022, pp. 27730–27744

  59. [59]

    Reinforcement Learning from Human Feedback

    N.Lambert,“Reinforcementlearningfromhumanfeedback,”arXiv preprint arXiv:2504.12501, 2025

  60. [60]

    The crossroads of LLM and traffic control: A study on large language models in adaptive traffic signal control,

    M. Movahedi and J. Choi, “The crossroads of LLM and traffic control: A study on large language models in adaptive traffic signal control,”IEEE Trans. Intell. Transp. Syst., vol. 26, no. 2, pp. 1701– 1716, 2025

  61. [61]

    Largelanguagemodelsintransportation:Acomprehen- sivebibliometricanalysisofemergingtrends,challenges,andfuture research,

    M.Hassan,“Largelanguagemodelsintransportation:Acomprehen- sivebibliometricanalysisofemergingtrends,challenges,andfuture research,”IEEE Access, vol. 13, pp. 132547–132598, 2025

  62. [62]

    Leveraging LLMs and knowledgegraphstodesignsecureautomationsystems,

    A. M. Hosseini, W. Kastner, and T. Sauter, “Leveraging LLMs and knowledgegraphstodesignsecureautomationsystems,”IEEEOpen J. Ind. Electron. Soc., vol. 6, pp. 380–395, 2025

  63. [63]

    Xavier Suau, Pieter Delobelle, Katherine Metcalf, Armand Joulin, Nicholas Apostoloff, Luca Zappella, and Pau Rodr´ıguez

    S. Soatto, P. Tabuada, P. Chaudhari, and T. Y. Liu, “Taming AI bots:Controllabilityofneuralstatesinlargelanguagemodels,”arXiv preprint arXiv:2305.18449, 2023

  64. [64]

    LLM4PLC: Harnessing large language models for verifiable programming of PLCs in industrial control systems,

    M. Fakihet al., “LLM4PLC: Harnessing large language models for verifiable programming of PLCs in industrial control systems,” in Proc. 46th Int. Conf. on Software Engineering: Software Engineer- ing in Practice, 2024, pp. 192–203

  65. [65]

    and Hu, B., 2024

    D. Kevianet al., “Capabilities of large language models in control engineering: A benchmark study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra,”arXiv preprint arXiv:2404.03647, 2024

  66. [66]

    Control industrial automation system with large language model agents,

    Y. Xia, N. Jazdi, J. Zhang, C. Shah, and M. Weyrich, “Control industrial automation system with large language model agents,” in Proc. IEEE 30th Int. Conf. on Emerging Technologies and Factory Automation (ETFA), 2025, pp. 1–8

  67. [67]

    Automated literature research and review-generation method based on large language models,

    S. Wuet al., “Automated literature research and review-generation method based on large language models,”Natl. Sci. Rev., vol. 12, no. 6, p. nwaf169, 2025

  68. [68]

    A survey on largelanguagemodelsforcodegeneration,

    J. Jiang, F. Wang, J. Shen, S. Kim, and S. Kim, “A survey on largelanguagemodelsforcodegeneration,”ACMTrans.Softw.Eng. Methodol., 2025

  69. [69]

    Code Llama: Open Foundation Models for Code

    B. Roziereet al., “Code Llama: Open foundation models for code,” arXiv preprint arXiv:2308.12950, 2023

  70. [70]

    Leveraging LLMs for legacy code modernization: challenges and opportunities for LLM- generated documentation,

    C. Diggset al., “Leveraging LLMs for legacy code modernization: Challenges and opportunities for LLM-generated documentation,” arXiv preprint arXiv:2411.14971, 2025

  71. [71]

    Large language models as data preprocessors,

    H. Zhang, Y. Dong, C. Xiao, and M. Oyamada, “Large language models as data preprocessors,”arXiv preprint arXiv:2308.16361, 2023

  72. [72]

    Data informativity: A new perspective on data-driven analysis and control,

    H. J. V. Waarde, J. Eising, H. L. Trentelman, and M. K. Camlibel, “Data informativity: A new perspective on data-driven analysis and control,”IEEE Trans. Autom. Control, vol. 65, no. 11, pp. 4753– 4768, 2020

  73. [73]

    LLM-agent-controller: A universal multi-agent large language model system as a control engineer,

    R. Zahedifar, S. A. Mirghasemi, M. S. Baghshah, and A. Taheri, “LLM-agent-controller: A universal multi-agent large language model system as a control engineer,”ACM Trans. Softw. Eng. Methodol., 2025

  74. [74]

    Pydantic,

    S. Colvinet al., “Pydantic,”GitHub repository, 2025, [Online]. Available: https://github.com/pydantic/pydantic

  75. [75]

    GPT-researcher,

    A. Elovic, “GPT-researcher,”GitHub repository, 2023, [Online]. Available: https://github.com/assafelovic/gpt-researcher

  76. [76]

    TheimpactofgenerativeAItoolsonresearchers andresearch:Implicationsforacademiainhighereducation,

    A.M.Al-Zahrani,“TheimpactofgenerativeAItoolsonresearchers andresearch:Implicationsforacademiainhighereducation,”Innov. Educ. Teach. Int., vol. 61, no. 5, pp. 1029–1043, 2024

  77. [77]

    An evaluation of general-purpose AI chatbots: a com- prehensive comparative analysis,

    O. Chalyi, “An evaluation of general-purpose AI chatbots: a com- prehensive comparative analysis,”InfoSci. Trends, vol. 1, no. 1, pp. 52–66, 2024

  78. [78]

    Library databases and chatbots,

    E. Lombard, “Library databases and chatbots,”Internet Ref. Serv. Q., vol. 28, no. 4, pp. 463–471, 2024

  79. [79]

    Large language models are zero-shot reasoners,

    T. Kojimaet al., “Large language models are zero-shot reasoners,” Adv. Neural Inf. Process. Syst., vol. 35, pp. 22199–22213, 2022

  80. [80]

    Pre-trainedlargelanguage models for industrial control,

    L.Song,C.Zhang,L.Zhao,andJ.Bian,“Pre-trainedlargelanguage models for industrial control,”arXiv preprint arXiv:2308.03028, 2023

Showing first 80 references.