When control meets large language models: From words to dynamics

Aleksei Tepljakov; Eduard Petlenkov; Juri Belikov; Komeil Nosrati

arxiv: 2602.03433 · v1 · pith:6CDR5A3Znew · submitted 2026-02-03 · 📡 eess.SY · cs.SY

When control meets large language models: From words to dynamics

Komeil Nosrati , Aleksei Tepljakov , Juri Belikov , Eduard Petlenkov This is my paper

Pith reviewed 2026-05-21 14:46 UTC · model grok-4.3

classification 📡 eess.SY cs.SY

keywords large language modelscontrol theoryprompt designsystem dynamicsalignmentinterpretabilitystate-space frameworkbidirectional continuum

0 comments

The pith

Large language models and control theory form a bidirectional continuum from prompt design to system dynamics.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to establish that LLMs and control theory are connected in both directions, so that language models can help build and improve control systems while control ideas can be used to guide and stabilize LLM behavior. A reader would care because this framing could make AI tools more useful in engineering tasks like controller design and make the models themselves more reliable and understandable for real-world use. It maps how LLMs assist control work directly through design help and indirectly through research support. It then shows control methods improving LLMs via input changes, parameter edits, and activation tweaks. Finally it treats the models as state-space systems tied to external loops and flags open challenges for future work.

Core claim

The paper claims that the interconnection between LLMs and control theory is best understood as a bidirectional continuum running from prompt design to full system dynamics. LLMs advance control directly by assisting in controller design and synthesis and indirectly by augmenting research workflows. Control concepts in turn steer LLM trajectories away from undesired outputs, improving reachability and alignment through input optimization, parameter editing, and activation-level interventions. Deeper integration comes from viewing LLMs as dynamic systems in a state-space framework whose internal states connect to external control loops. The goal is to develop LLMs that are as interpretable, (

What carries the argument

The bidirectional continuum linking prompt design to system dynamics, in which prompts aid control synthesis while control methods optimize LLM inputs, parameters, and activations.

Load-bearing premise

That control-theoretic interventions such as input optimization and activation changes can steer LLM behavior without degrading core language performance or creating new instabilities.

What would settle it

A controlled test in which applying input optimization or activation interventions to a standard LLM produces no measurable gain in alignment metrics or causes a clear drop in language task accuracy would falsify the claimed benefits.

Figures

Figures reproduced from arXiv: 2602.03433 by Aleksei Tepljakov, Eduard Petlenkov, Juri Belikov, Komeil Nosrati.

**Figure 1.** Figure 1: Capabilities of LLMS. paradigm based on next-token prediction [8]. The latter combines both approaches, unifying NLP tasks under a text-to-text framework and enabling flexible sequence-tosequence modeling for both comprehension and generation [9]. Building upon these foundations, the GPT family developed by OpenAI pioneered large-scale self-supervised pre-training [10], enabling strong zero- and few-shot… view at source ↗

**Figure 2.** Figure 2: Conceptual map of three key intersections between control and LLMs. • What are challenges and future trends at this intersection? By addressing these questions, we aim to provide a structured understanding of this rapidly developing field and motivate future work at the convergence of dynamical modeling, control theory, and LLMs. While we have already examined why this intersection matters and will conti… view at source ↗

**Figure 3.** Figure 3: Stanford Cart on an obstacle course with a young H. Moravec (1977) [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 4.** Figure 4: When control meets LLMs: A timeline. feedback-driven approaches (see [PITH_FULL_IMAGE:figures/full_fig_p003_4.png] view at source ↗

**Figure 5.** Figure 5: The early RLHF pipeline. had unified insights from psychology, optimal control, and temporal-difference learning, establishing RL as a distinct discipline. Conceptually, RL is grounded in control-theoretic frameworks such as Markov decision processes (MDPs), paralleling optimal control, where an agent seeks to minimize cumulative cost or maximize reward. A key milestone was Q-learning (Watkins, 1989 [53];… view at source ↗

**Figure 6.** Figure 6: LLM-assisted controller tuning (out-of-the-loop and in-the-loop) [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 7.** Figure 7: Step response of the plant in Listing 4 under the PID controller with gains obtained via offline LLM-based tuning. proposed analytically and then validated through simulation. Pre-trained LLMs are effective in this setting: given highlevel performance requirements (e.g., rise time 𝑇𝑟 , settling time 𝑇𝑠 , overshoot limits 𝑀𝑝 , etc.), they can quickly generate multiple candidate gain sets, reducing manual t… view at source ↗

**Figure 8.** Figure 8: , the framework systematically selects, integrates, and iteratively refines strategies by analyzing task objectives, environmental constraints, and system dynamics, retrieving relevant API definitions, input/output specifications, and integration requirements to construct executable plans. For example, in a robot navigation task, AuDeRe identifies suitable algorithms, configures parameters, and refines so… view at source ↗

**Figure 10.** Figure 10: An end-to-end LLM-based control architecture composed of two layers: functional agents and a code agent. range of application domains, including building energy management systems (EMS) [141], power systems [142, 143, 144, 145], robotics [146, 147], transportation [148], industrial automation [66], biology [149], cybersecurity [150], aerospace [151, 152], marine systems [153], and other emerging areas [1… view at source ↗

**Figure 9.** Figure 9: LLM-based workflow for generating and validating control invariants to derive new attack patterns. MPC and adaptive control laws. Through sequential collaboration and continuous refinement, the framework achieves a high degree of autonomy in controller design, ensuring stable closed-loop performance while detecting and correcting degradation during deployment. Empirical results indicate the LLM-driven ag… view at source ↗

**Figure 11.** Figure 11: Control theoretic perspective of LLMs. that 𝑓𝜃𝑒 (𝑥𝑒 ) = 𝑦𝑒 while maintaining its behavior on unrelated inputs: 𝑓𝜃𝑒 (𝑥) = { 𝑦𝑒 , 𝑖𝑓 𝑥 ∈ 𝑖(𝑥𝑒 , 𝑦𝑒 ) 𝑓𝜃 (𝑥), 𝑖𝑓 𝑥 ∈ 𝑜(𝑥𝑒 , 𝑦𝑒 ), where 𝑖(𝑥𝑒 , 𝑦𝑒 ) denotes the in-scope region, typically including 𝑥𝑒 and semantically related inputs 𝑛(𝑥𝑒 , 𝑦𝑒 ), and 𝑜(𝑥𝑒 , 𝑦𝑒 ) denotes the out-of-scope region of unrelated inputs. A successful model edit is typically evaluated a… view at source ↗

**Figure 12.** Figure 12: compares the computation patterns of RNNs, Transformers, and SSMs. RNNs rely on nonlinear recurrent updates, enabling fast autoregressive outputs but with limited parallelism and slower training. Transformers compute large matrix multiplications across query–key pairs in parallel, allowing efficient training but slow autoregressive inference. Discrete SSMs can operate in either recurrent or convolutiona… view at source ↗

**Figure 13.** Figure 13: Overview of S-SSM with hardware-aware state expansions. dimensions. The S-SSM is then discretized as 𝐴̄ → 𝑆 𝐴̄ = exp(𝑆 Δ𝐴), 𝐵̄ → 𝑆 𝐵̄ = (𝑆Δ𝐴) −1( exp(𝑆 Δ𝐴) − 𝐼 ) 𝑆 Δ𝑆 𝐵 , where 𝑆 𝐴̄ ∈ ℝ𝑀×𝐿×𝐷×𝑁 and 𝑆 𝐵̄ ∈ ℝ𝑀×𝐿×𝐷×𝑁 are the selective state-transition and input matrices, now explicit functions of the input 𝑥. Consequently, the discrete SSM becomes a linear time-varying (LTV) (i.e., content-aware) system 𝑦 = S… view at source ↗

**Figure 14.** Figure 14: Comparison of Mamba and Mamba-2 architectures. controllability explicit, simplifying controller design, state feedback, and pole-zero placement. For an LTI system with transfer function 𝐻(𝑠) = 𝑏𝑛−1𝑠 𝑛−1 + 𝑏𝑛−2𝑠 𝑛−2 + ⋯ + 𝑏1 𝑠 + 𝑏0 𝑠 𝑛 + 𝑎𝑛−1𝑠 𝑛−1 + ⋯ + 𝑎1 𝑠 + 𝑎0 , the state matrix 𝐴 in the CCF is given as 𝐴𝑐 = ⎡ ⎢ ⎢ ⎢ ⎢ ⎣ 0 1 0 ⋯ 0 0 0 0 1 ⋯ 0 0 ⋮ ⋮ ⋱ ⋮ ⋮ 0 0 0 ⋯ 0 1 −𝑎𝑛−1 −𝑎𝑛−2 −𝑎𝑛−3 ⋯ −𝑎1 −𝑎0 ⎤ ⎥ ⎥ ⎥ ⎥ … view at source ↗

**Figure 15.** Figure 15: The largest risks faced by the world. 4.1. Challenges 4.1.1. LLM for Control (Indirect) LLMs offer substantial potential to enhance control research workflows by supporting literature synthesis, code scaffolding, data preprocessing, and structured report generation. However, current uses are mostly conceptual and illustrative, with little empirical evidence of their impact on research quality. Several m… view at source ↗

**Figure 16.** Figure 16: Challenges and research opportunities at LLM–control interface. with stability and controllability guarantees, along with real-time deployment studies, will be key for safe and practical operation [PITH_FULL_IMAGE:figures/full_fig_p022_16.png] view at source ↗

read the original abstract

While large language models (LLMs) are transforming engineering and technology through enhanced control capabilities and decision support, they are simultaneously evolving into complex dynamical systems whose behavior must be regulated. This duality highlights a reciprocal connection in which prompts support control system design while control theory helps shape prompts to achieve specific goals efficiently. In this study, we frame this emerging interconnection of LLM and control as a bidirectional continuum, from prompt design to system dynamics. First, we investigate how LLMs can advance the field of control in two distinct capacities: directly, by assisting in the design and synthesis of controllers, and indirectly, by augmenting research workflows. Second, we examine how control concepts help LLMs steer their trajectories away from undesired meanings, improving reachability and alignment via input optimization, parameter editing, and activation-level interventions. Third, we look into deeper integrations by treating LLMs as dynamic systems within a state-space framework, where their internal representations are closely linked to external control loops. Finally, we identify key challenges and outline future research directions to understand LLM behavior and develop interpretable and controllable LLMs that are as trustworthy and robust as their electromechanical counterparts, thereby ensuring they continue to support and safeguard society.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a perspective piece that sketches a bidirectional framing between control theory and LLMs but adds no new methods, derivations, or evidence.

read the letter

Colleague, the main point here is that the paper frames LLMs and control as a two-way street—prompts and LLMs aiding controller design on one side, control concepts like input optimization and activation interventions guiding LLM outputs on the other—while also floating a state-space view of LLM internals. It stays at the level of an outline and does not claim or deliver any original technical result. What it does reasonably well is collect scattered ideas from both communities into one document and point to places where they might meet, such as using LLMs for indirect workflow help or treating reachability and alignment as control problems. The structure is clear and the literature pointers are relevant enough to give a newcomer a starting map. The soft spots are straightforward and central. Everything rests on untested assumptions about transfer: that control-style interventions on prompts, parameters, or activations will steer LLM trajectories without harming core capabilities or creating new instabilities. No concrete example, small case study, or even a toy calculation is given to show how this would work in practice. The challenges section lists issues like interpretability and robustness but offers no prioritized next steps or partial solutions. Because the manuscript is explicitly a perspective rather than a research report, these gaps are not fatal, but they do limit how much a reader can take away beyond the high-level suggestion. This kind of paper is mainly useful for people already working near the AI-systems boundary who want a quick synthesis and a list of open questions. Someone looking for a method to implement or a result to cite will find little to use directly. The thinking is coherent on its own terms and engages honestly with the two literatures, so it clears the bar for a serious referee. I would send it out for review at a venue that accepts perspective or position pieces, with the expectation that reviewers would ask for more illustrative examples or a tighter focus on one direction of the proposed continuum.

Referee Report

2 major / 2 minor

Summary. The manuscript is a perspective paper that frames the relationship between large language models (LLMs) and control theory as a bidirectional continuum, ranging from prompt design to dynamical system modeling. It argues that LLMs can advance control engineering both directly (via controller design assistance) and indirectly (via workflow augmentation), while control-theoretic tools can improve LLM reachability, alignment, and interpretability through input optimization, parameter editing, and activation interventions. The paper further proposes treating LLMs as state-space dynamical systems and concludes by outlining challenges and future directions for developing trustworthy, controllable LLMs.

Significance. If the proposed conceptual connections are pursued with concrete formalizations and experiments, the work could help bridge the control systems and machine learning communities, potentially yielding more interpretable and robust LLM-based systems. The perspective is timely given the growing use of LLMs in engineering applications, but its value rests on stimulating follow-on technical research rather than on any new results presented here.

major comments (2)

The central framing in the abstract and the section examining control concepts for LLMs asserts that interventions such as parameter editing and activation-level changes can steer LLM trajectories to improve alignment without degrading core language capabilities; however, this assumption is presented without any supporting derivation, reference to existing stability analyses, or discussion of potential instabilities, which is load-bearing for the claim of enhanced reachability and interpretability.
In the discussion of LLMs as dynamic systems within a state-space framework, the manuscript links internal representations to external control loops but provides no explicit state-space equations, observability/controllability conditions, or example mappings from token sequences to state vectors; this absence weakens the proposed deeper integration and leaves the dynamical-systems analogy at a high level.

minor comments (2)

The abstract and introduction would benefit from a clearer delineation of which parts are literature synthesis versus original framing, to help readers distinguish the paper's contributions from prior work on LLM prompting and alignment.
Several terms (e.g., 'reachability' and 'activation-level interventions') are used without initial definitions or references to standard control or LLM literature, which could reduce accessibility for readers from either community.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback and recommendation for minor revision. We address each major comment below with specific plans for strengthening the manuscript while preserving its perspective nature.

read point-by-point responses

Referee: The central framing in the abstract and the section examining control concepts for LLMs asserts that interventions such as parameter editing and activation-level changes can steer LLM trajectories to improve alignment without degrading core language capabilities; however, this assumption is presented without any supporting derivation, reference to existing stability analyses, or discussion of potential instabilities, which is load-bearing for the claim of enhanced reachability and interpretability.

Authors: We thank the referee for identifying this gap in grounding. As a perspective paper, the manuscript focuses on outlining bidirectional connections rather than new derivations. We agree that additional context is warranted. In the revision, we will add references to existing literature on activation steering (e.g., works on representation engineering and steering vectors) and stability analyses of fine-tuned or edited LLMs. We will also include a concise discussion of potential instabilities, such as unintended capability degradation or trajectory divergence, and note how control-theoretic regularization could help mitigate them. These changes will appear in the section on control concepts for LLMs. revision: yes
Referee: In the discussion of LLMs as dynamic systems within a state-space framework, the manuscript links internal representations to external control loops but provides no explicit state-space equations, observability/controllability conditions, or example mappings from token sequences to state vectors; this absence weakens the proposed deeper integration and leaves the dynamical-systems analogy at a high level.

Authors: We appreciate this suggestion for greater concreteness. The current treatment is intentionally high-level to emphasize the conceptual framework and stimulate future work. To address the comment, the revised manuscript will include an illustrative example: a simplified state-space mapping where token embeddings serve as inputs, hidden-layer activations as states, and next-token predictions as outputs, with a brief discussion of how prompt-based inputs could relate to controllability. We will explicitly state that full observability and controllability conditions remain open research questions. This addition will be placed in the state-space framework section. revision: partial

Circularity Check

0 steps flagged

No significant circularity

full rationale

The manuscript is a perspective paper that outlines conceptual connections between control theory and LLMs as a bidirectional continuum from prompt design to system dynamics. It advances no new theorems, equations, derivations, or empirical results. No load-bearing steps reduce by construction to self-definitions, fitted inputs renamed as predictions, or self-citation chains. The framing is presented explicitly as a perspective device rather than a proven equivalence, rendering the analysis self-contained with no circular reductions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on the domain assumption that LLMs possess internal dynamics amenable to control interventions, without independent evidence or formalization supplied in the abstract.

axioms (1)

domain assumption Control theory concepts such as state-space representations and input optimization can be directly transferred to LLM internal states and output trajectories
Invoked when examining how control helps steer LLMs and when treating LLMs as dynamic systems in a state-space framework.

pith-pipeline@v0.9.0 · 5751 in / 1179 out tokens · 72620 ms · 2026-05-21T14:46:49.730149+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

treating LLMs as dynamic systems within a state-space framework, where their internal representations are closely linked to external control loops
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

control concepts help LLMs steer their trajectories away from undesired meanings, improving reachability and alignment via input optimization, parameter editing, and activation-level interventions

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

263 extracted references · 263 canonical work pages · 19 internal anchors

[1]

A survey on large language model benchmarks,

S. Niet al., “A survey on large language model benchmarks,”arXiv preprint arXiv:2508.15361, 2025

work page arXiv 2025
[2]

Attention is all you need,

A. Vaswaniet al., “Attention is all you need,”Adv. Neural Inf. Process. Syst., vol. 30, 2017

work page 2017
[3]

Large Language Models: A Survey

S.Minaeeetal.,“Largelanguagemodels:Asurvey,”arXivpreprint arXiv:2402.06196, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[4]

Chain-of-thought prompting elicits reasoning in large language models,

J. Weiet al., “Chain-of-thought prompting elicits reasoning in large language models,”Adv. Neural Inf. Process. Syst., vol. 35, pp. 24824–24837, 2022

work page 2022
[5]

Augmented language models: A survey,

G. Mialonet al., “Augmented language models: A survey,”Trans. Mach. Learn. Res., 2023

work page 2023
[6]

React: Synergizing reasoning and acting in language models,

S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, and Y. Cao, “React: Synergizing reasoning and acting in language models,” inProc. 11th Int. Conf. Learn. Representations (ICLR), 2023

work page 2023
[7]

BERT: Pre- trainingofdeepbidirectionaltransformersforlanguageunderstand- ing,

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre- trainingofdeepbidirectionaltransformersforlanguageunderstand- ing,” inProc. 2019 Conf. North American Chapter of the Associa- tionforComputationalLinguistics:HumanLanguageTechnologies, 2019, pp. 4171–4186

work page 2019
[8]

Im- proving language understanding by generative pre-training,

A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever, “Im- proving language understanding by generative pre-training,”Online manuscript, 2018

work page 2018
[9]

Exploring the limits of transfer learning with a unifiedtext-to-texttransformer,

C. Raffelet al., “Exploring the limits of transfer learning with a unifiedtext-to-texttransformer,”J.Mach.Learn.Res.,vol.21,no.1, pp. 5485–5551, 2020

work page 2020
[10]

A brief overview of ChatGPT: The history, status quoandpotentialfuturedevelopment,

T. Wuet al., “A brief overview of ChatGPT: The history, status quoandpotentialfuturedevelopment,”IEEE/CAAJ.Autom.Sinica, vol. 10, no. 5, pp. 1122–1136, 2023

work page 2023
[11]

LLaMA: Open and Efficient Foundation Language Models

H.Touvronetal.,“LLAMA:Openandefficientfoundationlanguage models,”arXiv preprint arXiv:2302.13971, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[12]

PaLM: Scaling language modeling with pathways,

A. Chowdheryet al., “PaLM: Scaling language modeling with pathways,”J. Mach. Learn. Res., vol. 24, no. 240, pp. 1–113, 2023

work page 2023
[13]

Gemini: A Family of Highly Capable Multimodal Models

R. Anilet al., “Gemini: A family of highly capable multimodal models,”arXiv preprint arXiv:2312.11805, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[14]

DeepSeek:Paradigmshiftsandtechnicalevolution in large AI models,

L.Xiongetal.,“DeepSeek:Paradigmshiftsandtechnicalevolution in large AI models,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 5, pp. 841–858, 2025

work page 2025
[15]

Exploring DeepSeek: A survey on advances, ap- plications, challenges and future directions,

Z. Denget al., “Exploring DeepSeek: A survey on advances, ap- plications, challenges and future directions,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 5, pp. 872–893, 2025

work page 2025
[16]

LaMDA: Language Models for Dialog Applications

R. Thoppilanet al., “LaMDA: Language models for dialog applica- tions,”arXiv preprint arXiv:2201.08239, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022
[17]

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

J. Luoet al., “Large language model agent: A survey on methodol- ogy,applicationsandchallenges,”arXivpreprintarXiv:2503.21460, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[18]

Large language models are human-level prompt engineers,

Y. Zhouet al., “Large language models are human-level prompt engineers,” inProc. 11th Int. Conf. Learn. Representations (ICLR), 2023

work page 2023
[19]

Retrieval-Augmented Generation for Large Language Models: A Survey

Y. Gaoet al., “Retrieval-augmented generation for large language models: A survey,”arXiv preprint arXiv:2312.10997, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[20]

Hugging- GPT: Solving AI tasks with ChatGPT and its friends in Hugging Face,

Y. Shen, K. Song, X. Tan, D. Li, W. Lu, and Y. Zhuang, “Hugging- GPT: Solving AI tasks with ChatGPT and its friends in Hugging Face,”Adv. Neural Inf. Process. Syst., vol. 36, pp. 38154–38180, 2023

work page 2023
[21]

ART: Automatic multi-step reasoning and tool-use for large language models

B.Paranjapeetal.,“ART:Automaticmulti-stepreasoningandtool- use for large language models,”arXiv preprint arXiv:2303.09014, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[22]

Toolformer:Languagemodelscanteachthemselves to use tools,

T.Schicketal.,“Toolformer:Languagemodelscanteachthemselves to use tools,”Adv. Neural Inf. Process. Syst., vol. 36, pp. 68539– 68551, 2023

work page 2023
[23]

Gorilla: Large language model connected with massiveAPIs,

S. G. Patilet al., “Gorilla: Large language model connected with massiveAPIs,”Adv.NeuralInf.Process.Syst.,vol.37,pp.126544– 126565, 2024

work page 2024
[24]

Asurveyonlargelanguagemodelbasedautonomous agents,

L.Wangetal.,“Asurveyonlargelanguagemodelbasedautonomous agents,”Front. Comput. Sci., vol. 18, no. 6, p. 186345, 2024

work page 2024
[25]

The rise and potential of large language model based agents:Asurvey,

Z. Xiet al., “The rise and potential of large language model based agents:Asurvey,”Sci.ChinaInf.Sci.,vol.68,no.2,p.121101,2025

work page 2025
[26]

Whensoftwaresecuritymeetslargelanguagemodels: A survey,

X.Zhuetal.,“Whensoftwaresecuritymeetslargelanguagemodels: A survey,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 2, pp. 317–334, 2025

work page 2025
[27]

UnveilingLLMmechanismsthroughneural odes and control theory,

Y.ZhangandQ.Dong,“UnveilingLLMmechanismsthroughneural odes and control theory,”arXiv preprint arXiv:2406.16985, 2024

work page arXiv 2024
[28]

Neural ODE transformers: Analyzing internal dy- namics and adaptive fine-tuning,

A. Tonget al., “Neural ODE transformers: Analyzing internal dy- namics and adaptive fine-tuning,” inProc. 13th Int. Conf. Learn. Representations (ICLR), 2025

work page 2025
[29]

Linear feedback control systems for iterative prompt optimization in large language models,

R. R. Karn, “Linear feedback control systems for iterative prompt optimization in large language models,”arXiv preprint arXiv:2501.11979, 2025

work page arXiv 2025
[30]

Linearly controlled language genera- tion with performative guarantees,

E. Cheng and C. A. Alonso, “Linearly controlled language genera- tion with performative guarantees,” inProc. NeurIPS Workshop on Foundation Model Interventions, 2024

work page 2024
[31]

Mamba: Linear-time sequence modeling with selective state spaces,

A. Gu and T. Dao, “Mamba: Linear-time sequence modeling with selective state spaces,” inProc. 1st Conf. on Language Modeling, 2024

work page 2024
[32]

Safe RLHF: Safe reinforcement learning from human feedback,

J. Daiet al., “Safe RLHF: Safe reinforcement learning from human feedback,” inProc. 12th Int. Conf. Learn. Representations (ICLR), 2024

work page 2024
[33]

PIDformer: Transformer meets control theory,

T. M. Nguyen, C. A. Uribe, T. M. Nguyen, and R. Baraniuk, “PIDformer: Transformer meets control theory,” inProc. 41st Int. Conf. Machine Learning, 2024

work page 2024
[34]

and Thomson, M., 2023

A.Bhargavaetal.,“What’sthemagicword?acontroltheoryofLLM prompting,”arXiv preprint arXiv:2310.04444, 2023

work page arXiv 2023
[35]

Towardsautonomous system: Flexible modular production system enhanced with large language model agents,

Y.Xia,M.Shenoy,N.Jazdi,andM.Weyrich,“Towardsautonomous system: Flexible modular production system enhanced with large language model agents,” inProc. IEEE 28th Int. Conf. Emerging Technologies and Factory Automation (ETFA), 2023, pp. 1–8

work page 2023
[36]

AuDeRe: Automated strategy decision and realization in robot planning and control via LLMs,

Y. Meng, F. Chen, Y. Chen, and C. Fan, “AuDeRe: Automated strategy decision and realization in robot planning and control via LLMs,”arXiv preprint arXiv:2504.03015, 2025

work page arXiv 2025
[37]

LLMs-guidedadaptivecompensator:Bringingadap- tivity to automatic control systems with large language models,

Z.Zhouetal.,“LLMs-guidedadaptivecompensator:Bringingadap- tivity to automatic control systems with large language models,” arXiv preprint arXiv:2507.20509, 2025

work page arXiv 2025
[38]

Berlin,Germany: Springer, 2015

D.A.Novikov,Cybernetics:FromPasttoFuture. Berlin,Germany: Springer, 2015. Page 23 of 28

work page 2015
[39]

Wiener,Cybernetics: Or Control and Communication in the Animal and the Machine

N. Wiener,Cybernetics: Or Control and Communication in the Animal and the Machine. Paris and Cambridge, MA: Hermann & Cie and MIT Press, 1948

work page 1948
[40]

A logical calculus of the ideas im- manent in nervous activity,

W. McCulloch and W. Pitts, “A logical calculus of the ideas im- manent in nervous activity,”Bull. Math. Biol., vol. 5, pp. 115–137, 1943

work page 1943
[41]

Thehistoryofcyberneticsand artificialintelligence:AviewfromSaintPetersburg,

A.L.FradkovandA.I.Shepeljavyi,“Thehistoryofcyberneticsand artificialintelligence:AviewfromSaintPetersburg,”Cybern.Phys., vol. 11, pp. 253–263, 2022

work page 2022
[42]

W. R. Ashby,An Introduction to Cybernetics. London: Chapman & Hall Ltd., 1956

work page 1956
[43]

A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955,

J. McCarthy, M. L. Minsky, N. Rochester, and C. E. Shannon, “A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955,”AI Mag., vol. 27, no. 4, pp. 12–14, 2006

work page 1955
[44]

The perceptron: A perceiving and recognizing au- tomaton (Project PARA),

F. Rosenblatt, “The perceptron: A perceiving and recognizing au- tomaton (Project PARA),” Cornell Aeronautical Laboratory, Tech. Rep. 85-460-1, 1957

work page 1957
[45]

Some studies in machine learning using the game of checkers,

A. Samuel, “Some studies in machine learning using the game of checkers,”IBM J. Res. Dev., vol. 3, no. 3, pp. 210–229, 1959

work page 1959
[46]

TheStanfordCartandtheCMUrover,

H.P.Moravec,“TheStanfordCartandtheCMUrover,”Proc.IEEE, vol. 71, no. 7, pp. 872–884, 1983

work page 1983
[47]

Stanford, CA, USA: Stanford Univ

——,Obstacle Avoidance and Navigation in the Real World by a Seeing Robot Rover. Stanford, CA, USA: Stanford Univ. Press, 1980

work page 1980
[48]

Evolution of robotic arms,

M. E. Moran, “Evolution of robotic arms,”J. Robot. Surg., vol. 1, no. 2, pp. 103–111, 2007

work page 2007
[49]

Steps toward artificial intelligence,

M. Minsky, “Steps toward artificial intelligence,”Proc. IEEE, vol. 49, no. 1, pp. 8–30, 1961

work page 1961
[50]

NewYork:McGraw-Hill,1965

N.J.Nilsson,LearningMachines. NewYork:McGraw-Hill,1965

work page 1965
[51]

A heuristic approach to reinforcement learning control systems,

M. D. Waltz and K. S. Fu, “A heuristic approach to reinforcement learning control systems,”IEEE Trans. Autom. Control, vol. 10, pp. 390–398, 1965

work page 1965
[52]

Applications of artificial intelligence techniques to a spacecraft control problem,

J. M. Mendel, “Applications of artificial intelligence techniques to a spacecraft control problem,” National Aeronautics and Space Administration, Tech. Rep. NASA CR-755, 1966

work page 1966
[53]

Learningfromdelayedrewards,

C.J.C.H.Watkins,“Learningfromdelayedrewards,”Ph.D.disser- tation, Cambridge University, Cambridge, England, 1989

work page 1989
[54]

Q-learning,

C.J.C.H.WatkinsandP.Dayan,“Q-learning,”Mach.Learn.,vol.8, pp. 279–292, 1992

work page 1992
[55]

I. N. Aizenberg, N. N. Aizenberg, and J. P. L. Vandewalle,Multi- Valued and Universal Binary Neurons: Theory, Learning and Ap- plications. Springer Science & Business Media, 2000

work page 2000
[56]

The history of artificial intelligence,

T. Mucci, “The history of artificial intelligence,” IBM Think, [Online]. Available: https://www.ibm.com/think/topics/history-of- artificial-intelligence

work page
[57]

Pre-trained language models and their applications,

H. Wang, J. Li, H. Wu, E. Hovy, and Y. Sun, “Pre-trained language models and their applications,”Engineering, vol. 25, pp. 51–64, 2022

work page 2022
[58]

Training language models to follow instructions with human feedback,

L. Ouyanget al., “Training language models to follow instructions with human feedback,” inProc. Adv. Neural Inf. Process. Syst., vol. 35, 2022, pp. 27730–27744

work page 2022
[59]

Reinforcement Learning from Human Feedback

N.Lambert,“Reinforcementlearningfromhumanfeedback,”arXiv preprint arXiv:2504.12501, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[60]

The crossroads of LLM and traffic control: A study on large language models in adaptive traffic signal control,

M. Movahedi and J. Choi, “The crossroads of LLM and traffic control: A study on large language models in adaptive traffic signal control,”IEEE Trans. Intell. Transp. Syst., vol. 26, no. 2, pp. 1701– 1716, 2025

work page 2025
[61]

Largelanguagemodelsintransportation:Acomprehen- sivebibliometricanalysisofemergingtrends,challenges,andfuture research,

M.Hassan,“Largelanguagemodelsintransportation:Acomprehen- sivebibliometricanalysisofemergingtrends,challenges,andfuture research,”IEEE Access, vol. 13, pp. 132547–132598, 2025

work page 2025
[62]

Leveraging LLMs and knowledgegraphstodesignsecureautomationsystems,

A. M. Hosseini, W. Kastner, and T. Sauter, “Leveraging LLMs and knowledgegraphstodesignsecureautomationsystems,”IEEEOpen J. Ind. Electron. Soc., vol. 6, pp. 380–395, 2025

work page 2025
[63]

Xavier Suau, Pieter Delobelle, Katherine Metcalf, Armand Joulin, Nicholas Apostoloff, Luca Zappella, and Pau Rodr´ıguez

S. Soatto, P. Tabuada, P. Chaudhari, and T. Y. Liu, “Taming AI bots:Controllabilityofneuralstatesinlargelanguagemodels,”arXiv preprint arXiv:2305.18449, 2023

work page arXiv 2023
[64]

LLM4PLC: Harnessing large language models for verifiable programming of PLCs in industrial control systems,

M. Fakihet al., “LLM4PLC: Harnessing large language models for verifiable programming of PLCs in industrial control systems,” in Proc. 46th Int. Conf. on Software Engineering: Software Engineer- ing in Practice, 2024, pp. 192–203

work page 2024
[65]

and Hu, B., 2024

D. Kevianet al., “Capabilities of large language models in control engineering: A benchmark study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra,”arXiv preprint arXiv:2404.03647, 2024

work page arXiv 2024
[66]

Control industrial automation system with large language model agents,

Y. Xia, N. Jazdi, J. Zhang, C. Shah, and M. Weyrich, “Control industrial automation system with large language model agents,” in Proc. IEEE 30th Int. Conf. on Emerging Technologies and Factory Automation (ETFA), 2025, pp. 1–8

work page 2025
[67]

Automated literature research and review-generation method based on large language models,

S. Wuet al., “Automated literature research and review-generation method based on large language models,”Natl. Sci. Rev., vol. 12, no. 6, p. nwaf169, 2025

work page 2025
[68]

A survey on largelanguagemodelsforcodegeneration,

J. Jiang, F. Wang, J. Shen, S. Kim, and S. Kim, “A survey on largelanguagemodelsforcodegeneration,”ACMTrans.Softw.Eng. Methodol., 2025

work page 2025
[69]

Code Llama: Open Foundation Models for Code

B. Roziereet al., “Code Llama: Open foundation models for code,” arXiv preprint arXiv:2308.12950, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[70]

Leveraging LLMs for legacy code modernization: challenges and opportunities for LLM- generated documentation,

C. Diggset al., “Leveraging LLMs for legacy code modernization: Challenges and opportunities for LLM-generated documentation,” arXiv preprint arXiv:2411.14971, 2025

work page arXiv 2025
[71]

Large language models as data preprocessors,

H. Zhang, Y. Dong, C. Xiao, and M. Oyamada, “Large language models as data preprocessors,”arXiv preprint arXiv:2308.16361, 2023

work page arXiv 2023
[72]

Data informativity: A new perspective on data-driven analysis and control,

H. J. V. Waarde, J. Eising, H. L. Trentelman, and M. K. Camlibel, “Data informativity: A new perspective on data-driven analysis and control,”IEEE Trans. Autom. Control, vol. 65, no. 11, pp. 4753– 4768, 2020

work page 2020
[73]

LLM-agent-controller: A universal multi-agent large language model system as a control engineer,

R. Zahedifar, S. A. Mirghasemi, M. S. Baghshah, and A. Taheri, “LLM-agent-controller: A universal multi-agent large language model system as a control engineer,”ACM Trans. Softw. Eng. Methodol., 2025

work page 2025
[74]

Pydantic,

S. Colvinet al., “Pydantic,”GitHub repository, 2025, [Online]. Available: https://github.com/pydantic/pydantic

work page 2025
[75]

GPT-researcher,

A. Elovic, “GPT-researcher,”GitHub repository, 2023, [Online]. Available: https://github.com/assafelovic/gpt-researcher

work page 2023
[76]

TheimpactofgenerativeAItoolsonresearchers andresearch:Implicationsforacademiainhighereducation,

A.M.Al-Zahrani,“TheimpactofgenerativeAItoolsonresearchers andresearch:Implicationsforacademiainhighereducation,”Innov. Educ. Teach. Int., vol. 61, no. 5, pp. 1029–1043, 2024

work page 2024
[77]

An evaluation of general-purpose AI chatbots: a com- prehensive comparative analysis,

O. Chalyi, “An evaluation of general-purpose AI chatbots: a com- prehensive comparative analysis,”InfoSci. Trends, vol. 1, no. 1, pp. 52–66, 2024

work page 2024
[78]

Library databases and chatbots,

E. Lombard, “Library databases and chatbots,”Internet Ref. Serv. Q., vol. 28, no. 4, pp. 463–471, 2024

work page 2024
[79]

Large language models are zero-shot reasoners,

T. Kojimaet al., “Large language models are zero-shot reasoners,” Adv. Neural Inf. Process. Syst., vol. 35, pp. 22199–22213, 2022

work page 2022
[80]

Pre-trainedlargelanguage models for industrial control,

L.Song,C.Zhang,L.Zhao,andJ.Bian,“Pre-trainedlargelanguage models for industrial control,”arXiv preprint arXiv:2308.03028, 2023

work page arXiv 2023

Showing first 80 references.

[1] [1]

A survey on large language model benchmarks,

S. Niet al., “A survey on large language model benchmarks,”arXiv preprint arXiv:2508.15361, 2025

work page arXiv 2025

[2] [2]

Attention is all you need,

A. Vaswaniet al., “Attention is all you need,”Adv. Neural Inf. Process. Syst., vol. 30, 2017

work page 2017

[3] [3]

Large Language Models: A Survey

S.Minaeeetal.,“Largelanguagemodels:Asurvey,”arXivpreprint arXiv:2402.06196, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[4] [4]

Chain-of-thought prompting elicits reasoning in large language models,

J. Weiet al., “Chain-of-thought prompting elicits reasoning in large language models,”Adv. Neural Inf. Process. Syst., vol. 35, pp. 24824–24837, 2022

work page 2022

[5] [5]

Augmented language models: A survey,

G. Mialonet al., “Augmented language models: A survey,”Trans. Mach. Learn. Res., 2023

work page 2023

[6] [6]

React: Synergizing reasoning and acting in language models,

S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, and Y. Cao, “React: Synergizing reasoning and acting in language models,” inProc. 11th Int. Conf. Learn. Representations (ICLR), 2023

work page 2023

[7] [7]

BERT: Pre- trainingofdeepbidirectionaltransformersforlanguageunderstand- ing,

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre- trainingofdeepbidirectionaltransformersforlanguageunderstand- ing,” inProc. 2019 Conf. North American Chapter of the Associa- tionforComputationalLinguistics:HumanLanguageTechnologies, 2019, pp. 4171–4186

work page 2019

[8] [8]

Im- proving language understanding by generative pre-training,

A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever, “Im- proving language understanding by generative pre-training,”Online manuscript, 2018

work page 2018

[9] [9]

Exploring the limits of transfer learning with a unifiedtext-to-texttransformer,

C. Raffelet al., “Exploring the limits of transfer learning with a unifiedtext-to-texttransformer,”J.Mach.Learn.Res.,vol.21,no.1, pp. 5485–5551, 2020

work page 2020

[10] [10]

A brief overview of ChatGPT: The history, status quoandpotentialfuturedevelopment,

T. Wuet al., “A brief overview of ChatGPT: The history, status quoandpotentialfuturedevelopment,”IEEE/CAAJ.Autom.Sinica, vol. 10, no. 5, pp. 1122–1136, 2023

work page 2023

[11] [11]

LLaMA: Open and Efficient Foundation Language Models

H.Touvronetal.,“LLAMA:Openandefficientfoundationlanguage models,”arXiv preprint arXiv:2302.13971, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[12] [12]

PaLM: Scaling language modeling with pathways,

A. Chowdheryet al., “PaLM: Scaling language modeling with pathways,”J. Mach. Learn. Res., vol. 24, no. 240, pp. 1–113, 2023

work page 2023

[13] [13]

Gemini: A Family of Highly Capable Multimodal Models

R. Anilet al., “Gemini: A family of highly capable multimodal models,”arXiv preprint arXiv:2312.11805, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[14] [14]

DeepSeek:Paradigmshiftsandtechnicalevolution in large AI models,

L.Xiongetal.,“DeepSeek:Paradigmshiftsandtechnicalevolution in large AI models,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 5, pp. 841–858, 2025

work page 2025

[15] [15]

Exploring DeepSeek: A survey on advances, ap- plications, challenges and future directions,

Z. Denget al., “Exploring DeepSeek: A survey on advances, ap- plications, challenges and future directions,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 5, pp. 872–893, 2025

work page 2025

[16] [16]

LaMDA: Language Models for Dialog Applications

R. Thoppilanet al., “LaMDA: Language models for dialog applica- tions,”arXiv preprint arXiv:2201.08239, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022

[17] [17]

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

J. Luoet al., “Large language model agent: A survey on methodol- ogy,applicationsandchallenges,”arXivpreprintarXiv:2503.21460, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[18] [18]

Large language models are human-level prompt engineers,

Y. Zhouet al., “Large language models are human-level prompt engineers,” inProc. 11th Int. Conf. Learn. Representations (ICLR), 2023

work page 2023

[19] [19]

Retrieval-Augmented Generation for Large Language Models: A Survey

Y. Gaoet al., “Retrieval-augmented generation for large language models: A survey,”arXiv preprint arXiv:2312.10997, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[20] [20]

Hugging- GPT: Solving AI tasks with ChatGPT and its friends in Hugging Face,

Y. Shen, K. Song, X. Tan, D. Li, W. Lu, and Y. Zhuang, “Hugging- GPT: Solving AI tasks with ChatGPT and its friends in Hugging Face,”Adv. Neural Inf. Process. Syst., vol. 36, pp. 38154–38180, 2023

work page 2023

[21] [21]

ART: Automatic multi-step reasoning and tool-use for large language models

B.Paranjapeetal.,“ART:Automaticmulti-stepreasoningandtool- use for large language models,”arXiv preprint arXiv:2303.09014, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[22] [22]

Toolformer:Languagemodelscanteachthemselves to use tools,

T.Schicketal.,“Toolformer:Languagemodelscanteachthemselves to use tools,”Adv. Neural Inf. Process. Syst., vol. 36, pp. 68539– 68551, 2023

work page 2023

[23] [23]

Gorilla: Large language model connected with massiveAPIs,

S. G. Patilet al., “Gorilla: Large language model connected with massiveAPIs,”Adv.NeuralInf.Process.Syst.,vol.37,pp.126544– 126565, 2024

work page 2024

[24] [24]

Asurveyonlargelanguagemodelbasedautonomous agents,

L.Wangetal.,“Asurveyonlargelanguagemodelbasedautonomous agents,”Front. Comput. Sci., vol. 18, no. 6, p. 186345, 2024

work page 2024

[25] [25]

The rise and potential of large language model based agents:Asurvey,

Z. Xiet al., “The rise and potential of large language model based agents:Asurvey,”Sci.ChinaInf.Sci.,vol.68,no.2,p.121101,2025

work page 2025

[26] [26]

Whensoftwaresecuritymeetslargelanguagemodels: A survey,

X.Zhuetal.,“Whensoftwaresecuritymeetslargelanguagemodels: A survey,”IEEE/CAA J. Autom. Sinica, vol. 12, no. 2, pp. 317–334, 2025

work page 2025

[27] [27]

UnveilingLLMmechanismsthroughneural odes and control theory,

Y.ZhangandQ.Dong,“UnveilingLLMmechanismsthroughneural odes and control theory,”arXiv preprint arXiv:2406.16985, 2024

work page arXiv 2024

[28] [28]

Neural ODE transformers: Analyzing internal dy- namics and adaptive fine-tuning,

A. Tonget al., “Neural ODE transformers: Analyzing internal dy- namics and adaptive fine-tuning,” inProc. 13th Int. Conf. Learn. Representations (ICLR), 2025

work page 2025

[29] [29]

Linear feedback control systems for iterative prompt optimization in large language models,

R. R. Karn, “Linear feedback control systems for iterative prompt optimization in large language models,”arXiv preprint arXiv:2501.11979, 2025

work page arXiv 2025

[30] [30]

Linearly controlled language genera- tion with performative guarantees,

E. Cheng and C. A. Alonso, “Linearly controlled language genera- tion with performative guarantees,” inProc. NeurIPS Workshop on Foundation Model Interventions, 2024

work page 2024

[31] [31]

Mamba: Linear-time sequence modeling with selective state spaces,

A. Gu and T. Dao, “Mamba: Linear-time sequence modeling with selective state spaces,” inProc. 1st Conf. on Language Modeling, 2024

work page 2024

[32] [32]

Safe RLHF: Safe reinforcement learning from human feedback,

J. Daiet al., “Safe RLHF: Safe reinforcement learning from human feedback,” inProc. 12th Int. Conf. Learn. Representations (ICLR), 2024

work page 2024

[33] [33]

PIDformer: Transformer meets control theory,

T. M. Nguyen, C. A. Uribe, T. M. Nguyen, and R. Baraniuk, “PIDformer: Transformer meets control theory,” inProc. 41st Int. Conf. Machine Learning, 2024

work page 2024

[34] [34]

and Thomson, M., 2023

A.Bhargavaetal.,“What’sthemagicword?acontroltheoryofLLM prompting,”arXiv preprint arXiv:2310.04444, 2023

work page arXiv 2023

[35] [35]

Towardsautonomous system: Flexible modular production system enhanced with large language model agents,

Y.Xia,M.Shenoy,N.Jazdi,andM.Weyrich,“Towardsautonomous system: Flexible modular production system enhanced with large language model agents,” inProc. IEEE 28th Int. Conf. Emerging Technologies and Factory Automation (ETFA), 2023, pp. 1–8

work page 2023

[36] [36]

AuDeRe: Automated strategy decision and realization in robot planning and control via LLMs,

Y. Meng, F. Chen, Y. Chen, and C. Fan, “AuDeRe: Automated strategy decision and realization in robot planning and control via LLMs,”arXiv preprint arXiv:2504.03015, 2025

work page arXiv 2025

[37] [37]

LLMs-guidedadaptivecompensator:Bringingadap- tivity to automatic control systems with large language models,

Z.Zhouetal.,“LLMs-guidedadaptivecompensator:Bringingadap- tivity to automatic control systems with large language models,” arXiv preprint arXiv:2507.20509, 2025

work page arXiv 2025

[38] [38]

Berlin,Germany: Springer, 2015

D.A.Novikov,Cybernetics:FromPasttoFuture. Berlin,Germany: Springer, 2015. Page 23 of 28

work page 2015

[39] [39]

Wiener,Cybernetics: Or Control and Communication in the Animal and the Machine

N. Wiener,Cybernetics: Or Control and Communication in the Animal and the Machine. Paris and Cambridge, MA: Hermann & Cie and MIT Press, 1948

work page 1948

[40] [40]

A logical calculus of the ideas im- manent in nervous activity,

W. McCulloch and W. Pitts, “A logical calculus of the ideas im- manent in nervous activity,”Bull. Math. Biol., vol. 5, pp. 115–137, 1943

work page 1943

[41] [41]

Thehistoryofcyberneticsand artificialintelligence:AviewfromSaintPetersburg,

A.L.FradkovandA.I.Shepeljavyi,“Thehistoryofcyberneticsand artificialintelligence:AviewfromSaintPetersburg,”Cybern.Phys., vol. 11, pp. 253–263, 2022

work page 2022

[42] [42]

W. R. Ashby,An Introduction to Cybernetics. London: Chapman & Hall Ltd., 1956

work page 1956

[43] [43]

A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955,

J. McCarthy, M. L. Minsky, N. Rochester, and C. E. Shannon, “A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955,”AI Mag., vol. 27, no. 4, pp. 12–14, 2006

work page 1955

[44] [44]

The perceptron: A perceiving and recognizing au- tomaton (Project PARA),

F. Rosenblatt, “The perceptron: A perceiving and recognizing au- tomaton (Project PARA),” Cornell Aeronautical Laboratory, Tech. Rep. 85-460-1, 1957

work page 1957

[45] [45]

Some studies in machine learning using the game of checkers,

A. Samuel, “Some studies in machine learning using the game of checkers,”IBM J. Res. Dev., vol. 3, no. 3, pp. 210–229, 1959

work page 1959

[46] [46]

TheStanfordCartandtheCMUrover,

H.P.Moravec,“TheStanfordCartandtheCMUrover,”Proc.IEEE, vol. 71, no. 7, pp. 872–884, 1983

work page 1983

[47] [47]

Stanford, CA, USA: Stanford Univ

——,Obstacle Avoidance and Navigation in the Real World by a Seeing Robot Rover. Stanford, CA, USA: Stanford Univ. Press, 1980

work page 1980

[48] [48]

Evolution of robotic arms,

M. E. Moran, “Evolution of robotic arms,”J. Robot. Surg., vol. 1, no. 2, pp. 103–111, 2007

work page 2007

[49] [49]

Steps toward artificial intelligence,

M. Minsky, “Steps toward artificial intelligence,”Proc. IEEE, vol. 49, no. 1, pp. 8–30, 1961

work page 1961

[50] [50]

NewYork:McGraw-Hill,1965

N.J.Nilsson,LearningMachines. NewYork:McGraw-Hill,1965

work page 1965

[51] [51]

A heuristic approach to reinforcement learning control systems,

M. D. Waltz and K. S. Fu, “A heuristic approach to reinforcement learning control systems,”IEEE Trans. Autom. Control, vol. 10, pp. 390–398, 1965

work page 1965

[52] [52]

Applications of artificial intelligence techniques to a spacecraft control problem,

J. M. Mendel, “Applications of artificial intelligence techniques to a spacecraft control problem,” National Aeronautics and Space Administration, Tech. Rep. NASA CR-755, 1966

work page 1966

[53] [53]

Learningfromdelayedrewards,

C.J.C.H.Watkins,“Learningfromdelayedrewards,”Ph.D.disser- tation, Cambridge University, Cambridge, England, 1989

work page 1989

[54] [54]

Q-learning,

C.J.C.H.WatkinsandP.Dayan,“Q-learning,”Mach.Learn.,vol.8, pp. 279–292, 1992

work page 1992

[55] [55]

I. N. Aizenberg, N. N. Aizenberg, and J. P. L. Vandewalle,Multi- Valued and Universal Binary Neurons: Theory, Learning and Ap- plications. Springer Science & Business Media, 2000

work page 2000

[56] [56]

The history of artificial intelligence,

T. Mucci, “The history of artificial intelligence,” IBM Think, [Online]. Available: https://www.ibm.com/think/topics/history-of- artificial-intelligence

work page

[57] [57]

Pre-trained language models and their applications,

H. Wang, J. Li, H. Wu, E. Hovy, and Y. Sun, “Pre-trained language models and their applications,”Engineering, vol. 25, pp. 51–64, 2022

work page 2022

[58] [58]

Training language models to follow instructions with human feedback,

L. Ouyanget al., “Training language models to follow instructions with human feedback,” inProc. Adv. Neural Inf. Process. Syst., vol. 35, 2022, pp. 27730–27744

work page 2022

[59] [59]

Reinforcement Learning from Human Feedback

N.Lambert,“Reinforcementlearningfromhumanfeedback,”arXiv preprint arXiv:2504.12501, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[60] [60]

The crossroads of LLM and traffic control: A study on large language models in adaptive traffic signal control,

M. Movahedi and J. Choi, “The crossroads of LLM and traffic control: A study on large language models in adaptive traffic signal control,”IEEE Trans. Intell. Transp. Syst., vol. 26, no. 2, pp. 1701– 1716, 2025

work page 2025

[61] [61]

Largelanguagemodelsintransportation:Acomprehen- sivebibliometricanalysisofemergingtrends,challenges,andfuture research,

M.Hassan,“Largelanguagemodelsintransportation:Acomprehen- sivebibliometricanalysisofemergingtrends,challenges,andfuture research,”IEEE Access, vol. 13, pp. 132547–132598, 2025

work page 2025

[62] [62]

Leveraging LLMs and knowledgegraphstodesignsecureautomationsystems,

A. M. Hosseini, W. Kastner, and T. Sauter, “Leveraging LLMs and knowledgegraphstodesignsecureautomationsystems,”IEEEOpen J. Ind. Electron. Soc., vol. 6, pp. 380–395, 2025

work page 2025

[63] [63]

Xavier Suau, Pieter Delobelle, Katherine Metcalf, Armand Joulin, Nicholas Apostoloff, Luca Zappella, and Pau Rodr´ıguez

S. Soatto, P. Tabuada, P. Chaudhari, and T. Y. Liu, “Taming AI bots:Controllabilityofneuralstatesinlargelanguagemodels,”arXiv preprint arXiv:2305.18449, 2023

work page arXiv 2023

[64] [64]

LLM4PLC: Harnessing large language models for verifiable programming of PLCs in industrial control systems,

M. Fakihet al., “LLM4PLC: Harnessing large language models for verifiable programming of PLCs in industrial control systems,” in Proc. 46th Int. Conf. on Software Engineering: Software Engineer- ing in Practice, 2024, pp. 192–203

work page 2024

[65] [65]

and Hu, B., 2024

D. Kevianet al., “Capabilities of large language models in control engineering: A benchmark study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra,”arXiv preprint arXiv:2404.03647, 2024

work page arXiv 2024

[66] [66]

Control industrial automation system with large language model agents,

Y. Xia, N. Jazdi, J. Zhang, C. Shah, and M. Weyrich, “Control industrial automation system with large language model agents,” in Proc. IEEE 30th Int. Conf. on Emerging Technologies and Factory Automation (ETFA), 2025, pp. 1–8

work page 2025

[67] [67]

Automated literature research and review-generation method based on large language models,

S. Wuet al., “Automated literature research and review-generation method based on large language models,”Natl. Sci. Rev., vol. 12, no. 6, p. nwaf169, 2025

work page 2025

[68] [68]

A survey on largelanguagemodelsforcodegeneration,

J. Jiang, F. Wang, J. Shen, S. Kim, and S. Kim, “A survey on largelanguagemodelsforcodegeneration,”ACMTrans.Softw.Eng. Methodol., 2025

work page 2025

[69] [69]

Code Llama: Open Foundation Models for Code

B. Roziereet al., “Code Llama: Open foundation models for code,” arXiv preprint arXiv:2308.12950, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[70] [70]

Leveraging LLMs for legacy code modernization: challenges and opportunities for LLM- generated documentation,

C. Diggset al., “Leveraging LLMs for legacy code modernization: Challenges and opportunities for LLM-generated documentation,” arXiv preprint arXiv:2411.14971, 2025

work page arXiv 2025

[71] [71]

Large language models as data preprocessors,

H. Zhang, Y. Dong, C. Xiao, and M. Oyamada, “Large language models as data preprocessors,”arXiv preprint arXiv:2308.16361, 2023

work page arXiv 2023

[72] [72]

Data informativity: A new perspective on data-driven analysis and control,

H. J. V. Waarde, J. Eising, H. L. Trentelman, and M. K. Camlibel, “Data informativity: A new perspective on data-driven analysis and control,”IEEE Trans. Autom. Control, vol. 65, no. 11, pp. 4753– 4768, 2020

work page 2020

[73] [73]

LLM-agent-controller: A universal multi-agent large language model system as a control engineer,

R. Zahedifar, S. A. Mirghasemi, M. S. Baghshah, and A. Taheri, “LLM-agent-controller: A universal multi-agent large language model system as a control engineer,”ACM Trans. Softw. Eng. Methodol., 2025

work page 2025

[74] [74]

Pydantic,

S. Colvinet al., “Pydantic,”GitHub repository, 2025, [Online]. Available: https://github.com/pydantic/pydantic

work page 2025

[75] [75]

GPT-researcher,

A. Elovic, “GPT-researcher,”GitHub repository, 2023, [Online]. Available: https://github.com/assafelovic/gpt-researcher

work page 2023

[76] [76]

TheimpactofgenerativeAItoolsonresearchers andresearch:Implicationsforacademiainhighereducation,

A.M.Al-Zahrani,“TheimpactofgenerativeAItoolsonresearchers andresearch:Implicationsforacademiainhighereducation,”Innov. Educ. Teach. Int., vol. 61, no. 5, pp. 1029–1043, 2024

work page 2024

[77] [77]

An evaluation of general-purpose AI chatbots: a com- prehensive comparative analysis,

O. Chalyi, “An evaluation of general-purpose AI chatbots: a com- prehensive comparative analysis,”InfoSci. Trends, vol. 1, no. 1, pp. 52–66, 2024

work page 2024

[78] [78]

Library databases and chatbots,

E. Lombard, “Library databases and chatbots,”Internet Ref. Serv. Q., vol. 28, no. 4, pp. 463–471, 2024

work page 2024

[79] [79]

Large language models are zero-shot reasoners,

T. Kojimaet al., “Large language models are zero-shot reasoners,” Adv. Neural Inf. Process. Syst., vol. 35, pp. 22199–22213, 2022

work page 2022

[80] [80]

Pre-trainedlargelanguage models for industrial control,

L.Song,C.Zhang,L.Zhao,andJ.Bian,“Pre-trainedlargelanguage models for industrial control,”arXiv preprint arXiv:2308.03028, 2023

work page arXiv 2023