Online Adaptive Probabilistic Safety Certificate with Language Guidance

Hikaru Hoshino; Xiyu Deng; Yorie Nakahira; Zhuoyuan Wang

arxiv: 2511.12431 · v2 · submitted 2025-11-16 · 📡 eess.SY · cs.SY

Online Adaptive Probabilistic Safety Certificate with Language Guidance

Zhuoyuan Wang , Xiyu Deng , Hikaru Hoshino , Yorie Nakahira This is my paper

Pith reviewed 2026-05-17 22:40 UTC · model grok-4.3

classification 📡 eess.SY cs.SY

keywords probabilistic safety certificateslanguage guidancestochastic systemsadaptive safetylong-term safetyBayesian estimationautonomous lane-keepingprobabilistic invariance

0 comments

The pith

A language-guided framework uses probabilistic invariance to turn myopic checks into long-term safety guarantees for uncertain stochastic systems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes an adaptive probabilistic safety certificate framework that accepts natural-language user inputs and combines them with Bayesian estimates of the environment. It applies probabilistic invariance to derive simple myopic safety conditions that still deliver long-term guarantees despite stochastic dynamics and uncertainty. The method explicitly incorporates user preferences and risk tolerance so that the same underlying system can be personalized without redesigning the controller. Numerical simulations of autonomous lane-keeping under extreme road conditions illustrate how the certificates adapt online while maintaining safety.

Core claim

The framework integrates natural-language inputs from users and Bayesian estimators of the environment into adaptive safety certificates that explicitly account for user preferences, system dynamics, and quantified uncertainties. Probabilistic invariance is used to obtain myopic safety conditions that carry long-term safety guarantees for stochastic systems.

What carries the argument

Probabilistic invariance, a generalization of forward invariance to a probability space, applied to language-guided Bayesian estimates to produce adaptive safety certificates.

If this is right

Safety certificates can be updated online as language instructions or environmental estimates change without recomputing the entire controller.
Diverse user preferences translate directly into different safety margins while preserving the same long-term probabilistic guarantees.
The myopic conditions reduce computational burden compared with full-horizon optimization yet still enforce safety over time.
The approach extends to other stochastic control tasks where both uncertainty and human guidance must be handled simultaneously.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the Bayesian estimates remain accurate under distribution shift, the framework could reduce unnecessary conservatism in safety-critical applications.
Real-world deployment would require verifying that language parsing and preference encoding do not introduce new failure modes outside the modeled uncertainty.
The same structure might be tested on multi-agent systems where each agent receives separate language guidance.

Load-bearing premise

That probabilistic invariance applied to language-guided Bayesian estimates yields myopic conditions that actually deliver the claimed long-term safety guarantees beyond the reported simulations.

What would settle it

A long-horizon simulation or hardware trial in which the closed-loop trajectory violates a safety threshold even though every myopic condition derived from the current language input and Bayesian estimate is satisfied.

Figures

Figures reproduced from arXiv: 2511.12431 by Hikaru Hoshino, Xiyu Deng, Yorie Nakahira, Zhuoyuan Wang.

**Figure 2.** Figure 2: Safety probability vs. MPC horizon Tmpc. Distance (m) 0 20 40 60 80 Distance (m) -20 0 20 40 60 Proposed AMPC CDBF [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 5.** Figure 5: Safety vs. efficiency trade-offs with MPC-based adaptive controls. We evaluated the closed-loop performance of all three methods under varying road conditions with unknown ground truth friction coefficients: icy (µ ∈ [0.3, 0.4]), wet (µ ∈ [0.5, 0.6]), and dry (µ ∈ [0.8, 0.9]), and we tested with different estimator priors, measurement noise levels, and maximum lane-error tolerances, to conduct a compreh… view at source ↗

**Figure 6.** Figure 6: Qualitative results across three rounds of human instructions and feedback. [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 7.** Figure 7: Safety vs. efficiency trade-offs with MPC-based adaptive safe control variants. [PITH_FULL_IMAGE:figures/full_fig_p022_7.png] view at source ↗

**Figure 8.** Figure 8: Per step online computation time vs. MPC horizon [PITH_FULL_IMAGE:figures/full_fig_p022_8.png] view at source ↗

**Figure 9.** Figure 9: Sequential update of the posterior distribution of [PITH_FULL_IMAGE:figures/full_fig_p026_9.png] view at source ↗

**Figure 10.** Figure 10: Vehicle trajectories. Dry Wet Icy Road condition 0 5 10 15 20 25 30 35 Longitudinal speed (km/h) Proposed safe controller Non-adaptive safe controller Nominal controller [PITH_FULL_IMAGE:figures/full_fig_p026_10.png] view at source ↗

read the original abstract

Achieving long-term safety in uncertain/extreme environments while accounting for human preferences remains a fundamental challenge for autonomous systems. Existing methods often trade off long-term guarantees for fast real-time control and cannot adapt to variability in human preferences or risk tolerance. To address these limitations, we propose a language-guided adaptive probabilistic safety certificate (PSC) framework that guarantees long-term safety for stochastic systems under environmental uncertainty while accommodating diverse human preferences. The proposed framework integrates natural-language inputs from users and Bayesian estimators of the environment into adaptive safety certificates that explicitly account for user preferences, system dynamics, and quantified uncertainties. Our key technical innovation leverages probabilistic invariance--a generalization of forward invariance to a probability space--to obtain myopic safety conditions with long-term safety guarantees. We validate the framework through numerical simulations of autonomous lane-keeping with human-in-the-loop guidance under uncertain and extreme road conditions, demonstrating enhanced safety-performance trade-offs, adaptability to changing environments, and personalization to different user preferences. Code is available at https://github.com/hoshino06/adaptive_lane_keeping.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper combines language inputs with Bayesian adaptation and probabilistic invariance for online safety certificates in uncertain control, but the long-term guarantees under shifting estimates need explicit checking.

read the letter

The main point is that this framework folds natural language user preferences into Bayesian environment estimates and then applies probabilistic invariance to generate adaptive safety conditions for stochastic systems. The lane-keeping simulations with human guidance under uncertain roads show the method adjusting to different preferences and conditions while keeping decent performance and releasing code helps others test it directly.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes a language-guided adaptive probabilistic safety certificate (PSC) framework for stochastic systems. It integrates natural-language user inputs with Bayesian estimators of environmental uncertainty to generate adaptive safety certificates that incorporate user preferences. The central technical contribution applies probabilistic invariance to derive myopic safety conditions claimed to deliver long-term safety guarantees. The framework is validated through numerical simulations of autonomous lane-keeping under uncertain road conditions with human-in-the-loop guidance.

Significance. If the long-term probabilistic guarantees survive online adaptation of the certificate via language-driven Bayesian updates, the work would provide a useful bridge between formal safety methods and human preference accommodation in autonomous systems. The open-source code supports reproducibility and could facilitate follow-on research in safe control under uncertainty.

major comments (2)

[Section 3] Section 3 (Framework): The derivation of long-term safety from myopic conditions via probabilistic invariance assumes a fixed probability measure, yet the online Bayesian updates driven by sequential natural-language inputs render the posterior time-varying. The manuscript does not show how the invariance property is preserved across steps when the quantified uncertainty and estimate change.
[Section 5] Section 5 (Numerical Simulations): The lane-keeping results report improved safety-performance trade-offs and adaptability, but provide no quantitative bound or analysis on the probability of invariance violation under repeated online adaptation of the certificate. This leaves the central long-term guarantee claim without direct support beyond empirical observation.

minor comments (2)

[Introduction] The notation distinguishing the adaptive safety certificate from the underlying probabilistic invariance set could be introduced earlier and used consistently to improve readability.
[Section 5] Figure captions for the simulation results should explicitly state the number of Monte Carlo runs and the exact definition of the plotted safety metric.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and insightful comments. We address each major comment below with clarifications and indicate planned revisions to strengthen the presentation of the long-term safety guarantees.

read point-by-point responses

Referee: [Section 3] Section 3 (Framework): The derivation of long-term safety from myopic conditions via probabilistic invariance assumes a fixed probability measure, yet the online Bayesian updates driven by sequential natural-language inputs render the posterior time-varying. The manuscript does not show how the invariance property is preserved across steps when the quantified uncertainty and estimate change.

Authors: We appreciate this precise observation regarding the time-varying posterior. The probabilistic invariance is applied conditionally with respect to the current posterior at each time step; the myopic safety condition is recomputed using the updated posterior obtained from the language-driven Bayesian estimator. This ensures the one-step invariance holds under the measure at that instant. To make the chaining across time-varying measures explicit, we will insert a new proposition in Section 3 that proves, by induction, that repeated application of the myopic condition under consistent Bayesian updates preserves the long-term probabilistic safety guarantee. The proof relies on the tower property of conditional expectation and the fact that each update is a valid posterior. revision: yes
Referee: [Section 5] Section 5 (Numerical Simulations): The lane-keeping results report improved safety-performance trade-offs and adaptability, but provide no quantitative bound or analysis on the probability of invariance violation under repeated online adaptation of the certificate. This leaves the central long-term guarantee claim without direct support beyond empirical observation.

Authors: The referee is correct that the current simulations offer only empirical support. While the theoretical result guarantees safety under the maintained assumptions, obtaining a closed-form quantitative bound on the violation probability would require additional analysis of the convergence rate of the language-conditioned posterior, which lies outside the scope of the present contribution. In the revision we will add a new subsection to Section 5 containing Monte Carlo experiments over extended time horizons that report empirical violation frequencies, together with a remark that explicitly links these observations to the conditions of the theoretical guarantee. revision: partial

Circularity Check

0 steps flagged

No significant circularity; probabilistic invariance used as external tool

full rationale

The paper's derivation chain invokes probabilistic invariance as a generalization of forward invariance to obtain myopic conditions that deliver long-term guarantees. No equations or definitions in the abstract or framework description reduce the safety certificate or invariance property to a fitted parameter, self-referential definition, or prior self-citation that is load-bearing. The integration of language-guided Bayesian updates is presented as an application of the external invariance concept rather than a redefinition of it. The central claim therefore remains independent of its inputs by construction, consistent with self-contained use of a standard mathematical tool.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The framework rests on the assumption that probabilistic invariance can be leveraged to convert myopic conditions into long-term guarantees when combined with language and Bayesian estimates; no free parameters or new entities are explicitly introduced in the abstract.

axioms (1)

domain assumption Probabilistic invariance generalizes forward invariance to probability space and yields myopic safety conditions with long-term guarantees.
Stated as the key technical innovation that enables the framework.

pith-pipeline@v0.9.0 · 5486 in / 1111 out tokens · 40183 ms · 2026-05-17T22:40:47.644820+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

AUk_Hk+1 Ψπ_Hk(Xk) ≥ −γ(Ψπ_Hk(Xk)−(1−ϵ)) with γ strictly concave/linear and increasing, γ(q)≤q.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages

[1]

Llm-enhanced symbolic control for safety-critical applications.arXiv preprint arXiv:2505.11077,

Amir Bayat, Alessandro Abate, Necmiye Ozay, and Raphael M Jungers. Llm-enhanced symbolic control for safety-critical applications.arXiv preprint arXiv:2505.11077,

work page arXiv
[2]

Neural Ordinary Differential Equations for Nonlinear System Identification

Max H. Cohen and Calin Belta. High order robust adaptive control barrier functions and expo- nentially stabilizing adaptive control lyapunov functions. In2022 American Control Conference (ACC), pages 2233–2238. IEEE. ISBN 978-1-66545-196-3. doi: 10.23919/ACC53348.2022. 9867633. URLhttps://ieeexplore.ieee.org/document/9867633/. Ryan K Cosner, Andrew W Sing...

work page doi:10.23919/acc53348.2022 2022
[3]

Context-aware llm-based safe control against latent risks.arXiv preprint arXiv:2403.11863,

Xiyu Deng, Quan Khanh Luu, Anh Van Ho, and Yorie Nakahira. Context-aware llm-based safe control against latent risks.arXiv preprint arXiv:2403.11863,

work page arXiv
[4]

David D Fan, Jennifer Nguyen, Rohan Thakker, Nikhilesh Alatur, Ali-akbar Agha-mohammadi, and Evangelos A Theodorou

doi: 10.1177/0954407014556115. David D Fan, Jennifer Nguyen, Rohan Thakker, Nikhilesh Alatur, Ali-akbar Agha-mohammadi, and Evangelos A Theodorou. Bayesian learning-based adaptive control for safety critical systems. In2020 IEEE international conference on robotics and automation (ICRA), pages 4093–4099. IEEE,

work page doi:10.1177/0954407014556115
[5]

SafeLLM-controlledrobotswithformalguarantees via reachability analysis,

Ahmad Hafez, Alireza Naderi Akhormeh, Amr Hegazy, and Amr Alanwar. Safe llm-controlled robots with formal guarantees via reachability analysis.arXiv preprint arXiv:2503.03911,

work page arXiv
[6]

Yiwen Huang, Wei Liang, and Yan Chen

doi: 10.23919/ ACC60939.2024.10644621. Yiwen Huang, Wei Liang, and Yan Chen. Estimation and analysis of vehicle lateral stability region. In2017 American Control Conference (ACC), pages 4303–4308,

work page arXiv 2024
[7]

2017.7963617

doi: 10.23919/ACC. 2017.7963617. Yiwen Huang, Sze Zheng Yong, and Yan Chen. Stability control of autonomous ground vehicles using control-dependent barrier functions.IEEE Transactions on Intelligent Vehicles, 6(4):699– 710,

work page doi:10.23919/acc 2017
[8]

Motion planning for autonomous driving: The state of the art and future perspectives,

doi: 10.1109/TIV .2021.3058064. Mrdjan Jankovic. Robust control barrier functions for constrained stabilization of nonlinear systems. Automatica, 96:359–367,

work page doi:10.1109/tiv 2021
[9]

Safety aware task planning via large language models in robotics.arXiv preprint arXiv:2503.15707,

Azal Ahmad Khan, Michael Andrev, Muhammad Ali Murtaza, Sergio Aguilera, Rui Zhang, Jie Ding, Seth Hutchinson, and Ali Anwar. Safety aware task planning via large language models in robotics.arXiv preprint arXiv:2503.15707,

work page arXiv
[10]

doi: https://doi.org/10.1016/j.isatra.2022.09.038

ISSN 0019-0578. doi: https://doi.org/10.1016/j.isatra.2022.09.038. Brett T Lopez, Jean-Jacques E Slotine, and Jonathan P How. Robust adaptive control barrier func- tions: An adaptive and data-driven approach to safety.IEEE Control Systems Letters, 5(3): 1031–1036,

work page doi:10.1016/j.isatra.2022.09.038 2022
[11]

Lee, Matthew Tan, Yuke Zhu, and Jeannette Bohg

doi: 10.1109/ICRA48506.2021.9561894. Gabriel Maher. Llmpc: Large language model predictive control.arXiv preprint arXiv:2501.02486,

work page doi:10.1109/icra48506.2021.9561894 2021
[12]

Rajesh Rajamani.Vehicle Dynamics and Control

doi: 10.1109/TAC.2007.902736. Rajesh Rajamani.Vehicle Dynamics and Control. Springer, 2nd edition,

work page doi:10.1109/tac.2007.902736 2007
[13]

Sourav Sanyal and Kaushik Roy

doi: 10.1115/1.2192835. Sourav Sanyal and Kaushik Roy. Asma: An adaptive safety margin algorithm for vision-language drone navigation via scene-aware control barrier functions.IEEE Robotics and Automation Let- ters,

work page doi:10.1115/1.2192835
[14]

2nd Modeling, Estimation and Control Conference MECC

doi: https://doi.org/10.1016/j.ifacol.2022.11.232. 2nd Modeling, Estimation and Control Conference MECC

work page doi:10.1016/j.ifacol.2022.11.232 2022
[15]

Generalizable physics-informed learning for stochastic safety-critical systems.arXiv preprint arXiv:2407.08868,

Zhuoyuan Wang, Albert Chern, and Yorie Nakahira. Generalizable physics-informed learning for stochastic safety-critical systems.arXiv preprint arXiv:2407.08868,

work page arXiv
[16]

A survey on multi-turn interaction capabilities of large language models.arXiv preprint arXiv:2501.09959, 2025

Chen Zhang, Xinyi Dai, Yaxiong Wu, Qu Yang, Yasheng Wang, Ruiming Tang, and Yong Liu. A survey on multi-turn interaction capabilities of large language models.arXiv preprint arXiv:2501.09959,

work page arXiv
[17]

Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk # (46) = lim ∆t→0 E

From the definition ofS UkΨπ Hk+1(Xk)in (41) we get lim ∆t→0 S UkΨπ Hk+1(Xk)(44) = lim ∆t→0 E[Ψπ Hk+1(Xk+1)|Xk, Uk]−Ψ π Hk+1(Xk) ∆t (45) = lim ∆t→0 E " Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk # (46) = lim ∆t→0 E " E " Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk,Ξ k+1 # Xk, Uk # (47) =E " lim ∆t→0 E " Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk,Ξ k+1 # Xk, Uk # ,(48) where...

work page 1973
[18]

Remark 6Evaluation of the discrete time generatorA Uk Hk+1 Ψπ Hk(Xk)in(7)requires values of the long-term safety probabilityΨ π Hk(Xk)in(4). In practice, one could run online parallel Monte Carlo simulations to empirically estimate such values, or leverage offline model-based and data- driven methods such as the ones in Chern et al. (2021); Wang et al. (2...

work page 2021
[19]

While time stepkhas not reachedT end, we run the proposed adaptive safe control method

In line 1, we initialize the horizon of the problemT end, the discrete time step∆t, the initial stateX 0, the nominal control policyπ, the long-term safety horizonT, the risk tolerance ϵ, and the reference controllerN. While time stepkhas not reachedT end, we run the proposed adaptive safe control method. Specifically, we first obtain the available inform...

work page 2023
[20]

20 ADAPTIVEPSCWITHLANGUAGEGUIDANCE E.1. General Setup For the state space in (13), when the vehicle is traveling on a road with a non-zero curvature, the curvature is viewed as a disturbance on the heading errorψdescribed by ˙ψ=r−v xρ(s),(74) whereρ(s)denotes the radius of curvature as a function ofs. The road-tire friction coefficientµis an unknown fixed...

work page 2012
[21]

Note that for the estimator (15) Theorem 2 holds without assuming Gaussian distributions for the parameter estimates

are proposed for online estimation of the friction coefficient, in all experiments we chose to use a Bayesian estimator. Note that for the estimator (15) Theorem 2 holds without assuming Gaussian distributions for the parameter estimates. For all three MPC-based adaptive safe control methods considered, we define the MPC cost function as JMPC = TmpcX k=1 ...

work page 2021
[22]

That was too fast, please slow down

The empiricalSafetyreported in the results are calculated through the ratio of time period where the lateral lane deviation is less than3m 22 ADAPTIVEPSCWITHLANGUAGEGUIDANCE Aggressive User Input Conservative User Input Dry and Unsure User Input I want to drive aggressively and push the limits. That was too fast, please slow down. The road seems dry, but ...

work page 2017

[1] [1]

Llm-enhanced symbolic control for safety-critical applications.arXiv preprint arXiv:2505.11077,

Amir Bayat, Alessandro Abate, Necmiye Ozay, and Raphael M Jungers. Llm-enhanced symbolic control for safety-critical applications.arXiv preprint arXiv:2505.11077,

work page arXiv

[2] [2]

Neural Ordinary Differential Equations for Nonlinear System Identification

Max H. Cohen and Calin Belta. High order robust adaptive control barrier functions and expo- nentially stabilizing adaptive control lyapunov functions. In2022 American Control Conference (ACC), pages 2233–2238. IEEE. ISBN 978-1-66545-196-3. doi: 10.23919/ACC53348.2022. 9867633. URLhttps://ieeexplore.ieee.org/document/9867633/. Ryan K Cosner, Andrew W Sing...

work page doi:10.23919/acc53348.2022 2022

[3] [3]

Context-aware llm-based safe control against latent risks.arXiv preprint arXiv:2403.11863,

Xiyu Deng, Quan Khanh Luu, Anh Van Ho, and Yorie Nakahira. Context-aware llm-based safe control against latent risks.arXiv preprint arXiv:2403.11863,

work page arXiv

[4] [4]

David D Fan, Jennifer Nguyen, Rohan Thakker, Nikhilesh Alatur, Ali-akbar Agha-mohammadi, and Evangelos A Theodorou

doi: 10.1177/0954407014556115. David D Fan, Jennifer Nguyen, Rohan Thakker, Nikhilesh Alatur, Ali-akbar Agha-mohammadi, and Evangelos A Theodorou. Bayesian learning-based adaptive control for safety critical systems. In2020 IEEE international conference on robotics and automation (ICRA), pages 4093–4099. IEEE,

work page doi:10.1177/0954407014556115

[5] [5]

SafeLLM-controlledrobotswithformalguarantees via reachability analysis,

Ahmad Hafez, Alireza Naderi Akhormeh, Amr Hegazy, and Amr Alanwar. Safe llm-controlled robots with formal guarantees via reachability analysis.arXiv preprint arXiv:2503.03911,

work page arXiv

[6] [6]

Yiwen Huang, Wei Liang, and Yan Chen

doi: 10.23919/ ACC60939.2024.10644621. Yiwen Huang, Wei Liang, and Yan Chen. Estimation and analysis of vehicle lateral stability region. In2017 American Control Conference (ACC), pages 4303–4308,

work page arXiv 2024

[7] [7]

2017.7963617

doi: 10.23919/ACC. 2017.7963617. Yiwen Huang, Sze Zheng Yong, and Yan Chen. Stability control of autonomous ground vehicles using control-dependent barrier functions.IEEE Transactions on Intelligent Vehicles, 6(4):699– 710,

work page doi:10.23919/acc 2017

[8] [8]

Motion planning for autonomous driving: The state of the art and future perspectives,

doi: 10.1109/TIV .2021.3058064. Mrdjan Jankovic. Robust control barrier functions for constrained stabilization of nonlinear systems. Automatica, 96:359–367,

work page doi:10.1109/tiv 2021

[9] [9]

Safety aware task planning via large language models in robotics.arXiv preprint arXiv:2503.15707,

Azal Ahmad Khan, Michael Andrev, Muhammad Ali Murtaza, Sergio Aguilera, Rui Zhang, Jie Ding, Seth Hutchinson, and Ali Anwar. Safety aware task planning via large language models in robotics.arXiv preprint arXiv:2503.15707,

work page arXiv

[10] [10]

doi: https://doi.org/10.1016/j.isatra.2022.09.038

ISSN 0019-0578. doi: https://doi.org/10.1016/j.isatra.2022.09.038. Brett T Lopez, Jean-Jacques E Slotine, and Jonathan P How. Robust adaptive control barrier func- tions: An adaptive and data-driven approach to safety.IEEE Control Systems Letters, 5(3): 1031–1036,

work page doi:10.1016/j.isatra.2022.09.038 2022

[11] [11]

Lee, Matthew Tan, Yuke Zhu, and Jeannette Bohg

doi: 10.1109/ICRA48506.2021.9561894. Gabriel Maher. Llmpc: Large language model predictive control.arXiv preprint arXiv:2501.02486,

work page doi:10.1109/icra48506.2021.9561894 2021

[12] [12]

Rajesh Rajamani.Vehicle Dynamics and Control

doi: 10.1109/TAC.2007.902736. Rajesh Rajamani.Vehicle Dynamics and Control. Springer, 2nd edition,

work page doi:10.1109/tac.2007.902736 2007

[13] [13]

Sourav Sanyal and Kaushik Roy

doi: 10.1115/1.2192835. Sourav Sanyal and Kaushik Roy. Asma: An adaptive safety margin algorithm for vision-language drone navigation via scene-aware control barrier functions.IEEE Robotics and Automation Let- ters,

work page doi:10.1115/1.2192835

[14] [14]

2nd Modeling, Estimation and Control Conference MECC

doi: https://doi.org/10.1016/j.ifacol.2022.11.232. 2nd Modeling, Estimation and Control Conference MECC

work page doi:10.1016/j.ifacol.2022.11.232 2022

[15] [15]

Generalizable physics-informed learning for stochastic safety-critical systems.arXiv preprint arXiv:2407.08868,

Zhuoyuan Wang, Albert Chern, and Yorie Nakahira. Generalizable physics-informed learning for stochastic safety-critical systems.arXiv preprint arXiv:2407.08868,

work page arXiv

[16] [16]

A survey on multi-turn interaction capabilities of large language models.arXiv preprint arXiv:2501.09959, 2025

Chen Zhang, Xinyi Dai, Yaxiong Wu, Qu Yang, Yasheng Wang, Ruiming Tang, and Yong Liu. A survey on multi-turn interaction capabilities of large language models.arXiv preprint arXiv:2501.09959,

work page arXiv

[17] [17]

Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk # (46) = lim ∆t→0 E

From the definition ofS UkΨπ Hk+1(Xk)in (41) we get lim ∆t→0 S UkΨπ Hk+1(Xk)(44) = lim ∆t→0 E[Ψπ Hk+1(Xk+1)|Xk, Uk]−Ψ π Hk+1(Xk) ∆t (45) = lim ∆t→0 E " Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk # (46) = lim ∆t→0 E " E " Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk,Ξ k+1 # Xk, Uk # (47) =E " lim ∆t→0 E " Ψπ Hk+1(Xk+1)−Ψ π Hk+1(Xk) ∆t Xk, Uk,Ξ k+1 # Xk, Uk # ,(48) where...

work page 1973

[18] [18]

Remark 6Evaluation of the discrete time generatorA Uk Hk+1 Ψπ Hk(Xk)in(7)requires values of the long-term safety probabilityΨ π Hk(Xk)in(4). In practice, one could run online parallel Monte Carlo simulations to empirically estimate such values, or leverage offline model-based and data- driven methods such as the ones in Chern et al. (2021); Wang et al. (2...

work page 2021

[19] [19]

While time stepkhas not reachedT end, we run the proposed adaptive safe control method

In line 1, we initialize the horizon of the problemT end, the discrete time step∆t, the initial stateX 0, the nominal control policyπ, the long-term safety horizonT, the risk tolerance ϵ, and the reference controllerN. While time stepkhas not reachedT end, we run the proposed adaptive safe control method. Specifically, we first obtain the available inform...

work page 2023

[20] [20]

20 ADAPTIVEPSCWITHLANGUAGEGUIDANCE E.1. General Setup For the state space in (13), when the vehicle is traveling on a road with a non-zero curvature, the curvature is viewed as a disturbance on the heading errorψdescribed by ˙ψ=r−v xρ(s),(74) whereρ(s)denotes the radius of curvature as a function ofs. The road-tire friction coefficientµis an unknown fixed...

work page 2012

[21] [21]

Note that for the estimator (15) Theorem 2 holds without assuming Gaussian distributions for the parameter estimates

are proposed for online estimation of the friction coefficient, in all experiments we chose to use a Bayesian estimator. Note that for the estimator (15) Theorem 2 holds without assuming Gaussian distributions for the parameter estimates. For all three MPC-based adaptive safe control methods considered, we define the MPC cost function as JMPC = TmpcX k=1 ...

work page 2021

[22] [22]

That was too fast, please slow down

The empiricalSafetyreported in the results are calculated through the ratio of time period where the lateral lane deviation is less than3m 22 ADAPTIVEPSCWITHLANGUAGEGUIDANCE Aggressive User Input Conservative User Input Dry and Unsure User Input I want to drive aggressively and push the limits. That was too fast, please slow down. The road seems dry, but ...

work page 2017