RIPA: Sensory-Vector Prompt Injection Attacks on LLM-Controlled ROS 2 Robots

Nima Dorzhiev

arxiv: 2606.28649 · v1 · pith:VJID7AVCnew · submitted 2026-06-26 · 💻 cs.CR · cs.AI· cs.RO

RIPA: Sensory-Vector Prompt Injection Attacks on LLM-Controlled ROS 2 Robots

Nima Dorzhiev This is my paper

Pith reviewed 2026-06-30 09:26 UTC · model grok-4.3

classification 💻 cs.CR cs.AIcs.RO

keywords prompt injectionLLM-controlled robotsROS 2sensory attacksmodel robustnessadversarial inputsrobotic security

0 comments

The pith

Sensory inputs enable prompt injection attacks on LLM-controlled robots, with success varying by model rather than size.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that prompt injections can reach LLM robot controllers through visual, audio, and LiDAR sensory data in ROS 2 systems. Tests on five models from roughly 4B to 284B parameters show attack success rates that do not rise steadily with scale, as some smaller models block direct overrides while the smallest model matches the largest in susceptibility. A hybrid semantic firewall stops known patterns but allows roughly 10 percent bypass on obfuscated versions. This points to physical sensor channels as a distinct route for overriding embodied AI behavior.

Core claim

The central claim is that sensory-vector prompt injection attacks produce model-specific vulnerability profiles in LLM-controlled ROS 2 robots rather than monotonic scaling with parameter count. Llama-3.3-70B-Instruct-Turbo reaches 100 percent ASR on all variants while Llama-3-8B-Instruct-Lite and Qwen 2.5-7B-Instruct-Turbo reach 0 percent on direct-override injections, and the approximately 4B Gemma-3n-E4B matches the 70B profile. Three channels are defined, with LiDAR context poisoning achieving 100 percent ASR on DeepSeek-V4-Flash, and a proposed hybrid firewall shows a 10.2 percent bypass rate on 19 obfuscation payloads.

What carries the argument

The three sensory injection channels that embed prompts into visual OCR output, audio STT transcripts, and LiDAR obstacle context at the LLM system-prompt level.

If this is right

Direct text overrides are unnecessary when sensory data can deliver effective injections.
Model choice for robotic control must be evaluated against specific injection profiles rather than parameter count.
Hybrid semantic firewalls require additional layers to address obfuscated sensory payloads.
LiDAR context poisoning forms a high-success route for altering robot environment state representations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar sensory-channel attacks could extend to other multimodal LLM systems that ingest physical sensor streams.
Robot architectures may need separate sensor validation modules before data reaches the LLM prompt.
Security evaluations for embodied LLMs should incorporate simulated physical sensor inputs beyond text-only tests.

Load-bearing premise

The 100 independent runs per variant and the direct embedding of sensory data into the LLM system prompt capture real attack feasibility without unstated preprocessing or filtering in the ROS 2 pipeline.

What would settle it

Repeated trials on the LiDAR channel for DeepSeek-V4-Flash that produce zero successful injections despite insertion of fabricated obstacle data would falsify the 100 percent ASR result.

Figures

Figures reproduced from arXiv: 2606.28649 by Nima Dorzhiev.

**Figure 1.** Figure 1: RIPA system architecture. Three adversaries inject content through distinct channels: visual (Channel 1, OCR via [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗

**Figure 2.** Figure 2: Hybrid semantic firewall architecture. Stage 1 (rule-based filter) [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Attack success rate (ASR) by injection variant (A1: direct override, [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: LiDAR sensor context-poisoning variants (Channel 3). Each robot was modeled with eight angular sectors; green denotes a clear reading (3.5 m), and [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

read the original abstract

We present RIPA, the first systematic multi-channel empirical study of prompt injection attacks delivered through the sensory pipeline of a ROS 2-based LLM-controlled robotic system. Across 100 independent runs per injection variant on five LLMs spanning four model families and parameter scales from approximately 4B to approximately 284B (DeepSeek-V4-Flash, Llama-3-8B-Instruct-Lite, Llama-3.3-70B-Instruct-Turbo, Qwen 2.5-7B-Instruct-Turbo, Gemma-3n-E4B), we identify model-specific vulnerability profiles that do not follow a monotonic scaling trend: Llama-3.3-70B-Instruct-Turbo exhibits 100% attack success rate (ASR) across all injection variants, while Llama-3-8B-Instruct-Lite and Qwen 2.5-7B-Instruct-Turbo resist direct-override injection (0% ASR), and the smallest model evaluated (Gemma-3n-E4B, approximately 4B) matches the 70B model's vulnerability profile, indicating that robustness is model-specific rather than scale-dependent. We propose a hybrid semantic firewall that achieves 0% ASR against known injection patterns with no false positives on a preliminary benign set (0/20 commands) but exhibits a 10.2% trial-weighted bypass rate (58/570 trials; N equals 30 per payload across 19 obfuscation payloads) against adversarially obfuscated attacks, exposing a critical gap between rule-based and semantic defense layers. We further introduce three sensory injection channels: visual (Channel 1, via OCR), audio (Channel 2, via Whisper STT), and LiDAR sensor context poisoning (Channel 3). We show that Channel 3, which injects fabricated obstacle data into the robot environment-state representation at the LLM system-prompt level, achieves 100% ASR across all variants on DeepSeek-V4-Flash. We also contribute a firewall bypass taxonomy spanning 19 obfuscation payloads across five categories. All code, data, and results are publicly available.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper's main finding is that sensory prompt injections on ROS 2 LLM robots produce model-specific attack success rates that do not scale with size, plus a semantic firewall with a 10% bypass rate on obfuscated payloads.

read the letter

The central result is that Llama-3.3-70B and the 4B Gemma model both hit 100% ASR across injection variants while two 7-8B models drop to 0% on direct overrides, and LiDAR context poisoning works at 100% on DeepSeek. They also test a hybrid firewall that stops known patterns but allows 10.2% bypass on 19 obfuscated payloads.

The work is new in running the first multi-channel empirical test (OCR, Whisper STT, LiDAR) on actual ROS 2 robot state fed into LLMs, with 100 runs per variant and public code plus data. That setup lets others reproduce the non-monotonic vulnerability pattern and the firewall numbers.

The soft spot is the integration step. The abstract says sensory data goes in at the system-prompt level but does not confirm whether the template or token handling is identical across models. If any per-model formatting or truncation happens before the prompt is built, the ASR differences could be pipeline artifacts rather than model behavior. The stress-test concern lands here because the abstract leaves that unspecified.

This is for people working on embodied LLM security who need concrete attack numbers and a starting firewall to test against. It is worth sending to peer review because the empirical design is straightforward, the artifacts are released, and the scaling claim is falsifiable even if the methods section needs tightening on the prompt assembly details.

Referee Report

2 major / 2 minor

Summary. The paper introduces RIPA, the first systematic multi-channel empirical study of prompt injection attacks on LLM-controlled ROS 2 robots delivered via sensory pipelines (visual OCR, audio via Whisper STT, and LiDAR sensor context poisoning). Across 100 independent runs per injection variant on five LLMs spanning ~4B to ~284B parameters, it reports model-specific ASR profiles that do not follow monotonic scaling: 100% ASR for Llama-3.3-70B-Instruct-Turbo and Gemma-3n-E4B across variants, versus 0% ASR for Llama-3-8B-Instruct-Lite and Qwen 2.5-7B-Instruct-Turbo on direct-override; Channel 3 achieves 100% ASR on DeepSeek-V4-Flash. It proposes a hybrid semantic firewall with 0% ASR on known patterns (0/20 benign) but 10.2% bypass rate (58/570 trials) on 19 obfuscated payloads, and contributes a firewall bypass taxonomy. All code, data, and results are publicly available.

Significance. If the empirical measurements hold, the work is significant for showing that LLM robustness to sensory prompt injection in robotic systems is model-specific rather than scale-dependent, with direct implications for model selection in embodied AI. The multi-channel attack vectors and the firewall bypass taxonomy provide actionable insights into defense gaps. The public release of code, data, and results is a clear strength, supporting reproducibility and allowing independent verification of the reported ASR values and run independence.

major comments (2)

[Abstract] Abstract: The central claim that 'robustness is model-specific rather than scale-dependent' is based on the reported ASR differences (100% for Llama-3.3-70B-Instruct-Turbo and Gemma-3n-E4B vs. 0% for Llama-3-8B-Instruct-Lite and Qwen 2.5-7B-Instruct-Turbo on direct-override). However, sensory data integration is described only as occurring 'at the LLM system-prompt level' without specifying whether a single fixed template is used for all models or whether any model-dependent preprocessing, tokenization, truncation, or filtering is applied in the ROS 2 pipeline before prompt assembly. This detail is load-bearing for attributing the non-monotonic profile to intrinsic model behavior rather than pipeline artifacts.
[Abstract] Abstract (firewall evaluation): The hybrid semantic firewall is stated to achieve 0% ASR against known injection patterns with no false positives on a preliminary benign set (0/20 commands). The small benign-set size and lack of detail on how the 570 obfuscated trials were constructed (N=30 per payload across 19 payloads) make it difficult to assess whether the 10.2% bypass rate (58/570) generalizes or whether the 0% false-positive claim is robust.

minor comments (2)

[Abstract] Abstract: 'N equals 30' should be formatted consistently as N=30; similarly, parameter counts are given as 'approximately 4B' and 'approximately 284B' without exact figures or citations to model cards.
[Abstract] Abstract: The statement that 'All code, data, and results are publicly available' is a strength but would benefit from an explicit repository URL or DOI in the abstract for immediate accessibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their detailed comments on the abstract and evaluation methodology. We address each major comment below and will revise the manuscript to incorporate clarifications where appropriate.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that 'robustness is model-specific rather than scale-dependent' is based on the reported ASR differences (100% for Llama-3.3-70B-Instruct-Turbo and Gemma-3n-E4B vs. 0% for Llama-3-8B-Instruct-Lite and Qwen 2.5-7B-Instruct-Turbo on direct-override). However, sensory data integration is described only as occurring 'at the LLM system-prompt level' without specifying whether a single fixed template is used for all models or whether any model-dependent preprocessing, tokenization, truncation, or filtering is applied in the ROS 2 pipeline before prompt assembly. This detail is load-bearing for attributing the non-monotonic profile to intrinsic model behavior rather than pipeline artifacts.

Authors: The ROS 2 pipeline implements a single fixed system-prompt template for sensory data integration across all five evaluated models, with no model-dependent preprocessing, tokenization, truncation, or filtering steps. This uniform assembly process was designed to isolate differences in model behavior. We will revise the methods section (and abstract if space permits) to explicitly document the fixed template and confirm the absence of model-specific pipeline variations. revision: yes
Referee: [Abstract] Abstract (firewall evaluation): The hybrid semantic firewall is stated to achieve 0% ASR against known injection patterns with no false positives on a preliminary benign set (0/20 commands). The small benign-set size and lack of detail on how the 570 obfuscated trials were constructed (N=30 per payload across 19 payloads) make it difficult to assess whether the 10.2% bypass rate (58/570) generalizes or whether the 0% false-positive claim is robust.

Authors: The 570 trials comprise 19 obfuscation payloads (grouped into the five-category taxonomy presented in the paper), each run for 30 independent trials. The benign set of 20 commands is labeled preliminary in the manuscript. We will expand the firewall evaluation section to detail the payload construction process and taxonomy categories, and we will explicitly note the preliminary nature of the benign-set results as a limitation. No changes to the reported rates are required. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical measurements with no derivations or self-referential reductions

full rationale

The paper reports an empirical study of prompt injection attacks on LLM-controlled robots via ROS 2 sensory channels. It measures attack success rates across models and injection variants through direct experimentation (100 runs per variant), proposes a firewall based on observed results, and contributes a taxonomy from the same experiments. No equations, fitted parameters, predictions derived from inputs, or load-bearing self-citations appear in the provided text or abstract. The central claims (model-specific vulnerability profiles, firewall performance) rest on raw experimental outcomes rather than any reduction to prior results by construction. This matches the default expectation for non-circular empirical work.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is an empirical security evaluation relying on standard robotics and LLM components with no mathematical derivations, free parameters, or new postulated entities.

pith-pipeline@v0.9.1-grok · 5932 in / 1224 out tokens · 35839 ms · 2026-06-30T09:26:04.165390+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

16 extracted references · 13 canonical work pages · 5 internal anchors

[1]

Ignore Previous Prompt: Attack Techniques For Language Models

F. Perez and I. Ribeiro, “Ignore previous prompt: Attack techniques for language models,” inNeurIPS ML Safety Workshop, 2022. [Online]. Available: https://doi.org/10.48550/arXiv.2211.09527 10

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2211.09527 2022
[3]

Jailbreaking LLM-controlled robots,

A. Robey, Z. Ravichandran, V . Kumar, H. Hassani, and G. J. Pappas, “Jailbreaking LLM-controlled robots,” arXiv preprint, 2024. [Online]. Available: https://doi.org/10.48550/arXiv.2410.13691

work page doi:10.48550/arxiv.2410.13691 2024
[4]

BadRobot: Jailbreaking Embodied LLM Agents in the Physical World

H. Zhang, C. Zhu, X. Wang, Z. Zhou,et al., “BadRobot: Jailbreaking embodied LLMs in the physical world,” inProc. ICLR, 2025. [Online]. Available: https://doi.org/10.48550/arXiv.2407.20242

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2407.20242 2025
[5]

Prompt injection attack against LLM-integrated mobile robotic systems,

W. Zhang, X. Kong, C. Dewitt, T. Braunl, and J. B. Hong, “Prompt injection attack against LLM-integrated mobile robotic systems,” inProc. IEEE 35th Int. Symp. Softw. Reliability Eng. Workshops (ISSREW), 2024. [Online]. Available: https://doi.org/10.1109/ISSREW63542.2024.00103

work page doi:10.1109/issrew63542.2024.00103 2024
[8]

From prompt to physical action: Structured backdoor attacks on LLM-mediated robotic control systems,

M. Xie and J. Wei-Kocsis, “From prompt to physical action: Structured backdoor attacks on LLM-mediated robotic control systems,” arXiv preprint, 2026. [Online]. Available: https://doi.org/10.48550/arXiv.2604. 03890

work page doi:10.48550/arxiv.2604 2026
[9]

AgentDojo: A dynamic environment for evaluating prompt injection attacks and defenses for LLM agents,

E. Debenedetti, J. Zhang, M. Balunovic, L. Beurer-Kellner, M. Fischer, and M. Vechev, “AgentDojo: A dynamic environment for evaluating prompt injection attacks and defenses for LLM agents,” inProc. NeurIPS (Datasets and Benchmarks Track), 2024. [Online]. Available: https://doi. org/10.52202/079017-2636

work page doi:10.52202/079017-2636 2024
[10]

Training language models to follow instructions with human feedback,

L. Ouyang, J. Wu, X. Jiang, D. Almeida,et al., “Training language models to follow instructions with human feedback,” inProc. NeurIPS,
[11]

Available: https://doi.org/10.52202/068431-2011

[Online]. Available: https://doi.org/10.52202/068431-2011

work page doi:10.52202/068431-2011 2011
[12]

Robot Operating System 2: Design, architecture, and uses in the wild,

S. Macenski, T. Foote, B. Gerkey, C. Lalancette, and W. Woodall, “Robot Operating System 2: Design, architecture, and uses in the wild,”Science Robotics, vol. 7, no. 66, p. eabm6074, 2022. [Online]. Available: https: //doi.org/10.1126/scirobotics.abm6074

work page doi:10.1126/scirobotics.abm6074 2022
[13]

Robust Speech Recognition via Large-Scale Weak Supervision

A. Radford, J. W. Kim, T. Xu, G. Brockman, C. McLeavey, and I. Sutskever, “Robust speech recognition via large-scale weak supervision,” inProc. ICML, 2023. [Online]. Available: https://doi.org/10.48550/arXiv. 2212.04356

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv 2023
[14]

Towards robust and secure embodied AI: A survey on vulnerabilities and attacks,

W. Xing, M. Li, M. Li, and M. Han, “Towards robust and secure embodied AI: A survey on vulnerabilities and attacks,”ACM Computing Surveys,
[15]

Available: https://doi.org/10.1145/3806048

[Online]. Available: https://doi.org/10.1145/3806048

work page doi:10.1145/3806048
[16]

Propagating Unsafe Actions in LLM Controlled Multi-Robot Collaboration via Single Robot Compromise

Z. Huang, Z. Liu, M. Luo, W. Wu, and Z. Cai, “Propagating unsafe actions in LLM controlled multi-robot collaboration via single robot compromise,” arXiv preprint, 2026. [Online]. Available: https://doi.org/ 10.48550/arXiv.2605.15641

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2605.15641 2026
[17]

SafeEmbodAI: A safety framework for mobile robots in embodied AI systems,

W. Zhang, X. Kong, T. Braunl, and J. B. Hong, “SafeEmbodAI: A safety framework for mobile robots in embodied AI systems,” arXiv preprint,
[18]

Available: https://doi.org/10.48550/arXiv.2409.01630

[Online]. Available: https://doi.org/10.48550/arXiv.2409.01630

work page doi:10.48550/arxiv.2409.01630
[19]

Prompt Injection Attack to Tool Selection in LLM Agents

J. Shi, Z. Yuan, G. Tie, P. Zhou, N. Z. Gong, and L. Sun, “Prompt injection attack to tool selection in LLM agents,” inProc. NDSS, 2026. [Online]. Available: https://doi.org/10.48550/arXiv.2504.19793 11 Appendix A Technology Stack Component Version / Details OS Ubuntu 24.04 (WSL2 on Windows 11) ROS ROS 2 Jazzy Simulator Gazebo Harmonic Robot TurtleBot3 Wa...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2504.19793 2026

[1] [1]

Ignore Previous Prompt: Attack Techniques For Language Models

F. Perez and I. Ribeiro, “Ignore previous prompt: Attack techniques for language models,” inNeurIPS ML Safety Workshop, 2022. [Online]. Available: https://doi.org/10.48550/arXiv.2211.09527 10

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2211.09527 2022

[2] [3]

Jailbreaking LLM-controlled robots,

A. Robey, Z. Ravichandran, V . Kumar, H. Hassani, and G. J. Pappas, “Jailbreaking LLM-controlled robots,” arXiv preprint, 2024. [Online]. Available: https://doi.org/10.48550/arXiv.2410.13691

work page doi:10.48550/arxiv.2410.13691 2024

[3] [4]

BadRobot: Jailbreaking Embodied LLM Agents in the Physical World

H. Zhang, C. Zhu, X. Wang, Z. Zhou,et al., “BadRobot: Jailbreaking embodied LLMs in the physical world,” inProc. ICLR, 2025. [Online]. Available: https://doi.org/10.48550/arXiv.2407.20242

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2407.20242 2025

[4] [5]

Prompt injection attack against LLM-integrated mobile robotic systems,

W. Zhang, X. Kong, C. Dewitt, T. Braunl, and J. B. Hong, “Prompt injection attack against LLM-integrated mobile robotic systems,” inProc. IEEE 35th Int. Symp. Softw. Reliability Eng. Workshops (ISSREW), 2024. [Online]. Available: https://doi.org/10.1109/ISSREW63542.2024.00103

work page doi:10.1109/issrew63542.2024.00103 2024

[5] [8]

From prompt to physical action: Structured backdoor attacks on LLM-mediated robotic control systems,

M. Xie and J. Wei-Kocsis, “From prompt to physical action: Structured backdoor attacks on LLM-mediated robotic control systems,” arXiv preprint, 2026. [Online]. Available: https://doi.org/10.48550/arXiv.2604. 03890

work page doi:10.48550/arxiv.2604 2026

[6] [9]

AgentDojo: A dynamic environment for evaluating prompt injection attacks and defenses for LLM agents,

E. Debenedetti, J. Zhang, M. Balunovic, L. Beurer-Kellner, M. Fischer, and M. Vechev, “AgentDojo: A dynamic environment for evaluating prompt injection attacks and defenses for LLM agents,” inProc. NeurIPS (Datasets and Benchmarks Track), 2024. [Online]. Available: https://doi. org/10.52202/079017-2636

work page doi:10.52202/079017-2636 2024

[7] [10]

Training language models to follow instructions with human feedback,

L. Ouyang, J. Wu, X. Jiang, D. Almeida,et al., “Training language models to follow instructions with human feedback,” inProc. NeurIPS,

[8] [11]

Available: https://doi.org/10.52202/068431-2011

[Online]. Available: https://doi.org/10.52202/068431-2011

work page doi:10.52202/068431-2011 2011

[9] [12]

Robot Operating System 2: Design, architecture, and uses in the wild,

S. Macenski, T. Foote, B. Gerkey, C. Lalancette, and W. Woodall, “Robot Operating System 2: Design, architecture, and uses in the wild,”Science Robotics, vol. 7, no. 66, p. eabm6074, 2022. [Online]. Available: https: //doi.org/10.1126/scirobotics.abm6074

work page doi:10.1126/scirobotics.abm6074 2022

[10] [13]

Robust Speech Recognition via Large-Scale Weak Supervision

A. Radford, J. W. Kim, T. Xu, G. Brockman, C. McLeavey, and I. Sutskever, “Robust speech recognition via large-scale weak supervision,” inProc. ICML, 2023. [Online]. Available: https://doi.org/10.48550/arXiv. 2212.04356

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv 2023

[11] [14]

Towards robust and secure embodied AI: A survey on vulnerabilities and attacks,

W. Xing, M. Li, M. Li, and M. Han, “Towards robust and secure embodied AI: A survey on vulnerabilities and attacks,”ACM Computing Surveys,

[12] [15]

Available: https://doi.org/10.1145/3806048

[Online]. Available: https://doi.org/10.1145/3806048

work page doi:10.1145/3806048

[13] [16]

Propagating Unsafe Actions in LLM Controlled Multi-Robot Collaboration via Single Robot Compromise

Z. Huang, Z. Liu, M. Luo, W. Wu, and Z. Cai, “Propagating unsafe actions in LLM controlled multi-robot collaboration via single robot compromise,” arXiv preprint, 2026. [Online]. Available: https://doi.org/ 10.48550/arXiv.2605.15641

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2605.15641 2026

[14] [17]

SafeEmbodAI: A safety framework for mobile robots in embodied AI systems,

W. Zhang, X. Kong, T. Braunl, and J. B. Hong, “SafeEmbodAI: A safety framework for mobile robots in embodied AI systems,” arXiv preprint,

[15] [18]

Available: https://doi.org/10.48550/arXiv.2409.01630

[Online]. Available: https://doi.org/10.48550/arXiv.2409.01630

work page doi:10.48550/arxiv.2409.01630

[16] [19]

Prompt Injection Attack to Tool Selection in LLM Agents

J. Shi, Z. Yuan, G. Tie, P. Zhou, N. Z. Gong, and L. Sun, “Prompt injection attack to tool selection in LLM agents,” inProc. NDSS, 2026. [Online]. Available: https://doi.org/10.48550/arXiv.2504.19793 11 Appendix A Technology Stack Component Version / Details OS Ubuntu 24.04 (WSL2 on Windows 11) ROS ROS 2 Jazzy Simulator Gazebo Harmonic Robot TurtleBot3 Wa...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2504.19793 2026