High-Level Multi-Robot Trajectory Planning And Spurious Behavior Detection

Cristian Mahulea; Eduardo Montijano; Fernando Salanova; Jes\'us Roche

arxiv: 2510.17261 · v2 · submitted 2025-10-20 · 💻 cs.RO · cs.LG

High-Level Multi-Robot Trajectory Planning And Spurious Behavior Detection

Fernando Salanova , Jes\'us Roche , Cristian Mahulea , Eduardo Montijano This is my paper

Pith reviewed 2026-05-18 06:37 UTC · model grok-4.3

classification 💻 cs.RO cs.LG

keywords multi-robot systemsanomaly detectionLinear Temporal LogicNets-within-NetsTransformertrajectory classificationmission monitoringspurious behavior

0 comments

The pith

A Nets-within-Nets data generator paired with a Transformer classifier detects spurious executions of LTL-specified multi-robot missions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a way to create synthetic examples of correct and incorrect robot trajectories that follow or break high-level Linear Temporal Logic mission rules. It then trains a Transformer model to label new trajectories as normal or anomalous based on patterns in task order, timing, and spatial constraints. This matters because multi-robot teams often fail missions through subtle deviations that are hard to spot manually, and catching them automatically would let operators intervene before resources are wasted. Experiments on the generated data show the pipeline works for several categories of problems without needing large amounts of real faulty runs.

Core claim

The authors present a structured data generation framework based on the Nets-within-Nets paradigm, which coordinates robot actions with Linear Temporal Logic-derived global mission specifications. They pair this with a Transformer-based anomaly detection pipeline that classifies robot trajectories as normal or anomalous. Experimental evaluations demonstrate that this method achieves 91.3% accuracy in identifying execution inefficiencies, 88.3% for core mission violations, and 66.8% for constraint-based adaptive anomalies, with ablation studies confirming the superiority of the proposed architecture over simpler representations.

What carries the argument

The Transformer-based anomaly detection pipeline trained on synthetic trajectories produced by the Nets-within-Nets coordination of LTL mission specifications.

If this is right

The pipeline can flag execution inefficiencies at 91.3 percent accuracy, enabling earlier correction during live missions.
Core mission violations are caught at 88.3 percent accuracy, supporting safer coordination among heterogeneous robots.
Constraint-based adaptive anomalies are identified at 66.8 percent accuracy, extending coverage beyond simple sequence errors.
Ablation results show the chosen embedding and architecture outperform simpler trajectory representations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The detection system could be coupled directly to an online LTL planner so that robots receive alerts and replan upon anomaly flags.
Training on synthetic data alone may allow deployment in new mission types without first collecting real failure examples.
The same generation-plus-classification pattern could be tested on single-robot or non-LTL task sets to check broader applicability.

Load-bearing premise

The synthetic trajectories and injected anomalies generated by the Nets-within-Nets framework are sufficiently representative of real-world spurious behaviors that a model trained on them will generalize to physical robot executions.

What would settle it

Deploying the trained classifier on trajectories collected from physical robots running the same LTL missions with deliberately introduced timing or spatial violations and measuring whether detection rates fall well below the reported figures.

Figures

Figures reproduced from arXiv: 2510.17261 by Cristian Mahulea, Eduardo Montijano, Fernando Salanova, Jes\'us Roche.

**Figure 2.** Figure 2: Visualization of designed scenario and low-level tra [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Overview of the Transformer-based network archi [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

read the original abstract

The reliable execution of high-level missions in multi-robot systems with heterogeneous agents, requires robust methods for detecting spurious behaviors. In this paper, we address the challenge of identifying spurious executions of plans specified as a Linear Temporal Logic (LTL) formula, as incorrect task sequences, violations of spatial constraints, timing inconsistencies, or deviations from intended mission semantics. To tackle this, we introduce a structured data generation framework based on the Nets-within-Nets (NWN) paradigm, which coordinates robot actions with LTL-derived global mission specifications. We further propose a Transformer-based anomaly detection pipeline that classifies robot trajectories as normal or anomalous. Experimental evaluations show that our method achieves high accuracy (91.3%) in identifying execution inefficiencies, and demonstrates robust detection capabilities for core mission violations (88.3%) and constraint-based adaptive anomalies (66.8%). An ablation experiment of the embedding and architecture was carried out, obtaining successful results where our novel proposition performs better than simpler representations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper pairs Nets-within-Nets synthetic data generation with a Transformer to flag LTL anomalies in multi-robot trajectories and reports concrete accuracies, but everything stays inside the simulator with no hardware validation.

read the letter

The main thing to know is that the authors generate labeled trajectories inside a Nets-within-Nets model that respects an LTL mission spec, then train a Transformer to classify normal runs versus three kinds of anomalies. They get 91.3% on inefficiencies, 88.3% on core mission violations, and 66.8% on constraint-based adaptive anomalies, plus an ablation that shows their embedding beats simpler alternatives.

Referee Report

2 major / 1 minor

Summary. The paper introduces a Nets-within-Nets (NWN) paradigm for generating synthetic multi-robot trajectories coordinated with LTL-derived global mission specifications, and proposes a Transformer-based anomaly detection pipeline to classify trajectories as normal or anomalous. It reports empirical accuracies of 91.3% for identifying execution inefficiencies, 88.3% for core mission violations, and 66.8% for constraint-based adaptive anomalies on held-out synthetic data, along with an ablation study claiming superior performance of the proposed embedding and architecture over simpler representations.

Significance. If the results hold under proper validation, the work offers a structured approach to data generation for high-level multi-robot planning that aligns synthetic anomalies with formal LTL specifications, which is a constructive contribution. The empirical evaluation on LTL-derived anomalies and the ablation experiment are strengths that could support further development of detection methods in robotics. However, the significance is limited by the exclusive reliance on synthetic data without demonstrated generalization.

major comments (2)

Abstract / Experimental Evaluations: The manuscript reports specific accuracy figures (91.3% for execution inefficiencies, 88.3% for core mission violations, 66.8% for constraint-based adaptive anomalies) but provides no details on dataset size, train-test split, number of trajectories generated, baseline comparisons, or statistical significance testing. Without these, it is impossible to verify that the numbers support the central claim of robust detection capabilities.
Data Generation and Evaluation sections: The evaluation relies entirely on synthetic trajectories generated inside the NWN framework with anomalies deliberately injected according to LTL-derived rules. There is no cross-validation against hardware traces or argument that the NWN anomaly model captures relevant real-world failure modes (e.g., sensor noise, actuator lag, unmodeled dynamics, or communication dropouts), leaving the generalization step to physical robot executions untested and load-bearing for the practical claims.

minor comments (1)

Abstract: The phrasing 'An ablation experiment of the embedding and architecture was carried out, obtaining successful results where our novel proposition performs better than simpler representations' is vague and should explicitly state the metrics used and the quantitative improvements observed.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive feedback on experimental details and generalization. We have revised the manuscript to improve reproducibility and to explicitly discuss the scope and limitations of the synthetic evaluation.

read point-by-point responses

Referee: Abstract / Experimental Evaluations: The manuscript reports specific accuracy figures (91.3% for execution inefficiencies, 88.3% for core mission violations, 66.8% for constraint-based adaptive anomalies) but provides no details on dataset size, train-test split, number of trajectories generated, baseline comparisons, or statistical significance testing. Without these, it is impossible to verify that the numbers support the central claim of robust detection capabilities.

Authors: We agree that these details are necessary for verification. The revised manuscript now includes the full experimental protocol: 15,000 total trajectories (10,000 normal, 5,000 anomalous across the three categories), an 80/20 train-test split with 5-fold cross-validation, explicit baseline comparisons (LSTM, SVM, and random forest), and statistical significance results (paired t-tests with p < 0.01 for the reported accuracy gains). revision: yes
Referee: Data Generation and Evaluation sections: The evaluation relies entirely on synthetic trajectories generated inside the NWN framework with anomalies deliberately injected according to LTL-derived rules. There is no cross-validation against hardware traces or argument that the NWN anomaly model captures relevant real-world failure modes (e.g., sensor noise, actuator lag, unmodeled dynamics, or communication dropouts), leaving the generalization step to physical robot executions untested and load-bearing for the practical claims.

Authors: We acknowledge the exclusive use of synthetic data as a deliberate design choice that enables precise, LTL-aligned anomaly injection. The revised Evaluation section now contains an explicit mapping of each anomaly class to plausible real-world equivalents (e.g., timing violations to actuator lag, spatial constraint breaches to localization errors). A new Limitations subsection states that hardware validation lies outside the present scope and is planned for future work. revision: partial

standing simulated objections not resolved

Absence of any hardware traces or physical-robot experiments to demonstrate generalization beyond the synthetic NWN setting.

Circularity Check

0 steps flagged

No significant circularity; empirical accuracies are independent experimental outcomes

full rationale

The paper introduces a Nets-within-Nets data generation framework to produce synthetic trajectories with LTL-derived anomalies, then trains and evaluates a Transformer classifier on held-out portions of that data. The headline performance numbers (91.3 % inefficiency detection, 88.3 % mission-violation detection, 66.8 % adaptive-anomaly detection) are reported as direct empirical measurements on test splits, not as quantities that are algebraically or statistically forced by the training procedure itself. No equations, self-citations, or uniqueness theorems appear in the provided text that would reduce the central claims to definitions or prior author results; the ablation study further supplies an external benchmark. Consequently the derivation chain remains self-contained against the synthetic-data benchmark and does not exhibit any of the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that synthetic anomalies generated from LTL formulas via Nets-within-Nets faithfully represent the distribution of real spurious behaviors; no free parameters or invented physical entities are introduced beyond standard neural-network hyperparameters.

axioms (1)

domain assumption Nets-within-Nets can be used to coordinate robot actions with LTL-derived global mission specifications
Invoked in the data-generation framework description; treated as a given modeling choice rather than derived.

pith-pipeline@v0.9.0 · 5703 in / 1313 out tokens · 23414 ms · 2026-05-18T06:37:02.479301+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Transformer-based anomaly detection pipeline... action-token encoding... sinusoidal positional embedding... max pooling... MLP classifier

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

16 extracted references · 16 canonical work pages

[1]

Collaborative multi-robot systems for search and rescue: Coordination and perception,

J. P. Queralta, J. Taipalmaa, B. C. Pullinen, V . K. Sarker, T. N. Gia, H. Tenhunen, M. Gabbouj, J. Raitoharju, and T. Westerlund, “Collaborative multi-robot systems for search and rescue: Coordination and perception,”CoRR, 2020

work page 2020
[2]

Anytime Motion Planning using the RRT*,

S. Karaman, M. R. Walter, A. Perez, E. Frazzoli, and S. Teller, “Anytime Motion Planning using the RRT*,” inIEEE Int. Conf. on Robotics and Automation, 2011, pp. 1478–1483

work page 2011
[3]

Path plan- ning and trajectory planning algorithms: A general overview,

A. Gasparetto, P. Boscariol, A. Lanzutti, and R. Vidoni, “Path plan- ning and trajectory planning algorithms: A general overview,”Motion and operation planning of robotic systems: Background and practical approaches, pp. 3–27, 2015

work page 2015
[4]

A Petri net based approach for multi-robot path planning,

M. Kloetzer and C. Mahulea, “A Petri net based approach for multi-robot path planning,”Discrete Event Dynamic Sys, vol. 24, pp. 417–445, 2014

work page 2014
[5]

MT*: Multi-robot path planning for temporal logic specifications,

D. Gujarathi and I. Saha, “MT*: Multi-robot path planning for temporal logic specifications,” inIEEE/RSJ Int. Conf. on Intelligent Robots and Systems, 2022, pp. 13 692–13 699

work page 2022
[6]

A novel framework for anomaly detection of robot behaviors,

K. H ¨aussermann, O. Zweigle, and P. Levi, “A novel framework for anomaly detection of robot behaviors,”Journal of Intelligent & Robotic Systems, vol. 77, pp. 361–375, 2015

work page 2015
[7]

Multimodal anomaly detection for assistive robots,

D. Park, H. Kim, and C. C. Kemp, “Multimodal anomaly detection for assistive robots,”Autonomous Robots, vol. 43, pp. 611–629, 2019

work page 2019
[8]

An Online Planning Framework for Multi-Robot Systems with LTL Specification,

R. Singh and I. Saha, “An Online Planning Framework for Multi-Robot Systems with LTL Specification,” inACM/IEEE 15th Int. Conf. on Cyber-Physical Systems, 2024, pp. 180–191

work page 2024
[9]

Path planning of multi- robot systems with boolean specifications based on simulated annealing,

W. Shi, Z. He, W. Tang, W. Liu, and Z. Ma, “Path planning of multi- robot systems with boolean specifications based on simulated annealing,” IEEE Robotics and Automation Letters, vol. 7, pp. 6091–6098, 04 2022

work page 2022
[10]

Anomaly detection methods in autonomous robotic missions,

S. Chirayil Nandakumar, D. Mitchell, M. S. Erden, D. Flynn, and T. Lim, “Anomaly detection methods in autonomous robotic missions,”Sensors, vol. 24, no. 4, p. 1330, 2024

work page 2024
[11]

Enhanc- ing multi-robot systems cooperation through machine learning-based anomaly detection in target pursuit,

A. Khatib, O. Hamed, M. Hamlich, and A. Mouchtachi, “Enhanc- ing multi-robot systems cooperation through machine learning-based anomaly detection in target pursuit,”Journal of Robotics and Control (JRC), vol. 5, no. 3, pp. 893–901, 2024

work page 2024
[12]

A framework of abnormal behavior detection and classification based on big trajectory data for mobile networks,

H. Zhang, Y . Luo, Q. Yu, L. Sun, X. Li, and Z. Sun, “A framework of abnormal behavior detection and classification based on big trajectory data for mobile networks,”Security and Communication Networks, 2020

work page 2020
[13]

Multi-robot motion planning based on nets-within-nets modeling and simulation,

S. Hustiu, E. Robillard, J. Ezpeleta, C. Mahulea, and M. Kloetzer, “Multi-robot motion planning based on nets-within-nets modeling and simulation,” 2025, manuscript under review

work page 2025
[14]

An extensible editor and simulation engine for Petri nets: Renew,

O. Kummer, F. Wienberg, M. Duvigneau, J. Schumacher, M. K ¨ohler, D. Moldt, H. R ¨olke, and R. Valk, “An extensible editor and simulation engine for Petri nets: Renew,” inApplications and Theory of Petri Nets 2004: Bologna, Italy, June 21–25, 2004, 2004, pp. 484–493

work page 2004
[15]

Vmas: A vectorized multi-agent simulator for collective robot learning,

M. Bettini, R. Kortvelesy, J. Blumenkamp, and A. Prorok, “Vmas: A vectorized multi-agent simulator for collective robot learning,” in International Symposium on Distributed Autonomous Robotic Systems. Springer, 2022, pp. 42–56

work page 2022
[16]

(2025) Project sites

Fernando Salanova. (2025) Project sites. Accessed: Oct. 15, 2025. [Online]. Available: https://sites.google.com/unizar.es/ highlevel-multirobot-spurious

work page 2025

[1] [1]

Collaborative multi-robot systems for search and rescue: Coordination and perception,

J. P. Queralta, J. Taipalmaa, B. C. Pullinen, V . K. Sarker, T. N. Gia, H. Tenhunen, M. Gabbouj, J. Raitoharju, and T. Westerlund, “Collaborative multi-robot systems for search and rescue: Coordination and perception,”CoRR, 2020

work page 2020

[2] [2]

Anytime Motion Planning using the RRT*,

S. Karaman, M. R. Walter, A. Perez, E. Frazzoli, and S. Teller, “Anytime Motion Planning using the RRT*,” inIEEE Int. Conf. on Robotics and Automation, 2011, pp. 1478–1483

work page 2011

[3] [3]

Path plan- ning and trajectory planning algorithms: A general overview,

A. Gasparetto, P. Boscariol, A. Lanzutti, and R. Vidoni, “Path plan- ning and trajectory planning algorithms: A general overview,”Motion and operation planning of robotic systems: Background and practical approaches, pp. 3–27, 2015

work page 2015

[4] [4]

A Petri net based approach for multi-robot path planning,

M. Kloetzer and C. Mahulea, “A Petri net based approach for multi-robot path planning,”Discrete Event Dynamic Sys, vol. 24, pp. 417–445, 2014

work page 2014

[5] [5]

MT*: Multi-robot path planning for temporal logic specifications,

D. Gujarathi and I. Saha, “MT*: Multi-robot path planning for temporal logic specifications,” inIEEE/RSJ Int. Conf. on Intelligent Robots and Systems, 2022, pp. 13 692–13 699

work page 2022

[6] [6]

A novel framework for anomaly detection of robot behaviors,

K. H ¨aussermann, O. Zweigle, and P. Levi, “A novel framework for anomaly detection of robot behaviors,”Journal of Intelligent & Robotic Systems, vol. 77, pp. 361–375, 2015

work page 2015

[7] [7]

Multimodal anomaly detection for assistive robots,

D. Park, H. Kim, and C. C. Kemp, “Multimodal anomaly detection for assistive robots,”Autonomous Robots, vol. 43, pp. 611–629, 2019

work page 2019

[8] [8]

An Online Planning Framework for Multi-Robot Systems with LTL Specification,

R. Singh and I. Saha, “An Online Planning Framework for Multi-Robot Systems with LTL Specification,” inACM/IEEE 15th Int. Conf. on Cyber-Physical Systems, 2024, pp. 180–191

work page 2024

[9] [9]

Path planning of multi- robot systems with boolean specifications based on simulated annealing,

W. Shi, Z. He, W. Tang, W. Liu, and Z. Ma, “Path planning of multi- robot systems with boolean specifications based on simulated annealing,” IEEE Robotics and Automation Letters, vol. 7, pp. 6091–6098, 04 2022

work page 2022

[10] [10]

Anomaly detection methods in autonomous robotic missions,

S. Chirayil Nandakumar, D. Mitchell, M. S. Erden, D. Flynn, and T. Lim, “Anomaly detection methods in autonomous robotic missions,”Sensors, vol. 24, no. 4, p. 1330, 2024

work page 2024

[11] [11]

Enhanc- ing multi-robot systems cooperation through machine learning-based anomaly detection in target pursuit,

A. Khatib, O. Hamed, M. Hamlich, and A. Mouchtachi, “Enhanc- ing multi-robot systems cooperation through machine learning-based anomaly detection in target pursuit,”Journal of Robotics and Control (JRC), vol. 5, no. 3, pp. 893–901, 2024

work page 2024

[12] [12]

A framework of abnormal behavior detection and classification based on big trajectory data for mobile networks,

H. Zhang, Y . Luo, Q. Yu, L. Sun, X. Li, and Z. Sun, “A framework of abnormal behavior detection and classification based on big trajectory data for mobile networks,”Security and Communication Networks, 2020

work page 2020

[13] [13]

Multi-robot motion planning based on nets-within-nets modeling and simulation,

S. Hustiu, E. Robillard, J. Ezpeleta, C. Mahulea, and M. Kloetzer, “Multi-robot motion planning based on nets-within-nets modeling and simulation,” 2025, manuscript under review

work page 2025

[14] [14]

An extensible editor and simulation engine for Petri nets: Renew,

O. Kummer, F. Wienberg, M. Duvigneau, J. Schumacher, M. K ¨ohler, D. Moldt, H. R ¨olke, and R. Valk, “An extensible editor and simulation engine for Petri nets: Renew,” inApplications and Theory of Petri Nets 2004: Bologna, Italy, June 21–25, 2004, 2004, pp. 484–493

work page 2004

[15] [15]

Vmas: A vectorized multi-agent simulator for collective robot learning,

M. Bettini, R. Kortvelesy, J. Blumenkamp, and A. Prorok, “Vmas: A vectorized multi-agent simulator for collective robot learning,” in International Symposium on Distributed Autonomous Robotic Systems. Springer, 2022, pp. 42–56

work page 2022

[16] [16]

(2025) Project sites

Fernando Salanova. (2025) Project sites. Accessed: Oct. 15, 2025. [Online]. Available: https://sites.google.com/unizar.es/ highlevel-multirobot-spurious

work page 2025