Proposes tail-performance objectives and the TWLO-MDP reformulation solved via RL for multi-robot monitoring, with theoretical properties and experimental outperformance on a new benchmark.
Therefore, ˜πis an optimal strategy starting fromp 0, which proves that for allT > D G + J ∗ ϕmin , Π∗ T ∩Π p0 ̸=∅
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.RO 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Minimizing Worst-Case Weighted Latency for Multi-Robot Persistent Monitoring: Theory and RL-Based Solutions
Proposes tail-performance objectives and the TWLO-MDP reformulation solved via RL for multi-robot monitoring, with theoretical properties and experimental outperformance on a new benchmark.