Real-world learning control for autonomous exploration of a biomimetic robotic shark,

· 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit

cs.RO · 2026-04-21 · unverdicted · novelty 6.0

M²GRPO uses a Mamba-based policy and normalized group-relative advantages under CTDE to achieve higher pursuit success and capture efficiency than MAPPO and recurrent baselines in simulations and pool tests.

citing papers explorer

Showing 1 of 1 citing paper.

M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit cs.RO · 2026-04-21 · unverdicted · none · ref 30
M²GRPO uses a Mamba-based policy and normalized group-relative advantages under CTDE to achieve higher pursuit success and capture efficiency than MAPPO and recurrent baselines in simulations and pool tests.

Real-world learning control for autonomous exploration of a biomimetic robotic shark,

fields

years

verdicts

representative citing papers

citing papers explorer