Adversarial Stress Testing of SPARK Humanoid Safety Filters
Pith reviewed 2026-05-20 09:24 UTC · model grok-4.3
The pith
Stress testing shows humanoid safety filters behave differently under crowding, noise, and delays.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By replicating the G1SportMode_D1_WG_SO_v1 benchmark case in MuJoCo and applying controlled stress tests, the authors show that safety filters for humanoids exhibit varying performance, with some methods tracking goals more closely and others reducing collision steps more effectively, and that their behavior changes under obstacle crowding, noisy distance estimates, and delayed obstacle information.
What carries the argument
The post-processing pipeline converting raw SPARK logs into goal-tracking, minimum-distance, and collision-step metrics under stress conditions.
Load-bearing premise
That the MuJoCo simulation and the selected metrics for goal-tracking, minimum-distance, and collision-steps sufficiently represent real-world humanoid safety performance and failure modes.
What would settle it
Testing the same safety filters on a physical humanoid robot in environments with crowded obstacles, added sensor noise, or communication delays and comparing the resulting collision rates and goal tracking to the simulation results.
Figures
read the original abstract
Humanoid robots are difficult to deploy safely because they have high-dimensional bodies, many collision constraints, and must operate near people and obstacles. Safety filters help by modifying a nominal control action when it may violate collision-avoidance constraints. Still, nominal benchmark scores do not fully show how these filters behave in harder environments. In this work, we study the robustness of SPARK humanoid safety filters through replication and stress testing. We replicate the SPARK benchmark case G1SportMode_D1_WG_SO_v1 in MuJoCo and evaluate RSSA, RSSS, SSA, CBF, PFM, and SMA under controlled random seeds. We also built a post-processing pipeline that converts raw SPARK logs into goal-tracking, minimum-distance, and collision-step metrics. Our results show that some methods track the goal more closely, while others reduce collision steps more effectively. The stress tests further indicate that safety behavior can change under obstacle crowding, noisy distance estimates, and delayed obstacle information. These findings suggest that humanoid autonomy should be evaluated beyond nominal performance, using metrics that expose failure modes before deployment.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper replicates the SPARK benchmark case G1SportMode_D1_WG_SO_v1 in MuJoCo and evaluates six safety filters (RSSA, RSSS, SSA, CBF, PFM, SMA) under nominal conditions and stress tests involving obstacle crowding, noisy distance estimates, and delayed obstacle information. A post-processing pipeline converts SPARK logs into goal-tracking, minimum-distance, and collision-step metrics; results indicate performance trade-offs across methods and sensitivity of safety behavior to the stress conditions.
Significance. If the MuJoCo results and chosen metrics are representative, the work usefully demonstrates that nominal benchmark scores are insufficient for humanoid safety filters and that stress testing can expose differential robustness among RSSA/RSSS/SSA/CBF/PFM/SMA. This aligns with the need for more rigorous evaluation before deployment near humans and obstacles.
major comments (2)
- [Abstract] Abstract and results section: comparative outcomes and metric shifts are reported without statistical details, error bars, number of runs, or full method specifications for the post-processing pipeline; this makes it impossible to determine whether observed changes under stress tests are statistically reliable or affected by post-hoc choices.
- [Simulation setup] Simulation setup and metrics section: the central claim that safety behavior changes under obstacle crowding, noisy distance estimates, and delayed information rests on the unvalidated assumption that MuJoCo faithfully reproduces humanoid dynamics, contact forces, and sensor effects, and that the post-processed goal-tracking/minimum-distance/collision-step metrics capture relevant real-world failure modes; without sensitivity analysis or real-robot validation, the stress-test findings risk being simulation-specific artifacts.
minor comments (2)
- [Methods] The description of the six safety filters would benefit from a concise comparison table listing their core mechanisms and any implementation differences from the original SPARK paper.
- [Stress tests] Clarify whether the random seeds for stress-test perturbations are fixed across all methods or independently sampled, as this affects reproducibility of the comparative results.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback. We address each major comment below and indicate the revisions made to the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract and results section: comparative outcomes and metric shifts are reported without statistical details, error bars, number of runs, or full method specifications for the post-processing pipeline; this makes it impossible to determine whether observed changes under stress tests are statistically reliable or affected by post-hoc choices.
Authors: We agree that additional statistical details are needed for interpretability. The original manuscript referenced controlled random seeds but did not report the exact number of runs or variability measures. In revision we now specify that all metrics are computed over 20 independent runs with distinct seeds, include error bars (standard deviation) on the figures, and provide a complete specification of the post-processing pipeline including every parameter and filtering step. These additions allow readers to evaluate the reliability of the reported shifts. revision: yes
-
Referee: [Simulation setup] Simulation setup and metrics section: the central claim that safety behavior changes under obstacle crowding, noisy distance estimates, and delayed information rests on the unvalidated assumption that MuJoCo faithfully reproduces humanoid dynamics, contact forces, and sensor effects, and that the post-processed goal-tracking/minimum-distance/collision-step metrics capture relevant real-world failure modes; without sensitivity analysis or real-robot validation, the stress-test findings risk being simulation-specific artifacts.
Authors: We acknowledge that MuJoCo is a simulator and does not perfectly replicate real-world contact dynamics or sensor behavior. Our work is framed as a controlled simulation study using a standard robotics physics engine, not as a direct claim of real-world transfer. To address the concern we have added a sensitivity analysis on simulation parameters (contact stiffness, friction, and noise models) and inserted an explicit limitations paragraph discussing the simulation-to-reality gap and the value of future hardware validation. Real-robot experiments remain outside the present scope. revision: partial
Circularity Check
Empirical replication and stress-testing study with no derivation chain
full rationale
The manuscript is an empirical replication and stress-test study performed in MuJoCo. It replicates the SPARK benchmark G1SportMode_D1_WG_SO_v1, evaluates RSSA/RSSS/SSA/CBF/PFM/SMA under controlled seeds, applies a post-processing pipeline to produce goal-tracking/minimum-distance/collision-step metrics, and reports observed changes under crowding/noise/delay perturbations. All claims rest on direct simulation outputs rather than any claimed derivation, fitted parameter renamed as prediction, or self-citation that reduces the central result to its own inputs. No equations or load-bearing self-referential steps appear; the work is therefore self-contained against the stated simulation benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption MuJoCo simulation accurately models humanoid robot dynamics, collisions, and sensor noise for the purposes of safety filter evaluation
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We replicate the SPARK benchmark case G1SportMode_D1_WG_SO_v1 in MuJoCo and evaluate RSSA, RSSS, SSA, CBF, PFM, and SMA under controlled random seeds. We also built a post-processing pipeline that converts raw SPARK logs into goal-tracking, minimum-distance, and collision-step metrics.
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The stress tests further indicate that safety behavior can change under obstacle crowding, noisy distance estimates, and delayed obstacle information.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Real-time obstacle avoidance for manipulators and mobile robots
O. Khatib, “Real-time obstacle avoidance for manipulators and mobile robots”,TheInternationalJournalofRoboticsRe- search, vol. 5, no. 1, pp. 90–98, Mar. 1986.doi: 10.1177/ 027836498600500106 [Online]. Available: https://journals. sagepub.com/doi/abs/10.1177/027836498600500106
-
[2]
Breach, a toolbox for verification and parameter synthesisofhybridsystems
A. Donzé, “Breach, a toolbox for verification and parameter synthesisofhybridsystems”,inComputerAidedVerification, T. Touili, B. Cook, and P. Jackson, Eds., Berlin, Heidelberg: SpringerBerlinHeidelberg,2010,pp.167–170,isbn:978-3-642- 14295-6
work page 2010
-
[3]
S-taliro:Atoolfortemporallogicfalsificationforhybridsys- tems
Y.Annpureddy,C.Liu,G.Fainekos,andS.Sankaranarayanan, “S-taliro:Atoolfortemporallogicfalsificationforhybridsys- tems”, inInternational Conference on Tools and Algorithms for the Construction and Analysis of Systems, Springer, 2011, pp.254–257
work page 2011
-
[4]
Reactivesliding-modealgo- rithmforcollisionavoidanceinroboticsystems
L.Gracia,F.Garelli,andA.Sala,“Reactivesliding-modealgo- rithmforcollisionavoidanceinroboticsystems”,IEEETrans- actionsonControlSystemsTechnology,vol.21,no.6,pp.2391– 2399,2013.doi:10.1109/TCST.2012.2231866
-
[5]
ControlinaSafeSet:AddressingSafetyinHuman-RobotInterac- tions,vol.Volume3,DynamicSystemsandControlConference, Oct.2014,V003T42A003.doi:10.1115/DSCC2014-6048eprint: https://asmedigitalcollection.asme.org/DSCC/proceedings- pdf/DSCC2014/46209/V003T42A003/4446881/v003t42a003- dscc2014-6048.pdf. [Online]. Available: https://doi.org/10. 1115/DSCC2014-6048 4–5 P...
-
[6]
1016/j.ifacol.2015.11.167 [Online]
Q.NguyenandK.Sreenath,“Safety-criticalcontrolfordynami- calbipedalwalkingwithprecisefootstepplacement**thiswork is partially supported through funding from the google fac- ulty award and nsf grant iis-1464337.”,IFAC-PapersOnLine, vol.48,no.27,pp.147–154,2015,AnalysisandDesignofHy- bridSystemsADHS,issn:2405-8963.doi:https://doi.org/10. 1016/j.ifacol.2015.1...
work page 2015
-
[7]
A.AgrawalandK.Sreenath,“Discretecontrolbarrierfunctions forsafety-criticalcontrolofdiscretesystemswithapplication tobipedalrobotnavigation”,Jul.2017.doi:10.15607/RSS.2017. XIII.073
-
[8]
M.ChenandC.J.Tomlin,“Hamilton-jacobireachability:Some recent theoretical advances and applications in unmanned airspace management”,Annu. Rev. Control. Robotics Auton. Syst., vol. 1, pp. 333–358, 2018. [Online]. Available: https:// api.semanticscholar.org/CorpusID:262693302
work page 2018
-
[9]
Simulation- basedadversarialtestgenerationforautonomousvehicleswith machinelearningcomponents
C.E.Tuncali,G.Fainekos,H.Ito,andJ.Kapinski,“Simulation- basedadversarialtestgenerationforautonomousvehicleswith machinelearningcomponents”,in2018IEEEIntelligentVehi- clesSymposium(IV),IEEE,2018,pp.1555–1562.doi:10.1109/ IVS.2018.8500421
-
[10]
Control barrier functions: Theory and ap- plications
A.D.Ames,S.Coogan,M.Egerstedt,G.Notomista,K.Sreenath, and P. Tabuada, “Control barrier functions: Theory and ap- plications”,in201918thEuropeanControlConference(ECC). IEEE,Jun.2019,pp.3420–3431,isbn:978-3-907144-00-8.doi: 10.23919/ECC.2019.8796030
-
[11]
Verifai: A toolkit for the formal design and analysisofartificialintelligence-basedsystems
T. Dreossi et al., “Verifai: A toolkit for the formal design and analysisofartificialintelligence-basedsystems”,inComputer AidedVerification,I.DilligandS.Tasiran,Eds.,Cham:Springer InternationalPublishing,2019,pp.432–442,isbn:978-3-030- 25540-4
work page 2019
-
[12]
Scenic:Alanguageforscenario specificationandscenegeneration
D.J.Fremont,T.Dreossi,S.Ghosh,X.Yue,A.L.Sangiovanni- Vincentelli,andS.A.Seshia,“Scenic:Alanguageforscenario specificationandscenegeneration”,inProceedingsofthe40th ACMSIGPLANConferenceonProgrammingLanguageDesign and Implementation, ser. PLDI 2019, Phoenix, AZ, USA: As- sociation for Computing Machinery, 2019, pp. 63–78,isbn: 9781450367127.doi:10.1145/3314...
-
[13]
M.Koren,S.Alsaif,R.Lee,andM.J.Kochenderfer,Adaptive stresstestingforautonomousvehicles,Feb.2019.[Online].Avail- able:https://arxiv.org/abs/1902.01909
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[14]
T.WeiandC.Liu,“Safecontrolalgorithmsusingenergyfunc- tions:Auniedframework,benchmark,andnewdirections”, in2019IEEE58thConferenceonDecisionandControl(CDC), Nice, France: IEEE Press, 2019, pp. 238–243.doi: 10.1109/ CDC40024.2019.9029720[Online].Available:https://doi.org/ 10.1109/CDC40024.2019.9029720
-
[15]
Safelearninginrobotics:Fromlearning-based controltosafereinforcementlearning
L.Brunkeetal.,“Safelearninginrobotics:Fromlearning-based controltosafereinforcementlearning”,AnnualReviewofCon- trol,Robotics,andAutonomousSystems, vol. 5, no. Volume 5, 2022,pp.411–444,2022,issn:2573-5144.doi:https://doi.org/ 10.1146/annurev-control-042920-020211[Online].Available: https://www.annualreviews.org/content/journals/10.1146/ annurev-control-0...
-
[16]
Hu- manoidself-collisionavoidanceusingwhole-bodycontrolwith controlbarrierfunctions
C. Khazoom, D. Gonzalez-Diaz, Y. Ding, and S. Kim, “Hu- manoidself-collisionavoidanceusingwhole-bodycontrolwith controlbarrierfunctions”,in2022IEEE-RAS21stInternational ConferenceonHumanoidRobots(Humanoids),2022,pp.558– 565.doi:10.1109/Humanoids53995.2022.10000235
-
[17]
Z.Yuanetal.,“Safe-control-gym:Aunifiedbenchmarksuite forsafelearning-basedcontrolandreinforcementlearningin robotics”,IEEERoboticsandAutomationLetters,vol.7,no.4, pp.11142–11149,2022.doi:10.1109/LRA.2022.3196132
-
[18]
C.Dawson,S.Gao,andC.Fan,“Safecontrolwithlearnedcer- tificates:Asurveyofneurallyapunov,barrier,andcontraction methods for robotics and control”,Trans.Rob., vol. 39, no. 3, pp.1749–1767,Jun.2023,issn:1552-3098.doi:10.1109/TRO. 2022.3232542[Online].Available:https://doi.org/10.1109/TRO. 2022.3232542
work page doi:10.1109/tro 2023
-
[19]
Safety-gymnasium:Aunifiedsafereinforcemeilearn- ingbenchmark
J.Jietal.,“Safety-gymnasium:Aunifiedsafereinforcemeilearn- ingbenchmark”,inProceedingsofthe37thInternationalCon- ferenceonNeuralInformationProcessingSystems,ser.NIPS’23, NewOrleans,LA,USA:CurranAssociatesInc.,2023
work page 2023
-
[20]
Safe whole-body task space con- trolforhumanoidrobots
V. Paredes and A. Hereid, “Safe whole-body task space con- trolforhumanoidrobots”,2024AmericanControlConference (ACC), pp. 949–956, 2023. [Online]. Available: https://api. semanticscholar.org/CorpusID:265213414
work page 2023
-
[21]
[Online].Available:https://arxiv.org/abs/2502.03132 5–5
Y.Sunetal.,Spark:Safeprotectiveandassistiverobotkit,2025. [Online].Available:https://arxiv.org/abs/2502.03132 5–5
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.