Testing of deep rein- forcement learning agents with surrogate models.ACM Transactions on Software Engineering and Methodology, 33(3):73:1–73:33

Matteo Biagiola, Paolo Tonella · 2024 · DOI 10.1145/3631970

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Worst-Case Discovery and Runtime Protection for RL-Based Network Controllers

cs.NI · 2026-05-06 · unverdicted · novelty 7.0

ReGuard discovers network scenarios where RL controllers perform 43-64% worse than achievable and reduces those gaps by 79-85% with lightweight rule-based protection that preserves normal performance.

Failure-Based Testing for Deep Reinforcement Learning Agents

cs.SE · 2026-06-30 · unverdicted · novelty 6.0

Proposes Prior Random Testing (PRT) that leverages task difficulty to prioritize failure-prone test cases for DRL agents, achieving over 50% lower testing cost than random testing while preserving diversity on four benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

Worst-Case Discovery and Runtime Protection for RL-Based Network Controllers cs.NI · 2026-05-06 · unverdicted · none · ref 8
ReGuard discovers network scenarios where RL controllers perform 43-64% worse than achievable and reduces those gaps by 79-85% with lightweight rule-based protection that preserves normal performance.
Failure-Based Testing for Deep Reinforcement Learning Agents cs.SE · 2026-06-30 · unverdicted · none · ref 5
Proposes Prior Random Testing (PRT) that leverages task difficulty to prioritize failure-prone test cases for DRL agents, achieving over 50% lower testing cost than random testing while preserving diversity on four benchmarks.

Testing of deep rein- forcement learning agents with surrogate models.ACM Transactions on Software Engineering and Methodology, 33(3):73:1–73:33

fields

years

verdicts

representative citing papers

citing papers explorer