Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks

Achraf Hsain; Sultan Almuhammadi

arxiv: 2606.13621 · v1 · pith:QQP6KHQRnew · submitted 2026-06-11 · 💻 cs.AI · cs.CR· cs.GT· cs.LG· cs.MA

Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks

Achraf Hsain , Sultan Almuhammadi This is my paper

Pith reviewed 2026-06-27 06:37 UTC · model grok-4.3

classification 💻 cs.AI cs.CRcs.GTcs.LGcs.MA

keywords shield synthesisdefensibility analysisadversarial networkssafety gamesreinforcement learningtemporal logicnetwork defenseattractor computation

0 comments

The pith

Shield synthesis serves as a design-time tool for producing defensibility verdicts on network topologies rather than runtime constraints on agents.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper argues that the standard automata machinery of shield synthesis—specification compilation, product construction, attractor computation, and winning-region extraction—functions more effectively as an analytical instrument for network defense questions than as a deployment mechanism. It applies this machinery through an asymmetric two-player safety game in which the defender specification marks the unsafe region while the attacker specification limits adversary moves during attractor computation. Solving the game produces a binary defensibility verdict together with a winning region and shield, which are then augmented by topology-level metrics and post-convergence outcomes from shield-constrained adversarial reinforcement learning. These elements combine into a defensibility fingerprint that separates formal safety margins from operational behavior under adaptive play. A what-if analysis demonstrates that formal defensibility and operational effectiveness track distinct aspects of security, so that architectural modifications can alter runtime results while leaving the formal verdict nearly unchanged.

Core claim

Shield synthesis applied to a constrained two-player safety game for network defense yields a defensibility verdict—a formal certificate that a topology-specification pair is or is not defensible—along with the associated winning region and shield; the verdict and derived attractor metrics, when combined with post-convergence reinforcement-learning behavior, form a defensibility fingerprint whose primary value lies in answering architectural questions about whether, where, and how a system can be defended.

What carries the argument

The asymmetric two-player safety game in which the defender specification defines the unsafe region of the game and the attacker specification restricts the adversary's legal actions during attractor computation.

If this is right

Formal defensibility verdicts and operational effectiveness under adaptive reinforcement learning capture distinct aspects of security.
Small architectural changes can produce large shifts in operational outcomes while leaving formal safety margins nearly unchanged.
The winning region and shield supply structural information about where and how defense is possible within a given topology.
The combination of attractor-derived metrics and post-convergence RL traces forms a defensibility fingerprint usable for design-time comparison of topology-specification pairs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The separation of formal and operational metrics suggests that security evaluation benefits from running both analyses rather than relying on either alone.
Designers could use the defensibility verdict to rank candidate topologies before committing to reinforcement-learning training.
The same game-construction technique might be reused to compare alternative specifications on a fixed network without retraining agents.

Load-bearing premise

The assumption that the asymmetric game construction and its combination with post-convergence RL behavior produce metrics that meaningfully capture real-world defensibility.

What would settle it

A controlled experiment in which the formal defensibility verdict and fingerprint metrics fail to predict actual attack success rates under adaptive adversaries in a simulated network would falsify the claim that these outputs capture meaningful defensibility.

Figures

Figures reproduced from arXiv: 2606.13621 by Achraf Hsain, Sultan Almuhammadi.

**Figure 2.** Figure 2: Attacker constraint automaton AA for φA (Definition 8): at most two Destroy actions per engagement. D = Destroy, ¬D = any non-Destroy action, ⊤ = any input; qi records i Destroys spent. AA advances on every Destroy the attacker selects—including no-op Destroys— so attempted destruction is priced like successful destruction. 3.2.3 Asymmetric Enforcement The two specifications serve fundamentally different r… view at source ↗

**Figure 3.** Figure 3: Reference topology. 5 nodes, 6 directed edges. Dashed red edge: VPN bypass path [PITH_FULL_IMAGE:figures/full_fig_p016_3.png] view at source ↗

**Figure 4.** Figure 4: Defensibility fingerprint for the baseline topology (Case 1). Six axes oriented so that [PITH_FULL_IMAGE:figures/full_fig_p018_4.png] view at source ↗

**Figure 5.** Figure 5: Defensibility fingerprints for all five what-if cases. Topology perturbations (Cases 2 [PITH_FULL_IMAGE:figures/full_fig_p019_5.png] view at source ↗

**Figure 6.** Figure 6: Winning state space decomposition across what-if cases. Green: winning region (safe [PITH_FULL_IMAGE:figures/full_fig_p020_6.png] view at source ↗

**Figure 7.** Figure 7: Defender Dominance Ratio at post-convergence (L200) across what-if cases. Each [PITH_FULL_IMAGE:figures/full_fig_p021_7.png] view at source ↗

read the original abstract

Shielded reinforcement learning is typically presented as a runtime safety mechanism that compiles temporal-logic specifications into automata restricting an agent's actions. We argue this is the wrong product. The same automata-theoretic machinery -- specification compilation, product game construction, attractor computation, and winning-region extraction -- is better read as a design-time analytical instrument whose outputs are structural insights about a system rather than runtime constraints on a deployed agent. We instantiate this through a constrained two-player safety game for network defense. The two specifications are enforced asymmetrically: the defender specification defines the unsafe region of the game, whereas the attacker specification restricts the adversary's legal actions during attractor computation. Solving the game yields a defensibility verdict -- a formal certificate that a topology-specification pair is or is not defensible -- with the associated winning region and shield. Beyond the binary verdict, we derive topology-level metrics from the attractor structure and combine them with post-convergence behavior from shield-constrained adversarial multi-agent reinforcement learning. Together these form a defensibility fingerprint capturing both a network's formal safety properties and its operational behavior under adaptive play. A what-if analysis shows that formal defensibility and operational effectiveness capture distinct aspects of security: small architectural changes can produce large shifts in operational outcomes while leaving formal safety margins nearly unchanged. Shield synthesis is thus most valuable not as a deployment mechanism for safe agents, but as a framework for answering architectural questions about whether, where, and how a system can be defended. The defensibility verdict is the output, not the safe policy.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper reframes shield synthesis as a design-time tool for defensibility verdicts and fingerprints in network games, but the abstract leaves the actual constructions and results uncheckable.

read the letter

The main move is to treat the automata machinery of shield synthesis as a way to answer whether a network topology can be defended, not just to restrict an agent at runtime. They set up an asymmetric two-player safety game, enforce the defender spec as the unsafe region and the attacker spec as action restrictions during attractor computation, extract a binary defensibility verdict plus winning region, then combine attractor-derived metrics with post-convergence shield-constrained multi-agent RL behavior to produce a fingerprint. A what-if analysis is used to argue that formal margins and operational outcomes under adaptive play capture different things.

This application to network defense games and the explicit separation of formal versus operational outputs is new relative to the runtime-enforcement literature they cite. The cleanest part is the argument that the verdict itself is the useful product for architectural questions.

The soft spot is the lack of visible math or data. Without the game constructions, attractor details, or the actual what-if results, it is impossible to judge whether the asymmetric setup introduces artifacts or whether the RL metrics add information beyond the formal part. The stress-test note about unstated modeling choices in the network topologies and adversary models is a fair concern; if those choices drive the divergence, the fingerprints may not travel. The central conceptual distinction holds up on the abstract, but the load-bearing evidence is missing here.

This is for people working at the intersection of formal methods, RL safety, and network security. It shows clear engagement with the automata and game literature, so it deserves a serious referee to check the constructions and experiments.

Referee Report

2 major / 2 minor

Summary. The paper claims that shield synthesis from temporal-logic specifications should be reinterpreted as a design-time analytical framework for defensibility analysis in adversarial networks, rather than primarily a runtime enforcement mechanism. It constructs an asymmetric two-player safety game in which the defender specification defines the unsafe region while the attacker specification restricts adversary actions during attractor computation; solving the game yields a binary defensibility verdict, winning region, and shield. Attractor-derived topology metrics are then combined with post-convergence behavior from shield-constrained multi-agent RL to produce a 'defensibility fingerprint' that captures both formal and operational aspects. A what-if analysis is presented to show that formal safety margins and operational effectiveness under adaptive play are distinct, with small architectural changes affecting the latter more than the former.

Significance. If the what-if results and metric derivations hold under scrutiny, the work provides a constructive way to leverage automata-theoretic machinery for architectural questions in network defense, moving beyond deployment-focused uses of shields. The explicit separation of formal verdict from operational fingerprint, grounded in standard game constructions plus RL, is a positive contribution that could inform design choices in adversarial multi-agent settings. The emphasis on the verdict itself as the primary output rather than the policy is a clear conceptual reframing.

major comments (2)

[§4] §4 (Game Construction): The central claim that the asymmetric enforcement (defender unsafe region vs. attacker action restriction during attractor computation) produces a meaningful defensibility verdict rests on the adaptation of standard attractor algorithms. The manuscript must supply the precise modified attractor definition or pseudocode, because the standard fixed-point computation does not automatically accommodate action restrictions without additional choices that could affect the winning-region extraction.
[§6] §6 (What-if Analysis): The divergence between formal margins and operational outcomes is load-bearing for the claim that the two capture distinct aspects. The specific network parameterizations, RL convergence criteria, and adversary models used in the experiments must be stated explicitly; without them it remains possible that the observed distinction is an artifact of the chosen modeling decisions rather than a general property of the fingerprint.

minor comments (2)

[Abstract] The term 'defensibility fingerprint' is used repeatedly but never given an explicit equation or boxed definition; adding one would improve readability.
[§3] Notation for the product game and winning regions should be cross-referenced to a standard reference (e.g., the exact attractor algorithm employed) to aid readers familiar with the literature.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which help strengthen the presentation of the game construction and experimental details. We address each major comment below and will revise the manuscript accordingly.

read point-by-point responses

Referee: [§4] §4 (Game Construction): The central claim that the asymmetric enforcement (defender unsafe region vs. attacker action restriction during attractor computation) produces a meaningful defensibility verdict rests on the adaptation of standard attractor algorithms. The manuscript must supply the precise modified attractor definition or pseudocode, because the standard fixed-point computation does not automatically accommodate action restrictions without additional choices that could affect the winning-region extraction.

Authors: We agree that an explicit definition of the restricted attractor is required. Section 4 describes the asymmetry (defender unsafe region vs. attacker action restriction) but does not provide the modified fixed-point operator. In the revision we will insert the formal definition of the action-restricted attractor together with pseudocode for the fixed-point iteration, making clear that the restriction is applied uniformly at each predecessor step and does not introduce nondeterministic choices beyond the standard safety-game semantics. revision: yes
Referee: [§6] §6 (What-if Analysis): The divergence between formal margins and operational outcomes is load-bearing for the claim that the two capture distinct aspects. The specific network parameterizations, RL convergence criteria, and adversary models used in the experiments must be stated explicitly; without them it remains possible that the observed distinction is an artifact of the chosen modeling decisions rather than a general property of the fingerprint.

Authors: We accept that the experimental setup must be fully specified. The current manuscript gives only high-level descriptions of the networks and RL procedure. In the revision we will add a dedicated subsection (or appendix) listing the exact topologies, edge weights, RL hyperparameters, convergence criteria (episode count, reward threshold, and variance tolerance), and adversary action spaces. This will allow independent verification that the reported separation between formal and operational metrics is not an artifact of particular modeling choices. revision: yes

Circularity Check

0 steps flagged

No significant circularity; standard constructions reframed without reduction to inputs

full rationale

The paper reframes existing automata-theoretic machinery (specification compilation, product game construction, attractor computation, winning-region extraction) as a design-time analysis tool rather than a runtime enforcer. It constructs an asymmetric safety game where defender and attacker specifications are applied differently, derives topology metrics from the attractor, and combines them with post-convergence shield-constrained multi-agent RL behavior to form a defensibility fingerprint. The what-if analysis then shows divergence between formal margins and operational outcomes. No quoted equations or steps reduce the central claims to fitted parameters, self-definitions, or self-citation chains; the constructions rely on standard game theory and RL without the target verdict or fingerprint being presupposed in the inputs. The derivation is therefore self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on standard automata theory and game solving being applicable to network defense without new axioms, but the combination of formal verdicts with RL operational behavior assumes the two capture distinct aspects without further justification.

axioms (2)

standard math Temporal logic specifications can be compiled into automata that correctly define unsafe regions and action restrictions in a two-player game.
Invoked in the description of specification enforcement and game construction.
domain assumption Attractor computation and winning-region extraction yield a meaningful defensibility verdict for network topologies.
Core to producing the binary verdict and structural metrics.

pith-pipeline@v0.9.1-grok · 5819 in / 1416 out tokens · 15029 ms · 2026-06-27T06:37:43.412933+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

32 extracted references · 16 canonical work pages

[1]

Alshiekh, R

M. Alshiekh, R. Bloem, R. Ehlers, B. K¨ onighofer, S. Niekum, and U. Topcu. Safe reinforcement learning via shielding. InProc. AAAI, pp. 2669–2678, 2018. 23 doi:10.1609/aaai.v32i1.11797

work page doi:10.1609/aaai.v32i1.11797 2018
[2]

Bloem, B

R. Bloem, B. K¨ onighofer, R. K¨ onighofer, and C. Wang. Shield synthesis: Runtime enforce- ment for reactive systems. InProc. TACAS, LNCS vol. 9035, Springer, pp. 533–548, 2015. doi:10.1007/978-3-662-46681-0 51

work page doi:10.1007/978-3-662-46681-0 2015
[3]

K¨ onighofer, M

B. K¨ onighofer, M. Alshiekh, R. Bloem, L. R. Humphrey, R. K¨ onighofer, U. Topcu, and C. Wang. Shield synthesis.Formal Methods in System Design, 51(2):332–361, 2017. doi:10.1007/s10703-017-0276-9

work page doi:10.1007/s10703-017-0276-9 2017
[4]

K¨ onighofer, R

B. K¨ onighofer, R. Bloem, N. Jansen, S. Junges, and S. Pranger. Shields for safe reinforce- ment learning.Communications of the ACM, 68(11):80–90, 2025. doi:10.1145/3715958

work page doi:10.1145/3715958 2025
[5]

Jansen, B

N. Jansen, B. K¨ onighofer, S. Junges, A. Serban, and R. Bloem. Safe reinforcement learning using probabilistic shields. InProc. CONCUR, LIPIcs vol. 171, Schloss Dagstuhl–Leibniz- Zentrum f¨ ur Informatik, article 3, pp. 3:1–3:16, 2020. doi:10.4230/LIPIcs.CONCUR.2020.3

work page doi:10.4230/lipics.concur.2020.3 2020
[6]

Hamel-De le Court, F

E. Hamel-De le Court, F. Belardinelli, and A. W. Goodall. Probabilistic shield- ing for safe reinforcement learning. InProc. AAAI, 39(15):16091–16099, 2025. doi:10.1609/aaai.v39i15.33767

work page doi:10.1609/aaai.v39i15.33767 2025
[7]

S. Carr, N. Jansen, S. Junges, and U. Topcu. Safe reinforcement learning via shielding under partial observability. InProc. AAAI, pp. 14748–14756, 2023. doi:10.1609/aaai.v37i12.26723

work page doi:10.1609/aaai.v37i12.26723 2023
[8]

Melcer, C

D. Melcer, C. Amato, and S. Tripakis. Shield decomposition for safe reinforcement learning in general partially observable multi-agent environments.Reinforcement Learning Journal, 4:1965–1994, 2024

1965
[9]

Tappler, S

M. Tappler, S. Pranger, B. K¨ onighofer, E. Muˇ skardin, R. Bloem, and K. G. Larsen. Au- tomata learning meets shielding. InProc. ISoLA, LNCS vol. 13701, Springer, pp. 335–359,
[10]

doi:10.1007/978-3-031-19849-6 20

work page doi:10.1007/978-3-031-19849-6
[11]

Brunke, M

L. Brunke, M. Greeff, A. W. Hall, Z. Yuan, S. Zhou, J. Panerati, and A. P. Schoellig. Safe learning in robotics: From learning-based control to safe reinforcement learning.Annual Review of Control, Robotics, and Autonomous Systems, 5:411–444, 2022

2022
[12]

Gr¨ adel, W

E. Gr¨ adel, W. Thomas, and T. Wilke, editors.Automata, Logics, and Infinite Games: A Guide to Current Research. LNCS vol. 2500, Springer, 2002. doi:10.1007/3-540-36387-4

work page doi:10.1007/3-540-36387-4 2002
[13]

Baier and J.-P

C. Baier and J.-P. Katoen.Principles of Model Checking. MIT Press, 2008

2008
[14]

M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Proc. ICML, pp. 157–163, Morgan Kaufmann, 1994

1994
[15]

M. L. Littman and C. Szepesv´ ari. A generalized reinforcement-learning model: Conver- gence and applications. InProc. ICML, pp. 310–318, 1996

1996
[16]

ElSayed-Aly, S

I. ElSayed-Aly, S. Bharadwaj, C. Amato, R. Ehlers, U. Topcu, and L. Feng. Safe multi- agent reinforcement learning via shielding. InProc. AAMAS, IFAAMAS, pp. 483–491, 2021

2021
[17]

W. Xiao, Y. Lyu, and J. M. Dolan. Model-based dynamic shielding for safe and efficient multi-agent reinforcement learning. arXiv:2304.06281, 2023

arXiv 2023
[18]

(eds.) Classifica- tion of Security Properties: (Part II: Net- work Security)

K. Chatterjee, T. A. Henzinger, and B. Jobstmann. Environment assumptions for synthesis. InProc. CONCUR, LNCS vol. 5201, Springer, pp. 147–161, 2008. doi:10.1007/978-3-540- 85361-9 14. 24

work page doi:10.1007/978-3-540- 2008
[19]

Piterman, A

N. Piterman, A. Pnueli, and Y. Sa’ar. Synthesis of Reactive(1) designs. InProc. VMCAI, LNCS vol. 3855, Springer, pp. 364–380, 2006. doi:10.1007/11609773 24

work page doi:10.1007/11609773 2006
[20]

CybORG: Cyber Operations Research Gym

CAGE Challenge. CybORG: Cyber Operations Research Gym. GitHub repository,https: //github.com/cage-challenge/CybORG, 2023

2023
[21]

Standen, M

M. Standen, M. Lucas, D. Bowman, T. J. Richer, J. Kim, and D. Marriott. CybORG: A gym for the development of autonomous cyber agents. arXiv:2108.09118, 2021

arXiv 2021
[22]

Kiely, M

M. Kiely, M. Ahiskali, E. Borde, B. Bowman, D. Bowman, D. Van Bruggen, K. C. Cowan, P. Dasgupta, E. Devendorf, B. Edwards, A. Fitts, S. Fugate, R. Gabrys, W. Gould, H. H. Huang, J. Jacobs, R. Kerr, I. J. King, L. Li, L. Martinez, C. Moir, C. Murphy, O. Naish, C. Owens, M. Purchase, A. Ridley, A. Taylor, S. Farmer, W. J. Valentine, and Y. Zhang. CAGE chall...

work page doi:10.1002/aaai.70021 2025
[23]

P. K. Manadhata and J. M. Wing. Measuring a system’s attack surface. Technical Report CMU-CS-04-102, School of Computer Science, Carnegie Mellon University, 2004

2004
[24]

L. Wang, A. Singhal, and S. Jajodia. Measuring the overall security of network configura- tions using attack graphs. InProc. DBSec, LNCS vol. 4602, Springer, pp. 98–112, 2007. doi:10.1007/978-3-540-73538-0 9

work page doi:10.1007/978-3-540-73538-0 2007
[25]

Pamula, S

J. Pamula, S. Jajodia, P. Ammann, and V. Swarup. A weakest-adversary security met- ric for network configuration security analysis. InProc. QoP, ACM, pp. 31–38, 2006. doi:10.1145/1179494.1179502

work page doi:10.1145/1179494.1179502 2006
[26]

Zenitani

K. Zenitani. Attack graph analysis: An explanatory guide.Computers & Security, 126:103081, 2023. doi:10.1016/j.cose.2022.103081

work page doi:10.1016/j.cose.2022.103081 2023
[27]

L. Wang, T. Islam, T. Long, A. Singhal, and S. Jajodia. An attack graph-based probabilis- tic security metric. InData and Applications Security XXII, LNCS vol. 5094, Springer, pp. 283–296, 2008. doi:10.1007/978-3-540-70567-3 22

work page doi:10.1007/978-3-540-70567-3 2008
[28]

Klaˇ ska, A

D. Klaˇ ska, A. Kuˇ cera, V. Musil, and V.ˇReh´ ak. Regstar: Efficient strategy synthesis for adversarial patrolling games. InProc. UAI, PMLR vol. 161, pp. 471–481, 2021

2021
[29]

Shishika, Y

D. Shishika, Y. Guan, J. R. Marden, M. Dorothy, P. Tsiotras, and V. Kumar. Dynamic adversarial resource allocation: The dDAB game. arXiv:2304.02172, 2023

arXiv 2023
[30]

Batfish: An open source network configuration analysis tool.https://www.batfish.org, 2023

The Batfish Open Source Project. Batfish: An open source network configuration analysis tool.https://www.batfish.org, 2023

2023
[31]

Kazemian, G

P. Kazemian, G. Varghese, and N. McKeown. Header space analysis: Static checking for networks. InProc. NSDI, pp. 113–126, USENIX Association, 2012

2012
[32]

Kiekintveld, M

C. Kiekintveld, M. Jain, J. Tsai, J. Pita, F. Ord´ o˜ nez, and M. Tambe. Computing optimal randomized resource allocations for massive security games. InProc. AAMAS, IFAAMAS, pp. 689–696, 2009. A Empirical Correlation Structure of the Defensibility Metrics Section 5 introduces six conceptually distinct metrics. The empirical relationships among them, eval...

2009

[1] [1]

Alshiekh, R

M. Alshiekh, R. Bloem, R. Ehlers, B. K¨ onighofer, S. Niekum, and U. Topcu. Safe reinforcement learning via shielding. InProc. AAAI, pp. 2669–2678, 2018. 23 doi:10.1609/aaai.v32i1.11797

work page doi:10.1609/aaai.v32i1.11797 2018

[2] [2]

Bloem, B

R. Bloem, B. K¨ onighofer, R. K¨ onighofer, and C. Wang. Shield synthesis: Runtime enforce- ment for reactive systems. InProc. TACAS, LNCS vol. 9035, Springer, pp. 533–548, 2015. doi:10.1007/978-3-662-46681-0 51

work page doi:10.1007/978-3-662-46681-0 2015

[3] [3]

K¨ onighofer, M

B. K¨ onighofer, M. Alshiekh, R. Bloem, L. R. Humphrey, R. K¨ onighofer, U. Topcu, and C. Wang. Shield synthesis.Formal Methods in System Design, 51(2):332–361, 2017. doi:10.1007/s10703-017-0276-9

work page doi:10.1007/s10703-017-0276-9 2017

[4] [4]

K¨ onighofer, R

B. K¨ onighofer, R. Bloem, N. Jansen, S. Junges, and S. Pranger. Shields for safe reinforce- ment learning.Communications of the ACM, 68(11):80–90, 2025. doi:10.1145/3715958

work page doi:10.1145/3715958 2025

[5] [5]

Jansen, B

N. Jansen, B. K¨ onighofer, S. Junges, A. Serban, and R. Bloem. Safe reinforcement learning using probabilistic shields. InProc. CONCUR, LIPIcs vol. 171, Schloss Dagstuhl–Leibniz- Zentrum f¨ ur Informatik, article 3, pp. 3:1–3:16, 2020. doi:10.4230/LIPIcs.CONCUR.2020.3

work page doi:10.4230/lipics.concur.2020.3 2020

[6] [6]

Hamel-De le Court, F

E. Hamel-De le Court, F. Belardinelli, and A. W. Goodall. Probabilistic shield- ing for safe reinforcement learning. InProc. AAAI, 39(15):16091–16099, 2025. doi:10.1609/aaai.v39i15.33767

work page doi:10.1609/aaai.v39i15.33767 2025

[7] [7]

S. Carr, N. Jansen, S. Junges, and U. Topcu. Safe reinforcement learning via shielding under partial observability. InProc. AAAI, pp. 14748–14756, 2023. doi:10.1609/aaai.v37i12.26723

work page doi:10.1609/aaai.v37i12.26723 2023

[8] [8]

Melcer, C

D. Melcer, C. Amato, and S. Tripakis. Shield decomposition for safe reinforcement learning in general partially observable multi-agent environments.Reinforcement Learning Journal, 4:1965–1994, 2024

1965

[9] [9]

Tappler, S

M. Tappler, S. Pranger, B. K¨ onighofer, E. Muˇ skardin, R. Bloem, and K. G. Larsen. Au- tomata learning meets shielding. InProc. ISoLA, LNCS vol. 13701, Springer, pp. 335–359,

[10] [10]

doi:10.1007/978-3-031-19849-6 20

work page doi:10.1007/978-3-031-19849-6

[11] [11]

Brunke, M

L. Brunke, M. Greeff, A. W. Hall, Z. Yuan, S. Zhou, J. Panerati, and A. P. Schoellig. Safe learning in robotics: From learning-based control to safe reinforcement learning.Annual Review of Control, Robotics, and Autonomous Systems, 5:411–444, 2022

2022

[12] [12]

Gr¨ adel, W

E. Gr¨ adel, W. Thomas, and T. Wilke, editors.Automata, Logics, and Infinite Games: A Guide to Current Research. LNCS vol. 2500, Springer, 2002. doi:10.1007/3-540-36387-4

work page doi:10.1007/3-540-36387-4 2002

[13] [13]

Baier and J.-P

C. Baier and J.-P. Katoen.Principles of Model Checking. MIT Press, 2008

2008

[14] [14]

M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Proc. ICML, pp. 157–163, Morgan Kaufmann, 1994

1994

[15] [15]

M. L. Littman and C. Szepesv´ ari. A generalized reinforcement-learning model: Conver- gence and applications. InProc. ICML, pp. 310–318, 1996

1996

[16] [16]

ElSayed-Aly, S

I. ElSayed-Aly, S. Bharadwaj, C. Amato, R. Ehlers, U. Topcu, and L. Feng. Safe multi- agent reinforcement learning via shielding. InProc. AAMAS, IFAAMAS, pp. 483–491, 2021

2021

[17] [17]

W. Xiao, Y. Lyu, and J. M. Dolan. Model-based dynamic shielding for safe and efficient multi-agent reinforcement learning. arXiv:2304.06281, 2023

arXiv 2023

[18] [18]

(eds.) Classifica- tion of Security Properties: (Part II: Net- work Security)

K. Chatterjee, T. A. Henzinger, and B. Jobstmann. Environment assumptions for synthesis. InProc. CONCUR, LNCS vol. 5201, Springer, pp. 147–161, 2008. doi:10.1007/978-3-540- 85361-9 14. 24

work page doi:10.1007/978-3-540- 2008

[19] [19]

Piterman, A

N. Piterman, A. Pnueli, and Y. Sa’ar. Synthesis of Reactive(1) designs. InProc. VMCAI, LNCS vol. 3855, Springer, pp. 364–380, 2006. doi:10.1007/11609773 24

work page doi:10.1007/11609773 2006

[20] [20]

CybORG: Cyber Operations Research Gym

CAGE Challenge. CybORG: Cyber Operations Research Gym. GitHub repository,https: //github.com/cage-challenge/CybORG, 2023

2023

[21] [21]

Standen, M

M. Standen, M. Lucas, D. Bowman, T. J. Richer, J. Kim, and D. Marriott. CybORG: A gym for the development of autonomous cyber agents. arXiv:2108.09118, 2021

arXiv 2021

[22] [22]

Kiely, M

M. Kiely, M. Ahiskali, E. Borde, B. Bowman, D. Bowman, D. Van Bruggen, K. C. Cowan, P. Dasgupta, E. Devendorf, B. Edwards, A. Fitts, S. Fugate, R. Gabrys, W. Gould, H. H. Huang, J. Jacobs, R. Kerr, I. J. King, L. Li, L. Martinez, C. Moir, C. Murphy, O. Naish, C. Owens, M. Purchase, A. Ridley, A. Taylor, S. Farmer, W. J. Valentine, and Y. Zhang. CAGE chall...

work page doi:10.1002/aaai.70021 2025

[23] [23]

P. K. Manadhata and J. M. Wing. Measuring a system’s attack surface. Technical Report CMU-CS-04-102, School of Computer Science, Carnegie Mellon University, 2004

2004

[24] [24]

L. Wang, A. Singhal, and S. Jajodia. Measuring the overall security of network configura- tions using attack graphs. InProc. DBSec, LNCS vol. 4602, Springer, pp. 98–112, 2007. doi:10.1007/978-3-540-73538-0 9

work page doi:10.1007/978-3-540-73538-0 2007

[25] [25]

Pamula, S

J. Pamula, S. Jajodia, P. Ammann, and V. Swarup. A weakest-adversary security met- ric for network configuration security analysis. InProc. QoP, ACM, pp. 31–38, 2006. doi:10.1145/1179494.1179502

work page doi:10.1145/1179494.1179502 2006

[26] [26]

Zenitani

K. Zenitani. Attack graph analysis: An explanatory guide.Computers & Security, 126:103081, 2023. doi:10.1016/j.cose.2022.103081

work page doi:10.1016/j.cose.2022.103081 2023

[27] [27]

L. Wang, T. Islam, T. Long, A. Singhal, and S. Jajodia. An attack graph-based probabilis- tic security metric. InData and Applications Security XXII, LNCS vol. 5094, Springer, pp. 283–296, 2008. doi:10.1007/978-3-540-70567-3 22

work page doi:10.1007/978-3-540-70567-3 2008

[28] [28]

Klaˇ ska, A

D. Klaˇ ska, A. Kuˇ cera, V. Musil, and V.ˇReh´ ak. Regstar: Efficient strategy synthesis for adversarial patrolling games. InProc. UAI, PMLR vol. 161, pp. 471–481, 2021

2021

[29] [29]

Shishika, Y

D. Shishika, Y. Guan, J. R. Marden, M. Dorothy, P. Tsiotras, and V. Kumar. Dynamic adversarial resource allocation: The dDAB game. arXiv:2304.02172, 2023

arXiv 2023

[30] [30]

Batfish: An open source network configuration analysis tool.https://www.batfish.org, 2023

The Batfish Open Source Project. Batfish: An open source network configuration analysis tool.https://www.batfish.org, 2023

2023

[31] [31]

Kazemian, G

P. Kazemian, G. Varghese, and N. McKeown. Header space analysis: Static checking for networks. InProc. NSDI, pp. 113–126, USENIX Association, 2012

2012

[32] [32]

Kiekintveld, M

C. Kiekintveld, M. Jain, J. Tsai, J. Pita, F. Ord´ o˜ nez, and M. Tambe. Computing optimal randomized resource allocations for massive security games. InProc. AAMAS, IFAAMAS, pp. 689–696, 2009. A Empirical Correlation Structure of the Defensibility Metrics Section 5 introduces six conceptually distinct metrics. The empirical relationships among them, eval...

2009