Effect-Transparent Governance for AI Workflow Architectures: Semantic Preservation, Expressive Minimality, and Decidability Boundaries
Pith reviewed 2026-05-09 19:22 UTC · model grok-4.3
The pith
Effect-level governance can be imposed on AI workflow architectures without reducing their internal computational expressivity or changing permitted behaviors.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors define a governance operator G that mediates every effectful directive in Interaction Trees. They establish seven properties: governed Turing completeness, governed oracle expressivity, a decidability boundary separating total governance predicates from undecidable program properties, goal preservation on permitted paths, expressive minimality of the five primitive capabilities, strict subsumption of content-level filtering, and semantic transparency under which permitted executions are observationally equivalent to their ungoverned counterparts.
What carries the argument
The governance operator G on Interaction Trees, which intercepts and filters all effectful actions while leaving the internal tree structure and permitted computations unchanged.
If this is right
- AI workflows remain Turing complete and oracle-expressive once the governance operator is applied.
- Governance predicates stay decidable and closed under Boolean operations even while full semantic properties of the programs remain undecidable.
- Structural effect governance strictly contains and improves upon simpler content-level filtering approaches.
- Permitted executions under governance are observationally identical to ungoverned executions.
- The five primitive capabilities suffice to express any governed workflow without loss of minimality.
Where Pith is reading between the lines
- The same governance construction might be lifted to other effectful models such as process calculi or distributed actor systems.
- Implementations could wrap existing LLM agents with effect boundaries while preserving their original reasoning traces on allowed inputs.
- The decidability boundary implies that automatic safety monitors can be added without solving the general halting problem for the governed programs.
- If the model scales, governance layers could be composed without forcing designers to simplify internal computation logic.
Load-bearing premise
The Interaction Trees model together with the defined governance operator accurately captures every relevant effectful behavior of real AI workflow architectures, including oracle queries.
What would settle it
A concrete AI workflow execution in which the governed version produces a different final state or observable output than the ungoverned version on the same permitted input, beyond any governance-only events.
read the original abstract
We present a machine-checked formalization of structurally governed AI workflow architectures and prove that effect-level governance can be imposed without reducing internal computational expressivity. Using Interaction Trees in Rocq 8.19, we define a governance operator G that mediates all effectful directives, including memory access, external calls, and oracle (LLM) queries. Our development compiles with 0 admitted lemmas and consists of 36 modules, ~12,000 lines of Rocq, and 454 theorems. We establishseven properties: (P1) governed Turing completeness, (P2) governed oracle expressivity, (P3) a decidability boundary in which governance predicates are total and closed under Boolean composition while semantic program properties remain non-trivial and undecidable by governance, (P4) goal preservation for permitted executions, (P5) expressive minimality of primitive capabilities (compute, memory, reasoning, external call, observability), (P6) subsumption asymmetry showing structural governance strictly subsumes content-level filtering, and (P7) semantic transparency: on all executions where governance permits, the governed interpretation is observationally equivalent (modulo governance-only events) to the ungoverned interpretation. Together, these results show that governance and computational expressivity are orthogonal dimensions: governance constrains the effect boundary of programs while remaining semantically transparent to internal computation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to provide a machine-checked formalization in Rocq of effect-transparent governance for AI workflow architectures using Interaction Trees. It defines a governance operator G that mediates all effectful directives and proves seven properties: governed Turing completeness (P1), governed oracle expressivity (P2), a decidability boundary (P3), goal preservation (P4), expressive minimality (P5), subsumption asymmetry over content filtering (P6), and semantic transparency (P7) where permitted governed executions are observationally equivalent to ungoverned ones. The overall thesis is that governance and computational expressivity are orthogonal.
Significance. If the formal results hold, this work is significant as it offers a rigorous, machine-verified demonstration that structural governance can be added to AI systems at the effect level without sacrificing expressivity or altering the semantics of permitted computations. The complete Rocq development with 454 theorems and no admitted lemmas provides strong evidence for the claims within the formal model, potentially informing the design of verifiable and governable AI architectures.
major comments (1)
- [Abstract (and the definition of the governance operator G)] The central claims P1, P2, and P7 rely on the Interaction Trees effect signature together with G exhaustively mediating all relevant effectful behaviors in AI workflows, including memory, external calls, and oracle/LLM queries. The manuscript asserts mediation of 'all effectful directives' but does not appear to provide an explicit argument or proof that the signature is complete and that no effects can occur outside G's mediation (e.g., untyped side effects or dynamically introduced effects). This is load-bearing for the semantic transparency and orthogonality results to transfer beyond the model.
Simulated Author's Rebuttal
We thank the referee for their careful and constructive review. The major comment identifies a point of exposition that we agree merits explicit clarification in the manuscript. We address it below and will revise accordingly.
read point-by-point responses
-
Referee: The central claims P1, P2, and P7 rely on the Interaction Trees effect signature together with G exhaustively mediating all relevant effectful behaviors in AI workflows, including memory, external calls, and oracle/LLM queries. The manuscript asserts mediation of 'all effectful directives' but does not appear to provide an explicit argument or proof that the signature is complete and that no effects can occur outside G's mediation (e.g., untyped side effects or dynamically introduced effects). This is load-bearing for the semantic transparency and orthogonality results to transfer beyond the model.
Authors: We agree that an explicit statement on signature completeness strengthens the presentation. In the Rocq development the effect signature is defined as a closed inductive datatype Effect whose constructors are exactly the primitive effects for memory access, external calls, oracle/LLM queries, and observability. The governance operator G is defined by exhaustive case analysis over every constructor of this datatype; the ITree type is parameterized by Effect, so every well-typed itree can emit only events from this signature. Untyped side effects and dynamic introduction of new effect types are outside the Interaction Trees framework and are not modeled. We will add a short subsection 'Effect Signature Exhaustiveness' (new Section 3.2) that states this design choice, quotes the Rocq definition, and notes that the orthogonality results hold inside this model. This revision makes the scope of the claims explicit without changing any theorems. revision: yes
Circularity Check
No circularity: machine-checked Rocq formalization derives properties from explicit definitions
full rationale
The paper defines the Interaction Trees model and governance operator G explicitly, then proves P1-P7 (including Turing completeness, oracle expressivity, and semantic transparency) via 454 theorems in 36 modules with zero admitted lemmas. No equations reduce outputs to inputs by construction, no parameters are fitted to data and relabeled as predictions, and no self-citations or prior author results are invoked as load-bearing uniqueness theorems. The derivation chain is self-contained within the stated formal semantics; the completeness of the effect signature is an explicit modeling assumption rather than a hidden reduction, so the proofs do not collapse to their own inputs.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math Interaction Trees provide a compositional semantics for effectful computations including memory, external calls, and oracle queries.
- domain assumption All effectful directives in the workflow are mediated through the governance operator.
invented entities (1)
-
governance operator G
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Constitutional AI: Harmlessness from AI Feedback
Yuntao Bai, Saurav Kadavath, Sandipan Kundu, et al. Constitutional AI : Harmlessness from AI feedback, 2022. arXiv:2212.08073
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[2]
Programming with algebraic effects and handlers
Andrej Bauer and Matija Pretnar. Programming with algebraic effects and handlers. Journal of Logical and Algebraic Methods in Programming, 84 0 (1): 0 108--123, 2015. doi:10.1016/j.jlamp.2014.02.001
-
[3]
David Dalrymple et al. Towards guaranteed safe AI : A framework for ensuring robust and reliable AI systems, 2024. arXiv:2405.06624
-
[4]
The power of parameterization in coinductive proof
Chung-Kil Hur, Georg Neis, Derek Dreyer, and Viktor Vafeiadis. The power of parameterization in coinductive proof. In Proceedings of the ACM on Programming Languages (POPL), 2013. doi:10.1145/2429069.2429093
-
[5]
seL4: formal verification of an OS kernel,
Gerwin Klein, Kevin Elphinstone, Gernot Heiser, et al. seL4 : Formal verification of an OS kernel. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP), 2009. doi:10.1145/1629575.1629596
-
[6]
Formal verification of a realistic compiler,
Xavier Leroy. Formal verification of a realistic compiler. Communications of the ACM, 52 0 (7): 0 107--115, 2009. doi:10.1145/1538788.1538814
-
[7]
Algebraic semantics of governed execution: Monoidal categories, effect algebras, and coterminous boundaries, 2026 a
Alan Lawrence McCann. Algebraic semantics of governed execution: Monoidal categories, effect algebras, and coterminous boundaries, 2026 a
2026
-
[8]
Mechanized foundations of structural governance: Machine-checked proofs for governed intelligence, 2026 b
Alan Lawrence McCann. Mechanized foundations of structural governance: Machine-checked proofs for governed intelligence, 2026 b
2026
-
[9]
Cryptographic registry provenance: Structural defense against dependency confusion in AI package ecosystems, 2026 c
Alan Lawrence McCann. Cryptographic registry provenance: Structural defense against dependency confusion in AI package ecosystems, 2026 c . arXiv preprint (submitted)
2026
-
[10]
Certified purity for cognitive workflow executors: From static analysis to cryptographic attestation, 2026 d
Alan Lawrence McCann. Certified purity for cognitive workflow executors: From static analysis to cryptographic attestation, 2026 d
2026
-
[11]
The two boundaries: Why behavioral AI governance fails structurally, 2026 e
Alan Lawrence McCann. The two boundaries: Why behavioral AI governance fails structurally, 2026 e
2026
-
[12]
Marvin L. Minsky. Computation: Finite and Infinite Machines. Prentice-Hall, 1967
1967
-
[13]
Training language models to follow instructions with human feedback
Long Ouyang, Jeff Wu, Xu Jiang, et al. Training language models to follow instructions with human feedback. In NeurIPS, 2022
2022
-
[14]
O’Brien, Carrie Jun Cai, Meredith Ringel Morris, Percy Liang, and Michael S
Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, and Michael S. Bernstein. Generative agents: Interactive simulacra of human behavior. In Proceedings of UIST, 2023. doi:10.1145/3586183.3606763
-
[15]
Gordon D. Plotkin and Matija Pretnar. Handlers of algebraic effects. In Programming Languages and Systems (ESOP). Springer, 2009. doi:10.1007/978-3-642-00590-9_7
-
[16]
Theory of Recursive Functions and Effective Computability
Hartley Rogers. Theory of Recursive Functions and Effective Computability. McGraw-Hill, 1967
1967
-
[17]
Sanjit A. Seshia, Dorsa Sadigh, and S. Shankararaman Shankar. Towards verified artificial intelligence, 2016. arXiv:1606.08514
-
[18]
Sumers, Shunyu Yao, Karthik Narasimhan, and Thomas L
Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, and Thomas L. Griffiths. Cognitive architectures for language agents. Transactions on Machine Learning Research (TMLR), 2024
2024
-
[19]
Monads for functional programming
Philip Wadler. Monads for functional programming. In Advanced Functional Programming (AFP). Springer, 1995. doi:10.1007/3-540-59451-5_2
-
[20]
Li-yao Xia, Yannick Zakowski, Paul He, Chung-Kil Hur, Gregory Malecha, Benjamin C. Pierce, and Steve Zdancewic. Interaction trees: Representing recursive and impure programs in Coq . In Proceedings of the ACM on Programming Languages (POPL), volume 4, 2020. doi:10.1145/3371119
-
[21]
Modular, compositional, and executable formal semantics for LLVM IR
Yannick Zakowski, Calvin Beck, Irene Yoon, Ilia Zaichuk, Vadim Zaliva, and Steve Zdancewic. Modular, compositional, and executable formal semantics for LLVM IR . In Proceedings of the ACM on Programming Languages (ICFP), volume 5, 2021. doi:10.1145/3473572
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.