Harness as an Asset: Enforcing Determinism via the Convergent AI Agent Framework (CAAF)

Tianbao Zhang

arxiv: 2604.17025 · v3 · submitted 2026-04-18 · 💻 cs.AI · cs.LG

Harness as an Asset: Enforcing Determinism via the Convergent AI Agent Framework (CAAF)

Tianbao Zhang This is my paper

Pith reviewed 2026-05-10 06:36 UTC · model grok-4.3

classification 💻 cs.AI cs.LG

keywords AI agent frameworksdeterminism enforcementLLM controllabilitydomain invariantssafety-critical systemsorchestration layersenterprise AI assets

0 comments

The pith

Formalizing domain invariants as an executable Harness turns it into a reusable enterprise asset that enforces determinism on commodity AI models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Large language models create a controllability gap in safety-critical work because even infrequent undetected violations make systems undeployable. The paper presents the Convergent AI Agent Framework as a way to move agent workflows from open generation to closed-loop determinism through three pillars that cover different failure modes. The central move is to encode domain rules into a machine-readable Harness enforced by a deterministic interface, so that reliability no longer depends on the underlying model's scale or cost. This approach is claimed to make fully self-hosted, on-premises deployment practical in regulated sectors where cloud access is restricted.

Core claim

CAAF achieves closed-loop fail-safe determinism by combining recursive atomic decomposition with context firewalls, a Harness that encodes domain invariants into machine-readable registries enforced by the Unified Assertion Interface, and structured semantic gradients with state locking. The paper claims that these three pillars address complementary failure surfaces, that none suffices alone at commodity cost, and that the resulting Harness itself becomes a first-class asset whose value compounds as foundation models commoditize.

What carries the argument

The Harness as an Asset, which formalizes domain invariants into machine-readable registries enforced by a deterministic Unified Assertion Interface.

If this is right

The Harness compounds in value over time as foundation models become interchangeable commodities.
CAAF delivers the required reliability on commodity-tier models, removing the need for premium or proprietary models in regulated settings.
Self-hosted on-premises architectures become feasible for sectors that cannot rely on cloud APIs.
The three pillars must be used together; removing any one leaves an open failure surface at commodity cost.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Organizations in the same regulated domain could share and maintain common Harnesses, turning the asset into an industry-level resource.
Development effort may shift from scaling individual models to curating and verifying invariant registries.
The same Harness mechanism could be tested for consistency across entirely different model families without retraining.

Load-bearing premise

Encoding domain invariants into machine-readable registries through the Unified Assertion Interface will enforce determinism on commodity models without introducing new failure modes or requiring model-specific tuning.

What would settle it

A controlled run on a safety-critical workflow that still shows constraint violation rates high enough to block deployment even after the full Harness and interface are applied.

Figures

Figures reproduced from arXiv: 2604.17025 by Tianbao Zhang.

**Figure 2.** Figure 2: Context Firewall contrast. Left: A monolithic LLM receives all constraints in a shared context window, enabling safety–cost trade-off contamination (Context Rot). Right: CAAF’s RAD architecture isolates each Executor to its domain-specific context; cross-node conflicts are detected deterministically by the UAI at integration time. Metadata-Driven DAG Construction: The Orchestrator uses a two-step RAD proce… view at source ↗

**Figure 3.** Figure 3: Enterprise asset landscape before and after Harness as an Asset (HaaA). Before HaaA, [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Structured Semantic Gradient pipeline. The LLM Executor generates a candidate ar [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗

**Figure 5.** Figure 5: L3 AD paradox zone visualization. Forward safety requires [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗

**Figure 6.** Figure 6: Failure mode distribution across all 7 experimental conditions ( [PITH_FULL_IMAGE:figures/full_fig_p016_6.png] view at source ↗

**Figure 7.** Figure 7: DAG topology for the L3 AD degradation scenario. Layer 1 (sensor processing) feeds into [PITH_FULL_IMAGE:figures/full_fig_p017_7.png] view at source ↗

**Figure 8.** Figure 8: Solution zone for the satisfiable AD variant (perception range relaxed from 30 m to 90 m), [PITH_FULL_IMAGE:figures/full_fig_p019_8.png] view at source ↗

**Figure 9.** Figure 9: Stochastic oscillation (naive reflection, [PITH_FULL_IMAGE:figures/full_fig_p022_9.png] view at source ↗

**Figure 10.** Figure 10: Pharmaceutical flow reactor paradox zone in the ( [PITH_FULL_IMAGE:figures/full_fig_p024_10.png] view at source ↗

**Figure 11.** Figure 11: Failure mode distribution across eight experimental conditions ( [PITH_FULL_IMAGE:figures/full_fig_p024_11.png] view at source ↗

**Figure 12.** Figure 12: Paradox detection rate across architectures and domains. All baselines without deter [PITH_FULL_IMAGE:figures/full_fig_p028_12.png] view at source ↗

**Figure 13.** Figure 13: Cost vs. reliability on the Pharma benchmark ( [PITH_FULL_IMAGE:figures/full_fig_p030_13.png] view at source ↗

read the original abstract

Large Language Models produce a controllability gap in safety-critical engineering: even low rates of undetected constraint violations render a system undeployable. Current orchestration paradigms suffer from sycophantic compliance, context attention decay, and stochastic oscillation during self-correction. We introduce the Convergent AI Agent Framework (CAAF), which transitions agentic workflows from open-loop generation to closed-loop fail-safe determinism via three pillars: (1) Recursive Atomic Decomposition with physical context firewalls; (2) Harness as an Asset, formalizing domain invariants into machine-readable registries enforced by a deterministic Unified Assertion Interface; and (3) Structured Semantic Gradients with State Locking for monotonic non-regression. This paper makes two core claims. First, an industrialization thesis: once domain invariants are formalized as an executable Harness, the Harness itself becomes a first-class enterprise asset that compounds in value as foundation models commoditize, and CAAF's ability to deliver its reliability on commodity-tier models makes fully self-hosted, on-premises deployment architecturally feasible for regulated sectors where cloud APIs are not an option. Second, an architectural claim supported by ablation: CAAF's three pillars address complementary failure surfaces and none alone closes the controllability gap at commodity cost. The paper contributes entirely at the orchestration and industrialization layer. Evidence across two complementary benchmarks, three-tier UAI ablations, multi-agent baselines, and a closed-source commodity family replicated by two independent open-weight families, is reported in the body.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

CAAF frames domain invariants as a reusable Harness asset for closed-loop determinism on commodity models, but the ablations and mechanisms aren't shown in enough detail to confirm the claims hold without extra costs.

read the letter

The main thing here is that the paper introduces CAAF to shift agent workflows from open-loop to closed-loop determinism using three pillars: recursive decomposition with firewalls, the Harness formalized via a Unified Assertion Interface, and semantic gradients with state locking. It positions the Harness as an enterprise asset that compounds value for on-premises use in regulated sectors. That framing is the clearest new angle, and it does a solid job naming real pain points like sycophantic compliance, attention decay, and stochastic self-correction failures. The claim that the pillars address complementary surfaces and that none suffices alone is a reasonable architectural hypothesis, and referencing ablations plus replication across open-weight and commodity families shows some intent to ground it in data. The industrialization thesis about self-hosted deployment also lines up with practical needs where cloud APIs are off-limits. The soft spots sit in the execution details. The abstract describes two benchmarks, three-tier UAI ablations, and multi-agent baselines, yet without the actual error rates, correction success metrics, or how the interface turns invariants into reliable machine-readable checks, it's hard to judge whether stochasticity is truly tamed or just masked by retries and context growth. The stress-test point about potential new failure modes or per-model tuning is worth checking against the full results, because if those costs appear, the complementary-pillars and low-cost claims weaken. This is aimed at engineers and researchers working on agent orchestration for safety-critical or regulated applications. A reader in that space could pull useful structure from the pillar breakdown even if they don't adopt the full system. It deserves peer review so the benchmarks and implementation specifics get proper scrutiny, though the evaluation section will likely need tightening to make the determinism guarantees convincing.

Referee Report

2 major / 2 minor

Summary. The paper introduces the Convergent AI Agent Framework (CAAF) to close the controllability gap in LLM agentic workflows by transitioning from open-loop generation to closed-loop fail-safe determinism. It proposes three pillars—Recursive Atomic Decomposition with physical context firewalls, Harness as an Asset that formalizes domain invariants into machine-readable registries enforced by a deterministic Unified Assertion Interface (UAI), and Structured Semantic Gradients with State Locking for monotonic non-regression—and makes two core claims: an industrialization thesis that the Harness becomes a first-class enterprise asset compounding in value on commodity models for on-premises regulated deployments, and an architectural claim (supported by ablation) that the pillars address complementary failure surfaces with none sufficient alone at commodity cost. Evidence is stated to come from two benchmarks, three-tier UAI ablations, multi-agent baselines, and replication across open-weight and closed-source model families.

Significance. If the unreported empirical results in the body substantiate the claims, the work would be significant for enabling reliable self-hosted AI in safety-critical and regulated sectors by treating orchestration invariants as durable assets rather than relying on model improvements alone. The explicit focus on complementary failure modes and replication across model families is a constructive strength at the industrialization layer.

major comments (2)

[Abstract and body (benchmarks and ablations sections)] Abstract and body (benchmarks and ablations sections): the architectural claim that the three pillars address complementary failure surfaces and that the UAI enforces determinism on commodity models without model-specific tuning is load-bearing for both theses, yet the manuscript references but does not present the actual ablation metrics, violation rates, success percentages, or statistical comparisons (e.g., single-pillar vs. full CAAF performance), preventing verification that the Harness reliably catches stochastic violations without introducing retry loops or context bloat.
[Pillar 2 description (Harness as an Asset and UAI)] Pillar 2 description (Harness as an Asset and UAI): the mechanism for converting domain invariants into executable machine-readable registries and achieving monotonic correction is described conceptually but lacks a formal specification, pseudocode, state diagram, or error-handling analysis, which is required to assess robustness against inherent LLM stochasticity and whether new failure modes arise.

minor comments (2)

The abstract states that 'evidence ... is reported in the body' but the provided manuscript text does not include any tables, figures, or quantified results, which affects readability even if the full experimental section exists elsewhere.
Notation for the three pillars is introduced without a summary table or diagram that cross-references failure surfaces addressed by each, which would improve clarity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback, which identifies key areas where the empirical evidence and formal mechanisms require clearer presentation to support our claims. We address each major comment below and will revise the manuscript to incorporate the requested details.

read point-by-point responses

Referee: [Abstract and body (benchmarks and ablations sections)] Abstract and body (benchmarks and ablations sections): the architectural claim that the three pillars address complementary failure surfaces and that the UAI enforces determinism on commodity models without model-specific tuning is load-bearing for both theses, yet the manuscript references but does not present the actual ablation metrics, violation rates, success percentages, or statistical comparisons (e.g., single-pillar vs. full CAAF performance), preventing verification that the Harness reliably catches stochastic violations without introducing retry loops or context bloat.

Authors: We agree that while the manuscript states that ablation evidence is reported in the body, the specific numerical results (violation rates, success percentages, and statistical comparisons between single-pillar and full CAAF configurations) are not explicitly tabulated or analyzed in the main text. This limits independent verification of the complementary failure surfaces and the UAI's performance without excessive retries or context overhead. In the revised manuscript we will add a dedicated ablation table and accompanying analysis presenting these metrics across the three-tier UAI experiments, multi-agent baselines, and model families, including direct comparisons that substantiate the architectural claim. revision: yes
Referee: [Pillar 2 description (Harness as an Asset and UAI)] Pillar 2 description (Harness as an Asset and UAI): the mechanism for converting domain invariants into executable machine-readable registries and achieving monotonic correction is described conceptually but lacks a formal specification, pseudocode, state diagram, or error-handling analysis, which is required to assess robustness against inherent LLM stochasticity and whether new failure modes arise.

Authors: We concur that the current description of the Harness and UAI remains at a conceptual level and does not provide sufficient formal detail for evaluating robustness to LLM stochasticity. We will revise the Pillar 2 section to include a formal specification of the invariant-to-registry conversion process, pseudocode for the Unified Assertion Interface, a state diagram illustrating monotonic correction and state locking, and an error-handling analysis that demonstrates how the mechanism avoids introducing new failure modes while enforcing determinism. revision: yes

Circularity Check

0 steps flagged

No circularity: claims rest on external benchmarks rather than self-referential definitions

full rationale

The provided manuscript text contains no equations, fitted parameters, or derivation steps that reduce to their own inputs. The industrialization thesis and architectural claim are presented as conceptual assertions whose support is explicitly attributed to separate empirical evidence (two benchmarks, three-tier UAI ablations, multi-agent baselines, and replication across model families). No self-citations, uniqueness theorems, or ansatzes are invoked in a load-bearing way within the abstract or full-text excerpts. The absence of any quoted reduction (e.g., a 'prediction' that is definitionally identical to a fitted input) keeps the circularity score at zero.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 3 invented entities

The central claims rest on the unverified premise that LLMs exhibit a controllability gap and that the three proposed pillars are both necessary and sufficient; several new entities are introduced without external evidence.

axioms (2)

domain assumption Large language models produce a controllability gap that renders systems undeployable in safety-critical engineering even at low violation rates.
Stated as the opening premise of the abstract.
domain assumption Current orchestration paradigms suffer from sycophantic compliance, context attention decay, and stochastic oscillation.
Listed as the problems CAAF is designed to solve.

invented entities (3)

Convergent AI Agent Framework (CAAF) no independent evidence
purpose: Transition agentic workflows from open-loop generation to closed-loop fail-safe determinism.
New framework introduced in the abstract.
Harness no independent evidence
purpose: Formalize domain invariants into machine-readable registries enforced by a deterministic Unified Assertion Interface.
Core concept presented as a first-class enterprise asset.
Unified Assertion Interface (UAI) no independent evidence
purpose: Enforce the harness registries deterministically.
Mentioned as part of the harness pillar.

pith-pipeline@v0.9.0 · 5561 in / 1457 out tokens · 44625 ms · 2026-05-10T06:36:37.520189+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

4 extracted references · 4 canonical work pages · 2 internal anchors

[1]

Constitutional AI: Harmlessness from AI Feedback

URLhttps://arxiv.org/abs/2212.08073. 43 Clark Barrett, Christopher L. Conway, Morgan Deters, Liana Hadarean, Dejan Jovanovi´ c, Tim King, Andrew Reynolds, and Cesare Tinelli. CVC4. InComputer Aided Verification (CAV), volume 6806 ofLecture Notes in Computer Science, pages 171–177. Springer, 2011. doi: 10. 1007/978-3-642-22110-1 14. Birgitta B¨ ockeler. Ha...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.1007/978-3-540-78800-3 2011
[2]

Submitted Dec 2025; accepted to ICLR 2026 Workshop VerifAI-2

URLhttps://arxiv.org/abs/2512.23738. Submitted Dec 2025; accepted to ICLR 2026 Workshop VerifAI-2. Agent-C: DSL for temporal properties→first-order logic→SMT solver interleaved with constrained generation; backtracks on non-compliant token sequences. Reports 100% conformance / 0% harm on retail and airline benchmarks. Omar Khattab, Arnav Singhvi, Paridhi ...

work page arXiv 2025
[3]

How Computer Systems Embody Values

URLhttps://arxiv.org/abs/2603.28052. Submitted Mar 2026. Searches over harness code (storage / retrieval / presentation logic) using an agentic proposer that accesses source code, scores, and execution traces through a filesystem. Argues existing optimizers under-perform because they over-aggressively compress feedback. Hsien-Jyh Liao. Enforcing monotonic...

work page doi:10.1109/2 2026
[4]

Octo: An Open-Source Generalist Robot Policy

URLhttps://github.com/guidance-ai/guidance. Open-source toolkit for constrained (token-level) LLM generation. Octo Model Team, Dibya Ghosh, Homer Walke, Karl Pertsch, Kevin Black, Oier Mees, et al. Octo: An open-source generalist robot policy.arXiv preprint arXiv:2405.12213, 2024. URL https://arxiv.org/abs/2405.12213. Reid Pryzant, Dan Iter, Jerry Li, Yin...

work page internal anchor Pith review arXiv 2024

[1] [1]

Constitutional AI: Harmlessness from AI Feedback

URLhttps://arxiv.org/abs/2212.08073. 43 Clark Barrett, Christopher L. Conway, Morgan Deters, Liana Hadarean, Dejan Jovanovi´ c, Tim King, Andrew Reynolds, and Cesare Tinelli. CVC4. InComputer Aided Verification (CAV), volume 6806 ofLecture Notes in Computer Science, pages 171–177. Springer, 2011. doi: 10. 1007/978-3-642-22110-1 14. Birgitta B¨ ockeler. Ha...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.1007/978-3-540-78800-3 2011

[2] [2]

Submitted Dec 2025; accepted to ICLR 2026 Workshop VerifAI-2

URLhttps://arxiv.org/abs/2512.23738. Submitted Dec 2025; accepted to ICLR 2026 Workshop VerifAI-2. Agent-C: DSL for temporal properties→first-order logic→SMT solver interleaved with constrained generation; backtracks on non-compliant token sequences. Reports 100% conformance / 0% harm on retail and airline benchmarks. Omar Khattab, Arnav Singhvi, Paridhi ...

work page arXiv 2025

[3] [3]

How Computer Systems Embody Values

URLhttps://arxiv.org/abs/2603.28052. Submitted Mar 2026. Searches over harness code (storage / retrieval / presentation logic) using an agentic proposer that accesses source code, scores, and execution traces through a filesystem. Argues existing optimizers under-perform because they over-aggressively compress feedback. Hsien-Jyh Liao. Enforcing monotonic...

work page doi:10.1109/2 2026

[4] [4]

Octo: An Open-Source Generalist Robot Policy

URLhttps://github.com/guidance-ai/guidance. Open-source toolkit for constrained (token-level) LLM generation. Octo Model Team, Dibya Ghosh, Homer Walke, Karl Pertsch, Kevin Black, Oier Mees, et al. Octo: An open-source generalist robot policy.arXiv preprint arXiv:2405.12213, 2024. URL https://arxiv.org/abs/2405.12213. Reid Pryzant, Dan Iter, Jerry Li, Yin...

work page internal anchor Pith review arXiv 2024