Gaming-Resistant Insurance Contracts for Autonomous AI Agents: Strategy-Proof Toll Mechanism Design

Hao-Hsuan Chen

arxiv: 2606.16326 · v2 · pith:6ESKSZ7Fnew · submitted 2026-06-15 · 💻 cs.GT · cs.AI· q-fin.RM

Gaming-Resistant Insurance Contracts for Autonomous AI Agents: Strategy-Proof Toll Mechanism Design

Hao-Hsuan Chen This is my paper

Pith reviewed 2026-06-27 02:45 UTC · model grok-4.3

classification 💻 cs.GT cs.AIq-fin.RM

keywords insurance contractsautonomous AI agentsstrategy-proof mechanismsincentive compatibilityactuarial runtimetoll mechanismsgaming attacksside-effect pricing

0 comments

The pith

Contract clauses render AI-agent insurance gaming-resistant by closing a five-attack space with aggregation, compliance, and truthful reporting.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper extends an existing actuarial runtime that prices AI-agent actions against a safe default into a full insurance contract when the operator can act strategically. It maps a five-attack space, shows that two surfaces are already closed by minimal-authority and no-splitting rules, and supplies three new clauses to close the rest: common-control aggregation that preserves boundary tolls on total exposure, escalation fees that treat interface failures as contract events rather than zero-toll wins, and a model-identity menu whose componentwise-minimum penalties make truthful model reporting weakly dominant. Composing the clauses with the runtime yields joint incentive compatibility; a two-parameter premium family then satisfies operator individual rationality and weak budget balance at the truthful equilibrium.

Core claim

The actuarial runtime is gaming-resistant once augmented with common-control aggregation, an interface-compliance theorem, and a model-identity menu; the resulting mechanism achieves joint incentive compatibility over the five-attack space while a two-parameter premium family meets individual rationality and weak budget balance at truthful equilibrium.

What carries the argument

The five-attack space together with the three new clauses (common-control aggregation, interface-compliance escalation, and model-identity menu with componentwise-minimum penalties) that close the remaining surfaces.

If this is right

Joint incentive compatibility holds over the entire five-attack space.
Truthful model reporting is weakly dominant for the operator.
Interface failures incur escalation fees rather than zero toll.
Cross-boundary re-routing cannot reduce toll below the boundary potential applied to total exposure.
The two-parameter premium family satisfies individual rationality and weak budget balance at the truthful equilibrium.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same clause structure could be applied to other runtime-priced autonomous systems where the user controls model selection and routing.
If the companion traces miss certain failure modes, the escalation-fee clause would need re-validation on new data.
The mechanism supplies a concrete template for adding incentive layers to any actuarial control system that previously treated the operator as passive.
Simulation experiments with the premium family could check whether budget balance remains intact under small deviations from truth-telling.

Load-bearing premise

The interface-compliance theorem is validated only on committed cross-model traces from the companion empirical paper, so the validation transfers only if those traces capture the relevant failure modes and the companion work is independent.

What would settle it

An operator that successfully reduces total toll below the boundary potential by cross-boundary re-routing, by triggering interface failures without paying escalation, or by misreporting the model while still receiving the minimum penalty schedule would falsify joint incentive compatibility.

read the original abstract

Paper A defines a time-consistent actuarial runtime that prices each side-effect-bearing action against a contractually fixed safe default and gates execution against a reserve budget. It treats the operator as passive. This paper makes the operator strategic. We characterise a five-attack space for autonomous AI-agent insurance contracts and prove when the actuarial runtime is gaming-resistant. Two attack surfaces -- post-toll safe-default selection and within-boundary action splitting -- are closed by Paper A's minimal-authority and no-splitting clauses. The remaining three require new contract clauses. First, common-control aggregation prevents cross-boundary re-routing from reducing toll below the boundary potential applied to total exposure. Second, interface failures such as invalid JSON are contract-relevant events, not safety wins: treating them as zero-toll safe defaults can reward unreliable models, while escalation fees reverse the incentive. We validate this interface-compliance theorem on committed cross-model traces from the companion empirical paper. Third, a model-identity menu with a componentwise-minimum penalty schedule makes truthful reporting of the deployed model weakly dominant. We then compose these clauses with Paper A's runtime guarantees to obtain joint incentive compatibility over the five-attack space. Finally, a two-parameter premium family discharges operator individual rationality and weak budget balance at the truthful equilibrium. The result is an incentive-compatibility layer for actuarial control of autonomous-agent side effects.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper adds three concrete clauses to close strategic attacks on AI-agent insurance but the joint IC claim rests on validation from a companion paper whose coverage and independence are unconfirmed.

read the letter

The paper's main contribution is characterizing a five-attack space for operator gaming of AI side-effect insurance and adding three clauses—common-control aggregation, escalation fees on interface failures, and a model-identity menu—to close the remaining surfaces after Paper A's minimal-authority and no-splitting rules handle the first two. It then composes everything to claim joint incentive compatibility and supplies a two-parameter premium family that satisfies individual rationality and weak budget balance at the truthful point.

What works is the clean separation of attack surfaces and the targeted fixes. The aggregation clause prevents re-routing from undercutting the toll, the interface clause treats failures as contract events rather than free wins, and the menu makes truthful model reporting dominant. These are specific, not generic extensions.

The soft spot is the interface-compliance theorem. Its validation sits on committed cross-model traces from the companion empirical paper. If those traces miss relevant invalid patterns or if the companion shares authorship and is not independent, the empirical closure does not transfer and the joint IC result is incomplete for that surface. The other two new clauses are presented as theoretical and look fine on their own, but the overall claim invokes the validated theorem.

This is for mechanism-design researchers working on AI safety or autonomous-agent control. A reader who already knows Paper A will see the incremental value in the attack list and clauses. It deserves peer review because the new clauses are concrete and the composition is a distinct result, even though referees will need to examine the companion traces and any missing derivations.

Referee Report

1 major / 0 minor

Summary. The manuscript characterizes a five-attack space for autonomous AI-agent insurance contracts and claims to prove that the actuarial runtime from Paper A is gaming-resistant when augmented by three new clauses: common-control aggregation to prevent cross-boundary re-routing, interface-compliance with escalation fees for failures such as invalid JSON (validated on companion traces), and a model-identity menu with componentwise-minimum penalties. It composes these with Paper A's minimal-authority and no-splitting rules to obtain joint incentive compatibility, and introduces a two-parameter premium family that satisfies operator individual rationality and weak budget balance at the truthful equilibrium.

Significance. If the central claims hold, this work contributes to mechanism design for AI safety by supplying a strategy-proof toll layer that renders actuarial control of side effects robust to strategic operators. The explicit composition of contract clauses with existing runtime guarantees and the identification of a five-attack space represent a structured approach to incentive compatibility in this domain.

major comments (1)

Abstract: The joint incentive compatibility result over the five-attack space is obtained by composing the new clauses with Paper A's runtime guarantees, but the interface-compliance theorem (the third attack surface) is validated only on committed cross-model traces from the companion empirical paper. The manuscript provides no details on the specific failure modes covered by these traces or on the independence of the companion work, which is load-bearing for closing the interface-failure attack surface and thus for the overall claim.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful review and for highlighting the need for greater transparency around the companion empirical work. We address the single major comment below and commit to revisions that strengthen the manuscript without altering its core claims.

read point-by-point responses

Referee: Abstract: The joint incentive compatibility result over the five-attack space is obtained by composing the new clauses with Paper A's runtime guarantees, but the interface-compliance theorem (the third attack surface) is validated only on committed cross-model traces from the companion empirical paper. The manuscript provides no details on the specific failure modes covered by these traces or on the independence of the companion work, which is load-bearing for closing the interface-failure attack surface and thus for the overall claim.

Authors: We agree that the manuscript currently provides insufficient detail on the companion traces and their relationship to the interface-compliance theorem, leaving the closure of that attack surface less self-contained than it should be. In revision we will add a dedicated subsection (or short appendix) that (i) enumerates the concrete failure modes validated on the traces (invalid JSON, malformed outputs, non-compliant interface calls, and similar events) and (ii) states that the companion empirical paper is an independent study whose traces are committed and publicly referenceable, with no shared authorship or data leakage. These additions will make the joint incentive-compatibility argument fully transparent while preserving all existing theorems and proofs. revision: yes

Circularity Check

1 steps flagged

Joint IC result depends on interface-compliance theorem validated solely on companion empirical traces with overlapping authorship

specific steps

self citation load bearing [Abstract]
"We validate this interface-compliance theorem on committed cross-model traces from the companion empirical paper. ... We then compose these clauses with Paper A's runtime guarantees to obtain joint incentive compatibility over the five-attack space."

The joint incentive compatibility claim is obtained by composing the new clauses with Paper A's guarantees; one of the three attack surfaces is closed only via the interface-compliance theorem, which the paper states is validated on traces from the companion empirical paper. This makes the central result load-bearing on a citation whose authors overlap with the present work, with no independent verification supplied.

full rationale

The paper's central derivation composes three new clauses with Paper A's runtime guarantees to claim joint incentive compatibility over the five-attack space. Two surfaces are closed by existing clauses, but the interface-failure surface is closed only by the interface-compliance theorem, whose validation is explicitly stated to rest on committed cross-model traces from the companion empirical paper. Because the overall joint-IC claim invokes this validated theorem as a load-bearing step and the companion work shares authorship with Paper A, the result reduces to self-citation load-bearing rather than independent external support. No machine-checked proof, parameter-free external benchmark, or falsifiable claim outside the fitted traces is provided for that surface.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The work rests on the five-attack space being exhaustive, the base runtime from Paper A being correctly gaming-resistant under passive operators, and the companion traces being representative. The two-parameter premium family is introduced to satisfy IR and budget balance.

free parameters (1)

two-parameter premium family
Discharges operator individual rationality and weak budget balance at the truthful equilibrium; parameters chosen to satisfy these properties.

axioms (2)

domain assumption The five-attack space exhausts relevant strategic behaviors for autonomous AI-agent insurance contracts.
Paper states it characterises the space and proves resistance when clauses are added; no independent justification given in abstract.
domain assumption Paper A's minimal-authority and no-splitting clauses close the post-toll safe-default and action-splitting attacks.
Invoked directly to reduce the remaining attack surfaces to three.

pith-pipeline@v0.9.1-grok · 5770 in / 1371 out tokens · 49925 ms · 2026-06-27T02:45:12.926405+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

44 extracted references · 17 canonical work pages · 5 internal anchors

[1]

Acharya, Lasse H

Viral V. Acharya, Lasse H. Pedersen, Thomas Philippon, and Matthew Richardson. Measuring systemic risk.The Review of Financial Studies, 30(1):2–47, 2017. doi: 10.1093/rfs/hhw088

work page doi:10.1093/rfs/hhw088 2017
[2]

A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification

Anastasios N. Angelopoulos and Stephen Bates. A gentle introduction to conformal prediction and distribution-free uncertainty quantification.arXiv preprint arXiv:2107.07511, 2021. doi: 10.48550/arXiv.2107.07511

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2107.07511 2021
[3]

Coherent measures of risk.Mathematical Finance, 9(3):203–228, 1999

Philippe Artzner, Freddy Delbaen, Jean-Marc Eber, and David Heath. Coherent measures of risk.Mathematical Finance, 9(3):203–228, 1999. doi: 10.1111/1467-9965.00068

work page doi:10.1111/1467-9965.00068 1999
[4]

Constitutional AI: Harmlessness from AI feedback.arXiv preprint arXiv:2212.08073, 2022

Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, et al. Constitutional AI: Harmlessness from AI feedback.arXiv preprint arXiv:2212.08073, 2022

Pith/arXiv arXiv 2022
[5]

Algorithmic insurance.arXiv preprint arXiv:2106.00839, 2021

Dimitris Bertsimas and Agni Orfanoudaki. Algorithmic insurance.arXiv preprint arXiv:2106.00839, 2021. doi: 10.48550/arXiv.2106.00839

work page doi:10.48550/arxiv.2106.00839 2021
[6]

Catastrophe insurance: An adaptive robust optimization approach

Dimitris Bertsimas et al. Catastrophe insurance: An adaptive robust optimization approach. arXiv preprint arXiv:2405.07068, 2024. doi: 10.48550/arXiv.2405.07068

work page doi:10.48550/arxiv.2405.07068 2024
[7]

Finance Stoch

Jocelyne Bion-Nadal. Dynamic risk measures: Time consistency and risk measures from bmo martingales.Finance and Stochastics, 12(2):219–244, 2008. doi: 10.1007/s00780-007-0057-1

work page doi:10.1007/s00780-007-0057-1 2008
[8]

Experience rating and credibility.ASTIN Bulletin, 4(3):199–207, 1967

Hans B¨ uhlmann. Experience rating and credibility.ASTIN Bulletin, 4(3):199–207, 1967

1967
[9]

Insuring Every Action: An Authority Frontier Framework for Runtime Actuarial Control of Autonomous AI Agents

Hao-Hsuan Chen. Insuring every action: An authority frontier framework for runtime actuarial control of autonomous AI agents.arXiv preprint arXiv:2605.25632, 2026. doi: 10.48550/arXiv.2605.25632. Companion empirical paper

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2605.25632 2026
[10]

Foundations of a Time-Consistent Counterfactual Actuarial Runtime for Autonomous AI Agents

Hao-Hsuan Chen. Foundations of a time-consistent counterfactual actuarial runtime for autonomous AI agents.arXiv preprint arXiv:2605.26508, 2026. doi: 10.48550/arXiv.2605.26508. Companion mathematical foundations paper; also posted on SSRN (id 6761960)

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2605.26508 2026
[11]

Electron

Patrick Cheridito, Freddy Delbaen, and Michael Kupper. Dynamic monetary risk measures for bounded discrete-time processes.Electronic Journal of Probability, 11:57–106, 2006. doi: 10.1214/EJP.v11-302. 27

work page doi:10.1214/ejp.v11-302 2006
[12]

Algorithms for CVaR optimization in MDPs

Yinlam Chow and Mohammad Ghavamzadeh. Algorithms for CVaR optimization in MDPs. In Advances in Neural Information Processing Systems, volume 27, 2014

2014
[13]

Edward H. Clarke. Multipart pricing of public goods.Public Choice, 11:17–33, 1971

1971
[14]

Finance Stoch

Kai Detlefsen and Giacomo Scandolo. Conditional and dynamic convex risk measures.Finance and Stochastics, 9:539–561, 2005. doi: 10.1007/s00780-005-0159-6

work page doi:10.1007/s00780-005-0159-6 2005
[15]

What do we know about cyber risk and cyber risk insurance? A systematization of literature.Journal of Risk Finance, 17(5):474–491, 2016

Martin Eling and Werner Schnell. What do we know about cyber risk and cyber risk insurance? A systematization of literature.Journal of Risk Finance, 17(5):474–491, 2016. doi: 10.1108/JRF-09-2016-0122

work page doi:10.1108/jrf-09-2016-0122 2016
[16]

Manipulation of voting schemes: A general result.Econometrica, 41(4): 587–601, 1973

Allan Gibbard. Manipulation of voting schemes: A general result.Econometrica, 41(4): 587–601, 1973

1973
[17]

Cand` es

Isaac Gibbs and Emmanuel J. Cand` es. Adaptive conformal inference under distribution shift. InAdvances in Neural Information Processing Systems, volume 34, 2021

2021
[18]

Incentives in teams.Econometrica, 41(4):617–631, 1973

Theodore Groves. Incentives in teams.Econometrica, 41(4):617–631, 1973

1973
[19]

Moral hazard and observability.The Bell Journal of Economics, 10(1): 74–91, 1979

Bengt Holmstr¨ om. Moral hazard and observability.The Bell Journal of Economics, 10(1): 74–91, 1979

1979
[20]

Quantifying Trust: Financial Risk Management for Trustworthy AI Agents

Wenyue Hua, Tianyi Peng, Chi Wang, Jiaxin Pei, Ian Kaufman, Bryan Lim, and Chandler Fang. Quantifying trust: Financial risk management for trustworthy AI agents.arXiv preprint arXiv:2604.03976, 2026. doi: 10.48550/arXiv.2604.03976

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.03976 2026
[21]

Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan

Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan. SWE-bench: Can language models resolve real-world GitHub issues? In International Conference on Learning Representations, 2024

2024
[22]

Boda Kang and Jerzy A. Filar. Time consistent dynamic risk measures.Mathematical Methods of Operations Research, 63(1):169–186, 2006. doi: 10.1007/s00186-005-0045-1

work page doi:10.1007/s00186-005-0045-1 2006
[23]

Kochenderfer, Tim A

Mykel J. Kochenderfer, Tim A. Wheeler, and Kyle H. Wray.Algorithms for Decision Making. MIT Press, 2022

2022
[24]

Princeton University Press, 2002

Jean-Jacques Laffont and David Martimort.The Theory of Incentives: The Principal-Agent Model. Princeton University Press, 2002

2002
[25]

AgentBench: Evaluating LLMs as agents

Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, et al. AgentBench: Evaluating LLMs as agents. InInternational Conference on Learning Representations, 2024

2024
[26]

McNeil, R¨ udiger Frey, and Paul Embrechts.Quantitative Risk Management: Concepts, Techniques and Tools

Alexander J. McNeil, R¨ udiger Frey, and Paul Embrechts.Quantitative Risk Management: Concepts, Techniques and Tools. Princeton University Press, revised edition, 2015

2015
[27]

Miller.Robust Composition: Towards a Unified Approach to Access Control and Concurrency Control

Mark S. Miller.Robust Composition: Towards a Unified Approach to Access Control and Concurrency Control. PhD thesis, Johns Hopkins University, 2006

2006
[28]

Roger B. Myerson. Optimal auction design.Mathematics of Operations Research, 6(1):58–73,
[29]

doi: 10.1287/moor.6.1.58

work page doi:10.1287/moor.6.1.58
[30]

Myerson and Mark A

Roger B. Myerson and Mark A. Satterthwaite. Efficient mechanisms for bilateral trading. Journal of Economic Theory, 29(2):265–281, 1983. 28

1983
[31]

Wainwright, et al

Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, et al. Training language models to follow instructions with human feedback. InAdvances in Neural Information Processing Systems, 2022

2022
[32]

OWASP top 10 for large language model applications

OWASP Foundation. OWASP top 10 for large language model applications. https://owasp.org/www-project-top-10-for-large-language-model-applications, 2024

2024
[33]

Cambridge University Press, 2 edition, 2009

Judea Pearl.Causality: Models, Reasoning, and Inference. Cambridge University Press, 2 edition, 2009

2009
[34]

NeMo Guardrails: A toolkit for controllable and safe LLM applications with programmable rails

Traian Rebedea, Razvan Dinu, Makesh Sreedhar, Christopher Parisien, and Jonathan Cohen. NeMo Guardrails: A toolkit for controllable and safe LLM applications with programmable rails. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 431–445. Association for Computational Linguistics,

2023
[35]

Tyrrell Rockafellar and Stanislav Uryasev

R. Tyrrell Rockafellar and Stanislav Uryasev. Optimization of conditional value-at-risk. Journal of Risk, 2:21–42, 2000

2000
[36]

Berend Roorda and J. M. Schumacher. Time consistency conditions for acceptability measures, with an application to tail value at risk.Insurance: Mathematics and Economics, 40(2): 209–230, 2007. doi: 10.1016/j.insmatheco.2006.04.003

work page doi:10.1016/j.insmatheco.2006.04.003 2007
[37]

Risk-averse dynamic programming for Markov decision processes

Andrzej Ruszczy´ nski. Risk-averse dynamic programming for Markov decision processes. Mathematical Programming, 125(2):235–261, 2010. doi: 10.1007/s10107-010-0393-3

work page doi:10.1007/s10107-010-0393-3 2010
[38]

Strategy-proofness and Arrow’s conditions: Existence and correspondence theorems for voting procedures and social welfare functions.Journal of Economic Theory, 10(2):187–217, 1975

Mark Allen Satterthwaite. Strategy-proofness and Arrow’s conditions: Existence and correspondence theorems for voting procedures and social welfare functions.Journal of Economic Theory, 10(2):187–217, 1975

1975
[39]

SIAM, 2009

Alexander Shapiro, Darinka Dentcheva, and Andrzej Ruszczy´ nski.Lectures on Stochastic Programming: Modeling and Theory. SIAM, 2009

2009
[40]

Optimizing the CVaR via sampling

Aviv Tamar, Yonatan Glassner, and Shie Mannor. Optimizing the CVaR via sampling. In AAAI Conference on Artificial Intelligence, 2015

2015
[41]

Capital Allocation to Business Units and Sub-Portfolios: the Euler Principle

Dirk Tasche. Capital allocation to business units and sub-portfolios: The Euler principle. arXiv preprint arXiv:0708.2542, 2007. doi: 10.48550/arXiv.0708.2542

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.0708.2542 2007
[42]

Counterspeculation, auctions, and competitive sealed tenders.The Journal of Finance, 16(1):8–37, 1961

William Vickrey. Counterspeculation, auctions, and competitive sealed tenders.The Journal of Finance, 16(1):8–37, 1961

1961
[43]

Springer, 2005

Vladimir Vovk, Alexander Gammerman, and Glenn Shafer.Algorithmic Learning in a Random World. Springer, 2005

2005
[44]

Shunyu Yao, Noah Shinn, Pedram Razavi, and Karthik Narasimhan.τ-bench: A benchmark for tool-agent-user interaction in real-world domains.arXiv preprint arXiv:2406.12045, 2024. 29

Pith/arXiv arXiv 2024

[1] [1]

Acharya, Lasse H

Viral V. Acharya, Lasse H. Pedersen, Thomas Philippon, and Matthew Richardson. Measuring systemic risk.The Review of Financial Studies, 30(1):2–47, 2017. doi: 10.1093/rfs/hhw088

work page doi:10.1093/rfs/hhw088 2017

[2] [2]

A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification

Anastasios N. Angelopoulos and Stephen Bates. A gentle introduction to conformal prediction and distribution-free uncertainty quantification.arXiv preprint arXiv:2107.07511, 2021. doi: 10.48550/arXiv.2107.07511

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2107.07511 2021

[3] [3]

Coherent measures of risk.Mathematical Finance, 9(3):203–228, 1999

Philippe Artzner, Freddy Delbaen, Jean-Marc Eber, and David Heath. Coherent measures of risk.Mathematical Finance, 9(3):203–228, 1999. doi: 10.1111/1467-9965.00068

work page doi:10.1111/1467-9965.00068 1999

[4] [4]

Constitutional AI: Harmlessness from AI feedback.arXiv preprint arXiv:2212.08073, 2022

Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, et al. Constitutional AI: Harmlessness from AI feedback.arXiv preprint arXiv:2212.08073, 2022

Pith/arXiv arXiv 2022

[5] [5]

Algorithmic insurance.arXiv preprint arXiv:2106.00839, 2021

Dimitris Bertsimas and Agni Orfanoudaki. Algorithmic insurance.arXiv preprint arXiv:2106.00839, 2021. doi: 10.48550/arXiv.2106.00839

work page doi:10.48550/arxiv.2106.00839 2021

[6] [6]

Catastrophe insurance: An adaptive robust optimization approach

Dimitris Bertsimas et al. Catastrophe insurance: An adaptive robust optimization approach. arXiv preprint arXiv:2405.07068, 2024. doi: 10.48550/arXiv.2405.07068

work page doi:10.48550/arxiv.2405.07068 2024

[7] [7]

Finance Stoch

Jocelyne Bion-Nadal. Dynamic risk measures: Time consistency and risk measures from bmo martingales.Finance and Stochastics, 12(2):219–244, 2008. doi: 10.1007/s00780-007-0057-1

work page doi:10.1007/s00780-007-0057-1 2008

[8] [8]

Experience rating and credibility.ASTIN Bulletin, 4(3):199–207, 1967

Hans B¨ uhlmann. Experience rating and credibility.ASTIN Bulletin, 4(3):199–207, 1967

1967

[9] [9]

Insuring Every Action: An Authority Frontier Framework for Runtime Actuarial Control of Autonomous AI Agents

Hao-Hsuan Chen. Insuring every action: An authority frontier framework for runtime actuarial control of autonomous AI agents.arXiv preprint arXiv:2605.25632, 2026. doi: 10.48550/arXiv.2605.25632. Companion empirical paper

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2605.25632 2026

[10] [10]

Foundations of a Time-Consistent Counterfactual Actuarial Runtime for Autonomous AI Agents

Hao-Hsuan Chen. Foundations of a time-consistent counterfactual actuarial runtime for autonomous AI agents.arXiv preprint arXiv:2605.26508, 2026. doi: 10.48550/arXiv.2605.26508. Companion mathematical foundations paper; also posted on SSRN (id 6761960)

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2605.26508 2026

[11] [11]

Electron

Patrick Cheridito, Freddy Delbaen, and Michael Kupper. Dynamic monetary risk measures for bounded discrete-time processes.Electronic Journal of Probability, 11:57–106, 2006. doi: 10.1214/EJP.v11-302. 27

work page doi:10.1214/ejp.v11-302 2006

[12] [12]

Algorithms for CVaR optimization in MDPs

Yinlam Chow and Mohammad Ghavamzadeh. Algorithms for CVaR optimization in MDPs. In Advances in Neural Information Processing Systems, volume 27, 2014

2014

[13] [13]

Edward H. Clarke. Multipart pricing of public goods.Public Choice, 11:17–33, 1971

1971

[14] [14]

Finance Stoch

Kai Detlefsen and Giacomo Scandolo. Conditional and dynamic convex risk measures.Finance and Stochastics, 9:539–561, 2005. doi: 10.1007/s00780-005-0159-6

work page doi:10.1007/s00780-005-0159-6 2005

[15] [15]

What do we know about cyber risk and cyber risk insurance? A systematization of literature.Journal of Risk Finance, 17(5):474–491, 2016

Martin Eling and Werner Schnell. What do we know about cyber risk and cyber risk insurance? A systematization of literature.Journal of Risk Finance, 17(5):474–491, 2016. doi: 10.1108/JRF-09-2016-0122

work page doi:10.1108/jrf-09-2016-0122 2016

[16] [16]

Manipulation of voting schemes: A general result.Econometrica, 41(4): 587–601, 1973

Allan Gibbard. Manipulation of voting schemes: A general result.Econometrica, 41(4): 587–601, 1973

1973

[17] [17]

Cand` es

Isaac Gibbs and Emmanuel J. Cand` es. Adaptive conformal inference under distribution shift. InAdvances in Neural Information Processing Systems, volume 34, 2021

2021

[18] [18]

Incentives in teams.Econometrica, 41(4):617–631, 1973

Theodore Groves. Incentives in teams.Econometrica, 41(4):617–631, 1973

1973

[19] [19]

Moral hazard and observability.The Bell Journal of Economics, 10(1): 74–91, 1979

Bengt Holmstr¨ om. Moral hazard and observability.The Bell Journal of Economics, 10(1): 74–91, 1979

1979

[20] [20]

Quantifying Trust: Financial Risk Management for Trustworthy AI Agents

Wenyue Hua, Tianyi Peng, Chi Wang, Jiaxin Pei, Ian Kaufman, Bryan Lim, and Chandler Fang. Quantifying trust: Financial risk management for trustworthy AI agents.arXiv preprint arXiv:2604.03976, 2026. doi: 10.48550/arXiv.2604.03976

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.03976 2026

[21] [21]

Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan

Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, and Karthik Narasimhan. SWE-bench: Can language models resolve real-world GitHub issues? In International Conference on Learning Representations, 2024

2024

[22] [22]

Boda Kang and Jerzy A. Filar. Time consistent dynamic risk measures.Mathematical Methods of Operations Research, 63(1):169–186, 2006. doi: 10.1007/s00186-005-0045-1

work page doi:10.1007/s00186-005-0045-1 2006

[23] [23]

Kochenderfer, Tim A

Mykel J. Kochenderfer, Tim A. Wheeler, and Kyle H. Wray.Algorithms for Decision Making. MIT Press, 2022

2022

[24] [24]

Princeton University Press, 2002

Jean-Jacques Laffont and David Martimort.The Theory of Incentives: The Principal-Agent Model. Princeton University Press, 2002

2002

[25] [25]

AgentBench: Evaluating LLMs as agents

Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, et al. AgentBench: Evaluating LLMs as agents. InInternational Conference on Learning Representations, 2024

2024

[26] [26]

McNeil, R¨ udiger Frey, and Paul Embrechts.Quantitative Risk Management: Concepts, Techniques and Tools

Alexander J. McNeil, R¨ udiger Frey, and Paul Embrechts.Quantitative Risk Management: Concepts, Techniques and Tools. Princeton University Press, revised edition, 2015

2015

[27] [27]

Miller.Robust Composition: Towards a Unified Approach to Access Control and Concurrency Control

Mark S. Miller.Robust Composition: Towards a Unified Approach to Access Control and Concurrency Control. PhD thesis, Johns Hopkins University, 2006

2006

[28] [28]

Roger B. Myerson. Optimal auction design.Mathematics of Operations Research, 6(1):58–73,

[29] [29]

doi: 10.1287/moor.6.1.58

work page doi:10.1287/moor.6.1.58

[30] [30]

Myerson and Mark A

Roger B. Myerson and Mark A. Satterthwaite. Efficient mechanisms for bilateral trading. Journal of Economic Theory, 29(2):265–281, 1983. 28

1983

[31] [31]

Wainwright, et al

Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, et al. Training language models to follow instructions with human feedback. InAdvances in Neural Information Processing Systems, 2022

2022

[32] [32]

OWASP top 10 for large language model applications

OWASP Foundation. OWASP top 10 for large language model applications. https://owasp.org/www-project-top-10-for-large-language-model-applications, 2024

2024

[33] [33]

Cambridge University Press, 2 edition, 2009

Judea Pearl.Causality: Models, Reasoning, and Inference. Cambridge University Press, 2 edition, 2009

2009

[34] [34]

NeMo Guardrails: A toolkit for controllable and safe LLM applications with programmable rails

Traian Rebedea, Razvan Dinu, Makesh Sreedhar, Christopher Parisien, and Jonathan Cohen. NeMo Guardrails: A toolkit for controllable and safe LLM applications with programmable rails. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 431–445. Association for Computational Linguistics,

2023

[35] [35]

Tyrrell Rockafellar and Stanislav Uryasev

R. Tyrrell Rockafellar and Stanislav Uryasev. Optimization of conditional value-at-risk. Journal of Risk, 2:21–42, 2000

2000

[36] [36]

Berend Roorda and J. M. Schumacher. Time consistency conditions for acceptability measures, with an application to tail value at risk.Insurance: Mathematics and Economics, 40(2): 209–230, 2007. doi: 10.1016/j.insmatheco.2006.04.003

work page doi:10.1016/j.insmatheco.2006.04.003 2007

[37] [37]

Risk-averse dynamic programming for Markov decision processes

Andrzej Ruszczy´ nski. Risk-averse dynamic programming for Markov decision processes. Mathematical Programming, 125(2):235–261, 2010. doi: 10.1007/s10107-010-0393-3

work page doi:10.1007/s10107-010-0393-3 2010

[38] [38]

Strategy-proofness and Arrow’s conditions: Existence and correspondence theorems for voting procedures and social welfare functions.Journal of Economic Theory, 10(2):187–217, 1975

Mark Allen Satterthwaite. Strategy-proofness and Arrow’s conditions: Existence and correspondence theorems for voting procedures and social welfare functions.Journal of Economic Theory, 10(2):187–217, 1975

1975

[39] [39]

SIAM, 2009

Alexander Shapiro, Darinka Dentcheva, and Andrzej Ruszczy´ nski.Lectures on Stochastic Programming: Modeling and Theory. SIAM, 2009

2009

[40] [40]

Optimizing the CVaR via sampling

Aviv Tamar, Yonatan Glassner, and Shie Mannor. Optimizing the CVaR via sampling. In AAAI Conference on Artificial Intelligence, 2015

2015

[41] [41]

Capital Allocation to Business Units and Sub-Portfolios: the Euler Principle

Dirk Tasche. Capital allocation to business units and sub-portfolios: The Euler principle. arXiv preprint arXiv:0708.2542, 2007. doi: 10.48550/arXiv.0708.2542

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.0708.2542 2007

[42] [42]

Counterspeculation, auctions, and competitive sealed tenders.The Journal of Finance, 16(1):8–37, 1961

William Vickrey. Counterspeculation, auctions, and competitive sealed tenders.The Journal of Finance, 16(1):8–37, 1961

1961

[43] [43]

Springer, 2005

Vladimir Vovk, Alexander Gammerman, and Glenn Shafer.Algorithmic Learning in a Random World. Springer, 2005

2005

[44] [44]

Shunyu Yao, Noah Shinn, Pedram Razavi, and Karthik Narasimhan.τ-bench: A benchmark for tool-agent-user interaction in real-world domains.arXiv preprint arXiv:2406.12045, 2024. 29

Pith/arXiv arXiv 2024