Agentic AI and Hallucinations

Ali I. Ozkes; Engin Iyidogan

arxiv: 2507.19183 · v1 · submitted 2025-07-25 · 💰 econ.TH

Agentic AI and Hallucinations

Engin Iyidogan , Ali I. Ozkes This is my paper

Pith reviewed 2026-05-19 03:12 UTC · model grok-4.3

classification 💰 econ.TH

keywords agentic AIhallucinationsreputational equilibriumverification effortAI marketaccuracy concernsgenerative models

0 comments

The pith

AI agents exert more verification effort when facing users who most fear hallucinations, raising prices in accuracy-sensitive sectors.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper models a competitive market where AI agents purchase answers from generative models and resell them to users who differ in accuracy concerns and hallucination fears. Agents can privately invest in costly verification to reduce risks. A hallucination ends the interaction permanently, so agents protect future rents by verifying more when the user base includes many accuracy-focused customers. Under discounting, this produces a unique reputational equilibrium. The model implies that sectors such as law and medicine will generate higher equilibrium effort and prices without external regulation.

Core claim

A unique reputational equilibrium exists under nontrivial discounting. The equilibrium effort, and thus the price, increases with the share of users who have high accuracy concerns, implying that hallucination-sensitive sectors, such as law and medicine, endogenously lead to more serious verification efforts in agentic AI markets.

What carries the argument

The reputational equilibrium created by permanent interaction halts after hallucinations, which disciplines agents through the loss of all future rents and creates incentives for costly verification.

If this is right

Equilibrium verification effort and prices rise as the share of high-accuracy users grows.
Sectors with strong accuracy demands, such as law and medicine, endogenously produce more reliable AI services.
Competitive pricing reflects differences in user accuracy concerns across market segments.
Nontrivial discounting is required for the equilibrium to be unique.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same permanent-loss mechanism could apply to other AI errors if they also terminate user relationships.
Raising user awareness of accuracy risks could improve overall verification levels across the market.
The market may segment over time into high-verification services for concerned users and lower-cost options for tolerant ones.

Load-bearing premise

A hallucination causes the interaction to halt permanently, thereby disciplining agents through the loss of all future rents from that user.

What would settle it

Data on whether AI agents serving legal or medical clients show measurably higher verification rates or prices than agents serving entertainment or casual users.

Figures

Figures reproduced from arXiv: 2507.19183 by Ali I. Ozkes, Engin Iyidogan.

**Figure 1.** Figure 1: a shows that greater patience relaxes the incentive-compatibility constraint, shifting the whole e ∗ (µ) schedule upward, while effort is strictly increasing in µ as established in Theorem 1. Verification plateaus once e ∗ ≈ 1.97, where the hallucination rate falls below 0.02. 00 0 0 0 0 0 [PITH_FULL_IMAGE:figures/full_fig_p008_1.png] view at source ↗

**Figure 2.** Figure 2: Upstream model choice. Notes: Total welfare W under Model A (h A 0 = 0.20, kA = 0.05, solid) and Model B (h B 0 = 0.13, kB = 0.30, dashed). References Asgari, E., N. Monta˜na-Brown, M. Dubois, S. Khalil, J. Balloch, J. A. Yeung, and D. Pimenta (2025). A Framework to Assess Clinical Safety and Hallucination Rates of LLMs for Medical Text Summarisation. npj Digital Medicine 8(1), 274. Athey, S. C., K. A. Bry… view at source ↗

read the original abstract

We model a competitive market where AI agents buy answers from upstream generative models and resell them to users who differ in how much they value accuracy and in how much they fear hallucinations. Agents can privately exert effort for costly verification to lower hallucination risks. Since interactions halt in the event of a hallucination, the threat of losing future rents disciplines effort. A unique reputational equilibrium exists under nontrivial discounting. The equilibrium effort, and thus the price, increases with the share of users who have high accuracy concerns, implying that hallucination-sensitive sectors, such as law and medicine, endogenously lead to more serious verification efforts in agentic AI markets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a clean reputational story for why verification effort rises in hallucination-sensitive sectors, but the whole result turns on the assumption that any detected error ends the relationship permanently.

read the letter

The core contribution is straightforward: in a market where AI agents resell model outputs, agents choose verification effort to avoid hallucinations, and users differ in how much they care about accuracy. Because a hallucination stops all future trade with that user, the loss of continuation value disciplines the agent. The model delivers a unique equilibrium under discounting and a comparative static that effort and price both increase with the fraction of high-accuracy users. That maps directly onto the claim that law and medicine will see stricter verification than less sensitive sectors. The setup is a standard repeated-game incentive constraint applied to this new market structure, and the logic is internally consistent on its own terms. The comparative static follows once the population share shifts the effective continuation payoff in the incentive-compatibility condition. What the paper does well is keep the mechanism transparent and tie it to a policy-relevant observation about sector differences. The main soft spot is the permanent-halt assumption itself. If users can switch agents, forgive, or continue with positive probability after an error, the continuation loss shrinks and the incentive to verify weakens; the comparative static could flatten or reverse. The abstract states existence and uniqueness without showing the derivation steps or functional forms, so it is hard to judge how fragile those results are to small changes in the setup. The free parameters are the usual ones in these models—discount factor and user-type distribution—so nothing exotic is hidden there. This is the kind of applied-theory note that belongs in a field journal or as a short paper in an AI-economics session. It is worth sending to referees who know repeated games and platform design; they can check whether the equilibrium derivation holds up and whether the halt assumption needs relaxation or robustness checks. I would bring it to a reading group for the discussion on modeling choices, but I would not cite it in my own work yet.

Referee Report

3 major / 2 minor

Summary. The paper models a competitive market in which AI agents purchase outputs from upstream generative models and resell them to heterogeneous users who differ in their valuation of accuracy and their aversion to hallucinations. Agents privately choose costly verification effort that reduces the probability of hallucinations. The central modeling assumption is that a detected hallucination permanently terminates the relationship with that user, so that the loss of all future rents disciplines effort. Under nontrivial discounting the model yields a unique reputational equilibrium; equilibrium verification effort and the resulting market price both rise with the measure of high-accuracy-concern users. The paper concludes that hallucination-sensitive sectors such as law and medicine will therefore induce higher endogenous verification effort.

Significance. If the equilibrium characterization is correct, the paper supplies a clean reputational mechanism that links user heterogeneity to verification incentives in agentic-AI markets. The comparative-static result offers a theoretical rationale for why professional domains may observe more careful agent behavior without external mandates. The contribution lies in embedding a standard repeated-game discipline device inside a competitive market for AI services; its value depends on whether the permanent-halt assumption can be relaxed without overturning the main prediction.

major comments (3)

[§2] §2 (model primitives): the incentive-compatibility constraint for verification effort equates marginal cost to the discounted loss of all future rents only because the continuation value after a hallucination is set to zero for that user-agent pair. The paper should derive the equilibrium effort level explicitly under a positive continuation probability (e.g., user forgiveness or switching) and show whether the comparative static on the share of high-accuracy users survives; without this check the load-bearing role of the permanent-halt assumption remains untested.
[§3, Proposition 1] §3, Proposition 1: uniqueness of the reputational equilibrium is asserted under nontrivial discounting, yet the proof sketch is not supplied. The argument appears to rely on a contraction-mapping or fixed-point property of the best-reply correspondence; an explicit statement of the mapping and the conditions on the discount factor that guarantee uniqueness would allow readers to assess whether the result is robust to small changes in the user-type distribution.
[§4] §4 (comparative statics): the claim that equilibrium effort rises with the share of high-accuracy users follows from the shift in the effective continuation value when the population composition changes. The paper should report the derivative of equilibrium effort with respect to this share (or the relevant cross-partial) and confirm that it remains positive when the hallucination probability is allowed to depend on effort in a non-linear way.

minor comments (2)

[§2] Notation for the user-type distribution and the effort cost function should be introduced once in §2 and used consistently thereafter; several equations reuse symbols without redefinition.
[Abstract] The abstract states existence and the comparative static without any functional forms or proof outline; a one-sentence sketch of the key incentive constraint in the abstract would improve accessibility.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We respond to each major point below and indicate the revisions we will make to strengthen the manuscript.

read point-by-point responses

Referee: [§2] §2 (model primitives): the incentive-compatibility constraint for verification effort equates marginal cost to the discounted loss of all future rents only because the continuation value after a hallucination is set to zero for that user-agent pair. The paper should derive the equilibrium effort level explicitly under a positive continuation probability (e.g., user forgiveness or switching) and show whether the comparative static on the share of high-accuracy users survives; without this check the load-bearing role of the permanent-halt assumption remains untested.

Authors: We agree that the zero-continuation-value assumption after a detected hallucination is central to generating strict incentives for verification. This choice is motivated by the permanent reputational damage observed in professional domains. To address the concern directly, we will add an extension in the revised Section 2 that introduces a continuation probability γ ∈ (0,1) (capturing forgiveness or user switching). We derive the modified incentive-compatibility condition and show analytically that the equilibrium effort remains strictly increasing in the share of high-accuracy users provided γ is below a threshold that depends on the discount factor. The main comparative static therefore survives for plausible values of γ. revision: yes
Referee: [§3, Proposition 1] §3, Proposition 1: uniqueness of the reputational equilibrium is asserted under nontrivial discounting, yet the proof sketch is not supplied. The argument appears to rely on a contraction-mapping or fixed-point property of the best-reply correspondence; an explicit statement of the mapping and the conditions on the discount factor that guarantee uniqueness would allow readers to assess whether the result is robust to small changes in the user-type distribution.

Authors: We acknowledge that the uniqueness argument was only sketched. In the revision we will supply a complete proof in an appendix. We define the best-reply mapping T that sends a candidate effort profile to the optimal verification level given the induced continuation values. We then show that T is a contraction with modulus β(1-ε) < 1 whenever the discount factor β is sufficiently close to 1 and the type distribution satisfies a mild continuity condition. This establishes uniqueness and also clarifies the robustness to small perturbations in the user-type measure. revision: yes
Referee: [§4] §4 (comparative statics): the claim that equilibrium effort rises with the share of high-accuracy users follows from the shift in the effective continuation value when the population composition changes. The paper should report the derivative of equilibrium effort with respect to this share (or the relevant cross-partial) and confirm that it remains positive when the hallucination probability is allowed to depend on effort in a non-linear way.

Authors: We will report the explicit derivative de*/dμ in the revised Section 4, where μ denotes the measure of high-accuracy users. The sign is positive because an increase in μ raises the average continuation value that disciplines effort. For non-linear hallucination technologies (e.g., exponential or convex specifications), we will add a short analytical argument and a numerical illustration confirming that the relevant cross-partial remains positive under standard convexity of the verification cost function. These additions will be included in the revised manuscript. revision: partial

Circularity Check

0 steps flagged

No circularity: standard repeated-game IC derivation from explicit modeling assumptions

full rationale

The paper posits a competitive market for AI agents who exert costly verification effort, with the explicit assumption that a hallucination permanently halts the interaction for that user-agent pair. This converts the problem into an infinitely repeated moral-hazard setting whose incentive-compatibility constraint equates marginal verification cost to the discounted loss of continuation rents. The claimed unique equilibrium under nontrivial discounting and the comparative static (effort and price rising in the measure of high-accuracy users) are direct consequences of solving that IC constraint under the stated population composition; they are not definitions or renamings of the inputs. No self-citation, fitted parameter, or ansatz is invoked to close the derivation. The model is therefore self-contained against its own primitives.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The model rests on repeated-game assumptions standard in industrial organization; without the full text, the precise free parameters and domain assumptions cannot be enumerated exhaustively.

free parameters (2)

discount factor
Nontrivial discounting is required for the reputational equilibrium to exist; its specific value is not stated in the abstract.
share of high-accuracy users
The comparative static result is stated with respect to this share; it functions as a parameter that shifts equilibrium effort.

axioms (2)

domain assumption A hallucination terminates the relationship and eliminates all future rents from that user.
This premise generates the incentive for verification effort and is stated directly in the abstract.
domain assumption Agents can privately choose costly verification effort that reduces hallucination probability.
Core modeling choice that links effort to risk reduction.

pith-pipeline@v0.9.0 · 5626 in / 1431 out tokens · 51743 ms · 2026-05-19T03:12:34.312416+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Since interactions halt in the event of a hallucination, the threat of losing future rents disciplines effort. ... c(e) ≤ δ[h(m,0)−h(m,e)] VC
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Equilibrium effort e* is strictly increasing in the share of high-type users μ

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

9 extracted references · 9 canonical work pages

[1]

Monta \ n a-Brown, M

Asgari, E., N. Monta \ n a-Brown, M. Dubois, S. Khalil, J. Balloch, J. A. Yeung, and D. Pimenta (2025). A Framework to Assess Clinical Safety and Hallucination Rates of LLMs for Medical Text Summarisation . npj Digital Medicine\/ 8\/ (1), 274

work page 2025
[2]

Athey, S. C., K. A. Bryan, and J. S. Gans (2020). The allocation of decision authority to human and artificial intelligence. AEA Papers and Proceedings\/ 110 , 80--84

work page 2020
[3]

Li, and L

Brynjolfsson, E., D. Li, and L. Raymond (2025). Generative AI at Work . The Quarterly Journal of Economics\/ 140\/ (2), 889--942

work page 2025
[4]

Canayaz, M. (2025). AI Agency . Available at SSRN 5109326\/

work page 2025
[5]

Colback, L. (2025). AI Agents: From Co-pilot to Autopilot . https://www.ft.com/content/3e862e23-6e2c-4670-a68c-e204379fe01f. [Accessed 22-07-2025]

work page 2025
[6]

Magesh, M

Dahl, M., V. Magesh, M. Suzgun, and D. E. Ho (2024). Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models . Journal of Legal Analysis\/ 16\/ (1), 64--93

work page 2024
[7]

Levin, J. (2003). Relational incentive contracts. American Economic Review\/ 93\/ (3), 835--857

work page 2003
[8]

Rothschild, D. M., M. Mobius, J. M. Hofman, E. W. Dillon, D. G. Goldstein, N. Immorlica, S. Jaffe, B. Lucier, A. Slivkins, and M. Vogel (2025). The Agentic Economy . arXiv preprint arXiv:2505.15799\/

work page arXiv 2025
[9]

Agarwal, M

Shavit, Y., S. Agarwal, M. Brundage, S. Adler, C. O’Keefe, R. Campbell, T. Lee, P. Mishkin, T. Eloundou, A. Hickey, et al. (2023). Practices for Governing Agentic AI Systems . Research Paper, OpenAI\/

work page 2023

[1] [1]

Monta \ n a-Brown, M

Asgari, E., N. Monta \ n a-Brown, M. Dubois, S. Khalil, J. Balloch, J. A. Yeung, and D. Pimenta (2025). A Framework to Assess Clinical Safety and Hallucination Rates of LLMs for Medical Text Summarisation . npj Digital Medicine\/ 8\/ (1), 274

work page 2025

[2] [2]

Athey, S. C., K. A. Bryan, and J. S. Gans (2020). The allocation of decision authority to human and artificial intelligence. AEA Papers and Proceedings\/ 110 , 80--84

work page 2020

[3] [3]

Li, and L

Brynjolfsson, E., D. Li, and L. Raymond (2025). Generative AI at Work . The Quarterly Journal of Economics\/ 140\/ (2), 889--942

work page 2025

[4] [4]

Canayaz, M. (2025). AI Agency . Available at SSRN 5109326\/

work page 2025

[5] [5]

Colback, L. (2025). AI Agents: From Co-pilot to Autopilot . https://www.ft.com/content/3e862e23-6e2c-4670-a68c-e204379fe01f. [Accessed 22-07-2025]

work page 2025

[6] [6]

Magesh, M

Dahl, M., V. Magesh, M. Suzgun, and D. E. Ho (2024). Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models . Journal of Legal Analysis\/ 16\/ (1), 64--93

work page 2024

[7] [7]

Levin, J. (2003). Relational incentive contracts. American Economic Review\/ 93\/ (3), 835--857

work page 2003

[8] [8]

Rothschild, D. M., M. Mobius, J. M. Hofman, E. W. Dillon, D. G. Goldstein, N. Immorlica, S. Jaffe, B. Lucier, A. Slivkins, and M. Vogel (2025). The Agentic Economy . arXiv preprint arXiv:2505.15799\/

work page arXiv 2025

[9] [9]

Agarwal, M

Shavit, Y., S. Agarwal, M. Brundage, S. Adler, C. O’Keefe, R. Campbell, T. Lee, P. Mishkin, T. Eloundou, A. Hickey, et al. (2023). Practices for Governing Agentic AI Systems . Research Paper, OpenAI\/

work page 2023