pith. sign in

arxiv: 2605.13998 · v1 · pith:X6KSKWCJnew · submitted 2026-05-13 · 💱 q-fin.CP · cs.LG

Synthetic American Option Pricing via Jump-HMM-Driven Heston Implied Volatility

Pith reviewed 2026-05-15 02:25 UTC · model grok-4.3

classification 💱 q-fin.CP cs.LG
keywords synthetic option pricingimplied volatilityHeston modelJump Hidden Markov ModelAmerican optionsbinomial latticeregime-dependent variancesynthetic data generation
0
0 comments X p. Extension
pith:X6KSKWCJ Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{X6KSKWCJ}

Prints a linked pith:X6KSKWCJ badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

The pith

A structural equity model generates implied volatility surfaces to price synthetic American options without market calibration.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to break the circular dependency where realistic synthetic option prices require implied volatility, yet implied volatility is derived from observed option prices. It does so by building a pipeline in which a Jump Hidden Markov Model first simulates equity price paths with stylized facts and cross-asset tail dependence. A modified Heston variance process then converts those paths into implied-volatility surfaces by letting the mean-reversion target depend on regime, expiration, moneyness, and market mood; variance is initialized at that target so that smile, skew, and term structure appear automatically. American options are finally priced on a recombining binomial lattice. The resulting framework is calibrated hierarchically on real multi-ticker data and released as open-source software, enabling consistent forward simulation of prices, Greeks, and P&L.

Core claim

Implied volatility emerges as an output of a structural model of equity returns rather than as an input derived from observed option prices. A Jump Hidden Markov Model produces multi-asset price paths; a modified Heston variance process whose mean-reversion target depends on regime state, days to expiration, moneyness, and a market-mood indicator converts those paths into implied-volatility surfaces; and a recombining binomial lattice prices American options from the resulting surface. Initializing variance at its mean-reversion target for each strike-expiration pair lets the volatility smile and term structure form without external calibration.

What carries the argument

The modified Heston variance process whose mean-reversion target depends on regime, expiration, moneyness, and market-mood indicator, initialized at that target and driven by Jump Hidden Markov Model equity paths, then priced on a recombining binomial lattice.

If this is right

  • Synthetic American option prices, finite-difference Greeks, and terminal short-premium P&L can be recovered from one coherent simulation of equity paths.
  • Calibration via a hierarchy of parametric baseline, globally shared neural surrogate, and sector-specific neural surrogate allows adaptation while preserving cross-ticker robustness.
  • Calendar-derived earnings-distance and same-sector peer-coupling features recover the anticipatory component of scheduled-event effects on generalization error.
  • The same pipeline can be re-run on underlyings from different sectors and volatility regimes to confirm consistency of generated surfaces and Greeks.
  • An open-source Julia implementation makes the full pipeline available for generating large volumes of consistent synthetic option data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Large-scale machine-learning models for option pricing or risk management could be trained on internally consistent synthetic datasets without circularity artifacts.
  • Extending the regime and mood indicators to include macroeconomic variables might improve performance during crisis periods not represented in the original calibration.
  • Because the lattice prices American options directly, the framework could be used to study early-exercise boundaries under realistic stochastic volatility without separate numerical solvers.

Load-bearing premise

The modified Heston variance process with regime-, expiration-, moneyness-, and mood-dependent mean-reversion target, when initialized at that target, produces realistic implied volatility surfaces that generalize beyond the calibration data.

What would settle it

Generate implied-volatility surfaces from the model on a hold-out period containing unscheduled events and check whether the surfaces deviate systematically from observed market surfaces in skew, term structure, or level.

Figures

Figures reproduced from arXiv: 2605.13998 by Jeffrey D. Varner, Jiawei Zhang, Julia Sun, Zheyu Jin.

Figure 1
Figure 1. Figure 1: Per-ticker ψNN smile fits on the fifteen-date pooled corpus. Representative IV smile panels at mid-DTE for six qualified tickers spanning four sectors (SPY, NVDA, MSFT, LLY, GS, AVGO). The per-ticker network (solid black) tracked the smile shape more tightly than the sector network (dashed gray) on every panel, with the largest gap on NVDA and MSFT where the shared Tech surface flattened to accommodate AVG… view at source ↗
Figure 2
Figure 2. Figure 2: Model-vs-market dollar pricing error (CRR American at the calibrated IV minus market [PITH_FULL_IMAGE:figures/full_fig_p010_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Forward GS share-price paths and short option premium paths for the real GS 2026-05-29 [PITH_FULL_IMAGE:figures/full_fig_p013_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Path-conditional implied volatility at the put strike (left) and call strike (right) over the [PITH_FULL_IMAGE:figures/full_fig_p014_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Terminal P&L distributions per contract (1,000 paths, log-y) for the GS scenario. Both [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: LLY terminal P&L distributions per contract (1,000 paths, log-y), parallel to Fig. [PITH_FULL_IMAGE:figures/full_fig_p015_6.png] view at source ↗
read the original abstract

Generating realistic synthetic option prices requires implied volatility as an input, yet implied volatility is itself derived from observed option prices, creating a circular dependency that limits synthetic data for machine-learning and risk-analysis applications. We break this circularity with a pipeline in which implied volatility emerges as an output of a structural model of equity returns. A Jump Hidden Markov Model produces multi-asset price paths with realistic stylized facts and cross-asset tail dependence; a modified Heston variance process, whose mean-reversion target depends on regime state, days to expiration, moneyness, and a market-mood indicator, converts those paths into implied-volatility paths; and a recombining binomial lattice prices American options from the resulting surface. Initializing variance at its mean-reversion target for each strike-expiration pair lets smile, skew, and term structure emerge without external calibration. We calibrate the shape function through a hierarchy spanning a parametric baseline, a globally shared neural surrogate, and a sector-specific neural surrogate fit to a multi-ticker, multi-sector option ladder. A temporal holdout on a multi-day capture isolated scheduled corporate events as the dominant source of test-time generalization error, and calendar-derived earnings-distance and same-sector peer-coupling features recovered the anticipatory portion of that signal. We then apply the framework as a synthetic-data generator on real near-the-money put and call contracts, forward-simulating price paths, and recovering path-conditional implied volatility, finite-difference American Greeks, and terminal short-premium profit and loss from one coherent simulation, and confirm cross-ticker robustness by re-running on a second underlying from a different sector and volatility regime. The framework is released as an open-source Julia package.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript proposes a pipeline for generating synthetic American option prices that aims to break the circularity between observed option prices and implied volatility. It uses a Jump Hidden Markov Model to generate multi-asset price paths, a modified Heston variance process with mean-reversion targets depending on regime, expiration, moneyness, and mood (initialized at the target), and a recombining binomial lattice for pricing. The shape function for the mean-reversion target is calibrated using neural network surrogates fit to real option data from multiple tickers, with a temporal holdout validation. The framework is implemented as an open-source Julia package.

Significance. If the model consistency can be established, this work could provide a valuable tool for generating synthetic data for machine learning applications in quantitative finance, particularly for American options. The open-source release and cross-ticker robustness testing are positive aspects. The approach attempts to derive implied volatility structurally from equity return models rather than directly from market data.

major comments (3)
  1. [Modified Heston variance process] The mean-reversion target theta in the modified Heston variance process is specified as a function of moneyness. This implies that each strike has a different long-run variance level. No single Itô process for the variance can produce this, so the resulting implied volatility surface is not the marginal distribution implied by any arbitrage-free model of the underlying asset. This is load-bearing for the claim that the surfaces emerge structurally from the equity-return paths.
  2. [Calibration hierarchy] The abstract states that smile, skew, and term structure emerge 'without external calibration', yet the shape function is calibrated via neural surrogates to real multi-ticker option data. This appears contradictory and requires clarification on what aspects are calibrated versus emergent.
  3. [Validation experiments] The temporal holdout experiment is described, but no quantitative error metrics (such as RMSE on implied vols or option prices) or checks for no-arbitrage violations (e.g., butterfly positivity, calendar monotonicity) are provided. This makes it difficult to assess whether the generated surfaces are usable for pricing.
minor comments (2)
  1. [Abstract] The abstract is dense with technical details; consider breaking it into clearer sentences or adding a short overview paragraph.
  2. [Software release] The manuscript should include a direct link or repository identifier for the open-source Julia package to facilitate reproducibility.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments, which help clarify key aspects of our framework. We address each major comment below with clarifications and proposed revisions where appropriate.

read point-by-point responses
  1. Referee: [Modified Heston variance process] The mean-reversion target theta in the modified Heston variance process is specified as a function of moneyness. This implies that each strike has a different long-run variance level. No single Itô process for the variance can produce this, so the resulting implied volatility surface is not the marginal distribution implied by any arbitrage-free model of the underlying asset. This is load-bearing for the claim that the surfaces emerge structurally from the equity-return paths.

    Authors: We acknowledge this modeling choice creates strike-dependent variance dynamics rather than a single Itô process. The modification to theta (depending on regime, expiration, moneyness, and mood) is deliberate to let the IV surface emerge from the Jump-HMM equity paths while capturing observed stylized facts. The structural claim rests on generating paths first and then mapping to IV via this process, not on claiming a unified arbitrage-free SDE for the underlying across all strikes. We will add explicit discussion of this approximation and its limitations in a new subsection on model consistency, without altering the core pipeline. revision: partial

  2. Referee: [Calibration hierarchy] The abstract states that smile, skew, and term structure emerge 'without external calibration', yet the shape function is calibrated via neural surrogates to real multi-ticker option data. This appears contradictory and requires clarification on what aspects are calibrated versus emergent.

    Authors: The phrase 'without external calibration' refers specifically to the per-simulation generation step: once the shape function has been fit once via the hierarchy (parametric baseline to sector-specific neural surrogate) on historical data, new IV surfaces emerge directly from fresh equity paths without refitting to option prices. The calibration is a fixed preprocessing step, not repeated for each synthetic surface. We agree the abstract wording is imprecise and will revise it, the introduction, and the methods section to distinguish pre-calibration of the shape function from the emergent path-driven IV generation. revision: yes

  3. Referee: [Validation experiments] The temporal holdout experiment is described, but no quantitative error metrics (such as RMSE on implied vols or option prices) or checks for no-arbitrage violations (e.g., butterfly positivity, calendar monotonicity) are provided. This makes it difficult to assess whether the generated surfaces are usable for pricing.

    Authors: We agree that quantitative metrics and arbitrage checks are necessary for evaluating usability. In the revised manuscript we will add RMSE values for both implied volatilities and American option prices on the temporal holdout set. We will also report diagnostics for no-arbitrage conditions, including butterfly spread positivity and calendar-spread monotonicity, computed on the generated surfaces. These results will be included in the validation experiments section. revision: yes

Circularity Check

1 steps flagged

Moneyness-dependent mean-reversion target calibrated to real option ladders, so IV surface is fitted construction rather than structural emergence

specific steps
  1. fitted input called prediction [Abstract]
    "Initializing variance at its mean-reversion target for each strike-expiration pair lets smile, skew, and term structure emerge without external calibration. We calibrate the shape function through a hierarchy spanning a parametric baseline, a globally shared neural surrogate, and a sector-specific neural surrogate fit to a multi-ticker, multi-sector option ladder."

    The mean-reversion target is defined via the shape function that is fitted directly to real option data; initializing variance at this target therefore constructs the IV surface by the calibration rather than deriving it from the Jump-HMM paths or a single consistent variance process. The claim of emergence 'without external calibration' is immediately followed by the calibration step that supplies the target, making the output statistically forced by the fitted inputs.

full rationale

The paper claims to break the IV circularity by letting implied volatility emerge from Jump-HMM equity paths through a modified Heston process. However, the mean-reversion target theta is explicitly a calibrated shape function of regime, expiration, moneyness and mood; variance is initialized at this target for each strike-expiration pair. The shape function itself is obtained by fitting parametric and neural surrogates to multi-ticker observed option ladders. Consequently the generated surfaces and the American prices derived from them are produced by construction from the calibration step rather than from unmodified dynamics of the underlying. The temporal-holdout experiment only tests generalization of the fitted shape, not independent structural derivation. This matches the fitted-input-called-prediction pattern and yields partial circularity on the central claim.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The framework rests on standard financial modeling assumptions for HMM and Heston dynamics plus fitted neural parameters; no new entities are postulated.

free parameters (2)
  • neural surrogate weights
    Weights of globally shared and sector-specific neural networks fitted to multi-ticker option ladder data for the shape function.
  • regime-dependent mean-reversion targets
    Parameters controlling how mean-reversion level varies with regime, expiration, moneyness, and market-mood indicator.
axioms (2)
  • domain assumption Equity returns follow a Jump Hidden Markov Model that reproduces stylized facts and cross-asset tail dependence.
    Invoked to generate the input price paths.
  • domain assumption Modified Heston process with state-dependent mean-reversion target produces realistic implied volatility when initialized at that target.
    Core modeling choice that lets smile and term structure emerge.

pith-pipeline@v0.9.0 · 5612 in / 1510 out tokens · 143147 ms · 2026-05-15T02:25:38.857877+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages · 1 internal anchor

  1. [1]

    The Review of Financial Studies , volume=

    A closed-form solution for options with stochastic volatility with applications to bond and currency options , author=. The Review of Financial Studies , volume=. 1993 , publisher=

  2. [2]

    Journal of Financial Economics , volume=

    Option pricing: A simplified approach , author=. Journal of Financial Economics , volume=. 1979 , publisher=

  3. [3]

    arXiv preprint arXiv:2603.10202 , year=

    Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion , author=. arXiv preprint arXiv:2603.10202 , year=

  4. [4]

    Simple and efficient simulation of the

    Andersen, Leif , journal=. Simple and efficient simulation of the. 2008 , doi=

  5. [5]

    A parsimonious arbitrage-free implied volatility parameterization with application to the valuation of volatility derivatives , author=

  6. [6]

    Journal of Political Economy , volume=

    The pricing of options and corporate liabilities , author=. Journal of Political Economy , volume=. 1973 , publisher=

  7. [7]

    Quantitative Finance , volume=

    Empirical properties of asset returns: stylized facts and statistical issues , author=. Quantitative Finance , volume=. 2001 , publisher=

  8. [8]

    Risk , volume=

    Pricing with a smile , author=. Risk , volume=

  9. [9]

    Risk , volume=

    Riding on a smile , author=. Risk , volume=

  10. [10]

    Wilmott Magazine , month=

    Managing smile risk , author=. Wilmott Magazine , month=

  11. [11]

    International Statistical Review , volume=

    The t copula and related copulas , author=. International Statistical Review , volume=. 2005 , publisher=

  12. [12]

    Quantitative Finance , volume=

    Deep hedging , author=. Quantitative Finance , volume=. 2019 , publisher=

  13. [13]

    Quantitative Finance , volume=

    Dynamics of implied volatility surfaces , author=. Quantitative Finance , volume=. 2002 , publisher=

  14. [14]

    Quantitative Finance , volume=

    Deep learning volatility: a deep neural network perspective on pricing and calibration in (rough) volatility models , author=. Quantitative Finance , volume=. 2021 , publisher=

  15. [15]

    Deep calibration of rough stochastic volatility models

    Deep calibration of rough stochastic volatility models , author=. arXiv preprint arXiv:1810.03399 , year=

  16. [16]

    Journal of Financial Economics , volume=

    Approximate option valuation for arbitrary stochastic processes , author=. Journal of Financial Economics , volume=. 1982 , publisher=

  17. [17]

    Journal of Futures Markets , volume=

    A modified lattice approach to option pricing , author=. Journal of Futures Markets , volume=. 1993 , publisher=

  18. [18]

    Applied Mathematical Finance , volume=

    Binomial models for option valuation--examining and improving convergence , author=. Applied Mathematical Finance , volume=. 1996 , publisher=

  19. [19]

    Wiese, Magnus and Knobloch, Robert and Korn, Ralf and Kretschmer, Peter , journal=. Quant. 2020 , publisher=

  20. [20]

    2004 , publisher=

    Monte Carlo Methods in Financial Engineering , author=. 2004 , publisher=

  21. [21]

    Review of Financial Studies , volume=

    American option valuation: New bounds, approximations, and a comparison of existing methods , author=. Review of Financial Studies , volume=. 1996 , publisher=