A structural causal framework for interventions on evolutionary accumulation models

Iain G. Johnston; \'I\~nigo R\'ios-Arroyo; Ramon Diaz-Uriarte

arxiv: 2606.12597 · v2 · pith:BVZHOK7Lnew · submitted 2026-06-10 · 🧬 q-bio.QM · q-bio.PE

A structural causal framework for interventions on evolutionary accumulation models

Ramon Diaz-Uriarte , \'I\~nigo R\'ios-Arroyo , Iain G. Johnston This is my paper

Pith reviewed 2026-06-27 07:27 UTC · model grok-4.3

classification 🧬 q-bio.QM q-bio.PE

keywords evolutionary accumulation modelsstructural causal modelsinterventionscancer progression modelstherapeutic targetsmutation orderdo-operator

0 comments

The pith

Evolutionary accumulation models support well-defined interventions by recasting them as structural causal models and applying the do-operator.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a procedure for extracting intervention predictions from EvAMs instead of relying on simple conditioning, which produces incorrect results. This matters because the models are already used to infer mutation order from data and have been proposed as tools for choosing therapeutic targets in tumors. For each of the listed model families the work supplies concrete parameter changes that realize an intervention, identifies when different procedures are equivalent, and checks whether the modularity needed for the do-operator to apply is defensible. It further separates two intervention types that standard EvAM representations collapse. The outcome is a ranking protocol that lets users score candidate targets under three explicit objectives.

Core claim

EvAMs can be equipped with intervention semantics by treating them as structural causal models. For each method the intervention is realized through explicit parameter modifications; equivalent implementations are noted; and the modularity assumption is examined per model. Underlying individual-level fitness DAGs reveal that standard representations conflate killing interventions (removing a mutation's effect) with inactivating interventions (preventing its occurrence), and the framework supplies distinct operators for each. The problem is then recast as ranking targets according to three defined intervention objectives, together with an evaluation protocol.

What carries the argument

Application of Pearl's do-operator to EvAM parameter sets, together with the distinction between killing and inactivating interventions drawn from explicit fitness DAGs.

If this is right

Each of the seven EvAM families admits at least one explicit parameter change that implements a given intervention.
Killing and inactivating interventions produce distinct post-intervention distributions and must be handled separately.
Modularity can be verified model-by-model, determining for which methods the intervention semantics are justified.
Targets can be ranked by any of the three stated objectives once the intervention operator is defined.
The same do-operator treatment applies to any computational model that can be read as a structural causal model.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The ranking protocol could be validated by holding out subsets of tumors that experienced a known mutational loss and measuring how often the model places the lost mutation high in the target list.
The same causal-intervention machinery could be applied to non-oncological order-of-acquisition models, such as those describing trait evolution in phylogenies.
Once implemented, the framework makes it possible to compare EvAMs on their ability to recover known intervention effects from simulated trajectories.

Load-bearing premise

The modularity assumption holds, so that an intervention on one variable leaves the rest of the model structure unchanged in the required way.

What would settle it

Generate synthetic cross-sectional data under a known ground-truth fitness landscape, apply an experimental removal of one mutation, and check whether the post-intervention mutation frequencies predicted by the do-operator on the fitted EvAM match the frequencies observed in the intervened data.

read the original abstract

Evolutionary accumulation models (EvAMs), also known as cancer progression models (CPMs), infer dependencies in the order of accumulation of mutations during tumor progression from cross-sectional data. It has been suggested that EvAMs could be used to identify therapeutic targets, but there is no procedure in the literature for how to extract predictions under intervention from these models. A simple approach of conditioning on the absence of a mutation gives incorrect predictions. We address this gap by formalizing what "intervene" means for all currently available EvAM methods (OT, OncoBN, CBN, H-ESBCN, MHN, HyperHMM, HyperTraPS), using Pearl's do operator and conditional interventions. For each model, we show how to implement the intervention (in most cases as specific parameter modifications), identify equivalent implementation procedures, and analyze whether the modularity assumption -- required for the intervention to be well-defined -- is justified. Drawing on individual-level causal DAGs that make fitness an explicit variable, we distinguish two types of intervention (killing and inactivating) that are conflated in standard EvAM representations. Since the goal is to prioritize intervention candidates, we recast the problem as one of ranking: we define three intervention objectives and provide a protocol for evaluating how well EvAMs rank targets. Our framework is not specific to cancer or EvAMs; it applies wherever fitted computational models can be interpreted as structural causal models. Code available from https://github.com/rdiaz02/scm-interv-evams.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a usable way to apply do-interventions to EvAMs by mapping them to parameter changes and separating killing from inactivating, but the modularity checks stay inside the models.

read the letter

The core contribution is a consistent way to turn fitted EvAMs into intervention predictors. They apply Pearl's do-operator across OT, OncoBN, CBN, H-ESBCN, MHN, HyperHMM, and HyperTraPS, show the matching parameter edits for each, and separate two intervention types that standard representations had blurred. They also turn the output into a ranking task with three explicit objectives and a protocol for checking how well the models order targets.

They handle the full set of current methods without skipping any, lay out equivalent implementation routes, and check the modularity assumption case by case. Adding fitness as an explicit node in the DAG is a clean move that makes the killing/inactivating split concrete. Releasing the code is a practical plus.

The soft spot is the modularity justification. It rests on each model's internal structure rather than any test against data-generating processes that could violate independent mechanisms. If an intervention on one mutation changes another mechanism through unmodeled fitness effects, the claimed equivalence between the parameter change and the do-operator no longer holds, and the ranking protocol would give unreliable priorities. The abstract supplies no derivations or external checks, so the implementations' numerical stability is still open.

This is for people already working with EvAMs who need to move from order inference to intervention ranking. A reader who knows both the cancer progression literature and basic causal models will get the most from it. The framework is general enough that it could apply beyond cancer, but the immediate value is in that subfield.

It deserves a serious referee. The gap it fills is real, the code makes the claims checkable, and the modularity issue is the kind of point referees can pressure-test with the full text.

Referee Report

3 major / 2 minor

Summary. The paper claims to fill a gap in using evolutionary accumulation models (EvAMs) for therapeutic target identification by formalizing interventions via Pearl's do-operator applied to models including OT, OncoBN, CBN, H-ESBCN, MHN, HyperHMM, and HyperTraPS. It shows how to implement interventions (typically as parameter modifications), analyzes the modularity assumption per method, distinguishes killing versus inactivating interventions via auxiliary fitness-explicit DAGs, and recasts target prioritization as a ranking problem with three objectives and an evaluation protocol. The framework is presented as general to any fitted computational model interpretable as a structural causal model, with code provided.

Significance. If the central formalization holds, the work supplies a principled, model-specific procedure for extracting intervention predictions from existing EvAMs, which could increase their applicability to cancer therapy prioritization. The explicit distinction between intervention types and the provision of reproducible code are concrete strengths that address a previously unformalized step in the literature.

major comments (3)

[modularity analysis section] Abstract and modularity analysis section: The claim that the do-operator application is justified rests on an internal structural analysis of each EvAM's modularity assumption. However, this analysis does not include tests against data-generating processes that violate modularity (e.g., unmodeled direct fitness interactions between mutations), which is load-bearing for the asserted equivalence between parameter modification and do(·). Without such external validation, the intervention definitions remain conditional on model-internal assumptions that may not transfer.
[implementation sections for each EvAM] Implementation sections for each EvAM: The manuscript states that interventions are implemented as specific parameter modifications but provides no explicit derivations showing that these modifications are equivalent to the do-operator under the model's equations. For instance, the mapping from do(M_i = 0) to changes in the model's parameters (e.g., for MHN or HyperTraPS) is asserted without step-by-step reduction from the likelihood or structural equations, leaving the central implementation claim without visible algebraic support.
[ranking protocol section] Ranking protocol section: The protocol for evaluating how well EvAMs rank intervention targets is defined in terms of three objectives, yet the manuscript contains no application to simulated or real data that would demonstrate the protocol's behavior or sensitivity to model misspecification. This absence weakens the claim that the framework enables reliable prioritization.

minor comments (2)

[abstract] The abstract mentions that a simple conditioning approach gives incorrect predictions, but a concrete numerical counter-example comparing conditioning versus do-intervention on one of the listed models would improve motivation.
Notation for the two intervention types (killing vs. inactivating) is introduced via auxiliary DAGs; a small summary table listing the parameter changes for each type across the seven EvAMs would aid readability.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments. We respond to each major point below and indicate the revisions we will incorporate.

read point-by-point responses

Referee: [modularity analysis section] Abstract and modularity analysis section: The claim that the do-operator application is justified rests on an internal structural analysis of each EvAM's modularity assumption. However, this analysis does not include tests against data-generating processes that violate modularity (e.g., unmodeled direct fitness interactions between mutations), which is load-bearing for the asserted equivalence between parameter modification and do(·). Without such external validation, the intervention definitions remain conditional on model-internal assumptions that may not transfer.

Authors: The modularity analysis is performed under each model's own structural assumptions, which is the appropriate scope for justifying the do-operator within the fitted EvAM. External tests against violating DGPs would require specifying alternative generative processes outside the models considered, a task that lies beyond the paper's focus on formalization. We will add an explicit discussion of this scope limitation and a small illustrative simulation in the revision to make the conditional nature of the results clearer. revision: yes
Referee: [implementation sections for each EvAM] Implementation sections for each EvAM: The manuscript states that interventions are implemented as specific parameter modifications but provides no explicit derivations showing that these modifications are equivalent to the do-operator under the model's equations. For instance, the mapping from do(M_i = 0) to changes in the model's parameters (e.g., for MHN or HyperTraPS) is asserted without step-by-step reduction from the likelihood or structural equations, leaving the central implementation claim without visible algebraic support.

Authors: The parameter modifications follow directly from the structural equations of each EvAM, but we agree that explicit algebraic derivations are not shown in full for every model. In the revision we will add a supplementary appendix containing step-by-step derivations for the key cases (including MHN and HyperTraPS) that reduce the intervention to the corresponding parameter change. revision: yes
Referee: [ranking protocol section] Ranking protocol section: The protocol for evaluating how well EvAMs rank intervention targets is defined in terms of three objectives, yet the manuscript contains no application to simulated or real data that would demonstrate the protocol's behavior or sensitivity to model misspecification. This absence weakens the claim that the framework enables reliable prioritization.

Authors: The manuscript defines the ranking protocol as the methodological contribution. To illustrate its behavior and sensitivity properties we will include a section applying the protocol to simulated data under controlled misspecification in the revised manuscript. revision: yes

Circularity Check

0 steps flagged

No significant circularity: standard causal formalism applied to existing models

full rationale

The paper formalizes interventions on EvAMs by directly applying Pearl's do-operator and conditional interventions to pre-existing methods (OT, OncoBN, CBN, etc.), showing parameter modifications as implementations and analyzing modularity per model. No equations reduce a claimed prediction or first-principles result to a fitted input by construction, no self-citations serve as load-bearing justifications for uniqueness or ansatzes, and no known empirical patterns are renamed as new derivations. The framework is an interpretive mapping rather than a self-referential derivation, remaining self-contained against external causal modeling benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the applicability of Pearl's structural causal models to EvAMs and the justification of the modularity assumption for interventions to be well-defined.

axioms (1)

domain assumption Modularity assumption required for the intervention to be well-defined
Explicitly stated as necessary; the paper analyzes whether it is justified for each EvAM method.

pith-pipeline@v0.9.1-grok · 5822 in / 1237 out tokens · 26401 ms · 2026-06-27T07:27:03.876178+00:00 · methodology

A structural causal framework for interventions on evolutionary accumulation models

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)