SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets

Alpar Turkoglu; Faramarz Fekri; Muralikrishnna G. Sethuraman

arxiv: 2605.16620 · v1 · pith:EBI7PWWQnew · submitted 2026-05-15 · 💻 cs.LG

SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets

Alpar Turkoglu , Muralikrishnna G. Sethuraman , Faramarz Fekri This is my paper

Pith reviewed 2026-05-20 19:50 UTC · model grok-4.3

classification 💻 cs.LG

keywords causal discoverycyclic causal graphssoft interventionsunknown intervention targetsnormalizing flowsnonlinear modelsgraph recovery

0 comments

The pith

SCOUT recovers nonlinear cyclic causal graphs and unknown intervention targets from soft interventional data using normalizing flows.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces SCOUT to learn nonlinear cyclic causal relationships from soft interventional data where intervention targets are unknown. Most prior methods assume acyclic structures, Gaussian noise, or known targets, limiting their applicability to real systems. SCOUT maximizes the data log-likelihood with contractive residual flows and neural spline flows to recover both the graph and the targets. This matters because it enables causal discovery in more realistic settings that violate standard assumptions.

Core claim

SCOUT recovers the cyclic causal graph structure and the unknown targets of soft interventions by maximizing the log-likelihood of the observed data using two normalizing flow architectures: contractive residual flows and neural spline flows.

What carries the argument

Contractive residual flows combined with neural spline flows for modeling the likelihood under cyclic nonlinear causal models with unknown soft intervention targets.

If this is right

SCOUT identifies causal graphs containing cycles from interventional data.
It recovers the specific targets of interventions without them being provided.
It applies to nonlinear relationships without requiring Gaussian noise assumptions.
Outperforms existing methods in recovering both graphs and targets across different settings.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

SCOUT could be applied to real-world problems like gene regulatory networks that exhibit cyclic behavior.
Improving the flow models might allow handling of higher-dimensional data.
Similar likelihood maximization techniques could be explored for other causal discovery challenges with unknown interventions.

Load-bearing premise

The data-generating process can be accurately captured by maximizing likelihood under the chosen contractive residual flow and neural spline flow architectures.

What would settle it

Running SCOUT on a synthetic dataset with a known cyclic nonlinear structure and soft interventions on unknown targets, and finding that the recovered graph or targets do not match the ground truth.

Figures

Figures reproduced from arXiv: 2605.16620 by Alpar Turkoglu, Faramarz Fekri, Muralikrishnna G. Sethuraman.

**Figure 2.** Figure 2: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and shift interventions. The number of nodes is varied from d = 10 to 70 [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 1.** Figure 1: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and various interventional and exogenous noise settings, evaluated using AUPRC (the higher the better). The box plots show the median and interquartile ranges across ten independent trials. In all cases, the number of nodes is fixed at d = 10. 3, SCOUT’s structure recovery performance remains relatively high, whereas o… view at source ↗

**Figure 3.** Figure 3: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and scale interventions. The number of nodes is varied from d = 10 to 70 While SCOUT-noNSF retains the ability to model soft interventions and includes an interventional target matrix for handling unknown targets, it lacks the flexibility to transform non-Gaussian noise distributions. In contrast, SCOUT incorporates t… view at source ↗

**Figure 5.** Figure 5: Graph recovery performance comparison between SCOUT, SCOUT-noNSF, and NODAGS for known intervention targets under nonlinear SEM and various interventional and exogenous noise settings, evaluated using AUPRC. In all cases, the number of nodes is fixed at d = 10. contains gene expressions taken from 218,331 melanoma cells split over three different cell conditions: (i) control, (ii) co-culture, and (iii) IF… view at source ↗

**Figure 4.** Figure 4: Graph recovery performance comparison between SCOUT, SCOUT-noNSF, and NODAGS under nonlinear SEM and various interventional and exogenous noise settings, evaluated using AUPRC. In all cases, the number of nodes is fixed at d = 10. Additional experiments, including performance evaluations on non-contractive SEMs (DAGs) in Appendix C.1, linear SEMs in Appendix C.2, hard interventions in Appendix C.4, and kno… view at source ↗

**Figure 6.** Figure 6 [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗

**Figure 8.** Figure 8: Illustration of the augmented graph G I corresponding to the set of interventional targets I = {∅, {X3}, {X4}}. To integrate multiple interventional settings in a single causal graph, we adopt the idea of joint causal model proposed by (Mooij et al., 2016) by introducing a new set of context variables CI = (C1, . . . , CK) each representing another interventional setting. (The scenario where Ck = ∅ for all… view at source ↗

**Figure 9.** Figure 9: Graph recovery performance comparison between SCOUT and baselines under non-contractive DAG’s and various interventional and exogenous noise settings, evaluated using AUPRC. In all cases, the number of nodes is fixed at d = 10. 20 [PITH_FULL_IMAGE:figures/full_fig_p020_9.png] view at source ↗

**Figure 10.** Figure 10: Graph recovery performance comparison between SCOUT and baselines under linear SEM and various interventional and exogenous noise settings, evaluated using AUPRC. In all cases, the number of nodes is fixed at d = 10. 21 [PITH_FULL_IMAGE:figures/full_fig_p021_10.png] view at source ↗

**Figure 11.** Figure 11: Graph recovery performance comparison between SCOUT and baselines for known intervention targets under nonlinear SEM and various interventional and exogenous noise settings, evaluated using AUPRC. In all cases, the number of nodes is fixed at d = 10. C.4. Experiments for Hard (Perfect) Interventions We run experiments on SCOUT and the baselines to evaluate their structures and target recovery performance … view at source ↗

**Figure 12.** Figure 12: Graph recovery performance comparison between SCOUT and baselines under nonlinear SEM and hard interventions with various exogenous noise settings, evaluated using AUPRC. In all cases, the number of nodes is fixed at d = 10. C.5. Ablation Studies C.5.1. IMPACT OF NUMBER OF MAXIMUM INTERVENTIONAL TARGETS In this section, we evaluated SCOUT’s performance alongside baselines while varying the maximum number … view at source ↗

**Figure 13.** Figure 13: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and shift interventions. The number of maximum intervention targets per experiment is varied from 1 to 5. 23 [PITH_FULL_IMAGE:figures/full_fig_p023_13.png] view at source ↗

**Figure 14.** Figure 14: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and shift interventions. The number of samples per experiment is varied from 250 to 1500 [PITH_FULL_IMAGE:figures/full_fig_p024_14.png] view at source ↗

**Figure 15.** Figure 15: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and shift interventions. The number of expected outgoing edge density is varied from 1 to 4 [PITH_FULL_IMAGE:figures/full_fig_p025_15.png] view at source ↗

**Figure 16.** Figure 16: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and shift interventions. The shift parameter is varied from 0 (observational case) to 2. C.5.5. IMPACT OF SCALE PARAMETER In this section, we vary the scale parameter to see its effect on the performance of SCOUT and baselines. We change it from 0.25 to 2 (0.5 is the observational case). The results are given in [PITH_… view at source ↗

**Figure 17.** Figure 17: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and shift interventions. The scale parameter is varied from 0.25 to 2. 26 [PITH_FULL_IMAGE:figures/full_fig_p026_17.png] view at source ↗

**Figure 18.** Figure 18: Graph recovery performance comparison between SCOUT and baselines under non-linear SEM and shift interventions. The number of cycles are varied from 0 to 8 [PITH_FULL_IMAGE:figures/full_fig_p027_18.png] view at source ↗

**Figure 19.** Figure 19: KL-divergence comparison between noisy function, shift, and scale interventions under all the single node interventions for a graph with d = 10 nodes. C.7. Additional Experiments on Perturb-CITE-seq Dataset We test how well SCOUT performs compared to other baselines when the intervention targets are known on the PerturbCITE-seq dataset (Frangieh et al., 2021). Additionally, we compared the baselines’ per… view at source ↗

**Figure 20.** Figure 20 [PITH_FULL_IMAGE:figures/full_fig_p029_20.png] view at source ↗

**Figure 21.** Figure 21 [PITH_FULL_IMAGE:figures/full_fig_p029_21.png] view at source ↗

**Figure 22.** Figure 22 [PITH_FULL_IMAGE:figures/full_fig_p029_22.png] view at source ↗

**Figure 23.** Figure 23: The adjacency matrix learnt by SCOUT for co-culture cell condition of Perturb-CITE-seq dataset (Frangieh et al., 2021). |α| ≤ 1. Then, the combined interventional causal mechanism f (Ik) ≜ (Ukf + (1d − Uk)f˜) remains contractive. Proof. We should show that f (Ik) ≜ (Ukf + (1d − Uk)f˜) will still be contractive if f is contractive and f˜= αf where |α| ≤ 1. f (Ik) (x) = Uk + α(I − Uk) f(x) = A f(x), where… view at source ↗

**Figure 24.** Figure 24: reports the training times of SCOUT and the baseline methods. In contrast to the gradient-based approaches, LLC and BACKSHIFT require no stochastic optimization; as a result, they are substantially faster. NODAGS-Flow has lower runtime than SCOUT, but its formulation does not support unknown-target estimation or neural spline flows for exogenous noise transformation. All runtimes are measured on graphs wi… view at source ↗

read the original abstract

Learning causal relationships between variables from data is a fundamental research area with many applications across disciplines. Most existing causal discovery algorithms rely on the assumptions that (i) the underlying system is acyclic, (ii) the exogenous noise variables are Gaussian, and (iii) the intervention targets for the data-generating experiments are known. While these assumptions simplify the analysis, they are violated in real-life systems. Most existing methods that address these issues either assume the underlying model is linear or are constrained to operate in limited interventional settings. To that end, we propose SCOUT, a novel causal discovery framework for learning nonlinear cyclic causal relationships from soft interventional data with unknown targets. Our approach maximizes the data log-likelihood to recover the graph structure, using two normalizing-flow architectures: contractive residual flows and neural spline flows. Through experiments on synthetic and real-world data, we show that SCOUT outperforms state-of-the-art methods in both causal graph recovery and unknown target recovery across various interventional and noise settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SCOUT combines contractive residual flows and neural spline flows to recover nonlinear cyclic graphs and unknown soft intervention targets via likelihood maximization, but unique identifiability is not established.

read the letter

SCOUT recovers cyclic causal graphs from soft interventional data when targets are unknown by maximizing likelihood under two normalizing flow architectures. The approach directly targets three common real-world violations at once: cycles, nonlinearity, and unknown intervention targets. That framing fills a practical gap, since most prior methods assume acyclicity or known targets and often stay linear. The paper shows the method outperforming baselines on synthetic data across noise and intervention regimes plus some real-world examples, which gives a concrete sense of where it helps. The architecture choice itself is a reasonable engineering step for handling the required flexibility in the density model. The main soft spot is identifiability. Nothing in the abstract or stress-test description supplies a theorem showing that the true graph plus targets uniquely maximize the likelihood once the flows are expressive enough. Multiple cyclic structures can often produce similar joint distributions, and contractive residuals plus splines do not automatically rule out observational equivalence. Without either a proof or systematic checks that the optimizer lands on the ground-truth structure rather than an equivalent one, the recovery claim rests on an assumption that may not hold in general. Experiments would also need clear reporting on baseline fairness, hyperparameter search, and whether any data splits or exclusions were post-hoc. This work is aimed at researchers who need causal models for feedback systems with partial intervention information, such as in systems biology or econometrics. Readers already comfortable with flow models or interventional causal discovery will extract the most value from the specific combination and the reported gains. It deserves a serious referee because the problem is relevant, the method is technically specified, and the empirical results are presented, even though identifiability and experimental transparency will likely require revision.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes SCOUT, a causal discovery framework for recovering nonlinear cyclic causal graphs from soft interventional data when intervention targets are unknown. The method maximizes the data log-likelihood using two normalizing-flow architectures (contractive residual flows and neural spline flows) to jointly infer the graph structure and the unknown targets, and reports outperformance relative to existing methods on both synthetic and real-world datasets across multiple interventional and noise regimes.

Significance. If the empirical and algorithmic claims hold, the work would constitute a meaningful advance by relaxing the standard assumptions of acyclicity, Gaussian noise, and known targets that constrain most prior causal discovery algorithms. The use of expressive flow models to perform likelihood-based structure recovery in cyclic nonlinear settings is a technically interesting direction, though its reliability rests on whether the fitted flows can distinguish the true structure from observationally equivalent alternatives.

major comments (2)

[Abstract] Abstract: the claim that SCOUT 'outperforms state-of-the-art methods in both causal graph recovery and unknown target recovery' is presented without any description of experimental design, choice of baselines, number of runs, error bars, or data-exclusion criteria. This absence makes it impossible to evaluate whether the reported gains are robust or sensitive to post-hoc decisions.
[Method] The central recovery procedure (likelihood maximization under the contractive residual flow and neural spline flow models) assumes that the maximum-likelihood parameters correspond to the ground-truth cyclic graph and target assignment. In nonlinear cyclic models with soft interventions, multiple distinct graphs can induce the same joint distribution once the flows are sufficiently expressive; the manuscript provides neither an identifiability theorem nor explicit diagnostics (e.g., multiple random initializations or likelihood comparisons across candidate graphs) to rule out observationally equivalent solutions. This assumption is load-bearing for the claim that the recovered graph and targets are causally meaningful.

minor comments (2)

The precise parameterization of the contractive residual flows (e.g., contraction constant, residual block architecture) and the neural spline flows (e.g., number of bins, tail bound) should be stated explicitly, together with any regularization used to enforce contractivity.
Notation for the soft intervention parameters and the mapping from flow parameters to graph edges should be introduced once and used consistently; currently the transition from flow outputs to adjacency matrix is described only at a high level.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment point by point below, outlining specific revisions where appropriate. Our goal is to improve the clarity of the experimental claims and to strengthen the discussion around the recovery procedure.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that SCOUT 'outperforms state-of-the-art methods in both causal graph recovery and unknown target recovery' is presented without any description of experimental design, choice of baselines, number of runs, error bars, or data-exclusion criteria. This absence makes it impossible to evaluate whether the reported gains are robust or sensitive to post-hoc decisions.

Authors: We agree that the abstract would benefit from additional context to allow readers to assess the reported results. In the revised manuscript we will expand the final sentence of the abstract to briefly describe the experimental protocol: synthetic data generated under multiple noise regimes and intervention densities, real-world datasets, comparison against representative baselines from the cyclic and interventional causal discovery literature, and aggregation over multiple independent runs with standard errors. These additions will be kept concise while providing the necessary information on design and variability. revision: yes
Referee: [Method] The central recovery procedure (likelihood maximization under the contractive residual flow and neural spline flow models) assumes that the maximum-likelihood parameters correspond to the ground-truth cyclic graph and target assignment. In nonlinear cyclic models with soft interventions, multiple distinct graphs can induce the same joint distribution once the flows are sufficiently expressive; the manuscript provides neither an identifiability theorem nor explicit diagnostics (e.g., multiple random initializations or likelihood comparisons across candidate graphs) to rule out observationally equivalent solutions. This assumption is load-bearing for the claim that the recovered graph and targets are causally meaningful.

Authors: We acknowledge the theoretical subtlety raised. The manuscript currently relies on empirical recovery performance rather than a formal identifiability result. In revision we will add a dedicated paragraph in the method section that (i) explains how the contractive residual flow architecture restricts the function class in a manner that reduces the set of observationally equivalent graphs, (ii) reports results from multiple random initializations showing convergence to the same recovered graph and target set, and (iii) includes likelihood comparisons between the recovered structure and a small set of plausible alternatives. We will also explicitly note that the current claims rest on these empirical diagnostics and that a complete identifiability theorem remains an open question for future work. These changes address the concern without overstating theoretical guarantees. revision: partial

Circularity Check

0 steps flagged

No significant circularity; likelihood maximization recovers structure from external data without definitional reduction.

full rationale

The paper proposes SCOUT as a method that maximizes data log-likelihood under contractive residual flows and neural spline flows to jointly recover cyclic graph structure and unknown soft intervention targets. This is an empirical fitting procedure applied to synthetic and real-world datasets, with performance measured by comparison to ground-truth structures and baselines. No step in the provided abstract or description reduces a claimed prediction or recovery result to a quantity defined by the fitted parameters themselves, nor does it rely on load-bearing self-citations or imported uniqueness theorems that would make the output equivalent to the input by construction. The approach is self-contained against external benchmarks and does not exhibit any of the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only the abstract is available, so the ledger is necessarily incomplete; the approach rests on standard causal discovery modeling assumptions plus the representational capacity of the two flow families.

axioms (1)

domain assumption The observed data is generated by a structural causal model that may contain cycles and is compatible with soft interventions whose targets are unobserved.
Invoked in the problem setup to justify the likelihood-based recovery procedure.

pith-pipeline@v0.9.0 · 5708 in / 1282 out tokens · 53711 ms · 2026-05-20T19:50:23.072000+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We use neural networks to parametrize the function f, and the contractivity assumption can be conserved with spectral normalization... pIk,G(X) = ... det(J(id−f(Ik))(X)) ...
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Theorem 3.1 ... ˆG is I∗-Markov equivalent to G∗ ...

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages · 4 internal anchors

[1]

Behrmann, J., Grathwohl, W., Chen, R

URL https://proceedings.mlr.press/ v124/amendola20a.html. Behrmann, J., Grathwohl, W., Chen, R. T. Q., Duvenaud, D., and Jacobsen, J.-H. Invertible residual networks. In Chaudhuri, K. and Salakhutdinov, R. (eds.),Proceed- ings of the 36th International Conference on Machine Learning, volume 97 ofProceedings of Machine Learn- ing Research, pp. 573–582. PML...

work page doi:10.1002/9781118619179 2019
[2]

cc/paper_files/paper/2019/file/ 5d0d5594d24f0f955548f0fc0ff83d10-Paper

URL https://proceedings.neurips. cc/paper_files/paper/2019/file/ 5d0d5594d24f0f955548f0fc0ff83d10-Paper. pdf. Dibaeinia, P. and Sinha, S. Sergio: A single-cell expres- sion simulator guided by gene regulatory networks.Cell Systems, 11, 08 2020. doi: 10.1016/j.cels.2020.08.003. Dixit, A., Parnas, O., Li, B., Chen, J., Fulco, C. P., Jerby-Arnon, L., Marjano...

work page doi:10.1016/j.cels.2020.08.003 2019
[3]

Drton, M., Fox, C., and Wang, Y

URL https://www.sciencedirect.com/ science/article/pii/S0092867416316105. Drton, M., Fox, C., and Wang, Y . S. Computation of maximum likelihood estimates in cyclic structural equa- tion models.The Annals of Statistics, 47(2):663 – 690,

work page
[4]

URL https://doi

doi: 10.1214/17-AOS1602. URL https://doi. org/10.1214/17-AOS1602. Durkan, C., Bekasov, A., Murray, I., and Papamakarios, G. Neural spline flows. In Wallach, H., Larochelle, H., Beygelzimer, A., d'Alch ´e-Buc, F., Fox, E., and Garnett, R. (eds.),Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.,

work page doi:10.1214/17-aos1602
[5]

Markov Properties for Graphical Models with Cycles and Latent Variables

URL https://proceedings.neurips. cc/paper_files/paper/2019/file/ 7ac71d433f282034e088473244df8c02-Paper. pdf. Forr´e, P. and Mooij, J. M. Markov properties for graphical models with cycles and latent variables, 2017. URL https://arxiv.org/abs/1710.08775. Frangieh, C., Melms, J., Thakore, P., Geiger-Schuller, K., Ho, P., Luoma, A., Cleary, B., Jerby-Arnon,...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.1038/s41588-021-00779-1 2019
[6]

Publisher Copyright: © 2022, The Author(s), under exclusive li- cence to Springer Nature America, Inc

doi: 10.1038/s41588-022-01106-y. Publisher Copyright: © 2022, The Author(s), under exclusive li- cence to Springer Nature America, Inc. Gamella, J. L. and Heinze-Deml, C. Active invariant causal prediction: Experiment selection through stability.arXiv: Methodology, 2020. URL https: //api.semanticscholar.org/CorpusID: 219558684. H¨agele, A., Rothfuss, J., ...

work page doi:10.1038/s41588-022-01106-y 2022
[7]

Hauser, A

URL https://openreview.net/forum? id=gbgPtVkztWn. Hauser, A. and B¨uhlmann, P. Characterization and greedy learning of interventional markov equivalence classes of directed acyclic graphs.J. Mach. Learn. Res., 13(1): 2409–2464, August 2012. ISSN 1532-4435. Heinze-Deml, C., Peters, J., and Meinshausen, N. Invari- ant causal prediction for nonlinear models....

work page doi:10.1515/jci-2017-0016 2012
[8]

Hyttinen, A., Eberhardt, F., and Hoyer, P

URL https://proceedings.mlr.press/ v124/huetter20a.html. Hyttinen, A., Eberhardt, F., and Hoyer, P. O. Learn- ing linear cyclic causal models with latent variables. Journal of Machine Learning Research, 13(109):3387– 3439, 2012. URL http://jmlr.org/papers/ v13/hyttinen12a.html. Jaber, A., Kocaoglu, M., Shanmugam, K., and Barein- boim, E. Causal discovery ...

work page 2012
[9]

Adam: A Method for Stochastic Optimization

URL https://proceedings.neurips. cc/paper_files/paper/2020/file/ 6cd9313ed34ef58bad3fdd504355e72c-Paper. pdf. Jang, E., Gu, S., and Poole, B. Categorical reparameter- ization with gumbel-softmax. InInternational Confer- ence on Learning Representations, 2017. URL https: //openreview.net/forum?id=rkE3y85ee. Kingma, D. P. and Ba, J. Adam: A method for stoch...

work page internal anchor Pith review Pith/arXiv arXiv 2020
[10]

Discovering Cyclic Causal Models by Independent Components Analysis

URL https://proceedings.neurips. cc/paper_files/paper/2019/file/ c3d96fbd5b1b45096ff04c04038fff5d-Paper. pdf. Lacerda, G., Spirtes, P., Ramsey, J., and Hoyer, P. O. Discovering cyclic causal models by inde- pendent components analysis.ArXiv, abs/1206.3273,

work page internal anchor Pith review Pith/arXiv arXiv 2019
[11]

org/CorpusID:84256

URL https://api.semanticscholar. org/CorpusID:84256. Lee, H.-C., Danieletto, M., Miotto, R., Cherng, S., and Dudley, J. T. Scaling structural learn- ing with no-bears to infer causal transcriptome networks.Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, 25:391–402,

work page
[12]

org/CorpusID:207773853

URL https://api.semanticscholar. org/CorpusID:207773853. M. Mooij, J. and Claassen, T. Constraint-based causal dis- covery using partial ancestral graphs in the presence of cycles. In Peters, J. and Sontag, D. (eds.),Proceedings of the 36th Conference on Uncertainty in Artificial Intel- ligence (UAI), volume 124 ofProceedings of Machine Learning Research,...

work page
[13]

URL https://proceedings.mlr.press/ v124/m-mooij20a.html. Meek, C. Graphical Models: Selecting causal and statistical models. 1 1997. doi: 10.1184/R1/22696393.v1. URL https://kilthub.cmu.edu/articles/ thesis/Graphical_Models_Selecting_ causal_and_statistical_models/ 22696393. Mokhtarian, E., Salehkaleybar, S., Ghassami, A., and Kiyavash, N. A unified exper...

work page doi:10.1184/r1/22696393.v1 1997
[14]

Directed Cyclic Graphical Representations of Feedback Models

URL https://proceedings.mlr.press/ v206/sethuraman23a.html. Solus, L., Wang, Y ., and Uhler, C. Consistency guarantees for greedy permutation-based causal infer- ence algorithms.Biometrika, 2017. URL https: //api.semanticscholar.org/CorpusID: 234167535. Spirtes, P., Glymour, C., and Scheines, R.Causation, Pre- diction, and Search. 01 2001. ISBN 9780262284...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.7551/mitpress/1754.001.0001 2017
[15]

Wang, Y ., Solus, L., Yang, K., and Uhler, C

URL https://proceedings.mlr.press/ v180/varici22a.html. Wang, Y ., Solus, L., Yang, K., and Uhler, C. Permutation- based causal inference algorithms with interventions. In Guyon, I., Luxburg, U. V ., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R. (eds.),Advances in Neural Information Process- ing Systems, volume 30. Curran Associat...

work page
[16]

cc/paper_files/paper/2017/file/ 12 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets 275d7fb2fd45098ad5c3ece2ed4a2824-Paper

URL https://proceedings.neurips. cc/paper_files/paper/2017/file/ 12 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets 275d7fb2fd45098ad5c3ece2ed4a2824-Paper. pdf. Yang, Y ., Salehkaleybar, S., and Kiyavash, N. Learning unknown intervention targets in structural causal models from heterogeneous data. InInternational Conference on...

work page arXiv 2017
[17]

org/CorpusID:53217974

URL https://api.semanticscholar. org/CorpusID:53217974. Zheng, X., Dan, C., Aragam, B., Ravikumar, P., and Xing, E. Learning sparse nonparametric dags. In Chiappa, S. and Calandra, R. (eds.),Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, volume 108 ofProceedings of Machine Learning Research, pp. 3414–34...

work page
[18]

URL https://proceedings.mlr.press/ v108/zheng20a.html. 13 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets The appendix is organized as follows: Appendix A discusses the solvability of cyclic systems under equilibrium and justifies the contractiviy assumption on observed and intervened causal mechanisms. Appendix B develops the...

work page 2021
[19]

The path π is said to be σ-opengiven C if it is not σ-blocked

the first node ofπ,i 0 ∈Cor its last nodei n ∈C, or 2.πcontains a collideri k /∈anG(C) 3.π contains a non-collider ik ∈C that points towards a neighbor that is not in the same strongly connected component asi k inG, i.e, such thati k−1 ←i k inπandi k−1 /∈scG(ik), ori k →i k+1 inπandi k+1 /∈scG(ik). The path π is said to be σ-opengiven C if it is not σ-blo...

work page 2017
[20]

Let π be a discriminating path for a node v in G1, and let π′ be the corresponding path to π in G2 If π′ is also a discriminating path forv, thenvis a collider onπinG 1 if and only if it is a collider onπ ′ inG 2. Hence, by Theorem B.16, two directed graphs G1 and G2 are I-Markov equivalent if and only if their corresponding σ-MAGs, σ-MAG(GI 1 ) and σ-MAG...

work page 2023
[21]

We then split the data by experimental condition (co-culture, IFN-γ, and control), training and evaluating models separately within each condition

selected from the full set of measured genes. We then split the data by experimental condition (co-culture, IFN-γ, and control), training and evaluating models separately within each condition. . 30 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets Table 12.The selected gene set from the Perturb-CITE-seq dataset (Frangieh et al....

work page 2021

[1] [1]

Behrmann, J., Grathwohl, W., Chen, R

URL https://proceedings.mlr.press/ v124/amendola20a.html. Behrmann, J., Grathwohl, W., Chen, R. T. Q., Duvenaud, D., and Jacobsen, J.-H. Invertible residual networks. In Chaudhuri, K. and Salakhutdinov, R. (eds.),Proceed- ings of the 36th International Conference on Machine Learning, volume 97 ofProceedings of Machine Learn- ing Research, pp. 573–582. PML...

work page doi:10.1002/9781118619179 2019

[2] [2]

cc/paper_files/paper/2019/file/ 5d0d5594d24f0f955548f0fc0ff83d10-Paper

URL https://proceedings.neurips. cc/paper_files/paper/2019/file/ 5d0d5594d24f0f955548f0fc0ff83d10-Paper. pdf. Dibaeinia, P. and Sinha, S. Sergio: A single-cell expres- sion simulator guided by gene regulatory networks.Cell Systems, 11, 08 2020. doi: 10.1016/j.cels.2020.08.003. Dixit, A., Parnas, O., Li, B., Chen, J., Fulco, C. P., Jerby-Arnon, L., Marjano...

work page doi:10.1016/j.cels.2020.08.003 2019

[3] [3]

Drton, M., Fox, C., and Wang, Y

URL https://www.sciencedirect.com/ science/article/pii/S0092867416316105. Drton, M., Fox, C., and Wang, Y . S. Computation of maximum likelihood estimates in cyclic structural equa- tion models.The Annals of Statistics, 47(2):663 – 690,

work page

[4] [4]

URL https://doi

doi: 10.1214/17-AOS1602. URL https://doi. org/10.1214/17-AOS1602. Durkan, C., Bekasov, A., Murray, I., and Papamakarios, G. Neural spline flows. In Wallach, H., Larochelle, H., Beygelzimer, A., d'Alch ´e-Buc, F., Fox, E., and Garnett, R. (eds.),Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.,

work page doi:10.1214/17-aos1602

[5] [5]

Markov Properties for Graphical Models with Cycles and Latent Variables

URL https://proceedings.neurips. cc/paper_files/paper/2019/file/ 7ac71d433f282034e088473244df8c02-Paper. pdf. Forr´e, P. and Mooij, J. M. Markov properties for graphical models with cycles and latent variables, 2017. URL https://arxiv.org/abs/1710.08775. Frangieh, C., Melms, J., Thakore, P., Geiger-Schuller, K., Ho, P., Luoma, A., Cleary, B., Jerby-Arnon,...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.1038/s41588-021-00779-1 2019

[6] [6]

Publisher Copyright: © 2022, The Author(s), under exclusive li- cence to Springer Nature America, Inc

doi: 10.1038/s41588-022-01106-y. Publisher Copyright: © 2022, The Author(s), under exclusive li- cence to Springer Nature America, Inc. Gamella, J. L. and Heinze-Deml, C. Active invariant causal prediction: Experiment selection through stability.arXiv: Methodology, 2020. URL https: //api.semanticscholar.org/CorpusID: 219558684. H¨agele, A., Rothfuss, J., ...

work page doi:10.1038/s41588-022-01106-y 2022

[7] [7]

Hauser, A

URL https://openreview.net/forum? id=gbgPtVkztWn. Hauser, A. and B¨uhlmann, P. Characterization and greedy learning of interventional markov equivalence classes of directed acyclic graphs.J. Mach. Learn. Res., 13(1): 2409–2464, August 2012. ISSN 1532-4435. Heinze-Deml, C., Peters, J., and Meinshausen, N. Invari- ant causal prediction for nonlinear models....

work page doi:10.1515/jci-2017-0016 2012

[8] [8]

Hyttinen, A., Eberhardt, F., and Hoyer, P

URL https://proceedings.mlr.press/ v124/huetter20a.html. Hyttinen, A., Eberhardt, F., and Hoyer, P. O. Learn- ing linear cyclic causal models with latent variables. Journal of Machine Learning Research, 13(109):3387– 3439, 2012. URL http://jmlr.org/papers/ v13/hyttinen12a.html. Jaber, A., Kocaoglu, M., Shanmugam, K., and Barein- boim, E. Causal discovery ...

work page 2012

[9] [9]

Adam: A Method for Stochastic Optimization

URL https://proceedings.neurips. cc/paper_files/paper/2020/file/ 6cd9313ed34ef58bad3fdd504355e72c-Paper. pdf. Jang, E., Gu, S., and Poole, B. Categorical reparameter- ization with gumbel-softmax. InInternational Confer- ence on Learning Representations, 2017. URL https: //openreview.net/forum?id=rkE3y85ee. Kingma, D. P. and Ba, J. Adam: A method for stoch...

work page internal anchor Pith review Pith/arXiv arXiv 2020

[10] [10]

Discovering Cyclic Causal Models by Independent Components Analysis

URL https://proceedings.neurips. cc/paper_files/paper/2019/file/ c3d96fbd5b1b45096ff04c04038fff5d-Paper. pdf. Lacerda, G., Spirtes, P., Ramsey, J., and Hoyer, P. O. Discovering cyclic causal models by inde- pendent components analysis.ArXiv, abs/1206.3273,

work page internal anchor Pith review Pith/arXiv arXiv 2019

[11] [11]

org/CorpusID:84256

URL https://api.semanticscholar. org/CorpusID:84256. Lee, H.-C., Danieletto, M., Miotto, R., Cherng, S., and Dudley, J. T. Scaling structural learn- ing with no-bears to infer causal transcriptome networks.Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, 25:391–402,

work page

[12] [12]

org/CorpusID:207773853

URL https://api.semanticscholar. org/CorpusID:207773853. M. Mooij, J. and Claassen, T. Constraint-based causal dis- covery using partial ancestral graphs in the presence of cycles. In Peters, J. and Sontag, D. (eds.),Proceedings of the 36th Conference on Uncertainty in Artificial Intel- ligence (UAI), volume 124 ofProceedings of Machine Learning Research,...

work page

[13] [13]

URL https://proceedings.mlr.press/ v124/m-mooij20a.html. Meek, C. Graphical Models: Selecting causal and statistical models. 1 1997. doi: 10.1184/R1/22696393.v1. URL https://kilthub.cmu.edu/articles/ thesis/Graphical_Models_Selecting_ causal_and_statistical_models/ 22696393. Mokhtarian, E., Salehkaleybar, S., Ghassami, A., and Kiyavash, N. A unified exper...

work page doi:10.1184/r1/22696393.v1 1997

[14] [14]

Directed Cyclic Graphical Representations of Feedback Models

URL https://proceedings.mlr.press/ v206/sethuraman23a.html. Solus, L., Wang, Y ., and Uhler, C. Consistency guarantees for greedy permutation-based causal infer- ence algorithms.Biometrika, 2017. URL https: //api.semanticscholar.org/CorpusID: 234167535. Spirtes, P., Glymour, C., and Scheines, R.Causation, Pre- diction, and Search. 01 2001. ISBN 9780262284...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.7551/mitpress/1754.001.0001 2017

[15] [15]

Wang, Y ., Solus, L., Yang, K., and Uhler, C

URL https://proceedings.mlr.press/ v180/varici22a.html. Wang, Y ., Solus, L., Yang, K., and Uhler, C. Permutation- based causal inference algorithms with interventions. In Guyon, I., Luxburg, U. V ., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R. (eds.),Advances in Neural Information Process- ing Systems, volume 30. Curran Associat...

work page

[16] [16]

cc/paper_files/paper/2017/file/ 12 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets 275d7fb2fd45098ad5c3ece2ed4a2824-Paper

URL https://proceedings.neurips. cc/paper_files/paper/2017/file/ 12 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets 275d7fb2fd45098ad5c3ece2ed4a2824-Paper. pdf. Yang, Y ., Salehkaleybar, S., and Kiyavash, N. Learning unknown intervention targets in structural causal models from heterogeneous data. InInternational Conference on...

work page arXiv 2017

[17] [17]

org/CorpusID:53217974

URL https://api.semanticscholar. org/CorpusID:53217974. Zheng, X., Dan, C., Aragam, B., Ravikumar, P., and Xing, E. Learning sparse nonparametric dags. In Chiappa, S. and Calandra, R. (eds.),Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, volume 108 ofProceedings of Machine Learning Research, pp. 3414–34...

work page

[18] [18]

URL https://proceedings.mlr.press/ v108/zheng20a.html. 13 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets The appendix is organized as follows: Appendix A discusses the solvability of cyclic systems under equilibrium and justifies the contractiviy assumption on observed and intervened causal mechanisms. Appendix B develops the...

work page 2021

[19] [19]

The path π is said to be σ-opengiven C if it is not σ-blocked

the first node ofπ,i 0 ∈Cor its last nodei n ∈C, or 2.πcontains a collideri k /∈anG(C) 3.π contains a non-collider ik ∈C that points towards a neighbor that is not in the same strongly connected component asi k inG, i.e, such thati k−1 ←i k inπandi k−1 /∈scG(ik), ori k →i k+1 inπandi k+1 /∈scG(ik). The path π is said to be σ-opengiven C if it is not σ-blo...

work page 2017

[20] [20]

Let π be a discriminating path for a node v in G1, and let π′ be the corresponding path to π in G2 If π′ is also a discriminating path forv, thenvis a collider onπinG 1 if and only if it is a collider onπ ′ inG 2. Hence, by Theorem B.16, two directed graphs G1 and G2 are I-Markov equivalent if and only if their corresponding σ-MAGs, σ-MAG(GI 1 ) and σ-MAG...

work page 2023

[21] [21]

We then split the data by experimental condition (co-culture, IFN-γ, and control), training and evaluating models separately within each condition

selected from the full set of measured genes. We then split the data by experimental condition (co-culture, IFN-γ, and control), training and evaluating models separately within each condition. . 30 SCOUT: Cyclic Causal Discovery Under Soft Interventions with Unknown Targets Table 12.The selected gene set from the Perturb-CITE-seq dataset (Frangieh et al....

work page 2021