pith. sign in

arxiv: 2605.19343 · v1 · pith:PQNZ4BQFnew · submitted 2026-05-19 · 💻 cs.LG

What Makes a Representation Good for Single-Cell Perturbation Prediction?

Pith reviewed 2026-05-20 08:03 UTC · model grok-4.3

classification 💻 cs.LG
keywords single-cell perturbationrepresentation learningcausal representationsvariational autoencoderdisentanglementgene expressionout-of-distribution predictionperturbation modeling
0
0 comments X

The pith

PerturbedVAE separates sparse perturbation signals from dominant invariant gene expression to recover causal representations for accurate single-cell response prediction.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Gene expression data is mostly unchanged by perturbations, so the actual response signals are sparse and easily lost or mixed with the background. Current approaches either blend the two kinds of information into non-generalizable predictors or suppress the perturbation signals, which hurts performance especially when predicting responses to new combinations of perturbations. PerturbedVAE is built to keep the invariant structure separate while isolating and using the perturbation-specific part through causal representations. The paper supplies an identifiability analysis that states the conditions under which the sparse effects can be recovered reliably. On standard benchmarks this separation produces state-of-the-art results and clear gains on out-of-distribution combinatorial tasks along with interpretable response programs.

Core claim

PerturbedVAE explicitly separates perturbation-specific information from dominant invariant structure and recovers causal representations to effectively utilize such information for prediction, achieving state-of-the-art performance on a widely used benchmark with significant gains on out-of-distribution combinatorial predictions.

What carries the argument

PerturbedVAE, a variational autoencoder that disentangles invariant cellular structure from sparse perturbation effects while recovering causal factors, guided by an identifiability analysis that specifies when the separation is possible.

If this is right

  • Predictions become more accurate for unseen combinations of genetic perturbations.
  • Recovered representations reveal interpretable perturbation-response programs.
  • Causal structure in the latent space supports more reliable downstream use of the learned representations.
  • The framework generalizes across multiple evaluation settings on standard single-cell benchmarks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar separation of dominant background from sparse signals could apply to other high-dimensional biological datasets where most variation is unrelated to the intervention of interest.
  • The identifiability conditions might be tested directly by measuring how well recovered factors align with known causal perturbation pathways in new datasets.
  • If the separation holds, it suggests a route to more parameter-efficient models that do not need to encode the full invariant background for every prediction task.

Load-bearing premise

The identifiability analysis correctly characterizes the conditions under which sparse perturbation effects can be reliably recovered from the data.

What would settle it

A controlled experiment in which representations learned without the explicit separation match or exceed PerturbedVAE performance on out-of-distribution combinatorial perturbation predictions would falsify the central claim.

Figures

Figures reproduced from arXiv: 2605.19343 by Ehsan Abbasnejad, Erdun Gao, Javen Qinfeng Shi, Jiayi Dong, Lina Yao, Wenkang Jiang, Yichao Cai, Yuhang Liu.

Figure 1
Figure 1. Figure 1: Linear probe accuracy (↑) for predicting perturbation labels. FM representations exhibit weaker linear decodability of perturbation labels than a PCA baseline, suggesting limited ac￾cessibility of perturbation-related information. In contrast, Per￾turbedVAE achieves competitive linear probing accuracy, indicat￾ing improved accessibility of perturbation-specific signals. As shown in [PITH_FULL_IMAGE:figure… view at source ↗
Figure 2
Figure 2. Figure 2: A latent generative model of single-cell perturba￾tions. zι denotes perturbation-invariant variable and zν denotes perturbation-responsive variables. Under the above generative assumptions, the joint prior dis￾tribution over the latent variables factorizes as p(zν, zι | u) = p(zν | u, zι) p(zι), (1) where the perturbation-invariant variables zι are in￾dependent of the perturbation condition u, while the pe… view at source ↗
Figure 3
Figure 3. Figure 3: Framework of the proposed PerturbedVAE. Perturbed x are used to learn the perturbation-responsive block zν, which captures the effects of perturbations indexed by u. In parallel, unperturbed x (u0) are leveraged for contrastive alignment of the perturbation￾invariant block zι, encouraging invariant background programs to be separated from perturbation-induced variation. variation. The contrastive alignment… view at source ↗
Figure 4
Figure 4. Figure 4: R 2 scores for genetic perturbation prediction across dif￾ferent latent dimensions dim(z). Left: single-gene perturbations. Right: double-gene perturbations. The proposed method consis￾tently achieves higher R 2 and remains stable as dim(z) increases. Shaded areas denote variation across runs. Double-Gene Perturbation. We further evaluate 112 double-gene perturbations, which constitute a zero-shot predicti… view at source ↗
Figure 5
Figure 5. Figure 5: Learned causal structure over the perturbation￾responsive latent variables zν. Nodes correspond to latent pro￾grams assigned to target genes via maximal intervention effect, and directed edges indicate inferred causal dependencies. Several known regulatory relationships are recovered. 6. Conclusion We showed that the failure of foundation models in per￾turbation prediction is not primarily a matter of scal… view at source ↗
Figure 6
Figure 6. Figure 6: t-SNE visualization of the invariant block zι for single-gene (a) and double-gene (b) perturbations in the test set. Together, these results provide additional evidence that zι captures perturbation-invariant variation in gene expression, validating the disentanglement between the invariant background block zι and the perturbation-responsive block zν. 27 [PITH_FULL_IMAGE:figures/full_fig_p027_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Perturbed gene hits on identifiable causal components. Learned Causal Structure. To further examine the structure of the learned latent representation, we visualize the di￾rected dependencies among the identifiable components zν [PITH_FULL_IMAGE:figures/full_fig_p028_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Visualization of the learned causal graph among identifiable components zν. Left: full adjacency matrix before thresholding. Right: sparse graph after thresholding (τ = 0.25). Semantic Grounding of Latent Programs. We next assess whether the latent components correspond to coherent bio￾logical programs [PITH_FULL_IMAGE:figures/full_fig_p028_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Inferred causal structure among latent programs. Biological Plausibility of Inferred Edges. Finally, we examine whether representative inferred causal edges are con￾sistent with known biological mechanisms [PITH_FULL_IMAGE:figures/full_fig_p029_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Performance on DE genes for double-gene perturbations. Interpretation: Information Preservation under Invariance. These patterns are consistent with the intended design of the model: perturbation-invariant background variation is stabilized in the invariant block zι , while perturbation-specific information is retained in the perturbed block zν rather than being suppressed. 30 [PITH_FULL_IMAGE:figures/fu… view at source ↗
Figure 11
Figure 11. Figure 11: Performance on DE genes for single-gene perturbations. 31 [PITH_FULL_IMAGE:figures/full_fig_p031_11.png] view at source ↗
read the original abstract

Single-cell perturbation modeling is fundamental for understanding and predicting cellular responses to genetic perturbations. However, existing approaches, from causal representation learning to foundation models, often struggle with an overlooked challenge: gene expression is dominated by perturbation-invariant information, while perturbation-specific signals are intrinsically sparse. As a result, learned representations either entangle invariant and perturbation-specific information, leading to spurious and non-generalizable predictors, or suppress perturbation-specific signals altogether, rendering them ineffective for prediction. To address this, we propose PerturbedVAE, a general framework designed to resolve this signal imbalance. The framework explicitly separates perturbation-specific information from dominant invariant structure and recovers causal representations to effectively utilize such information for prediction. We further provide an identifiability analysis that characterizes the conditions under which sparse perturbation effects can be reliably recovered, thereby clarifying how the framework can be concretely specified under such conditions. Empirically, PerturbedVAE achieves state-of-the-art performance on a widely used benchmark across multiple evaluation settings, yielding significant gains on out-of-distribution combinatorial predictions and uncovering interpretable perturbation-response programs.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The paper introduces PerturbedVAE, a VAE-based framework for single-cell perturbation prediction that explicitly separates sparse perturbation-specific signals from dominant invariant gene expression structure. It provides an identifiability analysis characterizing conditions for recovering causal representations under sparsity assumptions and reports state-of-the-art empirical performance on a standard benchmark, with particular gains on out-of-distribution combinatorial perturbation predictions.

Significance. If the identifiability results and empirical claims hold under realistic data conditions, the work could advance causal representation learning for perturbation modeling by directly addressing signal imbalance, potentially improving generalization in single-cell biology applications. The combination of theoretical analysis and benchmark results on combinatorial OOD tasks represents a meaningful contribution if the separation mechanism is shown to be load-bearing.

major comments (2)
  1. [§3] §3 (Identifiability Analysis): The analysis relies on assumptions of independent sparse perturbations and a specific mixing model to guarantee recovery of causal representations. However, the manuscript does not verify whether the high-dimensional single-cell benchmark datasets satisfy these conditions (e.g., via correlation analysis or simulation under batch effects), leaving open the possibility that recovered latents remain entangled rather than causally separated.
  2. [§5.2, Table 2] §5.2, Table 2 (OOD combinatorial results): The reported gains over baselines are presented as evidence for effective use of perturbation-specific information, but the paper lacks an ablation removing the explicit separation component while keeping other regularization choices fixed. Without this, it is unclear whether the improvements stem from causal recovery or from general VAE regularization benefits.
minor comments (3)
  1. [Abstract] The abstract claims 'significant gains' on OOD predictions but does not include even approximate quantitative values (e.g., percentage improvement or delta in a metric); adding a brief quantification would improve readability.
  2. [§2.2] Notation in §2.2 for the latent decomposition (invariant vs. perturbation-specific) could be made more explicit by introducing distinct symbols for each component rather than relying on context.
  3. [Figure 3] Figure 3 caption does not specify the exact number of runs or error bars used for the performance curves, which is needed to assess variability of the SOTA claims.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback on our manuscript. We address the major comments point by point below, providing clarifications and outlining planned revisions to strengthen the paper.

read point-by-point responses
  1. Referee: [§3] §3 (Identifiability Analysis): The analysis relies on assumptions of independent sparse perturbations and a specific mixing model to guarantee recovery of causal representations. However, the manuscript does not verify whether the high-dimensional single-cell benchmark datasets satisfy these conditions (e.g., via correlation analysis or simulation under batch effects), leaving open the possibility that recovered latents remain entangled rather than causally separated.

    Authors: We agree that explicitly verifying the assumptions on the benchmark datasets would enhance the robustness of the identifiability claims. The analysis assumes independent sparse perturbations and a linear mixing model, which are motivated by the biological sparsity of perturbation effects. In the revised manuscript, we will include additional analyses, such as computing correlations between different perturbations in the datasets and performing simulations that incorporate batch effects, to assess how well the conditions hold in practice. revision: yes

  2. Referee: [§5.2, Table 2] §5.2, Table 2 (OOD combinatorial results): The reported gains over baselines are presented as evidence for effective use of perturbation-specific information, but the paper lacks an ablation removing the explicit separation component while keeping other regularization choices fixed. Without this, it is unclear whether the improvements stem from causal recovery or from general VAE regularization benefits.

    Authors: We acknowledge the value of an ablation that isolates the contribution of the explicit separation mechanism. While the current manuscript compares against various baselines and includes some regularization ablations, it does not specifically remove only the separation component. We will add this ablation study in the revised version to demonstrate that the performance gains on OOD combinatorial predictions are attributable to the perturbation-specific separation rather than general VAE benefits. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper's central derivation introduces PerturbedVAE as a framework that explicitly separates perturbation-specific signals from invariant structure, then supplies its own identifiability analysis characterizing recovery conditions for sparse effects. No load-bearing step reduces by construction to a fitted parameter renamed as prediction, nor to a prior self-citation whose content is unverified; the analysis is presented as part of the current work rather than imported from overlapping prior authorship. The empirical SOTA claims rest on benchmark evaluation rather than tautological re-expression of inputs. This is the normal case of an independent modeling contribution.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 1 invented entities

The approach rests on the domain premise of dominant invariant signals with sparse perturbation effects and introduces a new VAE-based framework to exploit this structure.

free parameters (1)
  • VAE hyperparameters such as latent dimension and regularization strength
    Standard in variational models and required to balance separation of signals.
axioms (2)
  • domain assumption Gene expression is dominated by perturbation-invariant information while perturbation-specific signals are intrinsically sparse.
    This is the central overlooked challenge stated as the motivation for the framework.
  • domain assumption Sparse perturbation effects are recoverable under conditions characterized by the identifiability analysis.
    Directly invoked to justify concrete specification of the framework.
invented entities (1)
  • PerturbedVAE no independent evidence
    purpose: Framework that separates perturbation-specific information from invariant structure to recover causal representations.
    New model proposed to resolve the signal imbalance.

pith-pipeline@v0.9.0 · 5740 in / 1440 out tokens · 45818 ms · 2026-05-20T08:03:16.360859+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

  • IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear
    ?
    unclear

    Relation between the paper passage and the cited Recognition theorem.

    We propose PerturbedVAE... explicitly separates perturbation-specific information from dominant invariant structure... identifiability analysis that characterizes the conditions under which sparse perturbation effects can be reliably recovered

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

124 extracted references · 124 canonical work pages · 3 internal anchors

  1. [1]

    Science , volume=

    Exploring genetic interaction manifolds constructed from rich single-cell phenotypes , author=. Science , volume=. 2019 , publisher=

  2. [2]

    Cell , volume=

    Genome-scale CRISPR-mediated control of gene repression and activation , author=. Cell , volume=. 2014 , publisher=

  3. [3]

    Understanding intermediate layers using linear classifier probes

    Understanding intermediate layers using linear classifier probes , author=. arXiv preprint arXiv:1610.01644 , year=

  4. [4]

    Language models represent space and time,

    Language models represent space and time , author=. arXiv preprint arXiv:2310.02207 , year=

  5. [5]

    Advances in Neural Information Processing Systems , volume=

    Inference-time intervention: Eliciting truthful answers from a language model , author=. Advances in Neural Information Processing Systems , volume=

  6. [6]

    Nature medicine , volume=

    A visual--language foundation model for pathology image analysis using medical twitter , author=. Nature medicine , volume=. 2023 , publisher=

  7. [7]

    Nature Methods , volume=

    Deep-learning-based gene perturbation effect prediction does not yet outperform simple linear baselines , author=. Nature Methods , volume=. 2025 , publisher=

  8. [8]

    Machine learning , volume=

    Random forests , author=. Machine learning , volume=. 2001 , publisher=

  9. [9]

    IEEE transactions on information theory , volume=

    Nearest neighbor pattern classification , author=. IEEE transactions on information theory , volume=. 1967 , publisher=

  10. [10]

    Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=

    Regularization and variable selection via the elastic net , author=. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume=. 2005 , publisher=

  11. [11]

    BMC genomics , volume=

    Benchmarking foundation cell models for post-perturbation RNA-seq prediction , author=. BMC genomics , volume=. 2025 , publisher=

  12. [12]

    Nature , volume=

    Transfer learning enables predictions in network biology , author=. Nature , volume=. 2023 , publisher=

  13. [13]

    Nature Machine Intelligence , volume=

    scBERT as a large-scale pretrained deep language model for cell type annotation of single-cell RNA-seq data , author=. Nature Machine Intelligence , volume=. 2022 , publisher=

  14. [14]

    Nature methods , volume=

    Large-scale foundation model on single-cell transcriptomics , author=. Nature methods , volume=. 2024 , publisher=

  15. [15]

    International conference on machine learning , pages=

    A theoretical analysis of contrastive unsupervised representation learning , author=. International conference on machine learning , pages=. 2019 , organization=

  16. [16]

    International conference on machine learning , pages=

    Understanding contrastive representation learning through alignment and uniformity on the hypersphere , author=. International conference on machine learning , pages=. 2020 , organization=

  17. [17]

    Causal representation learning from multiple distributions: A general setting

    Causal representation learning from multiple distributions: A general setting , author=. arXiv preprint arXiv:2402.05052 , year=

  18. [18]

    Nature methods , volume=

    scGPT: toward building a foundation model for single-cell multi-omics using generative AI , author=. Nature methods , volume=. 2024 , publisher=

  19. [19]

    Advances in Neural Information Processing Systems , volume=

    Identifiability guarantees for causal disentanglement from soft interventions , author=. Advances in Neural Information Processing Systems , volume=

  20. [20]

    arXiv preprint arXiv:2206.06169 , year=

    Causal representation learning for instantaneous and temporal effects in interactive systems , author=. arXiv preprint arXiv:2206.06169 , year=

  21. [21]

    International Conference on Machine Learning , pages=

    Citris: Causal identifiability from temporal intervened sequences , author=. International Conference on Machine Learning , pages=. 2022 , organization=

  22. [22]

    Advances in Neural Information Processing Systems , volume=

    Weakly supervised causal representation learning , author=. Advances in Neural Information Processing Systems , volume=

  23. [23]

    Journal of machine learning research , volume=

    Visualizing data using t-SNE , author=. Journal of machine learning research , volume=

  24. [24]

    Advances in neural information processing systems , volume=

    Self-supervised learning with data augmentations provably isolates content from style , author=. Advances in neural information processing systems , volume=

  25. [25]

    Neural networks , volume=

    Nonlinear independent component analysis: Existence and uniqueness results , author=. Neural networks , volume=. 1999 , publisher=

  26. [26]

    Natural Image Statistics: A Probabilistic Approach to Early Computational Vision , pages=

    Independent component analysis , author=. Natural Image Statistics: A Probabilistic Approach to Early Computational Vision , pages=. 2001 , publisher=

  27. [27]

    Journal of Machine Learning Research , volume=

    Identifying weight-variant latent causal models , author=. Journal of Machine Learning Research , volume=

  28. [28]

    The Fourteenth International Conference on Learning Representations , year=

    I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data? , author=. The Fourteenth International Conference on Learning Representations , year=

  29. [29]

    Yuhang Liu and Zhen Zhang and Dong Gong and Erdun Gao and Biwei Huang and Mingming Gong and Anton van den Hengel and Kun Zhang and Javen Qinfeng Shi , booktitle=. Beyond

  30. [30]

    Transactions on Machine Learning Research , issn=

    Latent Covariate Shift: Unlocking Partial Identifiability for Multi-Source Domain Adaptation , author=. Transactions on Machine Learning Research , issn=

  31. [31]

    arXiv preprint arXiv:2403.15711 , year=

    Towards Identifiable Latent Additive Noise Models , author=. arXiv preprint arXiv:2403.15711 , year=

  32. [32]

    Advances in neural information processing systems , volume=

    Unsupervised feature extraction by time-contrastive learning and nonlinear ica , author=. Advances in neural information processing systems , volume=

  33. [33]

    Bioinformatics , volume=

    Predicting single-cell cellular responses to perturbations using cycle consistency learning , author=. Bioinformatics , volume=. 2024 , publisher=

  34. [34]

    International conference on machine learning , pages=

    Linear causal disentanglement via interventions , author=. International conference on machine learning , pages=. 2023 , organization=

  35. [35]

    Artificial intelligence and statistics , pages=

    Nonlinear ICA of temporally dependent stationary sources , author=. Artificial intelligence and statistics , pages=. 2017 , organization=

  36. [36]

    The Twelfth International Conference on Learning Representations , year=

    Identifiable Latent Polynomial Causal Models through the Lens of Change , author=. The Twelfth International Conference on Learning Representations , year=

  37. [37]

    arXiv preprint arXiv:2001.04872 , year=

    Disentanglement by nonlinear ica with general incompressible-flow networks (gin) , author=. arXiv preprint arXiv:2001.04872 , year=

  38. [38]

    Probabilistic and causal inference: The works of Judea Pearl , pages=

    Causality for machine learning , author=. Probabilistic and causal inference: The works of Judea Pearl , pages=

  39. [39]

    International conference on artificial intelligence and statistics , pages=

    Variational autoencoders and nonlinear ica: A unifying framework , author=. International conference on artificial intelligence and statistics , pages=. 2020 , organization=

  40. [40]

    2013 , publisher=

    Auto-encoding variational bayes , author=. 2013 , publisher=

  41. [41]

    International conference on machine learning , pages=

    Stochastic backpropagation and approximate inference in deep generative models , author=. International conference on machine learning , pages=. 2014 , organization=

  42. [42]

    Advances in neural information processing systems , volume=

    Dags with no tears: Continuous optimization for structure learning , author=. Advances in neural information processing systems , volume=

  43. [43]

    Advances in Neural Information Processing Systems , volume=

    Learning identifiable factorized causal representations of cellular responses , author=. Advances in Neural Information Processing Systems , volume=

  44. [44]

    Advances in Neural Information Processing Systems , volume=

    Nonparametric identifiability of causal representations from unknown interventions , author=. Advances in Neural Information Processing Systems , volume=

  45. [45]

    Nature methods , volume=

    scGen predicts single-cell perturbation responses , author=. Nature methods , volume=. 2019 , publisher=

  46. [46]

    Molecular systems biology , volume=

    Predicting cellular responses to complex perturbations in high-throughput screens , author=. Molecular systems biology , volume=

  47. [47]

    Nature Biotechnology , volume=

    Predicting transcriptional outcomes of novel multigene perturbations , author=. Nature Biotechnology , volume=. 2024 , publisher=

  48. [48]

    Advances in Neural Information Processing Systems , volume=

    Modelling cellular perturbations with the sparse additive mechanism shift variational autoencoder , author=. Advances in Neural Information Processing Systems , volume=

  49. [49]

    arXiv preprint arXiv:2506.12439 , year=

    Interpretable Causal Representation Learning for Biological Data in the Pathway Space , author=. arXiv preprint arXiv:2506.12439 , year=

  50. [50]

    bioRxiv , pages=

    Multi-ContrastiveVAE disentangles perturbation effects in single cell images from optical pooled screens , author=. bioRxiv , pages=. 2023 , publisher=

  51. [51]

    International conference on machine learning , pages=

    Interventional causal representation learning , author=. International conference on machine learning , pages=. 2023 , organization=

  52. [52]

    Advances in Neural Information Processing Systems , volume=

    Predicting cellular responses to novel drug perturbations at a single-cell resolution , author=. Advances in Neural Information Processing Systems , volume=

  53. [53]

    GPT-4 Technical Report

    Gpt-4 technical report , author=. arXiv preprint arXiv:2303.08774 , year=

  54. [54]

    IEEE transactions on pattern analysis and machine intelligence , volume=

    Representation learning: A review and new perspectives , author=. IEEE transactions on pattern analysis and machine intelligence , volume=. 2013 , publisher=

  55. [55]

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context , author=. arXiv preprint arXiv:2403.05530 , year=

  56. [56]

    Journal of Machine Learning Research , volume=

    Underspecification presents challenges for credibility in modern machine learning , author=. Journal of Machine Learning Research , volume=

  57. [57]

    Accounts of chemical research , volume=

    The chemical space project , author=. Accounts of chemical research , volume=. 2015 , publisher=

  58. [58]

    Cell , volume=

    Mapping information-rich genotype-phenotype landscapes with genome-scale Perturb-seq , author=. Cell , volume=. 2022 , publisher=

  59. [59]

    International conference on learning representations , year=

    beta-vae: Learning basic visual concepts with a constrained variational framework , author=. International conference on learning representations , year=

  60. [60]

    Learning in graphical models , pages=

    An introduction to variational methods for graphical models , author=. Learning in graphical models , pages=. 1998 , publisher=

  61. [61]

    Journal of the American statistical Association , volume=

    Variational inference: A review for statisticians , author=. Journal of the American statistical Association , volume=. 2017 , publisher=

  62. [62]

    2006 , publisher=

    Pattern recognition and machine learning , author=. 2006 , publisher=

  63. [63]

    arXiv preprint arXiv:2104.08835 , year=

    Crossfit: A few-shot learning challenge for cross-task generalization in nlp , author=. arXiv preprint arXiv:2104.08835 , year=

  64. [64]

    Journal of Machine Learning Research , volume=

    Score-based causal representation learning: Linear and general transformations , author=. Journal of Machine Learning Research , volume=

  65. [65]

    NeurIPS 2022 Workshop on Causality for Real-world Impact , year=

    Learning Causal Representations of Single Cells via Sparse Mechanism Shift Modeling , author=. NeurIPS 2022 Workshop on Causality for Real-world Impact , year=

  66. [66]

    Disentanglement via Mechanism Sparsity Regularization: A New Principle for Nonlinear

    Sebastien Lachapelle and Pau Rodriguez and Yash Sharma and Katie E Everett and R. Disentanglement via Mechanism Sparsity Regularization: A New Principle for Nonlinear. First Conference on Causal Learning and Reasoning , year=

  67. [67]

    Nature Machine Intelligence , volume=

    Shortcut learning in deep neural networks , author=. Nature Machine Intelligence , volume=. 2020 , publisher=

  68. [68]

    Proceedings of the IEEE , volume=

    Toward causal representation learning , author=. Proceedings of the IEEE , volume=. 2021 , publisher=

  69. [69]

    2009 , publisher=

    Causality , author=. 2009 , publisher=

  70. [70]

    Nature , volume=

    Dissecting cell identity via network inference and in silico gene perturbation , author=. Nature , volume=. 2023 , publisher=

  71. [71]

    Annual Review of Statistics and Its Application , volume=

    Causal inference in the social sciences , author=. Annual Review of Statistics and Its Application , volume=. 2024 , publisher=

  72. [72]

    The Econometrics Journal , volume=

    Causal inference and data fusion in econometrics , author=. The Econometrics Journal , volume=. 2025 , publisher=

  73. [73]

    Cell , volume=

    Toward a foundation model of causal cell and tissue biology with a Perturbation Cell and Tissue Atlas , author=. Cell , volume=. 2024 , publisher=

  74. [74]

    Cell , volume=

    How to build the virtual cell with artificial intelligence: Priorities and opportunities , author=. Cell , volume=. 2024 , publisher=

  75. [75]

    Nature biotechnology , volume=

    Combinatorial single-cell CRISPR screens by direct guide RNA capture and targeted sequencing , author=. Nature biotechnology , volume=. 2020 , publisher=

  76. [76]

    Conference on Causal Learning and Reasoning , pages=

    Learning causal representations of single cells via sparse mechanism shift modeling , author=. Conference on Causal Learning and Reasoning , pages=. 2023 , organization=

  77. [77]

    Nature Genetics , pages=

    Causal machine learning for single-cell genomics , author=. Nature Genetics , pages=. 2025 , publisher=

  78. [78]

    arXiv preprint arXiv:2410.03380 , year=

    Identifying biological perturbation targets through causal differential networks , author=. arXiv preprint arXiv:2410.03380 , year=

  79. [79]

    bioRxiv , pages=

    Learning Genetic Perturbation Effects with Variational Causal Inference , author=. bioRxiv , pages=. 2025 , publisher=

  80. [80]

    European Conference on Computer Vision (ECCV) , pages =

    CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts , author =. European Conference on Computer Vision (ECCV) , pages =

Showing first 80 references.