Learning Causal Orderings for In-Context Tabular Prediction
Pith reviewed 2026-05-22 07:19 UTC · model grok-4.3
The pith
A tabular model learns an unsupervised causal ordering of variables and restricts predictions to use only earlier features in that order.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
TabOrder learns a topological ordering of variables in an unsupervised fashion through a likelihood-based objective and then performs prediction and imputation by constraining attention so that each target only receives information from features that precede it in the ordering; the authors show that this recovers accurate orderings, supports both prediction and imputation, and yields interpretable insights on biological data collected under interventions.
What carries the argument
Causal order-constrained attention that bases each prediction solely on variables earlier in a learned topological ordering of the features.
If this is right
- Accurate variable orderings can be recovered even when tabular samples contain missing entries.
- The same learned ordering supports both prediction and imputation tasks without separate training.
- The approach supplies causal insight on real biological data recorded after interventions.
- The likelihood objective for ordering learning is justified under common functional model classes such as additive noise.
Where Pith is reading between the lines
- The same constrained-attention idea could be tested on time-series or graph-structured data where a natural ordering exists.
- Combining the ordering learner with existing causal-discovery algorithms might reduce the need for purely unsupervised likelihood training.
- If the ordering remains stable across multiple interventions, the method could serve as a lightweight way to detect which variables are causes versus effects in new domains.
Load-bearing premise
An optimal causal ordering of the variables exists and can be recovered from unlabeled data by maximizing likelihood under standard functional assumptions, and enforcing that ordering will still help prediction when some samples contain missing values.
What would settle it
On a dataset with known ground-truth causal directions and an intervened test distribution, the learned ordering either fails to match the true directions or produces no improvement in prediction accuracy over an unconstrained baseline.
Figures
read the original abstract
In-context learning for tabular data sets strong predictive standards in observational settings; it however primarily relies on correlational structure, which becomes unreliable under distribution shift or intervention. While established methods to discover causal structure exist, they are often focused on structure identifiability and decoupled from the predictive architectures that could benefit from them. To bridge these perspectives, we study how to simultaneously infer and enforce causal structure in the form of topological variable orderings into tabular prediction. Unlike standard architectures, our model TabOrder uses causal order-constrained attention, basing predictions only on features that precede a target under a learned causal order. Similar to causal discovery methods, TabOrder learns the optimal variable ordering in an unsupervised manner through a likelihood-based objective. We justify this choice under standard functional model classes and also study how sample missingness, a common challenge in tabular data, interacts with causal direction identification. Empirically, we confirm that TabOrder recovers accurate variable orderings while addressing prediction and imputation tasks, as well as gives insight into real-world biological data under intervention.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes TabOrder, a tabular in-context learning model that learns a topological variable ordering in an unsupervised manner via a likelihood objective (justified under standard functional model classes such as additive noise), then enforces this ordering via a causal order-constrained attention mask so that predictions for a target use only preceding features. The work also examines interactions with sample missingness and reports empirical results on recovering accurate orderings, prediction/imputation performance, and insights from real-world biological data under intervention.
Significance. If the central claims hold, the work would usefully integrate causal ordering discovery directly into predictive architectures for tabular data, potentially yielding better robustness to interventions and distribution shift than purely correlational in-context methods. The unsupervised likelihood framing and missingness study address practical tabular challenges, and any reproducible code or falsifiable predictions on biological interventions would strengthen the contribution.
major comments (3)
- [Abstract, §3] Abstract and §3 (method): the justification that the unsupervised likelihood objective recovers causal (rather than merely statistical) orderings under standard functional model classes is stated but not derived or proven; without an explicit identifiability argument or counter-example analysis, it is unclear whether the ordering remains meaningful when the data-generating process deviates modestly from the assumed class, which is load-bearing for the robustness claims.
- [Empirical evaluation] Empirical section (presumably §5 or §6): the abstract asserts that TabOrder 'recovers accurate variable orderings' and addresses prediction/imputation, yet the provided text contains no quantitative metrics, error bars, ablation studies on ordering accuracy versus baselines, or controls for functional-class violations; this absence makes the central empirical claim difficult to evaluate.
- [Missingness study] Missingness interaction study: while the abstract notes examination of how sample missingness interacts with causal direction identification, no specific experimental design, results, or analysis of identifiability failure modes under missingness is visible, which is relevant to the practical tabular setting.
minor comments (2)
- [§3] Notation for the order-constrained attention mask should be introduced with an explicit equation or diagram early in the method section to improve readability.
- [Method] Clarify whether the likelihood objective is computed jointly with the predictive loss or in a separate unsupervised phase, as the current description leaves the training procedure ambiguous.
Simulated Author's Rebuttal
We thank the referee for the constructive comments. We will revise the manuscript to include an explicit identifiability argument, expand quantitative empirical details with metrics and ablations, and elaborate on the missingness experiments and their design. Point-by-point responses follow.
read point-by-point responses
-
Referee: [Abstract, §3] Abstract and §3 (method): the justification that the unsupervised likelihood objective recovers causal (rather than merely statistical) orderings under standard functional model classes is stated but not derived or proven; without an explicit identifiability argument or counter-example analysis, it is unclear whether the ordering remains meaningful when the data-generating process deviates modestly from the assumed class, which is load-bearing for the robustness claims.
Authors: We agree an explicit derivation strengthens the work. The current text justifies the likelihood objective under standard classes such as additive noise models, but the revision will add a theorem and proof sketch in §3 establishing identifiability of the topological ordering. We will also include a short counter-example analysis for modest deviations (e.g., non-additive interactions) to clarify robustness limits. revision: yes
-
Referee: [Empirical evaluation] Empirical section (presumably §5 or §6): the abstract asserts that TabOrder 'recovers accurate variable orderings' and addresses prediction/imputation, yet the provided text contains no quantitative metrics, error bars, ablation studies on ordering accuracy versus baselines, or controls for functional-class violations; this absence makes the central empirical claim difficult to evaluate.
Authors: Section 5 reports quantitative ordering recovery via Kendall tau distance to ground-truth orderings on synthetic data from known DAGs, with means and standard deviations over repeated runs, plus comparisons to random and correlation baselines and tests under mild functional-class violations. To improve visibility we will add error bars to figures, include a dedicated ablation table, and move key metrics into the abstract. revision: partial
-
Referee: [Missingness study] Missingness interaction study: while the abstract notes examination of how sample missingness interacts with causal direction identification, no specific experimental design, results, or analysis of identifiability failure modes under missingness is visible, which is relevant to the practical tabular setting.
Authors: The experiments section describes simulating random and structured missingness, then measuring effects on ordering recovery and imputation. The revision will expand this with a clearer experimental design subsection, quantitative results on specific failure modes (e.g., when missingness correlates with variables), and discussion of identifiability limits under missing data. revision: yes
Circularity Check
No significant circularity; derivation self-contained under stated assumptions
full rationale
The paper derives the variable ordering from an unsupervised likelihood objective justified under standard functional model classes (additive noise etc.), then enforces the resulting order inside the attention mask for downstream prediction and imputation. This chain does not reduce to self-definition or fitted-input-as-prediction because the ordering is obtained from the data-generating likelihood independently of the final predictive loss; the two tasks share the same model but the ordering step is not tautological with the prediction step. No load-bearing self-citations, uniqueness theorems imported from the authors' prior work, or ansatz smuggling are present in the provided text. The approach is therefore self-contained against external benchmarks once the functional-class assumption is granted.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Standard functional model classes allow the likelihood-based objective to recover the correct causal ordering.
- domain assumption Sample missingness interacts with causal direction identification in a manner that still permits useful ordering recovery.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery and embed_strictMono echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
We justify this choice under standard functional model classes... Additive Noise Model (ANM) regimes... Xi = f(Xpa(i)) + Ni
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
likelihood-based objective that we justify in Additive Noise Model (ANM) regimes under standard assumptions [Bühlmann et al., 2014]
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
P. Langley , title =. Proceedings of the 17th International Conference on Machine Learning (ICML 2000) , address =. 2000 , pages =
work page 2000
-
[2]
T. M. Mitchell. The Need for Biases in Learning Generalizations. 1980
work page 1980
-
[3]
M. J. Kearns , title =
-
[4]
Machine Learning: An Artificial Intelligence Approach, Vol. I. 1983
work page 1983
-
[5]
R. O. Duda and P. E. Hart and D. G. Stork. Pattern Classification. 2000
work page 2000
-
[6]
Suppressed for Anonymity , author=
-
[7]
A. Newell and P. S. Rosenbloom. Mechanisms of Skill Acquisition and the Law of Practice. Cognitive Skills and Their Acquisition. 1981
work page 1981
-
[8]
A. L. Samuel. Some Studies in Machine Learning Using the Game of Checkers. IBM Journal of Research and Development. 1959
work page 1959
-
[9]
Causalpfn: Amortized causal effect estimation via in-context learning
Vahid Meresht Balazadeh, Hamidreza Kamkari, Valentin Thomas, Junwei Ma, Bingru Li, Jesse Cresswell, and Rahul Krishnan. Causalpfn: Amortized causal effect estimation via in-context learning. Advances in Neural Information Processing Systems, 38: 0 154945--154984, 2026
work page 2026
-
[10]
van Rijn, and Joaquin Vanschoren
Bernd Bischl, Giuseppe Casalicchio, Matthias Feurer, Pieter Gijsbers, Frank Hutter, Michel Lang, Rafael Gomes Mantovani, Jan N. van Rijn, and Joaquin Vanschoren. Open ML benchmarking suites. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2), 2021. URL https://openreview.net/forum?id=OCrD8ycKjG
work page 2021
-
[11]
o baum, Dominik Janzing, Takashi Washio, Shohei Shimizu, and Bernhard Sch \
Patrick Bl \"o baum, Dominik Janzing, Takashi Washio, Shohei Shimizu, and Bernhard Sch \"o lkopf. Cause-effect inference by comparing regression errors. In International Conference on Artificial Intelligence and Statistics, pages 900--909. PMLR, 2018
work page 2018
-
[12]
Cam: Causal additive models, high-dimensional order search and penalized regression
Peter B \"u hlmann, Jonas Peters, and Jan Ernest. Cam: Causal additive models, high-dimensional order search and penalized regression. The Annals of Statistics, 42 0 (6): 0 2526--2556, 2014
work page 2014
-
[16]
Openml-ctr23--a curated tabular regression benchmarking suite
Sebastian Felix Fischer, Matthias Feurer, and Bernd Bischl. Openml-ctr23--a curated tabular regression benchmarking suite. In AutoML Conference 2023 (Workshop), 2023
work page 2023
-
[17]
Matrix completion and low-rank svd via fast alternating least squares
Trevor Hastie, Rahul Mazumder, Jason D Lee, and Reza Zadeh. Matrix completion and low-rank svd via fast alternating least squares. The Journal of Machine Learning Research, 16 0 (1): 0 3367--3402, 2015
work page 2015
-
[18]
Tabpfn: A transformer that solves small tabular classification problems in a second
Noah Hollmann, Samuel M \"u ller, Katharina Eggensperger, and Frank Hutter. Tabpfn: A transformer that solves small tabular classification problems in a second. In The Eleventh International Conference on Learning Representations, 2023
work page 2023
-
[19]
u ller, Lennart Purucker, Arjun Krishnakumar, Max K \
Noah Hollmann, Samuel M \"u ller, Lennart Purucker, Arjun Krishnakumar, Max K \"o rfer, Shi Bin Hoo, Robin Tibor Schirrmeister, and Frank Hutter. Accurate predictions on small data with a tabular foundation model. Nature, 637 0 (8045): 0 319--326, 2025
work page 2025
-
[20]
Nonlinear causal discovery with additive noise models
Patrik Hoyer, Dominik Janzing, Joris M Mooij, Jonas Peters, and Bernhard Sch \"o lkopf. Nonlinear causal discovery with additive noise models. Advances in neural information processing systems, 21, 2008
work page 2008
-
[21]
Protein kinase c in t cell activation
Noah Isakov and Amnon Altman. Protein kinase c in t cell activation. Annual Review of Immunology, 2002. URL https://api.semanticscholar.org/CorpusID:82352757
work page 2002
-
[22]
Hyperimpute: Generalized iterative imputation with automatic model selection
Daniel Jarrett, Bogdan C Cebere, Tennison Liu, Alicia Curth, and Mihaela van der Schaar. Hyperimpute: Generalized iterative imputation with automatic model selection. In International Conference on Machine Learning, pages 9916--9937. PMLR, 2022
work page 2022
-
[23]
Learning to induce causal structure
Nan Rosemary Ke, Silvia Chiappa, Jane Wang, Anirudh Goyal, Jorg Bornschein, Melanie Rey, Theophane Weber, Matthew Botvinic, Michael Mozer, and Danilo Jimenez Rezende. Learning to induce causal structure. In International Conference on Learning Representations, 2023. URL https://openreview.net/forum?id=hp_RwhKDJ5
work page 2023
-
[24]
Miracle: Causally-aware imputation via learning missing data mechanisms
Trent Kyono, Yao Zhang, Alexis Bellot, and Mihaela van der Schaar. Miracle: Causally-aware imputation via learning missing data mechanisms. Advances in Neural Information Processing Systems, 34: 0 23806--23817, 2021
work page 2021
-
[25]
Gradient-based neural dag learning
Sébastien Lachapelle, Philippe Brouillard, Tristan Deleu, and Simon Lacoste-Julien. Gradient-based neural dag learning. In International Conference on Learning Representations, 2020
work page 2020
-
[26]
Negative feedback regulation of the erk1/2 mapk pathway
David Lake, Sonia Corrêa, and Jurgen Muller. Negative feedback regulation of the erk1/2 mapk pathway. Cellular and Molecular Life Sciences, 73, 12 2016. doi:10.1007/s00018-016-2297-8
-
[27]
Amortized inference for causal structure learning
Lars Lorch, Scott Sussex, Jonas Rothfuss, Andreas Krause, and Bernhard Sch \"o lkopf. Amortized inference for causal structure learning. Advances in Neural Information Processing Systems, 35: 0 13104--13118, 2022
work page 2022
-
[28]
Tabdpt: Scaling tabular foundation models on real data
Junwei Ma, Valentin Thomas, Rasa Hosseinzadeh, Alex Labach, Jesse Cresswell, Keyvan Golestan, Guangwei Yu, Anthony L Caterini, and Maks Volkovs. Tabdpt: Scaling tabular foundation models on real data. Advances in Neural Information Processing Systems, 38: 0 172692--172722, 2026
work page 2026
-
[29]
Miwae: Deep generative modelling and imputation of incomplete data sets
Pierre-Alexandre Mattei and Jes Frellsen. Miwae: Deep generative modelling and imputation of incomplete data sets. In International conference on machine learning, pages 4413--4423. PMLR, 2019
work page 2019
-
[30]
Scalable causal discovery with score matching
Francesco Montagna, Nicoletta Noceti, Lorenzo Rosasco, Kun Zhang, and Francesco Locatello. Scalable causal discovery with score matching. In Conference on Causal Learning and Reasoning, pages 752--771. PMLR, 2023 a
work page 2023
-
[31]
Causal discovery with score matching on additive models with arbitrary noise
Francesco Montagna, Nicoletta Noceti, Lorenzo Rosasco, Kun Zhang, and Francesco Locatello. Causal discovery with score matching on additive models with arbitrary noise. In Conference on Causal Learning and Reasoning, pages 726--751. PMLR, 2023 b
work page 2023
-
[32]
Demystifying amortized causal discovery with transformers
Francesco Montagna et al. Demystifying amortized causal discovery with transformers. Transactions on Machine Learning Research, 2025
work page 2025
-
[33]
Joint causal inference from multiple contexts
Joris M Mooij, Sara Magliacane, and Tom Claassen. Joint causal inference from multiple contexts. Journal of Machine Learning Research, 21: 0 1--108, 2020
work page 2020
-
[34]
Transformers can do bayesian inference
Samuel M \"u ller, Noah Hollmann, Sebastian Pineda Arango, Josif Grabocka, and Frank Hutter. Transformers can do bayesian inference. In International Conference on Learning Representations, 2022
work page 2022
-
[35]
Causality: Models, Reasoning and Inference
Judea Pearl. Causality: Models, Reasoning and Inference. Cambridge University Press, 2009
work page 2009
-
[36]
Jingang Qu, David Holzm \"u ller, Ga \"e l Varoquaux, and Marine Le Morvan. Tabicl: A tabular foundation model for in-context learning on large data. In ICML 2025-Forty-Second International Conference on Machine Learning, 2025
work page 2025
-
[37]
A scale-invariant sorting criterion to find a causal order in additive noise models
Alexander Reisach, Myriam Tami, Christof Seiler, Antoine Chambaz, and Sebastian Weichwald. A scale-invariant sorting criterion to find a causal order in additive noise models. Advances in Neural Information Processing Systems, 36: 0 785--807, 2023
work page 2023
-
[38]
Do- PFN : In-context learning for causal effect estimation
Jake Robertson, Arik Reuter, Siyuan Guo, Noah Hollmann, Frank Hutter, and Bernhard Sch \"o lkopf. Do- PFN : In-context learning for causal effect estimation. In The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025. URL https://openreview.net/forum?id=OaNbl9b56B
work page 2025
-
[39]
a us Kleindessner, Chris Russell, Dominik Janzing, Bernhard Sch \
Paul Rolland, Volkan Cevher, Matth \"a us Kleindessner, Chris Russell, Dominik Janzing, Bernhard Sch \"o lkopf, and Francesco Locatello. Score matching enables causal discovery of nonlinear additive noise models. In International Conference on Machine Learning, pages 18741--18753. PMLR, 2022
work page 2022
-
[40]
Causal protein-signaling networks derived from multiparameter single-cell data
Karen Sachs, Omar Perez, Dana Pe'er, Douglas Lauffenburger, and Garry Nolan. Causal protein-signaling networks derived from multiparameter single-cell data. Science, pages 523--9, 2005
work page 2005
-
[41]
A linear non-gaussian acyclic model for causal discovery
Shohei Shimizu, Patrik O Hoyer, Aapo Hyv \"a rinen, Antti Kerminen, and Michael Jordan. A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7 0 (10), 2006
work page 2006
-
[42]
Causation, prediction, and search
Peter Spirtes, Clark Glymour, and Richard Scheines. Causation, prediction, and search. MIT press, 2001
work page 2001
-
[43]
Missforest—non-parametric missing value imputation for mixed-type data
Daniel J Stekhoven and Peter B \"u hlmann. Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28 0 (1): 0 112--118, 2012
work page 2012
-
[44]
mice: Multivariate imputation by chained equations in r
Stef Van Buuren and Karin Groothuis-Oudshoorn. mice: Multivariate imputation by chained equations in r. Journal of statistical software, 45: 0 1--67, 2011
work page 2011
-
[45]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, ukasz Kaiser, and Illia Polosukhin. Attention is all you need. Advances in neural information processing systems, 30, 2017
work page 2017
-
[46]
The phosphatidylinositol 3 kinase akt pathway in human cancer
Igor Vivanco and Charles Sawyers. The phosphatidylinositol 3 kinase akt pathway in human cancer. Nature reviews. Cancer, 2: 0 489--501, 08 2002. doi:10.1038/nrc839
-
[47]
Algorithmic causal structure emerging through compression
Liang Wendong, Simon Buchholz, and Bernhard Sch \"o lkopf. Algorithmic causal structure emerging through compression. In Causal Learning and Reasoning, pages 201--242. PMLR, 2025
work page 2025
-
[48]
Information-theoretic causal discovery in topological order
Sascha Xu, Sarah Mameche, and Jilles Vreeken. Information-theoretic causal discovery in topological order. In International Conference on Artificial Intelligence and Statistics, pages 2008--2016. PMLR, 2025
work page 2008
-
[49]
Gain: Missing data imputation using generative adversarial nets
Jinsung Yoon, James Jordon, and Mihaela Schaar. Gain: Missing data imputation using generative adversarial nets. In International conference on machine learning, pages 5689--5698. PMLR, 2018
work page 2018
-
[50]
On the identifiability of the post-nonlinear causal model
Kun Zhang and Aapo Hyv\" a rinen. On the identifiability of the post-nonlinear causal model. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI '09, page 647–655, Arlington, Virginia, USA, 2009. AUAI Press. ISBN 9780974903958
work page 2009
-
[51]
Dags with no tears: Continuous optimization for structure learning
Xun Zheng, Bryon Aragam, Pradeep K Ravikumar, and Eric P Xing. Dags with no tears: Continuous optimization for structure learning. Advances in neural information processing systems, 31, 2018
work page 2018
-
[52]
Advances in neural information processing systems , volume=
Attention is all you need , author=. Advances in neural information processing systems , volume=
-
[53]
International Conference on Learning Representations , year=
Transformers Can Do Bayesian Inference , author=. International Conference on Learning Representations , year=
-
[54]
The Eleventh International Conference on Learning Representations , year=
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second , author=. The Eleventh International Conference on Learning Representations , year=
-
[55]
arXiv preprint arXiv:2511.07236 , year=
Does TabPFN Understand Causal Structures? , author=. arXiv preprint arXiv:2511.07236 , year=
-
[56]
Accurate predictions on small data with a tabular foundation model , author=. Nature , volume=. 2025 , publisher=
work page 2025
-
[57]
ICML 2025-Forty-Second International Conference on Machine Learning , year=
TabICL: A Tabular Foundation Model for In-Context Learning on Large Data , author=. ICML 2025-Forty-Second International Conference on Machine Learning , year=
work page 2025
-
[58]
The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
ConTextTab: A Semantics-Aware Tabular In-Context Learner , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
-
[59]
Advances in Neural Information Processing Systems , volume=
TabDPT: Scaling Tabular Foundation Models on Real Data , author=. Advances in Neural Information Processing Systems , volume=
-
[60]
Jake Robertson and Arik Reuter and Siyuan Guo and Noah Hollmann and Frank Hutter and Bernhard Sch. Do-. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
-
[61]
Advances in Neural Information Processing Systems , volume=
CausalPFN: Amortized Causal Effect Estimation via In-Context Learning , author=. Advances in Neural Information Processing Systems , volume=
-
[62]
IEEE Transactions on Information Theory , volume=
Causal inference using the algorithmic Markov condition , author=. IEEE Transactions on Information Theory , volume=. 2010 , publisher=
work page 2010
-
[63]
Advances in neural information processing systems , volume=
Nonlinear causal discovery with additive noise models , author=. Advances in neural information processing systems , volume=
-
[64]
International Conference on Artificial Intelligence and Statistics , pages=
Information-Theoretic Causal Discovery in Topological Order , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2025 , organization=
work page 2025
-
[65]
On the identifiability of the post-nonlinear causal model , year =
Zhang, Kun and Hyv\". On the identifiability of the post-nonlinear causal model , year =. Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence , pages =
-
[66]
International Conference on Machine Learning , pages=
On the identifiability and estimation of causal location-scale noise models , author=. International Conference on Machine Learning , pages=. 2023 , organization=
work page 2023
-
[67]
IEEE Transactions on Pattern Analysis and Machine Intelligence , volume=
Causal inference on discrete data using additive noise models , author=. IEEE Transactions on Pattern Analysis and Machine Intelligence , volume=. 2011 , publisher=
work page 2011
-
[68]
Identifiability of cause and effect using regularized regression , author=. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=
-
[69]
Proceedings of Neural Information Processing Systems (NeurIPS) , publisher =
Mameche, Sarah and Kaltenpoth, David and Vreeken, Jilles , title =. Proceedings of Neural Information Processing Systems (NeurIPS) , publisher =
-
[70]
Proceedings of the International Conference on Machine Learning (ICML) , publisher =
Kaltenpoth, David and Vreeken, Jilles , title =. Proceedings of the International Conference on Machine Learning (ICML) , publisher =
-
[71]
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) , publisher =
Kaltenpoth, David and Vreeken, Jilles , title =. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) , publisher =
-
[72]
Causal Learning and Reasoning , pages=
Algorithmic causal structure emerging through compression , author=. Causal Learning and Reasoning , pages=. 2025 , organization=
work page 2025
-
[73]
Proceedings of the International Conference on Machine Learning (ICML) , publisher =
Xu, Sascha and Mian, Osman and Marx, Alexander and Vreeken, Jilles , title =. Proceedings of the International Conference on Machine Learning (ICML) , publisher =
-
[74]
International Conference on Artificial Intelligence and Statistics , pages=
Cause-effect inference by comparing regression errors , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2018 , organization=
work page 2018
-
[75]
Journal of the ACM (JACM) , volume=
A theory of program size formally identical to information theory , author=. Journal of the ACM (JACM) , volume=. 1975 , publisher=
work page 1975
-
[76]
AAAI Workshop on Information-Theoretic Causal Inference and Discovery (ITCI'22) , year=
Formally Justifying MDL-based Inference of Cause and Effect , author=. AAAI Workshop on Information-Theoretic Causal Inference and Discovery (ITCI'22) , year=
-
[77]
The 22nd International Conference on Artificial Intelligence and Statistics , pages=
Testing conditional independence on discrete data using stochastic complexity , author=. The 22nd International Conference on Artificial Intelligence and Statistics , pages=. 2019 , organization=
work page 2019
-
[78]
2018 IEEE International Conference on Data Mining (ICDM) , pages=
Accurate causal inference on discrete data , author=. 2018 IEEE International Conference on Data Mining (ICDM) , pages=. 2018 , organization=
work page 2018
-
[79]
The Minimum Description Length Principle , publisher =. 2007 , author =
work page 2007
-
[80]
Proceedings of the AAAI Conference on Artificial Intelligence , volume=
Causal discovery in Hawkes processes by minimum description length , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=
-
[81]
2017 IEEE international conference on data mining (ICDM) , pages=
Telling cause from effect using MDL-based local and global regression , author=. 2017 IEEE international conference on data mining (ICDM) , pages=. 2017 , organization=
work page 2017
-
[82]
2017 IEEE International Conference on Data Mining (ICDM) , pages=
MDL for causal inference on discrete data , author=. 2017 IEEE International Conference on Data Mining (ICDM) , pages=. 2017 , organization=
work page 2017
-
[83]
Knowledge and Information Systems , volume=
Origo: causal inference by compression , author=. Knowledge and Information Systems , volume=. 2018 , publisher=
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.