Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

Jian Yang; Qinbin Li; Xiaofang Zhou; Yuan Tong; Zeyi Wen

arxiv: 2606.05129 · v1 · pith:OPKVFTNFnew · submitted 2026-06-03 · 💻 cs.CR · cs.LG

Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

Jian Yang , Yuan Tong , Qinbin Li , Zeyi Wen , Xiaofang Zhou This is my paper

Pith reviewed 2026-06-28 05:20 UTC · model grok-4.3

classification 💻 cs.CR cs.LG

keywords fully homomorphic encryptioncausal structure learningprivacy preservationNewton-Raphson approximationTaylor expansiondifferential privacyencrypted computation

0 comments

The pith

Causal structure learning can run on fully encrypted data with accuracy matching plaintext versions by approximating division and logarithm operations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a method for learning causal structures from distributed data while keeping everything encrypted via fully homomorphic encryption. It tackles the barriers of high cost and missing support for division and logs by simplifying circuits, using Newton-Raphson for reciprocal approximation, Taylor expansion for logs, and SIMD batching for speed. The resulting encrypted process yields causal graphs that stay consistent with those from unencrypted data on the tested sets. The same techniques also transfer to differential privacy settings. Overall the approach finishes in tens of minutes under encryption.

Core claim

All steps of causal structure learning can be executed directly on ciphertexts by replacing unsupported operations with Newton-Raphson reciprocal and Taylor expansions, together with circuit simplification and SIMD batching, so that the recovered structures match the plaintext versions in consistency and accuracy.

What carries the argument

Newton-Raphson reciprocal approximation for division and Taylor expansion for logarithm, applied inside a circuit-simplified, SIMD-batched FHE workflow that keeps data encrypted throughout.

If this is right

Causal discovery becomes possible on distributed encrypted data without any party decrypting intermediates.
The same approximation pipeline supports differential privacy as an alternative protection layer.
Learning completes in tens of minutes even under full FHE protection.
High consistency with plaintext results holds across the datasets examined.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Organizations could pool encrypted records for joint causal analysis without exposing raw values.
The method might scale to larger graphs if the approximation error stays bounded as node count grows.
Applying the same approximations to other graph-learning tasks that rely on division and logs could be tested directly.

Load-bearing premise

The numerical approximations for division and logarithm keep enough accuracy that downstream causal discovery still recovers the correct structure.

What would settle it

A dataset where the plaintext algorithm recovers the true causal graph but the FHE version with approximations recovers a different graph would show the accuracy claim fails.

Figures

Figures reproduced from arXiv: 2606.05129 by Jian Yang, Qinbin Li, Xiaofang Zhou, Yuan Tong, Zeyi Wen.

**Figure 1.** Figure 1: shows a causal structure for this traffic prediction example. Each column in the table represents a variable and opposites a node in the causal structure, while each row is a sample that records the values of these variables. In most samples, when Heavy Rain is true, Bad Vision and Slippery Road are true too, indicating Heavy Rain may lead to Bad Vision and Slippery Road. Hence, two edges “Heavy Rain”- “Ba… view at source ↗

**Figure 2.** Figure 2: The pipeline of our proposed privacy-preserving dis [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 4.** Figure 4: Taylor expansion circuit for log(x). 3) Calculation for χ 2 Test: After building 1/x and log(x) circuits, we can implement χ 2 and G2 CI test circuits. To calculate the value of χ 2 , we substitute Equation (3) into Equation (1). If we calculate the value χ 2 first, we need to conduct reciprocal twice. First, we compute 1/N++z, and then we require to calculate 1/Exyz. The complexity of arithmetic grows up … view at source ↗

**Figure 3.** Figure 3: Newton-Raphson reciprocal circuit for 1/x. 2) Arithmetic Circuit for log(x): The calculation of G2 involves computing log Nxyz Exyz which can be approximated by Taylor expansion of log in x0 = 1, because Nxyz/Exyz is around 1. Once we finish these steps, we directly compute the product of log NXyz Exyz with Nxyz and sum the product over every xyz tuple. More specifically, the three-term Taylor expansion fo… view at source ↗

**Figure 5.** Figure 5: The χ 2 Calculation Circuit. G 2 = 2 X x,y,z Nxyz log Nxyz Exyz = 2 X x,y,z Nxyz log Nxyz Nx+zN+yz N++z = 2 X x,y,z Nxyz log NxyzN++z Nx+zN+yz [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 6.** Figure 6: describes details of the arithmetic circuit. N+yz, Nx+z and N++z can be obtained from Nxyz. Then, they are fed into the circuit shown in the figure. After that, we calculate NxyzN++z and 1 Nx+zN+yz simultaneously and multiply them to get NxyzN++z Nx+zN+yz . NxyzN++z Nx+zN+yz is input to the log module and multiply with Nxyz to obtain G2 finally. 2 [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 7.** Figure 7: SHD and CI tests consistency. a separating set, which leads to shorter computation process, which has less precision loss than a longer process. From different tests perspective, the CI tests based on χ 2 is more consistent than those based on G2 in level 0, but worsen at level 1. This is because square operations have better precision and more computation requirement. At level 0, it has high precision due… view at source ↗

**Figure 8.** Figure 8: Time comparison. (a) is the total time for [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗

**Figure 9.** Figure 9: Time and memory for a large batch with G2 and χ 2 . complete running in 6 to 30 minutes under privacy protection. Note that causal structure learning is an offline process, and hence 30 minutes of learning time is often acceptable. TABLE IV: Elapsed time (sec) comparison in χ 2 between our method and the plaintext version. Dataset Plaintext Ours child 0.18 1545 insurance 0.24 1845 water 0.07 698 alarm 0.12… view at source ↗

**Figure 11.** Figure 11: Memory consumption during learning. E. Hyperparameter Study Hyperparameters are crucial configurations in our method. We study three important hyperparameters in this subsection. First, we discuss the effect of batching size on elapsed time and memory usage. Then, we analyze the times of iterations for the Newton-Raphson reciprocal circuit and the series for Taylor expansion. We test the memory usage and … view at source ↗

**Figure 14.** Figure 14: Elapsed time, communication, and memory analysis [PITH_FULL_IMAGE:figures/full_fig_p011_14.png] view at source ↗

**Figure 15.** Figure 15: The SHD in three datasets in different ϵ. H. Experimental Summary Here, we summarize the key findings of our experiment. First of all, Our method achieves high CI test consistency and comparable structure with the plaintext version. Our method can be extended to differential privacy settings, which also achieves good quality precision compared with the plaintext version. Second, 84% of the total time of o… view at source ↗

read the original abstract

Preserving data privacy is an important topic in structural data management and data mining. However, the issue of privacy leakage in distributed causal structure learning is a persistent challenge, especially in cases where data transmission and computation are required. In this paper, we propose a method based on fully homomorphic encryption (FHE) that performs calculations on ciphertexts, keeping data encrypted in transition and computation. Nevertheless, adopting FHE to causal structure learning is challenging due to the high computation cost and limited support on division as well as logarithm operations in FHE. To tackle this challenge, we propose a series of novel techniques including (i) circuit simplification for better efficiency, (ii) approximation of division and logarithm through Newton-Raphson Reciprocal and Taylor expansion, and (iii) a batching technique with SIMD-acceleration to enhance the whole learning process. Additionally, our method can be easily extended beyond FHE by demonstration of its portability to support differential privacy. Empirical results show that our method achieves high consistency and comparable causal structure with the plaintext version in the datasets tested. Last, our method is efficient and practical to complete learning causal structures in tens of minutes even under the privacy protection of FHE.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper shows how to run causal discovery under FHE by approximating the missing operations and reports that the recovered structures stay close to the plaintext versions on their test sets.

read the letter

The core contribution is a set of practical adaptations: circuit simplifications, Newton-Raphson for reciprocals, Taylor expansions for division and log, plus SIMD batching. These let standard causal algorithms run on ciphertexts without changing the high-level workflow. The authors also note the same approximations can be swapped for differential privacy, which is a useful portability point.

The experiments are the strongest part. They claim high consistency and comparable structures to the unencrypted baseline, and they give runtime numbers in the tens of minutes. That directly tests whether the numerical approximations are good enough for the downstream task, which is the obvious risk.

The soft spots are modest but real. The abstract does not list the datasets, sample sizes, or exact metrics, so it is difficult to judge how far the consistency claim travels. Causal discovery methods can be sensitive to small perturbations in conditional independence tests or score functions; without an error-propagation argument or ablation on approximation precision, readers have to take the empirical result at face value. The work also inherits the usual FHE cost and key-size constraints, which the batching mitigates but does not remove.

This is useful reading for anyone who needs to run causal analysis across parties that cannot share raw data. It is not foundational, but it is a concrete engineering step that addresses a recurring practical barrier. The thinking is clear and the empirical check is present, so the paper is worth sending to referees even if revisions will be needed on the evaluation details.

Referee Report

2 major / 1 minor

Summary. The paper proposes an FHE-based approach for privacy-preserving causal structure learning. It tackles FHE limitations on division and logarithm via Newton-Raphson reciprocal approximation and Taylor expansions, plus circuit simplification and SIMD batching for efficiency. The method is claimed to extend to differential privacy, and empirical results assert high consistency with plaintext causal discovery on tested datasets, with runtimes in tens of minutes.

Significance. If the numerical approximations preserve sufficient fidelity for downstream causal algorithms, the work would be a practical contribution to secure distributed causal discovery. The engineering focus on circuit simplification, batching, and portability to DP is a strength, as is the direct empirical test of the core assumption that the approximations do not degrade structure recovery.

major comments (2)

[Abstract / Experiments] Abstract and experimental results: the central claim that the method 'achieves high consistency and comparable causal structure with the plaintext version in the datasets tested' is load-bearing, yet the provided text supplies no datasets, sample sizes, metrics (e.g., SHD, precision/recall on edges), error bars, or ablation on approximation parameters. This prevents evaluation of whether the Newton-Raphson/Taylor approximations meet the weakest-assumption threshold.
[Method (approximations subsection)] Approximation methods: the Newton-Raphson reciprocal and Taylor expansions for division and logarithm lack any reported error bounds, convergence analysis, or propagation study showing that the induced numerical error does not alter the output of the causal discovery procedure. Because the paper's correctness rests on these approximations preserving structure, a concrete sensitivity test (e.g., graph difference under varying iteration counts or polynomial degree) is required.

minor comments (1)

[Implementation details] Notation for the batching and SIMD parameters could be clarified with an explicit table of ciphertext packing sizes and their effect on throughput.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. The comments highlight areas where the manuscript can be strengthened with additional experimental details and analysis. We will revise accordingly and address each point below.

read point-by-point responses

Referee: [Abstract / Experiments] Abstract and experimental results: the central claim that the method 'achieves high consistency and comparable causal structure with the plaintext version in the datasets tested' is load-bearing, yet the provided text supplies no datasets, sample sizes, metrics (e.g., SHD, precision/recall on edges), error bars, or ablation on approximation parameters. This prevents evaluation of whether the Newton-Raphson/Taylor approximations meet the weakest-assumption threshold.

Authors: We agree the manuscript lacks these specifics in the abstract and experimental reporting. In revision we will expand the experiments section to list the exact datasets (standard benchmarks such as Asia and Sachs), sample sizes, quantitative metrics including SHD and edge precision/recall with error bars across runs, and ablations varying approximation parameters (iteration counts, polynomial degrees) to show that structure recovery remains comparable to plaintext. revision: yes
Referee: [Method (approximations subsection)] Approximation methods: the Newton-Raphson reciprocal and Taylor expansions for division and logarithm lack any reported error bounds, convergence analysis, or propagation study showing that the induced numerical error does not alter the output of the causal discovery procedure. Because the paper's correctness rests on these approximations preserving structure, a concrete sensitivity test (e.g., graph difference under varying iteration counts or polynomial degree) is required.

Authors: The current text describes the approximations but omits explicit bounds and sensitivity results. We will add a dedicated analysis subsection reporting error bounds for the Newton-Raphson reciprocal and Taylor series, convergence behavior, and propagation through the causal algorithm. This will include sensitivity experiments measuring graph differences (SHD, edge changes) under different iteration counts and polynomial degrees to confirm the approximations preserve the recovered structure. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper presents an empirical method for causal structure learning under FHE, using circuit simplification, Newton-Raphson reciprocal, Taylor expansions for division/log, and SIMD batching. Its central claim is that the encrypted version achieves high consistency and comparable structure to the plaintext version on tested datasets. This is validated by direct external comparison rather than any internal derivation that reduces to fitted parameters, self-definitions, or self-citation chains. No load-bearing step equates a prediction to its own inputs by construction; the numerical fidelity of the approximations is treated as an empirical question tested outside the method itself.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies insufficient detail to enumerate free parameters or axioms; the method implicitly assumes standard FHE semantic security and that the chosen approximations do not alter causal discovery semantics.

pith-pipeline@v0.9.1-grok · 5748 in / 924 out tokens · 24307 ms · 2026-06-28T05:20:34.507099+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 8 canonical work pages · 6 internal anchors

[1]

Probabilistic reasoning in intelligent systems: networks of plausible inference,

J. Pearl, “Probabilistic reasoning in intelligent systems: networks of plausible inference,” 1988

1988
[2]

Citywide traffic volume estimation using trajectory data,

X. Zhan, Y . Zheng, X. Yi, and S. V . Ukkusuri, “Citywide traffic volume estimation using trajectory data,”IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 2, pp. 272–285, 2016

2016
[3]

A rear-end collision risk evaluation and control scheme using a bayesian network model,

C. Chen, X. Liu, H.-H. Chen, M. Li, and L. Zhao, “A rear-end collision risk evaluation and control scheme using a bayesian network model,” IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 1, pp. 264–284, 2018

2018
[4]

Darpa’s explainable artificial intelligence (XAI) program,

D. Gunning and D. Aha, “Darpa’s explainable artificial intelligence (XAI) program,”AI Magazine, vol. 40, no. 2, pp. 44–58, 2019

2019
[5]

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead,

C. Rudin, “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead,”Nature Machine Intelligence, vol. 1, no. 5, pp. 206–215, 2019

2019
[6]

Order-independent constraint- based causal structure learning

D. Colombo, M. H. Maathuiset al., “Order-independent constraint- based causal structure learning.”Journal of Machine Learning Research, vol. 15, no. 1, pp. 3741–3782, 2014

2014
[7]

Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information,

X. Zhang, X.-M. Zhao, K. He, L. Lu, Y . Cao, J. Liu, J.-K. Hao, Z.- P. Liu, and L. Chen, “Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information,”Bioinformatics, vol. 28, no. 1, pp. 98–104, 2012. 12

2012
[8]

Predicting causal effects in large-scale systems from observational data,

M. H. Maathuis, D. Colombo, M. Kalisch, and P. B ¨uhlmann, “Predicting causal effects in large-scale systems from observational data,”Nature methods, vol. 7, no. 4, pp. 247–248, 2010

2010
[9]

Learning Bayesian Networks with the bnlearn R Package

M. Scutari, “Learning Bayesian networks with the bnlearn R package,” arXiv preprint arXiv:0908.3817, 2009

work page internal anchor Pith review Pith/arXiv arXiv 2009
[10]

Tetrad—a toolbox for causal discovery,

J. D. Ramsey, K. Zhang, M. Glymour, R. S. Romero, B. Huang, I. Ebert- Uphoff, S. Samarasinghe, E. A. Barnes, and C. Glymour, “Tetrad—a toolbox for causal discovery,” inInternational Workshop on Climate Informatics, 2018

2018
[11]

Fully homomorphic encryption using ideal lattices,

C. Gentry, “Fully homomorphic encryption using ideal lattices,” in Proceedings of the forty-first annual ACM symposium on Theory of computing, 2009, pp. 169–178

2009
[12]

A review of homomorphic encryption and software tools for encrypted statistical machine learning

L. J. Aslett, P. M. Esperanc ¸a, and C. C. Holmes, “A review of homomorphic encryption and software tools for encrypted statistical machine learning,”arXiv preprint arXiv:1508.06574, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[13]

Spirtes, C

P. Spirtes, C. N. Glymour, R. Scheines, and D. Heckerman,Causation, prediction, and search. MIT press, 2000

2000
[14]

Learning high-dimensional directed acyclic graphs with latent and selection variables,

D. Colombo, M. H. Maathuis, M. Kalisch, and T. S. Richardson, “Learning high-dimensional directed acyclic graphs with latent and selection variables,”The Annals of Statistics, pp. 294–321, 2012

2012
[15]

Causal Inference and Causal Explanation with Background Knowledge

C. Meek, “Causal inference and causal explanation with background knowledge,”arXiv preprint arXiv:1302.4972, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[16]

cupc: Cuda-based parallel PC algorithm for causal structure learning on gpu,

B. Zarebavani, F. Jafarinejad, M. Hashemi, and S. Salehkaleybar, “cupc: Cuda-based parallel PC algorithm for causal structure learning on gpu,” IEEE Transactions on Parallel and Distributed Systems, vol. 31, no. 3, pp. 530–542, 2019

2019
[17]

Differential privacy,

C. Dwork, “Differential privacy,” inAutomata, Languages and Program- ming: 33rd International Colloquium, ICALP 2006, Venice, Italy, July 10-14, 2006, Proceedings, Part II 33. Springer, 2006, pp. 1–12

2006
[18]

Privacy-preserving deep learning,

R. Shokri and V . Shmatikov, “Privacy-preserving deep learning,” in Proceedings of the 22nd ACM SIGSAC conference on computer and communications security, 2015, pp. 1310–1321

2015
[19]

Deep learning with differential privacy,

M. Abadi, A. Chu, I. Goodfellow, H. B. McMahan, I. Mironov, K. Talwar, and L. Zhang, “Deep learning with differential privacy,” in Proceedings of the 2016 ACM SIGSAC conference on computer and communications security, 2016, pp. 308–318

2016
[20]

The exponential mechanism for social welfare: Private, truthful, and nearly optimal,

Z. Huang and S. Kannan, “The exponential mechanism for social welfare: Private, truthful, and nearly optimal,” in2012 IEEE 53rd Annual Symposium on Foundations of Computer Science. IEEE, 2012, pp. 140– 149

2012
[21]

Somewhat practical fully homomorphic encryption,

J. Fan and F. Vercauteren, “Somewhat practical fully homomorphic encryption,”Cryptology ePrint Archive, 2012

2012
[22]

(leveled) fully ho- momorphic encryption without bootstrapping,

Z. Brakerski, C. Gentry, and V . Vaikuntanathan, “(leveled) fully ho- momorphic encryption without bootstrapping,”ACM Transactions on Computation Theory (TOCT), vol. 6, no. 3, pp. 1–36, 2014

2014
[23]

Homomorphic encryption for arithmetic of approximate numbers,

J. H. Cheon, A. Kim, M. Kim, and Y . Song, “Homomorphic encryption for arithmetic of approximate numbers,” inAdvances in Cryptology– ASIACRYPT 2017: 23rd International Conference on the Theory and Applications of Cryptology and Information Security, Hong Kong, China, December 3-7, 2017, Proceedings, Part I 23. Springer, 2017, pp. 409– 437

2017
[24]

Fully homomorphic encryption over the integers,

M. Van Dijk, C. Gentry, S. Halevi, and V . Vaikuntanathan, “Fully homomorphic encryption over the integers,” inAdvances in Cryptology– EUROCRYPT 2010: 29th Annual International Conference on the The- ory and Applications of Cryptographic Techniques, French Riviera, May 30–June 3, 2010. Proceedings 29. Springer, 2010, pp. 24–43

2010
[25]

Choosing starting values for newton- raphson computation of reciprocals, square-roots and square-root recip- rocals,

P. Kornerup and J.-M. Muller, “Choosing starting values for newton- raphson computation of reciprocals, square-roots and square-root recip- rocals,” Ph.D. dissertation, INRIA, LIP, 2003

2003
[26]

Microsoft SEAL (release 4.1),

“Microsoft SEAL (release 4.1),” https://github.com/Microsoft/SEAL, Jan. 2023, microsoft Research, Redmond, W A

2023
[27]

Eva: An encrypted vector arithmetic language and compiler for efficient homomorphic computation,

R. Dathathri, B. Kostova, O. Saarikivi, W. Dai, K. Laine, and M. Musuvathi, “Eva: An encrypted vector arithmetic language and compiler for efficient homomorphic computation,” inProceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation, ser. PLDI 2020. New York, NY , USA: Association for Computing Machinery, 2020, p. ...

work page doi:10.1145/3385412.3386023 2020
[28]

Bayesian Network Constraint-Based Structure Learning Algorithms: Parallel and Optimised Implementations in the bnlearn R Package

M. Scutari, “Bayesian network constraint-based structure learning al- gorithms: Parallel and optimised implementations in the bnlearn R package,”arXiv preprint arXiv:1406.7648, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[29]

Generalized hamming distance,

A. Bookstein, V . A. Kulyukin, and T. Raita, “Generalized hamming distance,”Information Retrieval, vol. 5, pp. 353–375, 2002

2002
[30]

Fast parallel bayesian network structure learning,

J. Jiang, Z. Wen, and A. Mian, “Fast parallel bayesian network structure learning,” in2022 IEEE International Parallel and Distributed Process- ing Symposium (IPDPS), 2022, pp. 617–627

2022
[31]

Searching for Bayesian network structures in the space of restricted acyclic partially directed graphs,

S. Acid and L. M. de Campos, “Searching for Bayesian network structures in the space of restricted acyclic partially directed graphs,” Journal of Artificial Intelligence Research, vol. 18, pp. 445–490, 2003

2003
[32]

A Branch-and-Bound Algorithm for MDL Learning Bayesian Networks

J. Tian, “A branch-and-bound algorithm for mdl learning Bayesian networks,”arXiv preprint arXiv:1301.3897, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[33]

Learning Bayesian Networks from Incomplete Data with Stochastic Search Algorithms

J. W. Myers, K. B. Laskey, and T. S. Levitt, “Learning Bayesian networks from incomplete data with stochastic search algorithms,”arXiv preprint arXiv:1301.6726, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[34]

Efficient and scalable structure learning for bayesian net- works: Algorithms and applications,

R. Zhu, A. Pfadler, Z. Wu, Y . Han, X. Yang, F. Ye, Z. Qian, J. Zhou, and B. Cui, “Efficient and scalable structure learning for bayesian net- works: Algorithms and applications,” in2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 2021, pp. 2613–2624

2021
[35]

Counting unlabeled acyclic digraphs,

R. W. Robinson, “Counting unlabeled acyclic digraphs,” inCombinato- rial Mathematics V. Springer, 1977, pp. 28–43

1977
[36]

Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms,

M. Scutari, C. E. Graafland, and J. M. Guti ´errez, “Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms,”International Journal of Approximate Reasoning, vol. 115, pp. 235–253, 2019

2019
[37]

Secure multi-party computation,

O. Goldreich, “Secure multi-party computation,”Manuscript. Prelimi- nary version, vol. 78, no. 110, 1998

1998
[38]

Practical secure aggregation for privacy-preserving machine learning,

K. Bonawitz, V . Ivanov, B. Kreuter, A. Marcedone, H. B. McMahan, S. Patel, D. Ramage, A. Segal, and K. Seth, “Practical secure aggregation for privacy-preserving machine learning,” inproceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 2017, pp. 1175–1191

2017
[39]

A survey of homomorphic encryption for nonspecialists,

C. Fontaine and F. Galand, “A survey of homomorphic encryption for nonspecialists,”EURASIP Journal on Information Security, vol. 2007, pp. 1–10, 2007

2007
[40]

Multiparty differential privacy via ag- gregation of locally trained classifiers,

M. Pathak, S. Rane, and B. Raj, “Multiparty differential privacy via ag- gregation of locally trained classifiers,”Advances in neural information processing systems, vol. 23, 2010

2010
[41]

Privacy-preserving deep learning via additively homomorphic encryption,

Y . Aono, T. Hayashi, L. Wang, S. Moriaiet al., “Privacy-preserving deep learning via additively homomorphic encryption,”IEEE Transactions on Information Forensics and Security, vol. 13, no. 5, pp. 1333–1345, 2017

2017
[42]

Privlava: synthesizing relational data with foreign keys under differential privacy,

K. Cai, X. Xiao, and G. Cormode, “Privlava: synthesizing relational data with foreign keys under differential privacy,”Proceedings of the ACM on Management of Data, vol. 1, no. 2, pp. 1–25, 2023

2023
[43]

Better than composition: How to answer multiple relational queries under differential privacy,

W. Dong, D. Sun, and K. Yi, “Better than composition: How to answer multiple relational queries under differential privacy,”Proceedings of the ACM on Management of Data, vol. 1, no. 2, pp. 1–26, 2023

2023
[44]

Heda: Multi-attribute unbounded aggregation over homo- morphically encrypted database,

X. Ren, L. Su, Z. Gu, S. Wang, F. Li, Y . Xie, S. Bian, C. Li, and F. Zhang, “Heda: Multi-attribute unbounded aggregation over homo- morphically encrypted database,”Proceedings of the VLDB Endowment, vol. 16, no. 4, pp. 601–614, 2022

2022
[45]

Toward efficient homo- morphic encryption for outsourced databases through parallel caching,

O. T. Tawose, J. Dai, L. Yang, and D. Zhao, “Toward efficient homo- morphic encryption for outsourced databases through parallel caching,” Proceedings of the ACM on Management of Data, vol. 1, no. 1, pp. 1–23, 2023

2023
[46]

Batchcrypt: Efficient homomorphic encryption for cross-silo federated learning,

C. Zhang, S. Li, J. Xia, W. Wang, F. Yan, and Y . Liu, “Batchcrypt: Efficient homomorphic encryption for cross-silo federated learning,” inProceedings of the 2020 USENIX Annual Technical Conference (USENIX ATC 2020), 2020

2020
[47]

Towards federated bayesian network structure learning with continuous optimization,

I. Ng and K. Zhang, “Towards federated bayesian network structure learning with continuous optimization,” inInternational Conference on Artificial Intelligence and Statistics. PMLR, 2022, pp. 8095–8111

2022
[48]

Towards privacy- aware causal structure learning in federated setting,

J. Huang, K. Yu, X. Guo, F. Cao, and J. Liang, “Towards privacy- aware causal structure learning in federated setting,”arXiv preprint arXiv:2211.06919, 2022. 13

work page arXiv 2022

[1] [1]

Probabilistic reasoning in intelligent systems: networks of plausible inference,

J. Pearl, “Probabilistic reasoning in intelligent systems: networks of plausible inference,” 1988

1988

[2] [2]

Citywide traffic volume estimation using trajectory data,

X. Zhan, Y . Zheng, X. Yi, and S. V . Ukkusuri, “Citywide traffic volume estimation using trajectory data,”IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 2, pp. 272–285, 2016

2016

[3] [3]

A rear-end collision risk evaluation and control scheme using a bayesian network model,

C. Chen, X. Liu, H.-H. Chen, M. Li, and L. Zhao, “A rear-end collision risk evaluation and control scheme using a bayesian network model,” IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 1, pp. 264–284, 2018

2018

[4] [4]

Darpa’s explainable artificial intelligence (XAI) program,

D. Gunning and D. Aha, “Darpa’s explainable artificial intelligence (XAI) program,”AI Magazine, vol. 40, no. 2, pp. 44–58, 2019

2019

[5] [5]

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead,

C. Rudin, “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead,”Nature Machine Intelligence, vol. 1, no. 5, pp. 206–215, 2019

2019

[6] [6]

Order-independent constraint- based causal structure learning

D. Colombo, M. H. Maathuiset al., “Order-independent constraint- based causal structure learning.”Journal of Machine Learning Research, vol. 15, no. 1, pp. 3741–3782, 2014

2014

[7] [7]

Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information,

X. Zhang, X.-M. Zhao, K. He, L. Lu, Y . Cao, J. Liu, J.-K. Hao, Z.- P. Liu, and L. Chen, “Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information,”Bioinformatics, vol. 28, no. 1, pp. 98–104, 2012. 12

2012

[8] [8]

Predicting causal effects in large-scale systems from observational data,

M. H. Maathuis, D. Colombo, M. Kalisch, and P. B ¨uhlmann, “Predicting causal effects in large-scale systems from observational data,”Nature methods, vol. 7, no. 4, pp. 247–248, 2010

2010

[9] [9]

Learning Bayesian Networks with the bnlearn R Package

M. Scutari, “Learning Bayesian networks with the bnlearn R package,” arXiv preprint arXiv:0908.3817, 2009

work page internal anchor Pith review Pith/arXiv arXiv 2009

[10] [10]

Tetrad—a toolbox for causal discovery,

J. D. Ramsey, K. Zhang, M. Glymour, R. S. Romero, B. Huang, I. Ebert- Uphoff, S. Samarasinghe, E. A. Barnes, and C. Glymour, “Tetrad—a toolbox for causal discovery,” inInternational Workshop on Climate Informatics, 2018

2018

[11] [11]

Fully homomorphic encryption using ideal lattices,

C. Gentry, “Fully homomorphic encryption using ideal lattices,” in Proceedings of the forty-first annual ACM symposium on Theory of computing, 2009, pp. 169–178

2009

[12] [12]

A review of homomorphic encryption and software tools for encrypted statistical machine learning

L. J. Aslett, P. M. Esperanc ¸a, and C. C. Holmes, “A review of homomorphic encryption and software tools for encrypted statistical machine learning,”arXiv preprint arXiv:1508.06574, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[13] [13]

Spirtes, C

P. Spirtes, C. N. Glymour, R. Scheines, and D. Heckerman,Causation, prediction, and search. MIT press, 2000

2000

[14] [14]

Learning high-dimensional directed acyclic graphs with latent and selection variables,

D. Colombo, M. H. Maathuis, M. Kalisch, and T. S. Richardson, “Learning high-dimensional directed acyclic graphs with latent and selection variables,”The Annals of Statistics, pp. 294–321, 2012

2012

[15] [15]

Causal Inference and Causal Explanation with Background Knowledge

C. Meek, “Causal inference and causal explanation with background knowledge,”arXiv preprint arXiv:1302.4972, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[16] [16]

cupc: Cuda-based parallel PC algorithm for causal structure learning on gpu,

B. Zarebavani, F. Jafarinejad, M. Hashemi, and S. Salehkaleybar, “cupc: Cuda-based parallel PC algorithm for causal structure learning on gpu,” IEEE Transactions on Parallel and Distributed Systems, vol. 31, no. 3, pp. 530–542, 2019

2019

[17] [17]

Differential privacy,

C. Dwork, “Differential privacy,” inAutomata, Languages and Program- ming: 33rd International Colloquium, ICALP 2006, Venice, Italy, July 10-14, 2006, Proceedings, Part II 33. Springer, 2006, pp. 1–12

2006

[18] [18]

Privacy-preserving deep learning,

R. Shokri and V . Shmatikov, “Privacy-preserving deep learning,” in Proceedings of the 22nd ACM SIGSAC conference on computer and communications security, 2015, pp. 1310–1321

2015

[19] [19]

Deep learning with differential privacy,

M. Abadi, A. Chu, I. Goodfellow, H. B. McMahan, I. Mironov, K. Talwar, and L. Zhang, “Deep learning with differential privacy,” in Proceedings of the 2016 ACM SIGSAC conference on computer and communications security, 2016, pp. 308–318

2016

[20] [20]

The exponential mechanism for social welfare: Private, truthful, and nearly optimal,

Z. Huang and S. Kannan, “The exponential mechanism for social welfare: Private, truthful, and nearly optimal,” in2012 IEEE 53rd Annual Symposium on Foundations of Computer Science. IEEE, 2012, pp. 140– 149

2012

[21] [21]

Somewhat practical fully homomorphic encryption,

J. Fan and F. Vercauteren, “Somewhat practical fully homomorphic encryption,”Cryptology ePrint Archive, 2012

2012

[22] [22]

(leveled) fully ho- momorphic encryption without bootstrapping,

Z. Brakerski, C. Gentry, and V . Vaikuntanathan, “(leveled) fully ho- momorphic encryption without bootstrapping,”ACM Transactions on Computation Theory (TOCT), vol. 6, no. 3, pp. 1–36, 2014

2014

[23] [23]

Homomorphic encryption for arithmetic of approximate numbers,

J. H. Cheon, A. Kim, M. Kim, and Y . Song, “Homomorphic encryption for arithmetic of approximate numbers,” inAdvances in Cryptology– ASIACRYPT 2017: 23rd International Conference on the Theory and Applications of Cryptology and Information Security, Hong Kong, China, December 3-7, 2017, Proceedings, Part I 23. Springer, 2017, pp. 409– 437

2017

[24] [24]

Fully homomorphic encryption over the integers,

M. Van Dijk, C. Gentry, S. Halevi, and V . Vaikuntanathan, “Fully homomorphic encryption over the integers,” inAdvances in Cryptology– EUROCRYPT 2010: 29th Annual International Conference on the The- ory and Applications of Cryptographic Techniques, French Riviera, May 30–June 3, 2010. Proceedings 29. Springer, 2010, pp. 24–43

2010

[25] [25]

Choosing starting values for newton- raphson computation of reciprocals, square-roots and square-root recip- rocals,

P. Kornerup and J.-M. Muller, “Choosing starting values for newton- raphson computation of reciprocals, square-roots and square-root recip- rocals,” Ph.D. dissertation, INRIA, LIP, 2003

2003

[26] [26]

Microsoft SEAL (release 4.1),

“Microsoft SEAL (release 4.1),” https://github.com/Microsoft/SEAL, Jan. 2023, microsoft Research, Redmond, W A

2023

[27] [27]

Eva: An encrypted vector arithmetic language and compiler for efficient homomorphic computation,

R. Dathathri, B. Kostova, O. Saarikivi, W. Dai, K. Laine, and M. Musuvathi, “Eva: An encrypted vector arithmetic language and compiler for efficient homomorphic computation,” inProceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation, ser. PLDI 2020. New York, NY , USA: Association for Computing Machinery, 2020, p. ...

work page doi:10.1145/3385412.3386023 2020

[28] [28]

Bayesian Network Constraint-Based Structure Learning Algorithms: Parallel and Optimised Implementations in the bnlearn R Package

M. Scutari, “Bayesian network constraint-based structure learning al- gorithms: Parallel and optimised implementations in the bnlearn R package,”arXiv preprint arXiv:1406.7648, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[29] [29]

Generalized hamming distance,

A. Bookstein, V . A. Kulyukin, and T. Raita, “Generalized hamming distance,”Information Retrieval, vol. 5, pp. 353–375, 2002

2002

[30] [30]

Fast parallel bayesian network structure learning,

J. Jiang, Z. Wen, and A. Mian, “Fast parallel bayesian network structure learning,” in2022 IEEE International Parallel and Distributed Process- ing Symposium (IPDPS), 2022, pp. 617–627

2022

[31] [31]

Searching for Bayesian network structures in the space of restricted acyclic partially directed graphs,

S. Acid and L. M. de Campos, “Searching for Bayesian network structures in the space of restricted acyclic partially directed graphs,” Journal of Artificial Intelligence Research, vol. 18, pp. 445–490, 2003

2003

[32] [32]

A Branch-and-Bound Algorithm for MDL Learning Bayesian Networks

J. Tian, “A branch-and-bound algorithm for mdl learning Bayesian networks,”arXiv preprint arXiv:1301.3897, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[33] [33]

Learning Bayesian Networks from Incomplete Data with Stochastic Search Algorithms

J. W. Myers, K. B. Laskey, and T. S. Levitt, “Learning Bayesian networks from incomplete data with stochastic search algorithms,”arXiv preprint arXiv:1301.6726, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[34] [34]

Efficient and scalable structure learning for bayesian net- works: Algorithms and applications,

R. Zhu, A. Pfadler, Z. Wu, Y . Han, X. Yang, F. Ye, Z. Qian, J. Zhou, and B. Cui, “Efficient and scalable structure learning for bayesian net- works: Algorithms and applications,” in2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 2021, pp. 2613–2624

2021

[35] [35]

Counting unlabeled acyclic digraphs,

R. W. Robinson, “Counting unlabeled acyclic digraphs,” inCombinato- rial Mathematics V. Springer, 1977, pp. 28–43

1977

[36] [36]

Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms,

M. Scutari, C. E. Graafland, and J. M. Guti ´errez, “Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms,”International Journal of Approximate Reasoning, vol. 115, pp. 235–253, 2019

2019

[37] [37]

Secure multi-party computation,

O. Goldreich, “Secure multi-party computation,”Manuscript. Prelimi- nary version, vol. 78, no. 110, 1998

1998

[38] [38]

Practical secure aggregation for privacy-preserving machine learning,

K. Bonawitz, V . Ivanov, B. Kreuter, A. Marcedone, H. B. McMahan, S. Patel, D. Ramage, A. Segal, and K. Seth, “Practical secure aggregation for privacy-preserving machine learning,” inproceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 2017, pp. 1175–1191

2017

[39] [39]

A survey of homomorphic encryption for nonspecialists,

C. Fontaine and F. Galand, “A survey of homomorphic encryption for nonspecialists,”EURASIP Journal on Information Security, vol. 2007, pp. 1–10, 2007

2007

[40] [40]

Multiparty differential privacy via ag- gregation of locally trained classifiers,

M. Pathak, S. Rane, and B. Raj, “Multiparty differential privacy via ag- gregation of locally trained classifiers,”Advances in neural information processing systems, vol. 23, 2010

2010

[41] [41]

Privacy-preserving deep learning via additively homomorphic encryption,

Y . Aono, T. Hayashi, L. Wang, S. Moriaiet al., “Privacy-preserving deep learning via additively homomorphic encryption,”IEEE Transactions on Information Forensics and Security, vol. 13, no. 5, pp. 1333–1345, 2017

2017

[42] [42]

Privlava: synthesizing relational data with foreign keys under differential privacy,

K. Cai, X. Xiao, and G. Cormode, “Privlava: synthesizing relational data with foreign keys under differential privacy,”Proceedings of the ACM on Management of Data, vol. 1, no. 2, pp. 1–25, 2023

2023

[43] [43]

Better than composition: How to answer multiple relational queries under differential privacy,

W. Dong, D. Sun, and K. Yi, “Better than composition: How to answer multiple relational queries under differential privacy,”Proceedings of the ACM on Management of Data, vol. 1, no. 2, pp. 1–26, 2023

2023

[44] [44]

Heda: Multi-attribute unbounded aggregation over homo- morphically encrypted database,

X. Ren, L. Su, Z. Gu, S. Wang, F. Li, Y . Xie, S. Bian, C. Li, and F. Zhang, “Heda: Multi-attribute unbounded aggregation over homo- morphically encrypted database,”Proceedings of the VLDB Endowment, vol. 16, no. 4, pp. 601–614, 2022

2022

[45] [45]

Toward efficient homo- morphic encryption for outsourced databases through parallel caching,

O. T. Tawose, J. Dai, L. Yang, and D. Zhao, “Toward efficient homo- morphic encryption for outsourced databases through parallel caching,” Proceedings of the ACM on Management of Data, vol. 1, no. 1, pp. 1–23, 2023

2023

[46] [46]

Batchcrypt: Efficient homomorphic encryption for cross-silo federated learning,

C. Zhang, S. Li, J. Xia, W. Wang, F. Yan, and Y . Liu, “Batchcrypt: Efficient homomorphic encryption for cross-silo federated learning,” inProceedings of the 2020 USENIX Annual Technical Conference (USENIX ATC 2020), 2020

2020

[47] [47]

Towards federated bayesian network structure learning with continuous optimization,

I. Ng and K. Zhang, “Towards federated bayesian network structure learning with continuous optimization,” inInternational Conference on Artificial Intelligence and Statistics. PMLR, 2022, pp. 8095–8111

2022

[48] [48]

Towards privacy- aware causal structure learning in federated setting,

J. Huang, K. Yu, X. Guo, F. Cao, and J. Liang, “Towards privacy- aware causal structure learning in federated setting,”arXiv preprint arXiv:2211.06919, 2022. 13

work page arXiv 2022