Bayesian Synthesis of Probabilistic Programs for Automatic Data Modeling

Feras A. Saad; Marco F. Cusumano-Towner; Martin C. Rinard; Ulrich Schaechtle; Vikash K. Mansinghka

Bayesian synthesis over domain-specific languages builds probabilistic programs that infer data structure and make accurate predictions.

Reviewed by Pith at T0; open to challenge. T0 means a machine referee read the full paper against a public rubric. the ladder, T0–T4 →

Challenge this review Re-run · record.json Download PDF Read on arXiv ↗

T0 review · grok-4.3

2026-05-24 21:34 UTC pith:YEHHC6E6

load-bearing objection The paper gives a clean new formulation of Bayesian synthesis over PCFG DSLs with soundness results, plus experiments on real data, but leaves the key conditions unverified for the actual grammars used. the 1 major comments →

arxiv 1907.06249 v1 pith:YEHHC6E6 submitted 2019-07-14 cs.PL cs.AIcs.LGstat.CO

Bayesian Synthesis of Probabilistic Programs for Automatic Data Modeling

Feras A. Saad , Marco F. Cusumano-Towner , Ulrich Schaechtle , Martin C. Rinard , Vikash K. Mansinghka This is my paper

classification cs.PL cs.AIcs.LGstat.CO

keywords Bayesian synthesisprobabilistic programsautomatic data modelingprobabilistic context-free grammarstime seriesmultivariate dataprogram synthesis

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

The pith

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a method to automatically construct probabilistic programs for data analysis by using Bayesian inference to select programs from a modeling language that fits the observed data. It gives sufficient conditions for this synthesis procedure to be sound when the language is specified by a probabilistic context-free grammar. The approach is demonstrated on time series and multivariate tabular data, where the resulting programs can be inspected for the probability of qualitative features such as trends or clusters and can be translated into executable models for forecasting new observations. If the method works as described, it replaces manual model selection with an automated search that directly incorporates uncertainty over program structure.

Core claim

Bayesian synthesis identifies a sound procedure for inferring probabilistic programs in domain-specific languages given data. For languages defined by probabilistic context-free grammars, the procedure uses posterior inference over program structures to produce models whose qualitative properties can be read off as probabilities and whose predictions can be obtained by translation into a general-purpose probabilistic programming system.

What carries the argument

Bayesian synthesis, which performs posterior inference over programs in a domain-specific probabilistic modeling language to find those that best explain the observed data.

Load-bearing premise

The chosen domain-specific languages are expressive enough to represent the important features of the data-generating processes that occur in practice.

What would settle it

On held-out real-world time series or tabular datasets, the predictions from the synthesized programs are no more accurate than those from standard baseline methods such as ARIMA or Gaussian processes.

Watch this falsifier — get emailed when new claim-graph text bears on it.

If this is right

Probabilities can be computed for whether the underlying process exhibits specific qualitative properties such as periodicity or clustering.
The synthesized programs can be translated and run in a general probabilistic programming system to forecast new data points.
The same synthesis procedure applies to any domain-specific language whose syntax is captured by a probabilistic context-free grammar.
Empirical results on multiple datasets show accurate recovery of structure and better forecasting performance than conventional methods.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could be applied to new data domains by defining additional domain-specific languages that encode the relevant generative assumptions.
Soundness guarantees may help users trust the inferred programs when the grammar accurately reflects the space of plausible models.
The translation step suggests that specialized modeling languages can serve as front-ends that compile into more general inference engines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit.

Desk Editor's Note

The paper gives a clean new formulation of Bayesian synthesis over PCFG DSLs with soundness results, plus experiments on real data, but leaves the key conditions unverified for the actual grammars used.

read the letter

The main takeaway is that this work supplies a precise Bayesian formulation for synthesizing probabilistic programs from data, along with general algorithms and soundness results for any PCFG-specified DSL that meets a short list of conditions. That formulation and the associated algorithms look like the actual new piece, not just an incremental tweak on prior synthesis work. The experiments then apply the method to time-series and tabular modeling languages, infer qualitative structure from real datasets, and show better forecasting performance than standard baselines after translating the programs into Venture. Those results are concrete and directly tied to the synthesis procedure. The paper does a reasonable job connecting the theory to usable outputs like structure probabilities and predictions. The soft spot is exactly the one flagged in the stress-test note. Soundness is established only when the DSL satisfies the stated conditions (identifiability of the prior, proper support, etc.). The paper never checks whether the concrete time-series and tabular grammars meet those conditions, so the guarantee does not automatically transfer to the reported experiments. That is a genuine gap rather than a minor omission, though it is fixable with a short additional section. The work is aimed at researchers in probabilistic programming and automated statistics who already care about program synthesis for data modeling. A reader in that niche would get value from both the formal setup and the empirical demonstrations. It is coherent on its own terms and shows clear engagement with the relevant literature, so it deserves a serious referee even if the condition-verification issue needs to be resolved in revision. I would send it to peer review.

Referee Report

1 major / 0 minor

Summary. The paper introduces Bayesian synthesis for automatically generating probabilistic programs in domain-specific modeling languages (DSLs) from data, identifies sufficient conditions for soundness of the procedure, proves soundness for DSLs whose syntax is given by probabilistic context-free grammars (PCFGs) meeting those conditions, applies the method to time-series and multivariate tabular DSLs, extracts posterior probabilities over qualitative structural properties, translates the programs to Venture for forecasting, and reports that the approach infers structure accurately on real datasets while outperforming standard methods.

Significance. If the central claims hold, the work provides a principled, sound foundation for automating probabilistic data modeling via program synthesis, with the ability to quantify uncertainty over structural properties and generate predictions. The explicit identification of sufficient conditions and the soundness result for the PCFG class are notable strengths, as is the end-to-end demonstration on real data including translation to a general-purpose PPL.

major comments (1)

[Soundness theorem and experimental sections (application to time-series and tabular DSLs)] The soundness theorem is stated to apply to any PCFG DSL satisfying the listed sufficient conditions (identifiability of the prior, proper support, etc.). However, the manuscript does not verify that the concrete time-series and tabular grammars used for the experimental results satisfy these conditions. Because the experimental claims of accurate qualitative structure inference and outperformance on forecasting rest on the soundness guarantee transferring to these grammars, this verification step is load-bearing for the central claims.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments. We address the major comment below.

read point-by-point responses

Referee: [Soundness theorem and experimental sections (application to time-series and tabular DSLs)] The soundness theorem is stated to apply to any PCFG DSL satisfying the listed sufficient conditions (identifiability of the prior, proper support, etc.). However, the manuscript does not verify that the concrete time-series and tabular grammars used for the experimental results satisfy these conditions. Because the experimental claims of accurate qualitative structure inference and outperformance on forecasting rest on the soundness guarantee transferring to these grammars, this verification step is load-bearing for the central claims.

Authors: We agree that the manuscript does not explicitly verify that the time-series and tabular grammars satisfy the sufficient conditions for the soundness theorem. This is a valid point, as the experimental claims rely on the theorem applying to these grammars. In the revised manuscript, we will add an explicit verification (in the experimental sections or an appendix) confirming that both grammars meet the conditions, including identifiability of the prior and proper support. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained.

full rationale

The paper formulates Bayesian synthesis, identifies sufficient conditions for soundness, derives synthesis algorithms for PCFG-specified DSLs, and proves soundness under those conditions using standard Bayesian principles. Experimental application to time-series and tabular grammars is presented separately from the theorem. No quoted reduction shows any prediction or result equivalent to its inputs by construction, no load-bearing self-citation chain, and no fitted parameter renamed as prediction. The central claims rest on independent mathematical derivation rather than self-referential definitions or data fits.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based solely on the abstract, no specific free parameters, axioms, or invented entities are identifiable; the work relies on standard Bayesian inference and PCFG formalisms from prior literature.

pith-pipeline@v0.9.0 · 5768 in / 1121 out tokens · 32822 ms · 2026-05-24T21:34:26.658831+00:00 · methodology

0 comments

read the original abstract

We present new techniques for automatically constructing probabilistic programs for data analysis, interpretation, and prediction. These techniques work with probabilistic domain-specific data modeling languages that capture key properties of a broad class of data generating processes, using Bayesian inference to synthesize probabilistic programs in these modeling languages given observed data. We provide a precise formulation of Bayesian synthesis for automatic data modeling that identifies sufficient conditions for the resulting synthesis procedure to be sound. We also derive a general class of synthesis algorithms for domain-specific languages specified by probabilistic context-free grammars and establish the soundness of our approach for these languages. We apply the techniques to automatically synthesize probabilistic programs for time series data and multivariate tabular data. We show how to analyze the structure of the synthesized programs to compute, for key qualitative properties of interest, the probability that the underlying data generating process exhibits each of these properties. Second, we translate probabilistic programs in the domain-specific language into probabilistic programs in Venture, a general-purpose probabilistic programming system. The translated Venture programs are then executed to obtain predictions of new time series data and new multivariate data records. Experimental results show that our techniques can accurately infer qualitative structure in multiple real-world data sets and outperform standard data analysis methods in forecasting and predicting new data.

Figures

Figures reproduced from arXiv: 1907.06249 by Feras A. Saad, Marco F. Cusumano-Towner, Martin C. Rinard, Ulrich Schaechtle, Vikash K. Mansinghka.

**Figure 2.** Figure 2: Components of Bayesian synthesis of probabilistic programs for automatic data modeling. [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Components of domain-specific language for automatic data modeling of time series using Gaussian [PITH_FULL_IMAGE:figures/full_fig_p017_3.png] view at source ↗

**Figure 4.** Figure 4: Components of domain-specific language for automatic data modeling of multivariate tabular data [PITH_FULL_IMAGE:figures/full_fig_p018_4.png] view at source ↗

**Figure 5.** Figure 5: Examples of mutation operators applied to a mixture modeling DSL program during Bayesian synthesis. [PITH_FULL_IMAGE:figures/full_fig_p019_5.png] view at source ↗

**Figure 6.** Figure 6: Detecting probable temporal structures in multiple time series with varying characteristics. In each of [PITH_FULL_IMAGE:figures/full_fig_p020_6.png] view at source ↗

**Figure 7.** Figure 7: Quantitative evaluation of forecasting using Gaussian process programs from Bayesian synthesis, as [PITH_FULL_IMAGE:figures/full_fig_p021_7.png] view at source ↗

**Figure 8.** Figure 8: Synthesis runtime versus held-out predictive likelihood on the time series benchmarked in Figure [PITH_FULL_IMAGE:figures/full_fig_p022_8.png] view at source ↗

**Figure 9.** Figure 9: Detecting probable predictive relationships between pairs of variables for widely varying dependence [PITH_FULL_IMAGE:figures/full_fig_p023_9.png] view at source ↗

**Figure 10.** Figure 10: Comparing the quality of predictive models discovered by Bayesian synthesis to commonly-used [PITH_FULL_IMAGE:figures/full_fig_p024_10.png] view at source ↗

**Figure 11.** Figure 11: Comparing the error in the predictive probabilities of held-out data according to Bayesian synthesis [PITH_FULL_IMAGE:figures/full_fig_p025_11.png] view at source ↗

discussion (0)

Reference graph

Works this paper leans on

30 extracted references · 30 canonical work pages · 6 internal anchors

[1]

IEEE Trans

Applying probability measures to abstract languages. IEEE Trans. Comput. 22, 5 (1973), 442–450. Matko Bošnak, Pushmeet Kohli, Tejas Kulkrani, Sebastian Riedel, Tim Rocktäschel, Dawn Song, and Robert Zinkov (Eds.)

work page 1973
[2]

A transformation system for developing recursive programs. J. ACM 24, 1 (1977), 44–67. Bob Carpenter, Andrew Gelman, Matthew Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li, and Allen Riddell

work page 1977
[3]

Journal of Statistical Software 76, 1 (2017), 1–32

Stan: A probabilistic programming language. Journal of Statistical Software 76, 1 (2017), 1–32. Sarah Chasins and Phitchaya M. Phothilimthana

work page 2017
[4]

http://archive.ics.uci.edu/ml

UCI Machine Learning Repository. http://archive.ics.uci.edu/ml. (2017). Norman R. Draper and Harry Smith

work page 2017
[5]

Mathematical and Computer Modelling 52, 3 (2010), 490–500

Consistency of stochastic context-free grammars. Mathematical and Computer Modelling 52, 3 (2010), 490–500. Andrew Gelman and Jennifer Hill

work page 2010
[6]

http://dippl.org

The Design and Implementation of Probabilistic Programming Languages. http://dippl.org. (2014). Alex Graves, Greg Wayne, and Ivo Danihelka

work page 2014
[7]

Neural Turing machines. (2014). arXiv:1410.5401 Roger Grosse, Ruslan Salakhutdinov, William Freeman, and Joshua B. Tenenbaum

work page internal anchor Pith review Pith/arXiv arXiv 2014
[8]

Inducing probabilistic programs by Bayesian program merging. (2011). arXiv:1110.5667 Rob Hyndman and Yeasmin Khandakar

work page internal anchor Pith review Pith/arXiv arXiv 2011
[9]

Journal of Statistical Software 27, 3 (2008), 1–22

Automatic time series forecasting: The forecast package for R. Journal of Statistical Software 27, 3 (2008), 1–22. Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani

work page 2008
[10]

Journal of Machine Learning Research 14, Feb (2013), 673–701

Bayesian nonparametric hidden semi-Markov models. Journal of Machine Learning Research 14, Feb (2013), 673–701. John R. Koza

work page 2013
[11]

IEEE Transactions on Evolutionary Computation 1, 2 (1997), 109–128

Automated synthesis of analog electrical circuits by means of genetic programming. IEEE Transactions on Evolutionary Computation 1, 2 (1997), 109–128. Woosuk Lee, Kihong Heo, Rajeev Alur, and Mayur Naik

work page 1997
[12]

https://github.com/jamesrobertlloyd/gpss-research/tree/ master/data

Kernel structure discovery research code. https://github.com/jamesrobertlloyd/gpss-research/tree/ master/data. (2014). James R. Lloyd, David Duvenaud, Roger Grosse, Joshua B. Tenenbaum, and Zoubin Ghahramani

work page 2014
[13]

IEEE Transactions on Software Engineering 5, 4 (1979), 294–328

Synthesis: Dreams to programs. IEEE Transactions on Software Engineering 5, 4 (1979), 294–328. Zohar Manna and Richard Waldinger

work page 1979
[14]

ACM Transactions on Programming Languages and Systems 2, 1 (1980), 90–121

A deductive approach to program synthesis. ACM Transactions on Programming Languages and Systems 2, 1 (1980), 90–121. Vikash Mansinghka, Charles Kemp, Thomas Griffiths, and Joshua B. Tenenbaum

work page 1980
[15]

Venture: A higher-order probabilistic programming platform with programmable inference. (2014). arXiv:1404.0099 Vikash Mansinghka, Patrick Shafto, Eric Jonas, Cap Petschulat, Max Gasner, and Joshua B. Tenenbaum

work page internal anchor Pith review Pith/arXiv arXiv 2014
[16]

Journal of Machine Learning Research 17, 138 (2016), 1–49

CrossCat: A fully Bayesian nonparametric method for analyzing heterogeneous, high dimensional data. Journal of Machine Learning Research 17, 138 (2016), 1–49. Vikash K. Mansinghka, Ulrich Schaechtle, Shivam Handa, Alexey Radul, Yutian Chen, and Martin Rinard

work page 2016
[17]

Journal of Machine Learning Research 12, Oct (2011), 2825–2830

Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12, Oct (2011), 2825–2830. Yura N. Perov and Frank D. Wood

work page 2011
[18]

Learning probabilistic programs. (2014). arXiv:1407.2646 Avi Pfeffer

work page internal anchor Pith review Pith/arXiv arXiv 2014
[19]

Journal of Machine Learning Research 6, Dec (2005), 1939–1959

A unifying view of sparse approximate Gaussian process regression. Journal of Machine Learning Research 6, Dec (2005), 1939–1959. Jeff Racine and Qi Li

work page 2005
[20]

Journal of Econometrics 119, 1 (2004), 99–130

Nonparametric estimation of regression functions with both categorical and continuous data. Journal of Econometrics 119, 1 (2004), 99–130. Carl Rasmussen and Christopher Williams

work page 2004
[21]

Scott Reed and Nando de Freitas

Gaussian processes for machine learning (GPML) toolbox.Journal of Machine Learning Research 11, Nov (2010), 3011–3015. Scott Reed and Nando de Freitas

work page 2010
[22]

Probabilistic data analysis with probabilistic programming. (2016). arXiv:1608.05347 Feras Saad and Vikash Mansinghka

work page internal anchor Pith review Pith/arXiv arXiv 2016
[23]

PeerJ Computer Science 2 (2016), e55

Probabilistic programming in python using PyMC3. PeerJ Computer Science 2 (2016), e55. Eric Schkufza, Rahul Sharma, and Alex Aiken

work page 2016
[24]

Über die Bausteine der mathematischen Logik. Math. Ann. 92 (1924), 305–316. David W. Scott

work page 1924
[25]

The American Statistician 72, 1 (2018), 37–45

Forecasting at scale. The American Statistician 72, 1 (2018), 37–45. Luke Tierney

work page 2018
[26]

The Annals of Statistics 22, 4 (1994), 1701–1728

Markov chains for exploring posterior distributions. The Annals of Statistics 22, 4 (1994), 1701–1728. Anh Tong and Jaesik Choi

work page 1994
[27]

Automatic generation of probabilistic programming from time series data. (2016). arXiv:1607.00710 Dustin Tran, Matthew D. Hoffman, Rif A. Saurous, Eugene Brevdo, Kevin Murphy, and David M. Blei

work page internal anchor Pith review Pith/arXiv arXiv 2016
[28]

In particular, we prove the following lemma: Lemma A.1

Bayesian Synthesis of Probabilistic Programs for Automatic Data Modeling 37:33 A PROOFS FOR GAUSSIAN PROCESS DOMAIN-SPECIFIC LANGUAGE Section 5 of the main text states that the prior and likelihood semantics of the Gaussian process domain-specific language satisfy the theoretical preconditions required for Bayesian synthesis. In particular, we prove the f...

work page 2010
[29]

The conclusion follows from Theorem 4.4 in the main text

37:34 Saad, Cusumano-Towner, Schaechtle, Rinard, and Mansinghka The eigenvalues of M are 0.78512481, −0.67128139, and −0.11384342 and they are all less than 1 in absolute value. The conclusion follows from Theorem 4.4 in the main text. □ Proof for Condition 3.2. The semantic function Lik JKK ((x, y)) is normalized since the ex- pression represents a stand...

work page 1992
[30]

Publication date: January 2019

work page 2019

[1] [1]

IEEE Trans

Applying probability measures to abstract languages. IEEE Trans. Comput. 22, 5 (1973), 442–450. Matko Bošnak, Pushmeet Kohli, Tejas Kulkrani, Sebastian Riedel, Tim Rocktäschel, Dawn Song, and Robert Zinkov (Eds.)

work page 1973

[2] [2]

A transformation system for developing recursive programs. J. ACM 24, 1 (1977), 44–67. Bob Carpenter, Andrew Gelman, Matthew Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li, and Allen Riddell

work page 1977

[3] [3]

Journal of Statistical Software 76, 1 (2017), 1–32

Stan: A probabilistic programming language. Journal of Statistical Software 76, 1 (2017), 1–32. Sarah Chasins and Phitchaya M. Phothilimthana

work page 2017

[4] [4]

http://archive.ics.uci.edu/ml

UCI Machine Learning Repository. http://archive.ics.uci.edu/ml. (2017). Norman R. Draper and Harry Smith

work page 2017

[5] [5]

Mathematical and Computer Modelling 52, 3 (2010), 490–500

Consistency of stochastic context-free grammars. Mathematical and Computer Modelling 52, 3 (2010), 490–500. Andrew Gelman and Jennifer Hill

work page 2010

[6] [6]

http://dippl.org

The Design and Implementation of Probabilistic Programming Languages. http://dippl.org. (2014). Alex Graves, Greg Wayne, and Ivo Danihelka

work page 2014

[7] [7]

Neural Turing machines. (2014). arXiv:1410.5401 Roger Grosse, Ruslan Salakhutdinov, William Freeman, and Joshua B. Tenenbaum

work page internal anchor Pith review Pith/arXiv arXiv 2014

[8] [8]

Inducing probabilistic programs by Bayesian program merging. (2011). arXiv:1110.5667 Rob Hyndman and Yeasmin Khandakar

work page internal anchor Pith review Pith/arXiv arXiv 2011

[9] [9]

Journal of Statistical Software 27, 3 (2008), 1–22

Automatic time series forecasting: The forecast package for R. Journal of Statistical Software 27, 3 (2008), 1–22. Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani

work page 2008

[10] [10]

Journal of Machine Learning Research 14, Feb (2013), 673–701

Bayesian nonparametric hidden semi-Markov models. Journal of Machine Learning Research 14, Feb (2013), 673–701. John R. Koza

work page 2013

[11] [11]

IEEE Transactions on Evolutionary Computation 1, 2 (1997), 109–128

Automated synthesis of analog electrical circuits by means of genetic programming. IEEE Transactions on Evolutionary Computation 1, 2 (1997), 109–128. Woosuk Lee, Kihong Heo, Rajeev Alur, and Mayur Naik

work page 1997

[12] [12]

https://github.com/jamesrobertlloyd/gpss-research/tree/ master/data

Kernel structure discovery research code. https://github.com/jamesrobertlloyd/gpss-research/tree/ master/data. (2014). James R. Lloyd, David Duvenaud, Roger Grosse, Joshua B. Tenenbaum, and Zoubin Ghahramani

work page 2014

[13] [13]

IEEE Transactions on Software Engineering 5, 4 (1979), 294–328

Synthesis: Dreams to programs. IEEE Transactions on Software Engineering 5, 4 (1979), 294–328. Zohar Manna and Richard Waldinger

work page 1979

[14] [14]

ACM Transactions on Programming Languages and Systems 2, 1 (1980), 90–121

A deductive approach to program synthesis. ACM Transactions on Programming Languages and Systems 2, 1 (1980), 90–121. Vikash Mansinghka, Charles Kemp, Thomas Griffiths, and Joshua B. Tenenbaum

work page 1980

[15] [15]

Venture: A higher-order probabilistic programming platform with programmable inference. (2014). arXiv:1404.0099 Vikash Mansinghka, Patrick Shafto, Eric Jonas, Cap Petschulat, Max Gasner, and Joshua B. Tenenbaum

work page internal anchor Pith review Pith/arXiv arXiv 2014

[16] [16]

Journal of Machine Learning Research 17, 138 (2016), 1–49

CrossCat: A fully Bayesian nonparametric method for analyzing heterogeneous, high dimensional data. Journal of Machine Learning Research 17, 138 (2016), 1–49. Vikash K. Mansinghka, Ulrich Schaechtle, Shivam Handa, Alexey Radul, Yutian Chen, and Martin Rinard

work page 2016

[17] [17]

Journal of Machine Learning Research 12, Oct (2011), 2825–2830

Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12, Oct (2011), 2825–2830. Yura N. Perov and Frank D. Wood

work page 2011

[18] [18]

Learning probabilistic programs. (2014). arXiv:1407.2646 Avi Pfeffer

work page internal anchor Pith review Pith/arXiv arXiv 2014

[19] [19]

Journal of Machine Learning Research 6, Dec (2005), 1939–1959

A unifying view of sparse approximate Gaussian process regression. Journal of Machine Learning Research 6, Dec (2005), 1939–1959. Jeff Racine and Qi Li

work page 2005

[20] [20]

Journal of Econometrics 119, 1 (2004), 99–130

Nonparametric estimation of regression functions with both categorical and continuous data. Journal of Econometrics 119, 1 (2004), 99–130. Carl Rasmussen and Christopher Williams

work page 2004

[21] [21]

Scott Reed and Nando de Freitas

Gaussian processes for machine learning (GPML) toolbox.Journal of Machine Learning Research 11, Nov (2010), 3011–3015. Scott Reed and Nando de Freitas

work page 2010

[22] [22]

Probabilistic data analysis with probabilistic programming. (2016). arXiv:1608.05347 Feras Saad and Vikash Mansinghka

work page internal anchor Pith review Pith/arXiv arXiv 2016

[23] [23]

PeerJ Computer Science 2 (2016), e55

Probabilistic programming in python using PyMC3. PeerJ Computer Science 2 (2016), e55. Eric Schkufza, Rahul Sharma, and Alex Aiken

work page 2016

[24] [24]

Über die Bausteine der mathematischen Logik. Math. Ann. 92 (1924), 305–316. David W. Scott

work page 1924

[25] [25]

The American Statistician 72, 1 (2018), 37–45

Forecasting at scale. The American Statistician 72, 1 (2018), 37–45. Luke Tierney

work page 2018

[26] [26]

The Annals of Statistics 22, 4 (1994), 1701–1728

Markov chains for exploring posterior distributions. The Annals of Statistics 22, 4 (1994), 1701–1728. Anh Tong and Jaesik Choi

work page 1994

[27] [27]

Automatic generation of probabilistic programming from time series data. (2016). arXiv:1607.00710 Dustin Tran, Matthew D. Hoffman, Rif A. Saurous, Eugene Brevdo, Kevin Murphy, and David M. Blei

work page internal anchor Pith review Pith/arXiv arXiv 2016

[28] [28]

In particular, we prove the following lemma: Lemma A.1

Bayesian Synthesis of Probabilistic Programs for Automatic Data Modeling 37:33 A PROOFS FOR GAUSSIAN PROCESS DOMAIN-SPECIFIC LANGUAGE Section 5 of the main text states that the prior and likelihood semantics of the Gaussian process domain-specific language satisfy the theoretical preconditions required for Bayesian synthesis. In particular, we prove the f...

work page 2010

[29] [29]

The conclusion follows from Theorem 4.4 in the main text

37:34 Saad, Cusumano-Towner, Schaechtle, Rinard, and Mansinghka The eigenvalues of M are 0.78512481, −0.67128139, and −0.11384342 and they are all less than 1 in absolute value. The conclusion follows from Theorem 4.4 in the main text. □ Proof for Condition 3.2. The semantic function Lik JKK ((x, y)) is normalized since the ex- pression represents a stand...

work page 1992

[30] [30]

Publication date: January 2019

work page 2019