Compositional Inference Metaprogramming with Convergence Guarantees

Martin Rinard; Shivam Handa; Vikash Mansinghka

arxiv: 1907.05451 · v2 · pith:JKTXZUTQnew · submitted 2019-07-11 · 💻 cs.PL

Compositional Inference Metaprogramming with Convergence Guarantees

Shivam Handa , Vikash Mansinghka , Martin Rinard This is my paper

Pith reviewed 2026-05-24 22:25 UTC · model grok-4.3

classification 💻 cs.PL

keywords probabilistic programminginference metaprogrammingMarkov chain Monte Carloasymptotic convergencehybrid inferencesubproblem decomposition

0 comments

The pith

Hybrid probabilistic inference algorithms that apply separate MCMC kernels to independent subproblems of a program converge asymptotically to the target distribution.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a mathematical framework for hybrid inference that decomposes probabilistic program executions into subproblems and applies different Markov-Chain Monte Carlo algorithms to each. It distinguishes independent subproblem inference, where each kernel operates without reference to the full program trace, from entangled approaches. The authors prove that when subproblems meet an independence condition, the composed algorithm preserves the stationary distribution and ergodicity needed for convergence. This supplies the first asymptotic convergence guarantee for inference algorithms defined through subproblem-based metaprogramming.

Core claim

By formalizing independent subproblem inference, the paper shows that hybrid MCMC algorithms defined by metaprogramming can be composed from separate kernels while still satisfying the conditions for asymptotic convergence to the correct posterior distribution.

What carries the argument

The formalism of independent subproblem inference, which separates the action of each MCMC kernel from the full program trace so that kernels can be composed without losing overall stationarity or ergodicity.

If this is right

Metaprogrammed hybrid inference algorithms can be deployed on suitable decompositions with a provable guarantee of asymptotic correctness.
Different MCMC methods can be mixed across program subproblems without invalidating convergence.
Convergence analysis extends directly to programs that use inference metaprogramming constructs.
The same composition rules apply to any collection of kernels that individually satisfy standard MCMC conditions on their subproblems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The framework may support automated static checks that verify whether a user-supplied subproblem decomposition meets the independence condition.
Similar compositional arguments could be applied to other modular inference settings such as sequential Monte Carlo or variational methods on factored models.
If the independence condition can be checked efficiently, it would allow safe runtime selection among alternative inference strategies.

Load-bearing premise

The chosen decomposition into subproblems must satisfy an independence condition that lets the separate MCMC kernels compose while preserving the overall stationary distribution and ergodicity.

What would settle it

An explicit probabilistic program together with a subproblem decomposition obeying the independence condition for which the composed MCMC chain fails to converge to the target distribution.

Figures

Figures reproduced from arXiv: 1907.05451 by Martin Rinard, Shivam Handa, Vikash Mansinghka.

**Figure 1.** Figure 1: Probabilistic Lambda Calculus Dist(e) can be seen as a set of probabilistic lambda calculus expressions {ed |ed ∈ Dist(e) ⊆ Ev }. Based on the parameter expression e, Dist(e) makes a stochastic choice and returns an expression ev ∈ Dist(e). We define: Dist(e)[x/y] = Dist′ (e[x/y]) = {ed [x/y]|ed ∈ Dist(e[x/y])} FreeVariables(Dist(e)) = Ø ed ∈Dist(e)∪{e } FreeVariables(ed ) Traces: When a program executes, … view at source ↗

**Figure 2.** Figure 2: Traces We define the execution, including the generation of valid traces t, with the transition relation ⇒s⊆ Σv × Σid ×P → T ( [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Valid Traces [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Rolling back Traces to Probabilistic Program [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Dependency Graph for a Trace t [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: Equivalence Check for correct Inference Wingate et al. 2011], ts and t ′ s are valid traces of the same program ps (the subprogram for the subtracests and t ′ s ). The key challenge is converting the entangled subproblem (which is typically incomplete and therefore not a valid trace of any program) into a valid trace by transforming the subproblem to include external dependences and correctly scope both in… view at source ↗

**Figure 7.** Figure 7: Extraction Relation augmented expressions which are not within the subproblem and converts them into observe statements. This transformation constrains the value of these stochastic choices to the values present in the original trace. It leaves the stochastic choices in the subproblem in place and therefore accessible to the inference algorithm. For augmented expressions of the form (ae1 ae2)y = ae3, whe… view at source ↗

**Figure 8.** Figure 8: Stitching Transition Relation within the subproblem. If we keep the augmented expression as is, the inference algorithm may unroll ae3 and execute it again, changing some stochastic choices in ae3 but not in the subproblem. If we modify ae3, the constraint of ae3 being a valid lambda application breaks. We solve this problem by introducing assume statements and correctly scoping the resulting dependences … view at source ↗

**Figure 9.** Figure 9: Entangled vs Independent Subproblem Inference [PITH_FULL_IMAGE:figures/full_fig_p012_9.png] view at source ↗

**Figure 10.** Figure 10: Probabilistic measure over traces We use the shorthand SS ⊢ t ≡ t ′ to denote SS(t) ⊢ t ≡ t ′ . Theorem 6. A reversible subproblem selection strategy SS divides the trace space of program p into equivalence classes. We present the proof of this theorem in Appendix A.4 (Theorem 15). A reversible subproblem selection strategy SS divides the trace spaceTp into a countable number of equivalence classes where … view at source ↗

**Figure 11.** Figure 11: Inference Metaprogramming language Proof. ExtractTrace is one-to-one function from Tci to Tps and the push forward measure of vi(ci , .) is µps . Definition 25 (Generalized Markov Kernel). A Generalized Markov Kernel K is a parameterized Markov kernel which, when parameterized with a probabilistic programp, defines the probability space (Tp , Σp, µp ) (as defined above), and returns a Markov Kernel K(p) … view at source ↗

**Figure 12.** Figure 12: Execution Semantics for Inference Metaprograms [PITH_FULL_IMAGE:figures/full_fig_p024_12.png] view at source ↗

read the original abstract

Inference metaprogramming enables effective probabilistic programming by supporting the decomposition of executions of probabilistic programs into subproblems and the deployment of hybrid probabilistic inference algorithms that apply different probabilistic inference algorithms to different subproblems. We introduce the concept of independent subproblem inference (as opposed to entangled subproblem inference in which the subproblem inference algorithm operates over the full program trace) and present a mathematical framework for studying convergence properties of hybrid inference algorithms that apply different Markov-Chain Monte Carlo algorithms to different parts of the inference problem. We then use this formalism to prove asymptotic convergence results for probablistic programs with inference metaprogramming. To the best of our knowledge this is the first asymptotic convergence result for hybrid probabilistic inference algorithms defined by (subproblem-based) inference metaprogramming.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper claims the first convergence theorem for hybrid MCMC built by metaprogramming on independent subproblems, but the result stands or falls on whether the decomposition actually satisfies the independence condition needed for kernel composition.

read the letter

This paper introduces a distinction between independent and entangled subproblem inference and supplies a framework to prove that composing separate MCMC kernels on independent subproblems yields an overall chain that converges asymptotically to the target posterior. The main new element is the formal treatment of inference metaprogramming as a way to define hybrid algorithms and the claim that this is the first asymptotic convergence result for that style of construction. It gives a clean way to talk about when you can safely apply different inference methods to different parts of a program without breaking the stationary distribution or ergodicity properties. That is useful for anyone who already decomposes models this way in practice. The central assumption is that the subproblem split must be independent enough for each kernel to leave its marginal invariant and for the product measure to be preserved. When that holds, the composition argument goes through; when it does not, the guarantee does not apply, which the abstract itself flags by contrasting the two cases. Without the full derivation it is hard to judge how restrictive the independence condition turns out to be in realistic programs or whether hidden assumptions appear later in the proof. The work is aimed at people working on probabilistic programming systems and MCMC theory who need grounding for compositional techniques. It is worth sending to referees because it puts a concrete mathematical claim on the table that can be checked or extended, even if the independence requirement limits its scope.

Referee Report

2 major / 1 minor

Summary. The paper introduces the concept of independent subproblem inference (contrasted with entangled inference) for probabilistic programs, presents a mathematical framework for analyzing convergence properties of hybrid MCMC algorithms that apply different kernels to different subproblems, and uses this framework to prove asymptotic convergence results for inference metaprogramming. It claims this is the first such convergence result for subproblem-based hybrid algorithms.

Significance. If the central theorem holds, the framework would supply the first asymptotic convergence guarantees for compositional hybrid inference algorithms in probabilistic programming, strengthening the theoretical foundation for metaprogrammed inference and enabling safer deployment of mixed MCMC strategies on decomposed programs.

major comments (2)

[Main convergence theorem / framework definition] The convergence theorem depends on an independence condition for subproblem decompositions that ensures each subproblem MCMC kernel leaves the appropriate marginal invariant and that the composed kernel preserves the overall stationary distribution and ergodicity. The manuscript must explicitly define this condition (likely in the framework section) and demonstrate that it holds for the program decompositions to which the theorem is applied; without this, the invariance and convergence claims do not follow.
[Framework / Definitions] The distinction between independent and entangled subproblem inference must be formalized with precise statements of how the product measure is preserved under kernel composition. If the paper only assumes independence without deriving the required invariance from the program semantics, the reduction from the hybrid algorithm to the target posterior is incomplete.

minor comments (1)

[Abstract] Abstract contains the typo 'probablistic' (should be 'probabilistic').

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful review and for highlighting opportunities to strengthen the clarity of our framework. We address each major comment below and will revise the manuscript to make the required definitions and derivations explicit.

read point-by-point responses

Referee: [Main convergence theorem / framework definition] The convergence theorem depends on an independence condition for subproblem decompositions that ensures each subproblem MCMC kernel leaves the appropriate marginal invariant and that the composed kernel preserves the overall stationary distribution and ergodicity. The manuscript must explicitly define this condition (likely in the framework section) and demonstrate that it holds for the program decompositions to which the theorem is applied; without this, the invariance and convergence claims do not follow.

Authors: We agree that the independence condition should be stated more prominently. In the revised manuscript we will add an explicit definition (new Definition 2.4) of the subproblem independence condition in the framework section, together with a short lemma (Lemma 2.5) proving that the condition holds for the decompositions to which the main convergence theorem is applied. This will directly establish invariance of the marginals and preservation of the overall stationary distribution under kernel composition. revision: yes
Referee: [Framework / Definitions] The distinction between independent and entangled subproblem inference must be formalized with precise statements of how the product measure is preserved under kernel composition. If the paper only assumes independence without deriving the required invariance from the program semantics, the reduction from the hybrid algorithm to the target posterior is incomplete.

Authors: The manuscript already contrasts the two notions in Section 3 and defines independent inference via kernels that act on disjoint subproblem traces. To address the referee's concern, we will insert a new proposition (Proposition 3.2) that derives preservation of the product measure directly from the probabilistic program semantics under the independence condition, thereby making the reduction to the target posterior fully explicit. revision: yes

Circularity Check

0 steps flagged

No circularity: new formalism and standard MCMC convergence applied to defined subproblem class

full rationale

The paper defines independent subproblem inference as a distinct concept from entangled inference, then constructs a mathematical framework and applies it to prove asymptotic convergence of hybrid MCMC algorithms. The independence condition is an explicit definitional requirement on the program decomposition (ensuring each subproblem kernel preserves the appropriate marginal and the composed chain is ergodic), not a fitted quantity renamed as a prediction or a result smuggled in via self-citation. The derivation therefore rests on standard Markov chain theory applied to the newly delineated class rather than reducing to its own inputs by construction. No load-bearing self-citations, ansatzes, or renamings of known empirical patterns appear in the provided derivation chain.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The proof necessarily rests on standard Markov-chain convergence theory and on the definition of independence for subproblems; no free parameters or invented entities are visible in the abstract.

axioms (2)

standard math Standard ergodicity and convergence properties of individual MCMC kernels
The hybrid convergence result composes kernels whose individual convergence is taken as given.
domain assumption The subproblem decomposition preserves the joint stationary distribution
Required for the composed sampler to target the correct distribution.

pith-pipeline@v0.9.0 · 5659 in / 1213 out tokens · 26269 ms · 2026-05-24T22:25:55.925157+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages · 6 internal anchors

[1]

Machine learning 50, 1-2 (2003), 5–43

An introduction to MCMC for machine learning. Machine learning 50, 1-2 (2003), 5–43. Anonymous

work page 2003
[2]

(2020). Nimar S. Arora, Rodrigo de Salvo Braz, Erik B. Sudderth, and Stuar t J. Russell

work page 2020
[3]

Gibbs Sampling in Open-Universe Stochastic Languages

Gibbs Sampling in Open-Universe Stochastic Languages. CoRR abs/1203.3464 (2012). arXiv: 1203.3464 http://arxiv.org/abs/1203.3464 Krishna B Athreya, Hani Doss, Jayaram Sethuraman, et al

work page internal anchor Pith review Pith/arXiv arXiv 2012
[4]

The Annals of Statistics 24, 1 (1996), 69–100

On the convergence of the Markov chain simulation method. The Annals of Statistics 24, 1 (1996), 69–100. Eric Atkinson and Michael Carbin

work page 1996
[5]

Typesafety for Explicitly-Co ded Probabilistic Inference Procedures. (2017). Patrizia Berti, Luca Pratelli, Pietro Rigo, et al

work page 2017
[6]

The Annals of Probability 36, 6 (2008), 2215–2234

Trivial inters ection of σ -ﬁelds and Gibbs sampling. The Annals of Probability 36, 6 (2008), 2215–2234. Johannes Borgström, Ugo Dal Lago, Andrew D Gordon, and Marcin Szymc zak

work page 2008
[7]

Journal of Statistical Software 20 (2016), 1–37

Stan: A probabilistic pr ogramming language. Journal of Statistical Software 20 (2016), 1–37. Siddhartha Chib and Edward Greenberg

work page 2016
[8]

The american statistician 49, 4 (1995), 327–335

Understanding the metro polis-hastings algorithm. The american statistician 49, 4 (1995), 327–335. Ugo Dal Lago and Margherita Zorzi

work page 1995
[9]

RAIRO-Theoretical Informatics and Applications 46, 3 (2012), 413–450

Probabilistic operational s emantics for the lambda calculus. RAIRO-Theoretical Informatics and Applications 46, 3 (2012), 413–450. Persi Diaconis and Daniel Stroock

work page 2012
[10]

The Annals of Applied Probability (1991), 36–61

Geometric bounds for eigenval ues of Markov chains. The Annals of Applied Probability (1991), 36–61. David A Forsyth and Jean Ponce

work page 1991
[11]

Course notes, Spring Quarter (1998)

Markov chain Monte Carlo lecture notes. Course notes, Spring Quarter (1998). Noah Goodman, Vikash Mansinghka, Daniel M Roy, Keith Bonawitz, and Joshu a B Tenenbaum

work page 1998
[12]

http://dippl.org

The Design and Im plementation of Probabilistic Programming Lan- guages. http://dippl.org. Accessed: 2017-11-15. Andrew D. Gordon, Thore Graepel, Nicolas Rolland, Claudio Russo, Johannes Borgstrom, and John Guiver. 2014a. Tabular: a schema-driven probabilistic programming language. In ACM SIGPLAN Notices, Vol

work page 2017
[13]

CoRR abs/1701.02547 (2017)

A Convenient Category for Higher-Order Probability Theory. CoRR abs/1701.02547 (2017). arXiv: 1701.02547 http://arxiv.org/abs/1701.02547 David M Higdon

work page arXiv 2017
[14]

Journal of the American statistical Association 93, 442 (1998), 585–595

Auxiliary variable methods for Markov chain Mo nte Carlo with applications. Journal of the American statistical Association 93, 442 (1998), 585–595. Daniel Huang, Jean-Baptiste Tristan, and Greg Morrisett

work page 1998
[15]

In Foundations of Computer Science, 1979., 20th Annual Symposium on

Semantics of probabilistic programs. In Foundations of Computer Science, 1979., 20th Annual Symposium on. IEEE, 101–114. Uber AI Labs

work page 1979
[16]

Venture: a higher-order probabilistic programming platform with programmable inference

Venture: a hig her-order probabilistic programming platform with programmable inference. arXiv preprint arXiv:1404.0099 (2014). Vikash K Mansinghka, Ulrich Schaechtle, Shivam Handa, Alexey Radul , Yutian Chen, and Martin Rinard

work page internal anchor Pith review Pith/arXiv arXiv 2014
[17]

Statistical relational learning (2007),

BLOG: Probabilistic models with unknown objects. Statistical relational learning (2007),

work page 2007
[18]

Extending Bayesian networks to the open-universe case. (2010). David Monniaux

work page 2010
[19]

An Abstract Monte-Carlo Method for the Analysis of Probabilistic Programs

An abstract Monte-Carlo method for the analy sis of probabilistic programs. arXiv preprint cs/0701195 (2007). Kevin P Murphy

work page internal anchor Pith review Pith/arXiv arXiv 2007
[20]

Bayesian State-Space Modelling on High-Performance Hardware Using LibBi

Bayesian state-space modelling on high- performance hardware using LibBi. arXiv preprint arXiv:1306.3277 (2013). Norman Ramsey and Avi Pfeﬀer

work page internal anchor Pith review Pith/arXiv arXiv 2013
[21]

Probability surveys 1 (2004), 20–71

General stat e space Markov chains and MCMC algorithms. Probability surveys 1 (2004), 20–71. Gareth O Roberts and Adrian FM Smith

work page 2004
[22]

Stochastic processes and their applications 49, 2 (1994), 207–216

Simple conditions for the c onvergence of the Gibbs sampler and Metropolis- Hastings algorithms. Stochastic processes and their applications 49, 2 (1994), 207–216. Stuart J. Russell and Peter Norvig

work page 1994
[23]

Proceedings of the ACM on Programming Languages (PACMPL) 2, ICFP (30 7 2018), 83:1–83:29

Functional Pr ogramming for Modular Bayesian Inference. Proceedings of the ACM on Programming Languages (PACMPL) 2, ICFP (30 7 2018), 83:1–83:29. https://doi.org/10.1145/3236778 Adam Ścibior, Ohad Kammar, Matthijs Vákár, Sam Staton, Hongseok Y ang, Yufei Cai, Klaus Ostermann, Sean K. Moss, Chris Heunen, and Zoubin Ghahramani

work page doi:10.1145/3236778 2018
[24]

Denotational Validation of Hig her-order Bayesian Inference. Proc. ACM Program. Lang. 2, POPL, Article 60 (Dec. 2017), 29 pages. https://doi.org/10.1145/3158148 Micha Sharir, Amir Pnueli, and Sergiu Hart

work page doi:10.1145/3158148 2017
[25]

Veriﬁcation of proba bilistic programs. SIAM J. Comput. 13, 2 (1984), 292–314. Sam Staton, Hongseok Yang, Frank Wood, Chris Heunen, and Ohad Kammar

work page 1984
[26]

the Annals of Statistics (1994), 1701–1728

Markov chains for exploring posterior distributio ns. the Annals of Statistics (1994), 1701–1728. David Tolpin, Jan-Willem van de Meent, and Frank Wood

work page 1994
[27]

Deep Probabilistic Programming

Deep probabilistic programming. arXiv preprint arXiv:1701.03757 (2017). Jean-Baptiste Tristan, Daniel Huang, Joseph Tassarotti, Adam C. Poc ock, Stephen Green, and Guy L. Steele

work page internal anchor Pith review Pith/arXiv arXiv 2017
[28]

Proceed- ings of the ACM on Programming Languages (PACMPL) 3, POPL (1 1 2019), 36:1–36:29

A Domain Theory for Statistical Probabilistic Programming. Proceed- ings of the ACM on Programming Languages (PACMPL) 3, POPL (1 1 2019), 36:1–36:29. https://doi.org/10.1145/3290349 David Wingate, Andreas Stuhlmüller, and Noah Goodman

work page doi:10.1145/3290349 2019
[29]

Swift: Compiled Inference for Probabilistic Programming Languages

Swift: Compiled Inference for Probabilistic Programming Lan- guages. CoRR abs/1606.09242 (2016). arXiv: 1606.09242 http://arxiv.org/abs/1606.09242 1 Shivam Handa, Vikash Mansinghka, and Martin Rinard A APPENDIX A.1 Convergence L/e.sc/m.sc/m.sc/a.sc 7.For all t∈ T and A∈ Σ, K n f(t, A)= K n i(x)(/y.alt, V)I(x, U) where f(t)=⟨x, /y.alt⟩∈ Xi× Yi and U× V = f...

work page internal anchor Pith review Pith/arXiv arXiv 2016

[1] [1]

Machine learning 50, 1-2 (2003), 5–43

An introduction to MCMC for machine learning. Machine learning 50, 1-2 (2003), 5–43. Anonymous

work page 2003

[2] [2]

(2020). Nimar S. Arora, Rodrigo de Salvo Braz, Erik B. Sudderth, and Stuar t J. Russell

work page 2020

[3] [3]

Gibbs Sampling in Open-Universe Stochastic Languages

Gibbs Sampling in Open-Universe Stochastic Languages. CoRR abs/1203.3464 (2012). arXiv: 1203.3464 http://arxiv.org/abs/1203.3464 Krishna B Athreya, Hani Doss, Jayaram Sethuraman, et al

work page internal anchor Pith review Pith/arXiv arXiv 2012

[4] [4]

The Annals of Statistics 24, 1 (1996), 69–100

On the convergence of the Markov chain simulation method. The Annals of Statistics 24, 1 (1996), 69–100. Eric Atkinson and Michael Carbin

work page 1996

[5] [5]

Typesafety for Explicitly-Co ded Probabilistic Inference Procedures. (2017). Patrizia Berti, Luca Pratelli, Pietro Rigo, et al

work page 2017

[6] [6]

The Annals of Probability 36, 6 (2008), 2215–2234

Trivial inters ection of σ -ﬁelds and Gibbs sampling. The Annals of Probability 36, 6 (2008), 2215–2234. Johannes Borgström, Ugo Dal Lago, Andrew D Gordon, and Marcin Szymc zak

work page 2008

[7] [7]

Journal of Statistical Software 20 (2016), 1–37

Stan: A probabilistic pr ogramming language. Journal of Statistical Software 20 (2016), 1–37. Siddhartha Chib and Edward Greenberg

work page 2016

[8] [8]

The american statistician 49, 4 (1995), 327–335

Understanding the metro polis-hastings algorithm. The american statistician 49, 4 (1995), 327–335. Ugo Dal Lago and Margherita Zorzi

work page 1995

[9] [9]

RAIRO-Theoretical Informatics and Applications 46, 3 (2012), 413–450

Probabilistic operational s emantics for the lambda calculus. RAIRO-Theoretical Informatics and Applications 46, 3 (2012), 413–450. Persi Diaconis and Daniel Stroock

work page 2012

[10] [10]

The Annals of Applied Probability (1991), 36–61

Geometric bounds for eigenval ues of Markov chains. The Annals of Applied Probability (1991), 36–61. David A Forsyth and Jean Ponce

work page 1991

[11] [11]

Course notes, Spring Quarter (1998)

Markov chain Monte Carlo lecture notes. Course notes, Spring Quarter (1998). Noah Goodman, Vikash Mansinghka, Daniel M Roy, Keith Bonawitz, and Joshu a B Tenenbaum

work page 1998

[12] [12]

http://dippl.org

The Design and Im plementation of Probabilistic Programming Lan- guages. http://dippl.org. Accessed: 2017-11-15. Andrew D. Gordon, Thore Graepel, Nicolas Rolland, Claudio Russo, Johannes Borgstrom, and John Guiver. 2014a. Tabular: a schema-driven probabilistic programming language. In ACM SIGPLAN Notices, Vol

work page 2017

[13] [13]

CoRR abs/1701.02547 (2017)

A Convenient Category for Higher-Order Probability Theory. CoRR abs/1701.02547 (2017). arXiv: 1701.02547 http://arxiv.org/abs/1701.02547 David M Higdon

work page arXiv 2017

[14] [14]

Journal of the American statistical Association 93, 442 (1998), 585–595

Auxiliary variable methods for Markov chain Mo nte Carlo with applications. Journal of the American statistical Association 93, 442 (1998), 585–595. Daniel Huang, Jean-Baptiste Tristan, and Greg Morrisett

work page 1998

[15] [15]

In Foundations of Computer Science, 1979., 20th Annual Symposium on

Semantics of probabilistic programs. In Foundations of Computer Science, 1979., 20th Annual Symposium on. IEEE, 101–114. Uber AI Labs

work page 1979

[16] [16]

Venture: a higher-order probabilistic programming platform with programmable inference

Venture: a hig her-order probabilistic programming platform with programmable inference. arXiv preprint arXiv:1404.0099 (2014). Vikash K Mansinghka, Ulrich Schaechtle, Shivam Handa, Alexey Radul , Yutian Chen, and Martin Rinard

work page internal anchor Pith review Pith/arXiv arXiv 2014

[17] [17]

Statistical relational learning (2007),

BLOG: Probabilistic models with unknown objects. Statistical relational learning (2007),

work page 2007

[18] [18]

Extending Bayesian networks to the open-universe case. (2010). David Monniaux

work page 2010

[19] [19]

An Abstract Monte-Carlo Method for the Analysis of Probabilistic Programs

An abstract Monte-Carlo method for the analy sis of probabilistic programs. arXiv preprint cs/0701195 (2007). Kevin P Murphy

work page internal anchor Pith review Pith/arXiv arXiv 2007

[20] [20]

Bayesian State-Space Modelling on High-Performance Hardware Using LibBi

Bayesian state-space modelling on high- performance hardware using LibBi. arXiv preprint arXiv:1306.3277 (2013). Norman Ramsey and Avi Pfeﬀer

work page internal anchor Pith review Pith/arXiv arXiv 2013

[21] [21]

Probability surveys 1 (2004), 20–71

General stat e space Markov chains and MCMC algorithms. Probability surveys 1 (2004), 20–71. Gareth O Roberts and Adrian FM Smith

work page 2004

[22] [22]

Stochastic processes and their applications 49, 2 (1994), 207–216

Simple conditions for the c onvergence of the Gibbs sampler and Metropolis- Hastings algorithms. Stochastic processes and their applications 49, 2 (1994), 207–216. Stuart J. Russell and Peter Norvig

work page 1994

[23] [23]

Proceedings of the ACM on Programming Languages (PACMPL) 2, ICFP (30 7 2018), 83:1–83:29

Functional Pr ogramming for Modular Bayesian Inference. Proceedings of the ACM on Programming Languages (PACMPL) 2, ICFP (30 7 2018), 83:1–83:29. https://doi.org/10.1145/3236778 Adam Ścibior, Ohad Kammar, Matthijs Vákár, Sam Staton, Hongseok Y ang, Yufei Cai, Klaus Ostermann, Sean K. Moss, Chris Heunen, and Zoubin Ghahramani

work page doi:10.1145/3236778 2018

[24] [24]

Denotational Validation of Hig her-order Bayesian Inference. Proc. ACM Program. Lang. 2, POPL, Article 60 (Dec. 2017), 29 pages. https://doi.org/10.1145/3158148 Micha Sharir, Amir Pnueli, and Sergiu Hart

work page doi:10.1145/3158148 2017

[25] [25]

Veriﬁcation of proba bilistic programs. SIAM J. Comput. 13, 2 (1984), 292–314. Sam Staton, Hongseok Yang, Frank Wood, Chris Heunen, and Ohad Kammar

work page 1984

[26] [26]

the Annals of Statistics (1994), 1701–1728

Markov chains for exploring posterior distributio ns. the Annals of Statistics (1994), 1701–1728. David Tolpin, Jan-Willem van de Meent, and Frank Wood

work page 1994

[27] [27]

Deep Probabilistic Programming

Deep probabilistic programming. arXiv preprint arXiv:1701.03757 (2017). Jean-Baptiste Tristan, Daniel Huang, Joseph Tassarotti, Adam C. Poc ock, Stephen Green, and Guy L. Steele

work page internal anchor Pith review Pith/arXiv arXiv 2017

[28] [28]

Proceed- ings of the ACM on Programming Languages (PACMPL) 3, POPL (1 1 2019), 36:1–36:29

A Domain Theory for Statistical Probabilistic Programming. Proceed- ings of the ACM on Programming Languages (PACMPL) 3, POPL (1 1 2019), 36:1–36:29. https://doi.org/10.1145/3290349 David Wingate, Andreas Stuhlmüller, and Noah Goodman

work page doi:10.1145/3290349 2019

[29] [29]

Swift: Compiled Inference for Probabilistic Programming Languages

Swift: Compiled Inference for Probabilistic Programming Lan- guages. CoRR abs/1606.09242 (2016). arXiv: 1606.09242 http://arxiv.org/abs/1606.09242 1 Shivam Handa, Vikash Mansinghka, and Martin Rinard A APPENDIX A.1 Convergence L/e.sc/m.sc/m.sc/a.sc 7.For all t∈ T and A∈ Σ, K n f(t, A)= K n i(x)(/y.alt, V)I(x, U) where f(t)=⟨x, /y.alt⟩∈ Xi× Yi and U× V = f...

work page internal anchor Pith review Pith/arXiv arXiv 2016