Subspace accelerated measure transport methods for fast and scalable sequential experimental design, with application to photoacoustic imaging
Pith reviewed 2026-05-23 02:37 UTC · model grok-4.3
The pith
A derivative-based upper bound on incremental expected information gain enables subspace projectors and conditional transport maps for scalable sequential optimal experimental design in high-dimensional Bayesian inverse problems.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A derivative-based upper bound for the incremental expected information gain guides design placement and simultaneously supplies projectors onto likelihood-informed subspaces; when these projectors are paired with conditional measure transport maps for the sequence of posteriors, the combined procedure yields a unified, scalable framework for sequential optimal experimental design and amortized inference that works in high- and infinite-dimensional settings.
What carries the argument
derivative-based upper bound for iEIG together with likelihood-informed subspace projectors and conditional measure transport maps
If this is right
- Design selection can be performed by maximizing the bound instead of the intractable iEIG at each stage.
- Subspace projectors built from the bound reduce the effective parameter dimension before each transport map is trained.
- Conditional transport maps enable amortized sampling from every posterior that arises during the sequential process.
- The same framework applies without modification to infinite-dimensional parameter spaces arising from PDE discretizations.
- Numerical tests on two PDE-governed inverse problems confirm that the resulting designs improve information gain.
Where Pith is reading between the lines
- The same bound-driven projectors might be reused across multiple related inverse problems that share the same forward operator.
- Transport-map amortization could support real-time sequential design when new data arrive continuously.
- The approach may reduce the cost of related tasks such as Bayesian optimal design under model discrepancy.
Load-bearing premise
The derivative-based upper bound on iEIG is sufficiently tight to both select good designs and produce projectors that meaningfully reduce the dimension for the transport maps.
What would settle it
On a low-dimensional synthetic inverse problem where the exact iEIG can be computed by quadrature, compare the designs chosen by maximizing the bound against those chosen by maximizing the true iEIG and check whether the selected design sequences coincide.
read the original abstract
We propose a novel approach for sequential optimal experimental design (sOED) for Bayesian inverse problems involving expensive models with high-dimensional unknown parameters. This work focuses on designs that maximize the expected information gain (EIG) from prior to posterior, a task that is computationally very challenging in non-Gaussian settings. This challenge is amplified in sOED, as the incremental expected information gain (iEIG) must be repeatedly approximated across distinct stages, with both prior and posterior distributions being intractable. To address this, we derive a general-purpose, derivative-based upper bound for the iEIG, which not only guides design placement but also enables the construction of projectors onto likelihood-informed subspaces, facilitating parameter dimension reduction. By combining this approach with conditional measure transport maps for the sequence of posteriors, we develop a unified sOED and amortized inference framework scalable to high- and infinite-dimensional problems. Numerical experiments for two inverse problems governed by partial differential equations (PDEs) demonstrate the effectiveness of designs by maximizing the proposed bound.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to derive a general-purpose derivative-based upper bound for the incremental expected information gain (iEIG) in sequential optimal experimental design (sOED) for high-dimensional Bayesian inverse problems with expensive PDE models. This bound is asserted to simultaneously guide design selection and enable projectors onto likelihood-informed subspaces for parameter dimension reduction. The approach is combined with conditional measure transport maps to create a unified, scalable framework for sOED and amortized inference. Effectiveness is demonstrated via numerical experiments on two PDE-governed inverse problems.
Significance. If the derivative-based upper bound produces subspaces that reliably preserve the structure of the sequence of posteriors for accurate conditional transport maps, the work would represent a meaningful advance in tractable sOED for non-Gaussian, high- and infinite-dimensional settings. The unification of design optimization with amortized inference via transport maps addresses a genuine computational bottleneck, and the PDE numerical results provide initial evidence of practicality. The contribution would be strengthened by explicit verification that the bound's subspaces remain effective beyond the reported examples.
major comments (2)
- [Abstract] Abstract (paragraph on the bound and projectors): the central claim that the derivative-based upper bound 'enables the construction of projectors onto likelihood-informed subspaces' that make transport maps tractable rests on an unproven alignment between the bound and posterior variability. No theorem or analysis is cited showing that directions emphasized by the bound (computed from derivatives) coincide with those carrying the essential non-Gaussian posterior mass in the sequence of posteriors; this is load-bearing for the scalability assertion in high-dimensional non-Gaussian PDE settings.
- [Numerical experiments] Numerical experiments section: while the two PDE examples are reported to demonstrate effectiveness of designs obtained by maximizing the bound, the manuscript provides no quantitative assessment (e.g., error in transport map approximation with vs. without the subspace reduction, or comparison of subspace quality against true posterior covariance or samples) that would confirm the subspaces preserve the structure needed for the subsequent transport step. Without such diagnostics, the experiments do not yet substantiate the unified framework's claimed scalability.
minor comments (2)
- Notation for the upper bound and the projectors should be introduced with explicit definitions and distinguished from related quantities (e.g., standard EIG bounds) to improve readability.
- The manuscript would benefit from a short table summarizing the computational complexity of each component (bound evaluation, projector construction, transport map training) across the reported examples.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address each major comment below and outline the revisions we will make to strengthen the presentation.
read point-by-point responses
-
Referee: [Abstract] Abstract (paragraph on the bound and projectors): the central claim that the derivative-based upper bound 'enables the construction of projectors onto likelihood-informed subspaces' that make transport maps tractable rests on an unproven alignment between the bound and posterior variability. No theorem or analysis is cited showing that directions emphasized by the bound (computed from derivatives) coincide with those carrying the essential non-Gaussian posterior mass in the sequence of posteriors; this is load-bearing for the scalability assertion in high-dimensional non-Gaussian PDE settings.
Authors: The upper bound is constructed directly from the same derivative (gradient) information that defines likelihood-informed subspaces in the existing literature on dimension reduction for Bayesian inversion. Its form therefore identifies directions of high sensitivity to the data, which are the directions expected to carry the dominant posterior variability. While the manuscript does not contain a new standalone theorem proving exact equivalence in every non-Gaussian case, the alignment follows from the derivation of the bound itself. In the revision we will add a short subsection (or remark) that explicitly connects the bound to likelihood-informed projectors, cites the relevant prior works on this construction, and states the conditions under which the alignment is expected to hold, thereby making the reasoning transparent. revision: partial
-
Referee: [Numerical experiments] Numerical experiments section: while the two PDE examples are reported to demonstrate effectiveness of designs obtained by maximizing the bound, the manuscript provides no quantitative assessment (e.g., error in transport map approximation with vs. without the subspace reduction, or comparison of subspace quality against true posterior covariance or samples) that would confirm the subspaces preserve the structure needed for the subsequent transport step. Without such diagnostics, the experiments do not yet substantiate the unified framework's claimed scalability.
Authors: We agree that additional quantitative diagnostics would strengthen the numerical section. In the revised manuscript we will augment the experiments with (i) tables or plots comparing the transport-map approximation error (e.g., KL divergence or Wasserstein distance to reference samples) obtained with and without the subspace projectors, and (ii) comparisons of the retained subspace directions against the leading eigenvectors of the posterior covariance (where the latter is computable) or against empirical posterior samples. These additions will directly demonstrate that the subspaces preserve the structure required for accurate conditional transport. revision: yes
Circularity Check
No circularity detected in derivation chain
full rationale
The paper derives a new derivative-based upper bound for iEIG that simultaneously guides designs and constructs likelihood-informed projectors for dimension reduction, then combines this with conditional measure transport maps to form a unified sOED/amortized inference framework. No step in the provided abstract or description reduces a claimed prediction or result to a fitted parameter, self-citation, or input quantity by construction; the bound and subspace projectors are presented as newly derived, and the scalability claim rests on this independent combination rather than renaming or re-using prior fitted quantities. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Forward citations
Cited by 1 Pith paper
-
Conditional Sampling via Wasserstein Autoencoders and Triangular Transport
CWAEs use triangular decoders and latent independence in Wasserstein autoencoders to perform conditional simulation by capturing low-dimensional structure in conditioned and conditioning variables.
Reference graph
Works this paper leans on
- [1]
- [2]
-
[3]
R. Baptista, Y. Marzouk, and O. Zahm. On the representation and learning of monotone triangular transport maps. Foundations of Computational Mathematics , pages 1–46, 2023
work page 2023
-
[4]
J. Beck, B. Mansour Dia, L. Espath, and R. Tempone. Multilevel double loop Monte Carlo and stochastic collocation methods with importance sampling for Bayesian optimal experimental design. Interna- tional Journal for Numerical Methods in Engineering , 121(15):3482–3503, 2020
work page 2020
-
[5]
M. Brennan, D. Bigoni, O. Zahm, A. Spantini, and Y. Marzouk. Greedy inference with structure-exploiting lazy maps. Advances in Neural Information Processing Systems , 33:8330–8342, 2020
work page 2020
-
[6]
L. Cao, R. Baptista, J. Chen, F. Li, O. Ghattas, J. T. Oden, and Y. Marzouk. Bayesian model cali- 21 bration for block copolymer self-assembly: Likelihood-free inference and expected information gain computation via measure transport. Bulletin of the American Physical Society
- [7]
-
[8]
S. Chaturantabut and D. C. Sorensen. Nonlinear model reduction via discrete empirical interpolation. SIAM Journal on Scientific Computing , 32(5):2737–2764, 2010
work page 2010
-
[9]
R. T. Chen, Y. Rubanova, J. Bettencourt, and D. K. Duvenaud. Neural ordinary differential equations. Advances in Neural Information Processing Systems , 31, 2018
work page 2018
-
[10]
P. G. Constantine, C. Kent, and T. Bui-Thanh. Accelerating Markov chain Monte Carlo with active subspaces. SIAM Journal on Scientific Computing , 38(5):A2779–A2805, 2016
work page 2016
- [11]
-
[12]
B. Cox, J. G. Laufer, S. R. Arridge, and P. C. Beard. Quantitative spectroscopic photoacoustic imaging: a review. Journal of biomedical optics , 17(6):061202–061202, 2012
work page 2012
-
[13]
T. Cui. Deep Transport. https://github.com/DeepTransport/deep-tensor
-
[14]
T. Cui. Fast forward and inverse problems solver (fastfins). https://github.com/fastfins/fastfins.m
- [15]
-
[16]
T. Cui, S. Dolgov, and O. Zahm. Scalable conditional deep inverse Rosenblatt transports using tensor trains and gradient-based dimension reduction. Journal of Computational Physics , 485:112103, 2023
work page 2023
- [17]
-
[18]
T. Cui, K. J. Law, and Y. M. Marzouk. Dimension-independent likelihood-informed MCMC. Journal of Computational Physics, 304:109–137, 2016
work page 2016
-
[19]
T. Cui, J. Martin, Y. M. Marzouk, A. Solonen, and A. Spantini. Likelihood-informed dimension reduction for nonlinear inverse problems. Inverse Problems, 30(11):114015, 2014
work page 2014
- [20]
-
[21]
T. Cui, X. T. Tong, and O. Zahm. Prior normalization for certified likelihood-informed subspace detection of Bayesian inverse problems. Inverse Problems, 38(12):124002, 2022
work page 2022
- [22]
- [23]
-
[24]
J. Dong, C. Jacobsen, M. Khalloufi, M. Akram, W. Liu, K. Duraisamy, and X. Huan. Variational Bayesian optimal experimental design with normalizing flows. Comput. Methods Appl. Mech. Eng., 433:117457, 2025
work page 2025
-
[25]
C. C. Drovandi, J. M. McGree, and A. N. Pettitt. Sequential Monte Carlo for Bayesian sequentially designed experiments for discrete data. Comput. Stat. Data Anal. , 57(1):320–335, 2013
work page 2013
-
[26]
T. A. El Moselhy and Y. M. Marzouk. Bayesian inference with optimal maps. Journal of Computational Physics, 231(23):7815–7850, 2012
work page 2012
- [27]
-
[28]
M. Girolami and B. Calderhead. Riemann manifold Langevin and Hamiltonian Monte Carlo methods. Journal of the Royal Statistical Society Series B: Statistical Methodology , 73(2):123–214, 2011
work page 2011
- [29]
-
[30]
A. Gorodetsky, S. Karaman, and Y. Marzouk. A continuous analogue of the tensor-train decomposition. Computer Methods in Aapplied Mechanics and Engineering , 347:59–84, 2019
work page 2019
-
[31]
J. Gr¨ ohl, M. Schellenberg, K. Dreher, and L. Maier-Hein. Deep learning for biomedical photoacoustic imaging: A review. Photoacoustics, 22:100241, 2021
work page 2021
-
[32]
X. Huan, J. Jagalur, and Y. Marzouk. Optimal experimental design: Formulations and computations. Acta Numerica, 33:715–840, 2024
work page 2024
-
[33]
X. Huan and Y. M. Marzouk. Simulation-based optimal Bayesian experimental design for nonlinear systems. Journal of Computational Physics , 232(1):288–317
-
[34]
Sequential Bayesian optimal experimental design via approximate dynamic programming
X. Huan and Y. M. Marzouk. Sequential Bayesian optimal experimental design via approximate dynamic programming. arXiv preprint arXiv:1604.08320 , 2016
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[35]
J. Kaipio and V. Kolehmainen. Approximate marginalization over modeling errors and uncertainties in inverse problems. Bayesian theory and applications , pages 644–672, 2013
work page 2013
-
[36]
S. Kleinegesse, C. Drovandi, and M. U. Gutmann. Sequential Bayesian experimental design for implicit models via mutual information, 2020. 22
work page 2020
- [37]
- [38]
- [39]
- [40]
-
[41]
C. Lutzweiler and D. Razansky. Optoacoustic imaging and tomography: reconstruction approaches and outstanding challenges in image performance and quantification. Sensors, 13(6):7345–7384, 2013
work page 2013
- [42]
-
[43]
Y. Marzouk, T. Moselhy, M. Parno, and A. Spantini. Sampling via measure transport: An introduction. Handbook of uncertainty quantification, 1:2, 2016
work page 2016
-
[44]
J. C. Mattingly, N. S. Pillai, and A. Stuart. Diffusion limits of the random walk Metropolis algorithm in high dimensions. The Annals of Applied Probability , 22(3):881–930, 2012
work page 2012
- [45]
-
[46]
I. Oseledets and E. Tyrtyshnikov. TT-cross approximation for multidimensional arrays. Linear Algebra and its Applications , 432(1):70–88, 2010
work page 2010
-
[47]
G. Papamakarios, E. Nalisnick, D. J. Rezende, S. Mohamed, and B. Lakshminarayanan. Normalizing flows for probabilistic modeling and inference. J. Mach. Learn. Res. , 22(1):2617–2680, 2021
work page 2021
-
[48]
G. O. Roberts and J. S. Rosenthal. Optimal scaling for various Metropolis-Hastings algorithms. Statistical science, 16(4):351–367, 2001
work page 2001
-
[49]
E. G. Ryan, C. C. Drovandi, J. M. McGree, and A. N. Pettitt. A review of modern computational algorithms for Bayesian optimal design. International Statistical Review, 84(1):128–154, 2016
work page 2016
-
[50]
K. J. Ryan. Estimating expected information gains for experimental designs with application to the random fatigue-limit model. Journal of Computational and Graphical Statistics , 12(3):585–603, 2003
work page 2003
-
[51]
W. Shen and X. Huan. Bayesian sequential optimal experimental design for nonlinear models using policy gradient reinforcement learning. Comput. Methods Appl. Mech. Eng. , 416:116304, 2023
work page 2023
-
[52]
T. Tarvainen and B. Cox. Quantitative photoacoustic tomography: modeling and inverse problems.Journal of Biomedical Optics, 29(S1):S11509–S11509, 2024
work page 2024
-
[53]
J. Westermann and J. Zech. Measure transport via polynomial density surrogates. arXiv preprint arXiv:2311.04172, 2023
-
[54]
K. Wu, P. Chen, and O. Ghattas. A fast and scalable computational framework for large-scale high- dimensional Bayesian optimal experimental design. SIAM/ASA Journal on Uncertainty Quantifica- tion, 11(1):235–261, 2023
work page 2023
-
[55]
K. Wu, T. O’Leary-Roseberry, P. Chen, and O. Ghattas. Large-scale Bayesian optimal experimental design with derivative-informed projected neural network. J. Sci. Comput. , 95(1):30, 2023
work page 2023
-
[56]
O. Zahm, T. Cui, K. Law, A. Spantini, and Y. Marzouk. Certified dimension reduction in nonlinear Bayesian inverse problems. Mathematics of Computation , 91(336):1789–1835, 2022. 23
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.