Posterior Inference in Latent Space for Scalable Constrained Black-box Optimization
Pith reviewed 2026-05-19 07:04 UTC · model grok-4.3
The pith
Constrained black-box optimization can be recast as posterior inference over candidates in the latent space of flow-based generative models.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By training flow-based models to capture the data distribution together with surrogate models for objective and constraint predictions, and then casting candidate selection as posterior inference performed in the latent space and amortized by outsourced diffusion models, the approach generates promising points that simultaneously maximize the objective while respecting the constraints, and it demonstrates superior empirical performance on both synthetic benchmarks and real-world tasks.
What carries the argument
Posterior inference over candidates performed inside the latent space of flow-based generative models, with sampling amortized by outsourced diffusion models.
If this is right
- Candidate generation becomes a sampling problem from a posterior rather than an explicit constrained search in the input space.
- The method scales to high-dimensional inputs by shifting all search operations into a lower-dimensional latent representation.
- Surrogate models for the objective and constraints are used only to define the posterior, avoiding direct penalty or barrier terms.
- Amortized diffusion sampling in latent space reduces the risk of mode collapse compared with standard MCMC or variational inference in the original space.
Where Pith is reading between the lines
- The same latent-space inference pattern could be applied to other generative architectures such as VAEs or autoregressive models if they admit a suitable latent representation.
- The approach may be especially useful when the feasible set consists of multiple disconnected components that are difficult to discover by local search.
- It suggests a broader connection between black-box optimization and amortized inference techniques that could be explored on problems with mixed continuous-discrete variables.
Load-bearing premise
The latent space learned by the flow-based models preserves enough structure that posterior inference over it reliably identifies high-value feasible points without requiring explicit constraint handling in the original space.
What would settle it
On a test problem whose feasible region is poorly aligned with the structure captured by the flow model, the method would produce mostly infeasible or low-value samples despite the surrogate predictions.
Figures
read the original abstract
Optimizing high-dimensional black-box functions under black-box constraints is a pervasive task in a wide range of scientific and engineering problems. These problems are typically harder than unconstrained problems due to hard-to-find feasible regions. In this work, we reformulate constrained black-box optimization as posterior inference, and perform this inference in the latent space of generative models. Our method iterates through two stages. First, we train flow-based models to capture the data distribution and surrogate models that predict both function values and constraint violations. Second, we cast the candidate selection problem as a posterior inference problem to effectively search for promising candidates that have high objective values while not violating the constraints. Concretely, we utilize outsourced diffusion models to amortize the sampling from the posterior distribution in the latent space of flow-based models, which can bypass the issue of mode collapse. We empirically demonstrate that our method achieves superior performance across synthetic and real-world tasks. Our code is available \href{https://github.com/umkiyoung/CiBO}{here}.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a method for constrained black-box optimization that reformulates the task as posterior inference performed entirely in the latent space of flow-based generative models. The approach trains flow-based models on (presumably feasible) data along with surrogate models for the objective and constraint violations, then uses outsourced diffusion models to amortize sampling from the posterior over the latent space in order to identify high-value feasible points. The central claim is that this yields superior performance on synthetic and real-world tasks while avoiding explicit constraint handling in the original input space.
Significance. If the central construction holds, the work could provide a scalable route to high-dimensional constrained optimization by leveraging the structure captured in generative latent spaces and amortized diffusion sampling to sidestep mode collapse. The public release of code is a clear strength that supports reproducibility. The significance is tempered by the fact that the performance gain is not reduced to a quantity defined solely by the fitted parameters; it depends on an independent modeling step whose fidelity to the feasible set is not yet quantified.
major comments (2)
- [Method (abstract and §3)] The core construction (training flow models on feasible data, fitting surrogates, then performing posterior inference in latent space) assumes that the composition of latent posterior sampling followed by flow decoding maps high-posterior latent points to points that remain both high-value and feasible in the original space. No analysis, bound, or ablation is supplied on the mismatch between the learned density and the true feasible set, on imperfect invertibility of the flow, or on surrogate error in the constraint model; any such mismatch directly produces constraint-violating candidates and removes the claimed advantage of “no explicit constraint handling.”
- [Experiments] The empirical claim of superior performance is stated in the abstract and conclusion but is not accompanied, in the visible summary, by concrete metrics, baselines, or statistical significance tests. Without these details it is impossible to assess whether the reported gains are load-bearing for the central claim or could be explained by differences in hyper-parameter tuning or evaluation protocol.
minor comments (2)
- [§3.2] Clarify the precise form of the posterior that is being approximated by the outsourced diffusion model; the current description leaves open whether the surrogate constraint model enters the posterior as a hard indicator or as a soft penalty.
- [Related work] Add a short discussion of related latent-space Bayesian optimization and constrained generative-model methods to situate the contribution.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive comments. We address each major comment below with clarifications and indicate planned revisions to improve the manuscript.
read point-by-point responses
-
Referee: [Method (abstract and §3)] The core construction (training flow models on feasible data, fitting surrogates, then performing posterior inference in latent space) assumes that the composition of latent posterior sampling followed by flow decoding maps high-posterior latent points to points that remain both high-value and feasible in the original space. No analysis, bound, or ablation is supplied on the mismatch between the learned density and the true feasible set, on imperfect invertibility of the flow, or on surrogate error in the constraint model; any such mismatch directly produces constraint-violating candidates and removes the claimed advantage of “no explicit constraint handling.”
Authors: We appreciate the referee highlighting the need for explicit discussion of approximation quality. Our flow models are trained solely on feasible samples drawn from the problem's feasible set, so the support of the decoded distribution is intended to approximate feasible regions. Normalizing flows are bijective by construction, with the decoder being the exact inverse of the encoder (subject only to floating-point precision). Surrogate models for the objective and constraints are standard probabilistic regressors that incorporate predictive uncertainty into the posterior. We agree that a dedicated analysis of mismatch effects would strengthen the presentation. In the revision we will add a subsection to §3 that (i) states the modeling assumptions, (ii) provides a simple probabilistic bound on the probability of decoding a constraint-violating point when the flow density is close to the true feasible density in total variation, and (iii) reports an empirical ablation measuring the fraction of constraint violations among decoded candidates across the benchmark suite. revision: yes
-
Referee: [Experiments] The empirical claim of superior performance is stated in the abstract and conclusion but is not accompanied, in the visible summary, by concrete metrics, baselines, or statistical significance tests. Without these details it is impossible to assess whether the reported gains are load-bearing for the central claim or could be explained by differences in hyper-parameter tuning or evaluation protocol.
Authors: We thank the referee for noting that the experimental evidence should be more immediately visible. Section 4 of the full manuscript already contains the requested details: we evaluate on four synthetic constrained benchmarks and two real-world tasks, reporting mean and standard deviation (over 20 independent runs) of the best feasible objective value attained, the feasibility rate of returned candidates, and wall-clock time. Baselines include constrained BO with penalty and augmented Lagrangian formulations, evolutionary strategies, and prior latent-space optimization methods. Statistical significance of performance differences is assessed with paired t-tests (p < 0.05 reported). To address the referee's concern we will (a) insert a concise summary table of key metrics into the abstract and conclusion, (b) add an explicit paragraph describing the evaluation protocol and hyper-parameter selection procedure, and (c) include the full set of p-values in the revised experimental section. revision: partial
Circularity Check
No significant circularity; reformulation and empirical claims are independent of fitted inputs
full rationale
The paper describes a two-stage procedure: training flow-based generative models on observed data to learn a latent representation of the input distribution, training separate surrogate models for the objective and constraint violation, and then performing posterior inference over the latent variables using diffusion models to select candidates. This modeling choice and the subsequent empirical evaluation on synthetic and real-world tasks constitute an independent algorithmic contribution rather than any quantity being defined in terms of itself or a fitted parameter being relabeled as a prediction. No equations or self-citations are shown that would reduce the claimed performance advantage to a tautology or to a self-referential construction. The derivation chain therefore remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Flow-based models can faithfully capture the data distribution so that latent-space posterior inference corresponds to useful original-space candidates.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We cast the candidate selection problem as a posterior inference problem... amortize the sampling from the posterior distribution in the latent space of flow-based models
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Lagrangian relaxation of the objective as a reward function... rϕ(x) = μϕ(x) + γ·σϕ(x) − λ Σ max(0,g(m)ϕ(x))
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Bayesian optimization with inequality constraints
Jacob Gardner, Matt Kusner, Kilian Weinberger, John Cunningham, et al. Bayesian optimization with inequality constraints. In International Conference on Machine Learning, pages 937–945. PMLR, 2014
work page 2014
-
[2]
Constrained bayesian optimization for automatic chemical design using variational autoencoders
Ryan-Rhys Griffiths and José Miguel Hernández-Lobato. Constrained bayesian optimization for automatic chemical design using variational autoencoders. Chemical science, 11(2):577–586, 2020
work page 2020
-
[3]
Chembo: Bayesian optimization of small organic molecules with synthesizable recommendations
Ksenia Korovina, Sailun Xu, Kirthevasan Kandasamy, Willie Neiswanger, Barnabas Poczos, Jeff Schneider, and Eric Xing. Chembo: Bayesian optimization of small organic molecules with synthesizable recommendations. In International Conference on Artificial Intelligence and Statistics, pages 3393–3403. PMLR, 2020
work page 2020
-
[4]
Safe controller optimization for quadrotors with gaussian processes
Felix Berkenkamp, Angela P Schoellig, and Andreas Krause. Safe controller optimization for quadrotors with gaussian processes. In International conference on robotics and automation (ICRA), 2016
work page 2016
-
[5]
Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics
Felix Berkenkamp, Andreas Krause, and Angela P Schoellig. Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics. Machine Learning, 112(10):3713– 3747, 2023
work page 2023
-
[6]
MF Anjos and DR Jones. Mopta 2008 benchmark. URL http://www. miguelanjos. com/jones- benchmark, 2009
work page 2008
-
[7]
Hauke F Maathuis, Roeland De Breuker, and Saullo GP Castro. High-dimensional bayesian optimisation with large-scale constraints via latent space gaussian processes. arXiv preprint arXiv:2412.15679, 2024
-
[8]
Scalable constrained bayesian optimization
David Eriksson and Matthias Poloczek. Scalable constrained bayesian optimization. In Interna- tional conference on artificial intelligence and statistics, pages 730–738. PMLR, 2021
work page 2021
-
[9]
A Tutorial on Bayesian Optimization
Peter I Frazier. A tutorial on bayesian optimization. arXiv preprint arXiv:1807.02811, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[10]
Roman Garnett. Bayesian optimization. Cambridge University Press, 2023
work page 2023
-
[11]
Predictive entropy search for bayesian optimization with unknown constraints
José Miguel Hernández-Lobato, Michael Gelbart, Matthew Hoffman, Ryan Adams, and Zoubin Ghahramani. Predictive entropy search for bayesian optimization with unknown constraints. In International conference on machine learning, pages 1699–1707. PMLR, 2015
work page 2015
-
[12]
Bayesian optimiza- tion under mixed constraints with a slack-variable augmented lagrangian
Victor Picheny, Robert B Gramacy, Stefan Wild, and Sebastien Le Digabel. Bayesian optimiza- tion under mixed constraints with a slack-variable augmented lagrangian. Advances in neural information processing systems, 29, 2016
work page 2016
-
[13]
Admmbo: Bayesian op- timization with unknown constraints using admm
Setareh Ariafar, Jaume Coll-Font, Dana Brooks, and Jennifer Dy. Admmbo: Bayesian op- timization with unknown constraints using admm. Journal of Machine Learning Research, 20(123):1–26, 2019
work page 2019
-
[14]
Scalable global optimization via local bayesian optimization
David Eriksson, Michael Pearce, Jacob Gardner, Ryan D Turner, and Matthias Poloczek. Scalable global optimization via local bayesian optimization. InAdvances in Neural Information Processing Systems (NeurIPS), 2019
work page 2019
-
[15]
Diffusion models as constrained samplers for optimization with unknown constraints
Lingkai Kong, Yuanqi Du, Wenhao Mu, Kirill Neklyudov, Valentin De Bortoli, Dongxia Wu, Haorui Wang, Aaron M Ferber, Yian Ma, Carla P Gomes, and Chao Zhang. Diffusion models as constrained samplers for optimization with unknown constraints. In The 28th International Conference on Artificial Intelligence and Statistics, 2025
work page 2025
-
[16]
Black-box optimization with implicit constraints for public policy
Wenqian Xing, JungHo Lee, Chong Liu, and Shixiang Zhu. Black-box optimization with implicit constraints for public policy. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 39, pages 28511–28519, 2025
work page 2025
-
[17]
Masatoshi Uehara, Xingyu Su, Yulai Zhao, Xiner Li, Aviv Regev, Shuiwang Ji, Sergey Levine, and Tommaso Biancalani. Reward-guided iterative refinement in diffusion models at test-time with applications to protein and dna design. arXiv preprint arXiv:2502.14944, 2025. 10
-
[18]
Amortizing intractable inference in diffusion models for vision, language, and control
Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, and Nikolay Malkin. Amortizing intractable inference in diffusion models for vision, language, and control. In The Thirty-eighth Annual Conferen...
work page 2024
-
[19]
Carles Domingo-Enrich, Michal Drozdzal, Brian Karrer, and Ricky T. Q. Chen. Adjoint matching: Fine-tuning flow and diffusion generative models with memoryless stochastic optimal control. In The Thirteenth International Conference on Learning Representations, 2025
work page 2025
-
[20]
Masatoshi Uehara, Yulai Zhao, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M Tseng, Tommaso Biancalani, and Sergey Levine. Fine- tuning of continuous-time diffusion models as entropy-regularized control. arXiv preprint arXiv:2402.15194, 2024
-
[21]
Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models
Siddarth Venkatraman, Mohsin Hasan, Minsu Kim, Luca Scimeca, Marcin Sendera, Yoshua Bengio, Glen Berseth, and Nikolay Malkin. Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models. In International Conference on Machine Learning (ICML), 2025
work page 2025
-
[22]
Normalizing flow sampling with langevin dynamics in the latent space
Florentin Coeurdoux, Nicolas Dobigeon, and Pierre Chainais. Normalizing flow sampling with langevin dynamics in the latent space. arXiv preprint arXiv:2305.12149, 2023
-
[23]
Global versus local search in constrained optimization of computer models
Matthias Schonlau, William J Welch, and Donald R Jones. Global versus local search in constrained optimization of computer models. Lecture notes-monograph series, pages 11–25, 1998
work page 1998
-
[24]
Unexpected improvements to expected improvement for bayesian optimization
Sebastian Ament, Samuel Daulton, David Eriksson, Maximilian Balandat, and Eytan Bakshy. Unexpected improvements to expected improvement for bayesian optimization. Advances in Neural Information Processing Systems, 36:20577–20612, 2023
work page 2023
-
[25]
Principal component analysis for special types of data
Ian T Jolliffe. Principal component analysis for special types of data. Springer, 2002
work page 2002
-
[26]
The cma evolution strategy: a comparing review
Nikolaus Hansen. The cma evolution strategy: a comparing review. Towards a new evolutionary computation: Advances in the estimation of distribution algorithms, pages 75–102, 2006
work page 2006
-
[27]
Augmented lagrangian constraint handling for cma-es—case of a single linear constraint
Asma Atamna, Anne Auger, and Nikolaus Hansen. Augmented lagrangian constraint handling for cma-es—case of a single linear constraint. In International Conference on Parallel Problem Solving from Nature, pages 181–191. Springer, 2016
work page 2016
-
[28]
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, and Mark Chen. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125, 1(2):3, 2022
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[29]
Scaling rectified flow trans- formers for high-resolution image synthesis
Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, et al. Scaling rectified flow trans- formers for high-resolution image synthesis. In Forty-first international conference on machine learning, 2024
work page 2024
-
[30]
Posterior inference with diffusion models for high-dimensional black-box optimization
Taeyoung Yun, Kiyoung Om, Jaewoo Lee, Sujin Yun, and Jinkyoo Park. Posterior inference with diffusion models for high-dimensional black-box optimization. In International Conference on Machine Learning (ICML), 2025
work page 2025
-
[31]
Diffusion models for black-box optimization
Siddarth Krishnamoorthy, Satvik Mehul Mashkaria, and Aditya Grover. Diffusion models for black-box optimization. In International Conference on Machine Learning (ICML), 2023
work page 2023
-
[32]
Diff-BBO: Diffusion- based inverse modeling for black-box optimization
Dongxia Wu, Nikki Lijing Kuang, Ruijia Niu, Yian Ma, and Rose Yu. Diff-BBO: Diffusion- based inverse modeling for black-box optimization. In NeurIPS 2024 Workshop on Bayesian Decision-making and Uncertainty, 2024
work page 2024
-
[33]
Guided trajectory generation with diffusion models for offline model-based optimization
Taeyoung Yun, Sujin Yun, Jaewoo Lee, and Jinkyoo Park. Guided trajectory generation with diffusion models for offline model-based optimization. In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. 11
work page 2024
-
[34]
Paretoflow: Guided flows in multi-objective optimization
Ye Yuan, Can Chen, Christopher Pal, and Xue Liu. Paretoflow: Guided flows in multi-objective optimization. In The Thirteenth International Conference on Learning Representations, 2025
work page 2025
-
[35]
Biological sequence design with gflownets
Moksh Jain, Emmanuel Bengio, Alex Hernandez-Garcia, Jarrid Rector-Brooks, Bonaventure FP Dossou, Chanakya Ajit Ekbote, Jie Fu, Tianyu Zhang, Michael Kilgour, Dinghuai Zhang, et al. Biological sequence design with gflownets. In International Conference on Machine Learning, pages 9786–9801. PMLR, 2022
work page 2022
-
[36]
Improved off-policy reinforcement learning in biological sequence design
Hyeonah Kim, Minsu Kim, Taeyoung Yun, Sanghyeok Choi, Emmanuel Bengio, Alex Hernández-García, and Jinkyoo Park. Improved off-policy reinforcement learning in biological sequence design. In International Conference on Machine Learning (ICML), 2025
work page 2025
-
[37]
Diffusion models beat gans on image synthesis
Prafulla Dhariwal and Alexander Nichol. Diffusion models beat gans on image synthesis. Advances in neural information processing systems, 34:8780–8794, 2021
work page 2021
-
[38]
Classifier-free diffusion guidance
Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance. In NeurIPS Workshop on Deep Generative Models and Downstream Applications, 2021
work page 2021
-
[39]
Solving inverse problems in medi- cal imaging with score-based generative models
Yang Song, Liyue Shen, Lei Xing, and Stefano Ermon. Solving inverse problems in medi- cal imaging with score-based generative models. In International Conference on Learning Representations, 2022
work page 2022
-
[40]
Diffusion posterior sampling for general noisy inverse problems
Hyungjin Chung, Jeongsol Kim, Michael Thompson Mccann, Marc Louis Klasky, and Jong Chul Ye. Diffusion posterior sampling for general noisy inverse problems. In The Eleventh Interna- tional Conference on Learning Representations, 2023
work page 2023
-
[41]
Dpok: Reinforcement learning for fine-tuning text-to-image diffusion models
Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, and Kimin Lee. Dpok: Reinforcement learning for fine-tuning text-to-image diffusion models. Advances in Neural Information Processing Systems, 36:79858–79885, 2023
work page 2023
-
[42]
Cheng Lu, Huayu Chen, Jianfei Chen, Hang Su, Chongxuan Li, and Jun Zhu. Contrastive energy prediction for exact energy-guided diffusion sampling in offline reinforcement learning. In International Conference on Machine Learning, pages 22825–22855. PMLR, 2023
work page 2023
-
[43]
Practi- cal and asymptotically exact conditional sampling in diffusion models
Luhuan Wu, Brian Trippe, Christian Naesseth, David Blei, and John P Cunningham. Practi- cal and asymptotically exact conditional sampling in diffusion models. Advances in Neural Information Processing Systems, 36:31372–31403, 2023
work page 2023
-
[44]
Monte carlo guided denoising diffusion models for bayesian linear inverse problems
Gabriel Cardoso, Sylvain Le Corff, Eric Moulines, et al. Monte carlo guided denoising diffusion models for bayesian linear inverse problems. In The Twelfth International Conference on Learning Representations, 2024
work page 2024
-
[45]
Training diffusion models with reinforcement learning
Kevin Black, Michael Janner, Yilun Du, Ilya Kostrikov, and Sergey Levine. Training diffusion models with reinforcement learning. In The Twelfth International Conference on Learning Representations, 2024
work page 2024
-
[46]
Yoshua Bengio, Salem Lahlou, Tristan Deleu, Edward J Hu, Mo Tiwari, and Emmanuel Bengio. Gflownet foundations. Journal of Machine Learning Research, 24(210):1–55, 2023
work page 2023
-
[47]
Flow matching for generative modeling
Yaron Lipman, Ricky TQ Chen, Heli Ben-Hamu, Maximilian Nickel, and Matthew Le. Flow matching for generative modeling. In The Eleventh International Conference on Learning Representations, 2023
work page 2023
-
[48]
Flow straight and fast: Learning to generate and transfer data with rectified flow
Xingchao Liu, Chengyue Gong, et al. Flow straight and fast: Learning to generate and transfer data with rectified flow. In The Eleventh International Conference on Learning Representations, 2023
work page 2023
-
[49]
Building normalizing flows with stochastic interpolants
Michael Samuel Albergo and Eric Vanden-Eijnden. Building normalizing flows with stochastic interpolants. In The Eleventh International Conference on Learning Representations, 2023
work page 2023
-
[50]
Improved off-policy training of diffusion samplers
Marcin Sendera, Minsu Kim, Sarthak Mittal, Pablo Lemos, Luca Scimeca, Jarrid Rector-Brooks, Alexandre Adam, Yoshua Bengio, and Nikolay Malkin. Improved off-policy training of diffusion samplers. Advances in Neural Information Processing Systems, 37:81016–81045, 2024. 12
work page 2024
-
[51]
Trajectory balance: Improved credit assignment in gflownets
Nikolay Malkin, Moksh Jain, Emmanuel Bengio, Chen Sun, and Yoshua Bengio. Trajectory balance: Improved credit assignment in gflownets. Advances in Neural Information Processing Systems, 35:5955–5967, 2022
work page 2022
-
[52]
Score-based generative modeling through stochastic differential equations
Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations. In International Conference on Learning Representations, 2021
work page 2021
-
[53]
Simple and scalable predictive uncertainty estimation using deep ensembles
Balaji Lakshminarayanan, Alexander Pritzel, and Charles Blundell. Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems, 30, 2017
work page 2017
-
[54]
Bootstrapped training of score- conditioned generator for offline design of biological sequences
Minsu Kim, Federico Berto, Sungsoo Ahn, and Jinkyoo Park. Bootstrapped training of score- conditioned generator for offline design of biological sequences. In Advances in Neural Information Processing Systems (NeurIPS), 2023
work page 2023
-
[55]
Model inversion networks for model-based optimization
Aviral Kumar and Sergey Levine. Model inversion networks for model-based optimization. In Advances in Neural Information Processing Systems (NeurIPS), 2020
work page 2020
-
[56]
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair, Abhishek Gupta, Murtaza Dalal, and Sergey Levine. Awac: Accelerating online reinforcement learning with offline datasets. arXiv preprint arXiv:2006.09359, 2020
work page internal anchor Pith review Pith/arXiv arXiv 2006
-
[57]
Batched large-scale bayesian optimization in high-dimensional spaces
Zi Wang, Clement Gehring, Pushmeet Kohli, and Stefanie Jegelka. Batched large-scale bayesian optimization in high-dimensional spaces. In International Conference on Artificial Intelligence and Statistics, pages 745–754. PMLR, 2018
work page 2018
-
[58]
Lassobench: A high- dimensional hyperparameter optimization benchmark suite for lasso
Kenan Šehi ´c, Alexandre Gramfort, Joseph Salmon, and Luigi Nardi. Lassobench: A high- dimensional hyperparameter optimization benchmark suite for lasso. In International Confer- ence on Automated Machine Learning, pages 2–1. PMLR, 2022
work page 2022
-
[59]
A general framework for constrained bayesian optimization using information-based search
José Miguel Hern, Michael A Gelbart, Ryan P Adams, Matthew W Hoffman, Zoubin Ghahra- mani, et al. A general framework for constrained bayesian optimization using information-based search. Journal of Machine Learning Research, 17(160):1–53, 2016
work page 2016
-
[60]
Improving and generalizing flow-based genera- tive models with minibatch optimal transport
Alexander Tong, Kilian FATRAS, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks, Guy Wolf, and Yoshua Bengio. Improving and generalizing flow-based genera- tive models with minibatch optimal transport. Transactions on Machine Learning Research, 2024
work page 2024
-
[61]
Levine, Brandon M Wood, Bin Hu, Brandon Amos, Brian Karrer, Xiang Fu, Guan- Horng Liu, and Ricky T
Aaron J Havens, Benjamin Kurt Miller, Bing Yan, Carles Domingo-Enrich, Anuroop Sriram, Daniel S. Levine, Brandon M Wood, Bin Hu, Brandon Amos, Brian Karrer, Xiang Fu, Guan- Horng Liu, and Ricky T. Q. Chen. Adjoint sampling: Highly scalable diffusion samplers via adjoint matching. In Frontiers in Probabilistic Inference: Learning meets Sampling, 2025
work page 2025
-
[62]
Adaptive teachers for amortized samplers
Minsu Kim, Sanghyeok Choi, Taeyoung Yun, Emmanuel Bengio, Leo Feng, Jarrid Rector- Brooks, Sungsoo Ahn, Jinkyoo Park, Nikolay Malkin, and Yoshua Bengio. Adaptive teachers for amortized samplers. In The Thirteenth International Conference on Learning Representations, 2025
work page 2025
-
[63]
Nicola Demo, Marco Tezzele, and Gianluigi Rozza. A supervised learning approach involving active subspaces for an efficient genetic algorithm in high-dimensional optimization problems. SIAM Journal on Scientific Computing, 43(3):B831–B853, 2021
work page 2021
-
[64]
Learning search space partition for black- box optimization using monte carlo tree search
Linnan Wang, Rodrigo Fonseca, and Yuandong Tian. Learning search space partition for black- box optimization using monte carlo tree search. Advances in Neural Information Processing Systems, 33:19511–19522, 2020
work page 2020
-
[65]
Improving sample efficiency of high dimensional bayesian optimization with mcmc
Zeji Yi, Yunyue Wei, Chu Xin Cheng, Kaibo He, and Yanan Sui. Improving sample efficiency of high dimensional bayesian optimization with mcmc. In 6th Annual Learning for Dynamics & Control Conference, pages 813–824. PMLR, 2024
work page 2024
-
[66]
Zelda B Zabinsky and Robert L Smith. Hit-and-run methods. Encyclopedia of Operations Research and Management Science, pages 721–729, 2013. 13
work page 2013
-
[67]
Leonard Papenmeier, Luigi Nardi, and Matthias Poloczek. Increasing the scope as you learn: Adaptive bayesian optimization in nested subspaces.Advances in Neural Information Processing Systems, 35:11586–11601, 2022
work page 2022
-
[68]
Nikolaus Hansen, Youhei Akimoto, and Petr Baudis. CMA-ES/pycma on Github. Zenodo, DOI:10.5281/zenodo.2559634, February 2019
-
[69]
Gaussian Error Linear Units (GELUs)
Dan Hendrycks and Kevin Gimpel. Gaussian error linear units (gelus). arXiv preprint arXiv:1606.08415, 2016
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[70]
Adam: A method for stochastic optimization
Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization. In Interna- tional Conference on Learning Representations (ICLR), 2015
work page 2015
-
[71]
Yaron Lipman, Marton Havasi, Peter Holderrieth, Neta Shaul, Matt Le, Brian Karrer, Ricky TQ Chen, David Lopez-Paz, Heli Ben-Hamu, and Itai Gat. Flow matching guide and code. arXiv preprint arXiv:2412.06264, 2024
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[72]
Ricky T. Q. Chen. torchdiffeq, 2018
work page 2018
-
[73]
Ulf Grenander and Michael I Miller. Representations of knowledge in complex systems.Journal of the Royal Statistical Society: Series B (Methodological), 56(4):549–581, 1994
work page 1994
-
[74]
Simon Duane, Anthony D Kennedy, Brian J Pendleton, and Duncan Roweth. Hybrid monte carlo. Physics letters B, 195(2):216–222, 1987
work page 1987
-
[75]
J. H. Halton. Sequential monte carlo.Mathematical Proceedings of the Cambridge Philosophical Society, 58(1):57–78, 1962
work page 1962
-
[76]
A sequential particle filter method for static models
Nicolas Chopin. A sequential particle filter method for static models. Biometrika, 89(3):539– 552, 2002
work page 2002
-
[77]
Nested sampling for general bayesian computation
John Skilling. Nested sampling for general bayesian computation. 2006
work page 2006
-
[78]
Improving gradient-guided nested sampling for posterior inference
Pablo Lemos, Nikolay Malkin, Will Handley, Yoshua Bengio, Yashar Hezaveh, and Laurence Perreault-Levasseur. Improving gradient-guided nested sampling for posterior inference. In International Conference on Machine Learning, pages 27230–27253. PMLR, 2024
work page 2024
-
[79]
Path integral sampler: A stochastic control approach for sampling
Qinsheng Zhang and Yongxin Chen. Path integral sampler: A stochastic control approach for sampling. In International Conference on Learning Representations, 2022
work page 2022
-
[80]
Francisco Vargas, Will Sussman Grathwohl, and Arnaud Doucet. Denoising diffusion samplers. In The Eleventh International Conference on Learning Representations, 2023
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.