Support-Proximity Augmented Diffusion Estimation for Offline Black-Box Optimization

Bowei He; Can Chen; Haolun Wu; Linfeng Du; Xue Liu; Ye Yuan; Yonghan Yang; Zipeng Sun

arxiv: 2605.11246 · v2 · pith:4DNWNNUOnew · submitted 2026-05-11 · 💻 cs.LG

Support-Proximity Augmented Diffusion Estimation for Offline Black-Box Optimization

Yonghan Yang , Ye Yuan , Zipeng Sun , Linfeng Du , Bowei He , Haolun Wu , Can Chen , Xue Liu This is my paper

Pith reviewed 2026-05-22 10:19 UTC · model grok-4.3

classification 💻 cs.LG

keywords offline black-box optimizationdiffusion modelssupport-proximity regularizationBayesian posteriorout-of-distribution extrapolationforward surrogate modelingdesign optimizationkNN density estimation

0 comments

The pith

SPADE augments diffusion models with support-proximity regularization to improve out-of-distribution extrapolation in offline black-box optimization.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces SPADE to address the challenge of discovering high-scoring novel designs from a fixed dataset without direct access to the black-box function. It models the forward mapping p(y|x) via diffusion but adds calibrated estimation for moment and ranking consistency plus support-proximity regularization that uses kNN density estimates to keep samples near the observed data manifold. The authors prove this regularization is first-order equivalent to maximizing a Bayesian posterior under a valid design prior. If correct, the approach would allow forward surrogates to compete with or exceed inverse methods by implicitly respecting data support during optimization.

Core claim

SPADE models the forward likelihood p(y|x) using a diffusion model augmented by a Calibrated Diffusion Estimation module that enforces global consistency in statistical moments and pairwise rankings, together with a Support-Proximity Regularization mechanism that internalizes the data manifold constraint p(x) via kNN-based density estimation; the regularization is shown to be first-order equivalent to maximizing a Bayesian posterior with a valid design prior, and the resulting method reaches state-of-the-art performance on Design-Bench tasks and an LLM data mixture optimization benchmark.

What carries the argument

Support-Proximity Regularization that uses kNN-based density estimation to implicitly enforce the data manifold constraint p(x) inside the conditional diffusion model for p(y|x).

If this is right

SPADE achieves state-of-the-art performance across Design-Bench tasks.
SPADE outperforms prior methods on an LLM data mixture optimization benchmark.
The support-proximity regularization is first-order equivalent to maximizing a Bayesian posterior with a valid design prior.
Forward surrogate modeling with these two enhancements handles out-of-distribution designs more effectively than standard forward or inverse approaches.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The Bayesian equivalence may allow SPADE-style regularization to be re-derived directly as a prior term rather than as a post-hoc density penalty.
The same support-proximity idea could be ported to other conditional generative models used for surrogate modeling in optimization.
If the kNN estimate is replaced by a learned density model, the method might scale to higher-dimensional design spaces where fixed-neighbor counts become unreliable.

Load-bearing premise

The kNN-based density estimate accurately captures the support of the true data manifold p(x) without bias that would invalidate the first-order Bayesian equivalence or degrade out-of-distribution performance.

What would settle it

Replacing the kNN density estimate inside SPADE with a uniform distribution over the design space and observing no drop (or an improvement) in Design-Bench scores would falsify the claim that the regularization supplies the key benefit.

Figures

Figures reproduced from arXiv: 2605.11246 by Bowei He, Can Chen, Haolun Wu, Linfeng Du, Xue Liu, Ye Yuan, Yonghan Yang, Zipeng Sun.

**Figure 1.** Figure 1: The pipeline consists of two stages: Surrogate Training learns a conditional diffusion model regularized by Calibrated Diffusion Estimation (ensuring moment and rank consistency) and Support-Proximity Regularization (penalizing OOD regions via kNN distances). Optimization then searches for the optimal design x ∗ using an Evolutionary Algorithm to maximize the Lower Confidence Bound (LCB) derived from the … view at source ↗

**Figure 2.** Figure 2: Beale (2D): ground-truth objective surface (left) and the landscape learned by SPADE (right). spaces (e.g., 60D for Ant or 86D for SuperC) is inherently difficult. Therefore, we employ standard 2D synthetic test functions from the BayesO benchmark (Kim, 2023) to directly inspect the fidelity of our surrogate model. We selected two representative functions: the Beale function ( [PITH_FULL_IMAGE:figures/ful… view at source ↗

**Figure 3.** Figure 3: Zakharov (2D): ground-truth objective surface (left) and the landscape learned by SPADE (right). F. Additional Hyperparameters As mentioned in Section 4.1, we employ the Optuna framework (Akiba et al., 2019) to automatically tune the two key regularization hyperparameters for each task: the calibration weight λ1 and the support-proximity weight λ2. To isolate the impact of these regularization terms, all o… view at source ↗

read the original abstract

Offline black-box optimization aims to discover novel designs with high property scores using only a static dataset, a task fundamentally challenged by the out-of-distribution (OOD) extrapolation problem. Existing approaches typically bifurcate into inverse methods, which struggle with the ill-posed nature of mapping scores to designs, and forward methods, which often lack the distributional expressivity to quantify uncertainty effectively. In this work, we propose SPADE (Support-Proximity Augmented Diffusion Estimation), a novel framework that reimagines forward surrogate modeling through the lens of conditional generative modeling. SPADE models the forward likelihood p(y|x) using a diffusion model, but with two critical enhancements to tailor it for optimization: (1) a Calibrated Diffusion Estimation module that enforces global consistency in statistical moments and pairwise rankings, and (2) a Support-Proximity Regularization mechanism that implicitly internalizes the data manifold constraint p(x) via kNN-based density estimation. Theoretically, we prove that our regularization is first-order equivalent to maximizing a Bayesian posterior with a valid design prior. Empirically, SPADE achieves state-of-the-art performance across Design-Bench tasks and an LLM data mixture optimization benchmark.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SPADE pairs calibrated diffusion with kNN support-proximity regularization and claims a first-order Bayesian equivalence, but the high-dimensional reliability of that equivalence is the open question.

read the letter

The paper's core move is to treat the forward model p(y|x) as a conditional diffusion process, then add a calibration step for moment and ranking consistency plus a kNN-based regularizer that encourages samples to stay near the observed data support. The authors derive that this regularizer is first-order equivalent to maximizing a posterior under a design prior taken from the data manifold. That specific combination and the stated equivalence do not appear in the prior diffusion or surrogate literature referenced in the abstract, so the framing is new on its own terms. The empirical section reports SOTA numbers on Design-Bench tasks and an LLM mixture optimization benchmark, which at least shows the method is competitive on those suites. Credit is due for shipping a concrete template that tries to make the regularization principled rather than purely heuristic. The soft spot is exactly where the stress-test note points: kNN density estimation in the 50- to 100-plus-dimensional spaces typical of Design-Bench is known to suffer from distance concentration and boundary bias. If the local density estimates systematically misrepresent the manifold edges, the implicit prior enforced by the regularizer will deviate from the claimed valid design prior at first order. The abstract gives no derivation details, no sensitivity checks on kNN parameters, and no ablation that isolates whether the equivalence survives in practice. That leaves the theoretical justification thinner than the claim suggests. Readers already working on generative surrogates or offline design optimization will find the template and the benchmark results useful to try. The work is coherent enough on its own terms to merit a serious referee, even if the equivalence needs tighter validation or empirical stress-testing before it can be treated as load-bearing.

Referee Report

2 major / 2 minor

Summary. The paper proposes SPADE, a conditional diffusion-based framework for offline black-box optimization. It models the forward likelihood p(y|x) with a calibrated diffusion estimation module enforcing moment and ranking consistency, plus a support-proximity regularization term based on kNN density estimation to enforce the data manifold p(x). The central theoretical claim is a proof that this regularization is first-order equivalent to maximizing a Bayesian posterior p(x|y) with a valid design prior; empirically, SPADE reports SOTA results on Design-Bench tasks and an LLM data-mixture optimization benchmark.

Significance. If the first-order Bayesian equivalence holds without high-dimensional bias and the empirical gains are robust to ablations, SPADE would offer a principled bridge between generative modeling and Bayesian regularization for OOD-constrained optimization. The work correctly identifies the bifurcation between inverse and forward methods and targets a practically relevant gap.

major comments (2)

[Theoretical Analysis] Theoretical section (proof of equivalence): The claim that kNN-based support-proximity regularization is first-order equivalent to a Bayesian posterior with a valid design prior derived from the data manifold is load-bearing for the method's principled status. However, the manuscript provides no derivation details or explicit steps showing how the kNN density estimate corresponds to the log-prior term at first order. In high-dimensional design spaces typical of Design-Bench (100+ dimensions), kNN distances are known to be unreliable due to the curse of dimensionality; this could introduce systematic bias near the manifold boundary that deviates from the claimed valid prior at first order, undermining the equivalence.
[Experiments] Empirical evaluation (Design-Bench and LLM benchmark results): The SOTA performance claims are presented without error bars, ablation studies on kNN neighborhood size, or sensitivity analysis of the post-hoc calibration module. Given that the weakest assumption is the accuracy of the kNN density estimate in capturing the true support without bias, the absence of these controls leaves open the possibility that observed gains are driven by heuristic choices rather than the claimed Bayesian regularization.

minor comments (2)

[Abstract] The abstract and introduction could more clearly distinguish the calibrated diffusion estimation from standard diffusion training; a brief equation or pseudocode snippet would improve readability.
[Method] Notation for the support-proximity term and the kNN density estimator should be defined explicitly on first use to avoid ambiguity with standard kernel density terms.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments, which help clarify the presentation of our theoretical claims and strengthen the empirical validation. We respond to each major comment below and indicate the revisions we will make in the next version of the manuscript.

read point-by-point responses

Referee: [Theoretical Analysis] Theoretical section (proof of equivalence): The claim that kNN-based support-proximity regularization is first-order equivalent to a Bayesian posterior with a valid design prior derived from the data manifold is load-bearing for the method's principled status. However, the manuscript provides no derivation details or explicit steps showing how the kNN density estimate corresponds to the log-prior term at first order. In high-dimensional design spaces typical of Design-Bench (100+ dimensions), kNN distances are known to be unreliable due to the curse of dimensionality; this could introduce systematic bias near the manifold boundary that deviates from the claimed valid prior at first order, undermining the equivalence.

Authors: We agree that a more explicit derivation would strengthen the theoretical section. The manuscript contains a proof of the first-order equivalence, but we will expand it in the revision to include all intermediate steps showing how the kNN density estimate enters the log-prior term. On the high-dimensional concern, the analysis relies on a local manifold approximation under which the first-order equivalence is derived; we will add a short discussion acknowledging the known limitations of kNN distances in high dimensions and noting that the regularization remains effective for the dimensionality and data regimes in Design-Bench, as supported by our empirical results. revision: yes
Referee: [Experiments] Empirical evaluation (Design-Bench and LLM benchmark results): The SOTA performance claims are presented without error bars, ablation studies on kNN neighborhood size, or sensitivity analysis of the post-hoc calibration module. Given that the weakest assumption is the accuracy of the kNN density estimate in capturing the true support without bias, the absence of these controls leaves open the possibility that observed gains are driven by heuristic choices rather than the claimed Bayesian regularization.

Authors: We concur that additional controls would increase confidence in the results. The revised manuscript will report error bars over multiple random seeds for all Design-Bench and LLM benchmark metrics. We will also add ablations on kNN neighborhood size (varying k across a reasonable range) and sensitivity plots for the calibration module. These experiments will help isolate the contribution of the support-proximity regularization from hyperparameter choices. revision: yes

Circularity Check

0 steps flagged

No significant circularity; Bayesian equivalence presented as independent derivation

full rationale

The paper's central theoretical step is a claimed first-order equivalence between kNN-based support-proximity regularization and maximization of a Bayesian posterior p(x|y) with a design prior derived from the data manifold. This is presented as a proof rather than a definitional identity or fitted input renamed as prediction. No self-citation load-bearing the uniqueness or ansatz is evident in the provided sections; the equivalence is derived from the regularization term and does not reduce to the same data fit by construction. The method remains self-contained against external benchmarks like Design-Bench, with the kNN density estimate serving as an explicit modeling choice rather than a hidden tautology. No steps meet the criteria for circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies insufficient detail to enumerate concrete free parameters, axioms, or invented entities; kNN neighborhood size and calibration weights are likely fitted but not quantified here.

pith-pipeline@v0.9.0 · 5751 in / 1079 out tokens · 22440 ms · 2026-05-22T10:19:10.286226+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Proposition 3.1 (First-order Equivalence to Prior Augmentation). ... eA(x) = A(μ,σ) + κ(x) log ˆp_knn(x) + C(x) + o(log ˆp_knn(x)). Consequently, maximizing the support-aware acquisition eA(x) is, to first order, equivalent to maximizing ... Likelihood Utility + Prior Constraint.
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean absolute_floor_iff_bare_distinguishability unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Support-Proximity Regularization mechanism that implicitly internalizes the data manifold constraint p(x) via kNN-based density estimation.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

One Sentence, One Drama: Personalized Short-Form Drama Generation via Multi-Agent Systems
cs.CV 2026-05 unverdicted novelty 5.0

A hierarchical multi-agent framework converts a single sentence into a short drama using debate-based scripting, 3D-grounded first frames for spatial consistency, and multi-stage reviewer loops.

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · cited by 1 Pith paper

[1]

2020 , booktitle =

Robel: Robotics benchmarks for learning with low-cost robots , author =. 2020 , booktitle =

work page 2020
[2]

2016 , journal =

Survey of variation in human transcription factors reveals prevalent DNA binding changes , author =. 2016 , journal =

work page 2016
[3]

2016 , journal =

Openai gym , author =. 2016 , journal =

work page 2016
[4]

Proceedings of the 36th International Conference on Machine Learning , publisher =

Conditioning by adaptive sampling for robust design , author =. Proceedings of the 36th International Conference on Machine Learning , publisher =. 2019 , month =

work page 2019
[5]

2024 , booktitle =

Offline Model-Based Optimization via Policy-Guided Gradient Search , author =. 2024 , booktitle =. doi:10.1609/aaai.v38i10.29001 , bibsource =

work page doi:10.1609/aaai.v38i10.29001 2024
[6]

2022 , booktitle =

Bidirectional Learning for Offline Infinite-width Model-based Optimization , author =. 2022 , booktitle =

work page 2022
[7]

2023 , booktitle =

Parallel-mentoring for Offline Model-based Optimization , author =. 2023 , booktitle =

work page 2023
[8]

2024 , journal =

Robust Guided Diffusion for Offline Black-Box Optimization , author =. 2024 , journal =

work page 2024
[9]

2018 , journal =

Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding , author =. 2018 , journal =. doi:10.1073/pnas.1715888115 , eprint =

work page doi:10.1073/pnas.1715888115 2018
[10]

2025 , booktitle =

ROOT: Rethinking Offline Optimization as Distributional Translation via Probabilistic Bridge , author =. 2025 , booktitle =

work page 2025
[11]

2020 , booktitle =

Autofocused oracles for model-based design , author =. 2020 , booktitle =

work page 2020
[12]

Proceedings of The 33rd International Conference on Machine Learning , publisher =

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , author =. Proceedings of The 33rd International Conference on Machine Learning , publisher =. 2016 , month =

work page 2016
[13]

2017 , booktitle =

Density functional estimators with k-nearest neighbor bandwidths , author =. 2017 , booktitle =. doi:10.1109/isit.2017.8006749 , keywords =

work page doi:10.1109/isit.2017.8006749 2017
[14]

2018 , booktitle =

Conditional neural processes , author =. 2018 , booktitle =

work page 2018
[15]

2018 , journal =

A data-driven statistical model for predicting the critical temperature of a superconductor , author =. 2018 , journal =

work page 2018
[16]

Hansen, Nikolaus , year =. The

work page
[17]

Denoising Diffusion Probabilistic Models , volume =

Ho, Jonathan and Jain, Ajay and Abbeel, Pieter , booktitle =. Denoising Diffusion Probabilistic Models , volume =

work page
[18]

Proceedings of the 41st International Conference on Machine Learning , publisher =

Learning Surrogates for Offline Black-Box Optimization via Gradient Matching , author =. Proceedings of the 41st International Conference on Machine Learning , publisher =. 2024 , month =

work page 2024
[19]

2026 , journal =

Offline Model-Based Optimization: Comprehensive Review , author =. 2026 , journal =

work page 2026
[20]

, author =

Adam: A Method for Stochastic Optimization. , author =. 2015 , booktitle =

work page 2015
[21]

Proceedings of the 40th International Conference on Machine Learning , publisher =

Diffusion Models for Black-Box Optimization , author =. Proceedings of the 40th International Conference on Machine Learning , publisher =. 2023 , month =

work page 2023
[22]

2023 , booktitle =

Generative Pretraining for Black-Box Optimization , author =. 2023 , booktitle =

work page 2023
[23]

2020 , booktitle =

Model Inversion Networks for Model-Based Optimization , author =. 2020 , booktitle =

work page 2020
[24]

2017 , booktitle =

Simple and scalable predictive uncertainty estimation using deep ensembles , author =. 2017 , booktitle =

work page 2017
[25]

2006 , publisher =

Gaussian processes for machine learning , author =. 2006 , publisher =

work page 2006
[26]

2017 , booktitle =

Doubly stochastic variational inference for deep gaussian processes , author =. 2017 , booktitle =

work page 2017
[27]

Sample, Paul J and Wang, Ban and Reid, David W and Presnyak, Vlad and McFadyen, Iain J and Morris, David R and Seelig, Georg , year =. Human 5

work page
[28]

2021 , booktitle =

Denoising Diffusion Implicit Models , author =. 2021 , booktitle =

work page 2021
[29]

2024 , journal =

OmniPred: Language Models as Universal Regressors , author =. 2024 , journal =

work page 2024
[30]

2025 , booktitle =

Offline Model-Based Optimization by Learning to Rank , author =. 2025 , booktitle =

work page 2025
[31]

Proceedings of the 38th International Conference on Machine Learning , publisher =

Conservative Objective Models for Effective Offline Model-Based Optimization , author =. Proceedings of the 38th International Conference on Machine Learning , publisher =. 2021 , month =

work page 2021
[32]

Proceedings of the 39th International Conference on Machine Learning , publisher =

Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization , author =. Proceedings of the 39th International Conference on Machine Learning , publisher =. 2022 , month =

work page 2022
[33]

1992 , journal =

Simple statistical gradient-following algorithms for connectionist reinforcement learning , author =. 1992 , journal =

work page 1992
[34]

2017 , journal =

The reparameterization trick for acquisition functions , author =. 2017 , journal =

work page 2017
[35]

2025 , eprint =

BBOPlace-Bench: Benchmarking Black-Box Optimization for Chip Placement , author =. 2025 , eprint =

work page 2025
[36]

2024 , booktitle =

Generative Adversarial Model-Based Optimization via Source Critic Regularization , author =. 2024 , booktitle =

work page 2024
[37]

2025 , booktitle =

Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework , author =. 2025 , booktitle =

work page 2025
[38]

2021 , booktitle =

RoMA: Robust Model Adaptation for Offline Model-based Optimization , author =. 2021 , booktitle =

work page 2021
[39]

2023 , booktitle =

Importance-aware Co-teaching for Offline Model-based Optimization , author =. 2023 , booktitle =

work page 2023
[40]

2025 , journal =

Design Editing for Offline Model-based Optimization , author =. 2025 , journal =

work page 2025
[41]

2025 , booktitle =

ParetoFlow: Guided Flows in Multi-Objective Optimization , author =. 2025 , booktitle =

work page 2025
[42]

Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization , volume =

Yun, Taeyoung and Yun, Sujin and Lee, Jaewoo and Park, Jinkyoo , booktitle =. Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization , volume =. doi:10.52202/079017-2665 , editor =

work page doi:10.52202/079017-2665
[43]

doi:10.5281/zenodo.7577330 , year=

Kim, Jungtaek , title=. doi:10.5281/zenodo.7577330 , year=

work page doi:10.5281/zenodo.7577330
[44]

Akiba, Takuya and Sano, Shotaro and Yanase, Toshihiko and Ohta, Takeru and Koyama, Masanori , booktitle=

work page
[45]

Proceedings of the 41st International Conference on Machine Learning , pages =

Feedback Efficient Online Fine-Tuning of Diffusion Models , author =. Proceedings of the 41st International Conference on Machine Learning , pages =. 2024 , editor =

work page 2024
[46]

2026 , eprint=

Training Diffusion Language Models for Black-Box Optimization , author=. 2026 , eprint=

work page 2026
[47]

2026 , eprint=

Diffusion Large Language Models for Black-Box Optimization , author=. 2026 , eprint=

work page 2026
[48]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Large Language Diffusion Models , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

work page

[1] [1]

2020 , booktitle =

Robel: Robotics benchmarks for learning with low-cost robots , author =. 2020 , booktitle =

work page 2020

[2] [2]

2016 , journal =

Survey of variation in human transcription factors reveals prevalent DNA binding changes , author =. 2016 , journal =

work page 2016

[3] [3]

2016 , journal =

Openai gym , author =. 2016 , journal =

work page 2016

[4] [4]

Proceedings of the 36th International Conference on Machine Learning , publisher =

Conditioning by adaptive sampling for robust design , author =. Proceedings of the 36th International Conference on Machine Learning , publisher =. 2019 , month =

work page 2019

[5] [5]

2024 , booktitle =

Offline Model-Based Optimization via Policy-Guided Gradient Search , author =. 2024 , booktitle =. doi:10.1609/aaai.v38i10.29001 , bibsource =

work page doi:10.1609/aaai.v38i10.29001 2024

[6] [6]

2022 , booktitle =

Bidirectional Learning for Offline Infinite-width Model-based Optimization , author =. 2022 , booktitle =

work page 2022

[7] [7]

2023 , booktitle =

Parallel-mentoring for Offline Model-based Optimization , author =. 2023 , booktitle =

work page 2023

[8] [8]

2024 , journal =

Robust Guided Diffusion for Offline Black-Box Optimization , author =. 2024 , journal =

work page 2024

[9] [9]

2018 , journal =

Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding , author =. 2018 , journal =. doi:10.1073/pnas.1715888115 , eprint =

work page doi:10.1073/pnas.1715888115 2018

[10] [10]

2025 , booktitle =

ROOT: Rethinking Offline Optimization as Distributional Translation via Probabilistic Bridge , author =. 2025 , booktitle =

work page 2025

[11] [11]

2020 , booktitle =

Autofocused oracles for model-based design , author =. 2020 , booktitle =

work page 2020

[12] [12]

Proceedings of The 33rd International Conference on Machine Learning , publisher =

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , author =. Proceedings of The 33rd International Conference on Machine Learning , publisher =. 2016 , month =

work page 2016

[13] [13]

2017 , booktitle =

Density functional estimators with k-nearest neighbor bandwidths , author =. 2017 , booktitle =. doi:10.1109/isit.2017.8006749 , keywords =

work page doi:10.1109/isit.2017.8006749 2017

[14] [14]

2018 , booktitle =

Conditional neural processes , author =. 2018 , booktitle =

work page 2018

[15] [15]

2018 , journal =

A data-driven statistical model for predicting the critical temperature of a superconductor , author =. 2018 , journal =

work page 2018

[16] [16]

Hansen, Nikolaus , year =. The

work page

[17] [17]

Denoising Diffusion Probabilistic Models , volume =

Ho, Jonathan and Jain, Ajay and Abbeel, Pieter , booktitle =. Denoising Diffusion Probabilistic Models , volume =

work page

[18] [18]

Proceedings of the 41st International Conference on Machine Learning , publisher =

Learning Surrogates for Offline Black-Box Optimization via Gradient Matching , author =. Proceedings of the 41st International Conference on Machine Learning , publisher =. 2024 , month =

work page 2024

[19] [19]

2026 , journal =

Offline Model-Based Optimization: Comprehensive Review , author =. 2026 , journal =

work page 2026

[20] [20]

, author =

Adam: A Method for Stochastic Optimization. , author =. 2015 , booktitle =

work page 2015

[21] [21]

Proceedings of the 40th International Conference on Machine Learning , publisher =

Diffusion Models for Black-Box Optimization , author =. Proceedings of the 40th International Conference on Machine Learning , publisher =. 2023 , month =

work page 2023

[22] [22]

2023 , booktitle =

Generative Pretraining for Black-Box Optimization , author =. 2023 , booktitle =

work page 2023

[23] [23]

2020 , booktitle =

Model Inversion Networks for Model-Based Optimization , author =. 2020 , booktitle =

work page 2020

[24] [24]

2017 , booktitle =

Simple and scalable predictive uncertainty estimation using deep ensembles , author =. 2017 , booktitle =

work page 2017

[25] [25]

2006 , publisher =

Gaussian processes for machine learning , author =. 2006 , publisher =

work page 2006

[26] [26]

2017 , booktitle =

Doubly stochastic variational inference for deep gaussian processes , author =. 2017 , booktitle =

work page 2017

[27] [27]

Sample, Paul J and Wang, Ban and Reid, David W and Presnyak, Vlad and McFadyen, Iain J and Morris, David R and Seelig, Georg , year =. Human 5

work page

[28] [28]

2021 , booktitle =

Denoising Diffusion Implicit Models , author =. 2021 , booktitle =

work page 2021

[29] [29]

2024 , journal =

OmniPred: Language Models as Universal Regressors , author =. 2024 , journal =

work page 2024

[30] [30]

2025 , booktitle =

Offline Model-Based Optimization by Learning to Rank , author =. 2025 , booktitle =

work page 2025

[31] [31]

Proceedings of the 38th International Conference on Machine Learning , publisher =

Conservative Objective Models for Effective Offline Model-Based Optimization , author =. Proceedings of the 38th International Conference on Machine Learning , publisher =. 2021 , month =

work page 2021

[32] [32]

Proceedings of the 39th International Conference on Machine Learning , publisher =

Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization , author =. Proceedings of the 39th International Conference on Machine Learning , publisher =. 2022 , month =

work page 2022

[33] [33]

1992 , journal =

Simple statistical gradient-following algorithms for connectionist reinforcement learning , author =. 1992 , journal =

work page 1992

[34] [34]

2017 , journal =

The reparameterization trick for acquisition functions , author =. 2017 , journal =

work page 2017

[35] [35]

2025 , eprint =

BBOPlace-Bench: Benchmarking Black-Box Optimization for Chip Placement , author =. 2025 , eprint =

work page 2025

[36] [36]

2024 , booktitle =

Generative Adversarial Model-Based Optimization via Source Critic Regularization , author =. 2024 , booktitle =

work page 2024

[37] [37]

2025 , booktitle =

Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework , author =. 2025 , booktitle =

work page 2025

[38] [38]

2021 , booktitle =

RoMA: Robust Model Adaptation for Offline Model-based Optimization , author =. 2021 , booktitle =

work page 2021

[39] [39]

2023 , booktitle =

Importance-aware Co-teaching for Offline Model-based Optimization , author =. 2023 , booktitle =

work page 2023

[40] [40]

2025 , journal =

Design Editing for Offline Model-based Optimization , author =. 2025 , journal =

work page 2025

[41] [41]

2025 , booktitle =

ParetoFlow: Guided Flows in Multi-Objective Optimization , author =. 2025 , booktitle =

work page 2025

[42] [42]

Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization , volume =

Yun, Taeyoung and Yun, Sujin and Lee, Jaewoo and Park, Jinkyoo , booktitle =. Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization , volume =. doi:10.52202/079017-2665 , editor =

work page doi:10.52202/079017-2665

[43] [43]

doi:10.5281/zenodo.7577330 , year=

Kim, Jungtaek , title=. doi:10.5281/zenodo.7577330 , year=

work page doi:10.5281/zenodo.7577330

[44] [44]

Akiba, Takuya and Sano, Shotaro and Yanase, Toshihiko and Ohta, Takeru and Koyama, Masanori , booktitle=

work page

[45] [45]

Proceedings of the 41st International Conference on Machine Learning , pages =

Feedback Efficient Online Fine-Tuning of Diffusion Models , author =. Proceedings of the 41st International Conference on Machine Learning , pages =. 2024 , editor =

work page 2024

[46] [46]

2026 , eprint=

Training Diffusion Language Models for Black-Box Optimization , author=. 2026 , eprint=

work page 2026

[47] [47]

2026 , eprint=

Diffusion Large Language Models for Black-Box Optimization , author=. 2026 , eprint=

work page 2026

[48] [48]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Large Language Diffusion Models , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

work page