Recognition: unknown
Large sample analysis of the median heuristic
read the original abstract
In kernel methods, the median heuristic has been widely used as a way of setting the bandwidth of RBF kernels. While its empirical performances make it a safe choice under many circumstances, there is little theoretical understanding of why this is the case. Our aim in this paper is to advance our understanding of the median heuristic by focusing on the setting of kernel two-sample test. We collect new findings that may be of interest for both theoreticians and practitioners. In theory, we provide a convergence analysis that shows the asymptotic normality of the bandwidth chosen by the median heuristic in the setting of kernel two-sample test. Systematic empirical investigations are also conducted in simple settings, comparing the performances based on the bandwidths chosen by the median heuristic and those by the maximization of test power.
This paper has not been read by Pith yet.
Forward citations
Cited by 9 Pith papers
-
BAMIFun: Bayesian Multiple Imputation for Functional Data
BAMIFun provides Bayesian multiple imputation for functional data via low-rank penalized spline models, achieving accurate imputation and improved coverage in simulations and real datasets compared to single-imputatio...
-
Detecting Changes in Causal Dependence with Kernels and Copulas
A kernel-copula embedding statistic equals zero exactly when causal dependence between X and Y is stable and is strictly positive otherwise, with a near-linear estimator and convergence rates provided.
-
Convex-Geometric Error Bounds for Positive-Weight Kernel Quadrature
Positive simplex weights for kernel quadrature achieve O(d/N) convex-hull approximation error in feature space, transferring to RKHS worst-case bounds that beat Monte Carlo under exponential spectral decay.
-
The Generalised Kernel Covariance Measure
GKCM generalizes kernel CI testing to arbitrary regression models, provides uniform asymptotic level guarantees under stated conditions, and outperforms state-of-the-art methods in simulations when using tree-based re...
-
LLM-XTM: Enhancing Cross-Lingual Topic Models with Large Language Models
LLM-XTM integrates LLM-guided topic refinement with self-consistency uncertainty quantification to improve coherence and alignment in cross-lingual topic models while reducing dependence on bilingual resources and rep...
-
Concentration and Calibration in Predictive Bayesian Inference
Predictive Bayesian inference posteriors concentrate onto a forward-model-dependent quantity and produce miscalibrated credible sets unless the predictive model contains the true data-generating process.
-
A unified perspective on fine-tuning and sampling with diffusion and flow models
A unified framework for exponential tilting in diffusion and flow models that includes bias-variance decompositions showing finite gradient variance for some methods, norm bounds on adjoint ODEs, and adapted losses wi...
-
Non-asymptotic two-sample kernel testing with the spectrally truncated normalized MMD
Derives exponential upper bounds under the null for the spectrally truncated normalized MMD and supplies a practical data-adaptive quantile estimator with hyperparameter tuning that does not require splitting.
-
Physics-informed neural particle flow for the Bayesian update step
A neural network approximates the velocity field of log-homotopy particle flow by enforcing a derived master PDE from the continuity equation, enabling unsupervised amortized Bayesian updates with reduced stiffness.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.