Sharp Inequalities between Total Variation and Hellinger Distances for Gaussian Mixtures

Chao Gao; Joonhyuk Jung

arxiv: 2602.03202 · v2 · pith:VOPF23VHnew · submitted 2026-02-03 · 🧮 math.ST · stat.ML· stat.TH

Sharp Inequalities between Total Variation and Hellinger Distances for Gaussian Mixtures

Joonhyuk Jung , Chao Gao This is my paper

classification 🧮 math.ST stat.MLstat.TH

keywords mixturesgaussianhellingerdistancetotalvariationbounddistances

0 comments

read the original abstract

We study the relation between the total variation (TV) and Hellinger distances between two Gaussian location mixtures. Our first result establishes a general upper bound: for any two mixing distributions supported on a compact set, the Hellinger distance between the two mixtures is controlled by the TV distance raised to a power $1-o(1)$, where the $o(1)$ term is of order $1/\log\log(1/\mathrm{TV})$. We also construct two sequences of mixing distributions that demonstrate the sharpness of this bound. Taken together, our results resolve an open problem raised in Jia et al. (2023) and thus lead to an entropic characterization of learning Gaussian mixtures in total variation. Our inequality also yields optimal robust estimation of Gaussian mixtures in Hellinger distance, which has a direct implication for bounding the minimax regret of empirical Bayes under Huber contamination.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Weighted Chernoff information and optimal loss exponent in context-sensitive hypothesis testing
math.ST 2026-03 unverdicted novelty 6.0

The optimal weighted total loss decays as exp(-n times weighted Chernoff information) when the context weight factors across observations.