pith. sign in

arxiv: 1902.01075 · v1 · pith:2PT3SVGGnew · submitted 2019-02-04 · 🧮 math.ST · stat.ME· stat.TH

Numerical performance of Penalized Comparison to Overfitting for multivariate kernel density estimation

classification 🧮 math.ST stat.MEstat.TH
keywords bandwidthdensityestimationkernelmethodselectionalgorithmicbeen
0
0 comments X
read the original abstract

Kernel density estimation is a well known method involving a smoothing parameter (the bandwidth) that needs to be tuned by the user. Although this method has been widely used the bandwidth selection remains a challenging issue in terms of balancing algorithmic performance and statistical relevance. The purpose of this paper is to compare a recently developped bandwidth selection method for kernel density estimation to those which are commonly used by now (at least those which are implemented in the R-package). This new method is called Penalized Comparison to Overfitting (PCO). It has been proposed by some of the authors of this paper in a previous work devoted to its statistical relevance from a purely theoretical perspective. It is compared here to other usual bandwidth selection methods for univariate and also multivariate kernel density estimation on the basis of intensive simulation studies. In particular, cross-validation and plug-in criteria are numerically investigated and compared to PCO. The take home message is that PCO can outperform the classical methods without algorithmic additionnal cost.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Estimating the division rate from indirect measurements of single cells

    math.ST 2019-07 unverdicted novelty 6.0

    Derives a reconstruction formula to estimate division rate from size measurements in the incremental bacterial growth model and validates it numerically on simulated and experimental data.