Statistical Analysis of Nearest Neighbor Methods for Anomaly Detection

Alessandro Rinaldo; Leman Akoglu; Xiaoyi Gu

arxiv: 1907.03813 · v1 · pith:PFMYJOC4new · submitted 2019-07-08 · 📊 stat.ML · cs.LG· math.ST· stat.TH

Statistical Analysis of Nearest Neighbor Methods for Anomaly Detection

Xiaoyi Gu , Leman Akoglu , Alessandro Rinaldo This is my paper

Pith reviewed 2026-05-25 00:40 UTC · model grok-4.3

classification 📊 stat.ML cs.LGmath.STstat.TH

keywords anomaly detectionnearest neighbordistance to measurefinite-sample guaranteesHuber's contamination modelmisclassification ratesrobust geometric inference

0 comments

The pith

Nearest-neighbor anomaly detection receives finite-sample uniform guarantees via empirical distance-to-measure bounds.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper studies nearest-neighbor methods for anomaly detection through both simulations and theory. Simulations indicate that NN procedures perform competitively with other algorithms on synthetic benchmarks, while real-data results vary with dimensionality. The main theoretical step supplies finite-sample uniform guarantees on the empirical distance-to-measure and converts those guarantees into explicit misclassification rates for anomalous points. A reader would care because such rates give concrete, non-asymptotic error control for a widely used detection technique under a standard contamination model.

Core claim

The paper claims that finite-sample uniform guarantees for the empirical DTM yield misclassification rates for anomalous observations under various settings, obtained by relying on Huber's contamination model together with mild geometric regularity assumptions on the data distribution.

What carries the argument

The distance-to-measure (DTM), a quantity from robust geometric inference whose empirical version supplies uniform convergence that is converted into anomaly misclassification bounds.

If this is right

NN-based anomaly detectors can be accompanied by explicit finite-sample error rates rather than only asymptotic statements.
The same DTM analysis supplies uniform deviation bounds that apply across multiple anomaly-detection settings.
Real-data performance of NN methods can be interpreted through the lens of dimensionality under the same regularity conditions.
Misclassification control holds for both clean and contaminated observations when the geometric assumptions are satisfied.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same uniform-convergence argument could be adapted to other NN-derived statistics used in unsupervised tasks if comparable regularity is assumed.
Empirical checks on synthetic data with controlled contamination fractions would directly test the tightness of the derived misclassification rates.
Because DTM originated in topological inference, the bounds may link anomaly detection error to persistence-diagram stability under contamination.

Load-bearing premise

The guarantees rest on the data obeying Huber's contamination model plus mild geometric regularity assumptions on the underlying distribution.

What would settle it

Generate data from a distribution that meets the geometric regularity conditions and Huber's model, then check whether the observed fraction of misclassified anomalies stays inside the derived finite-sample bound; exceeding the bound would refute the claim.

Figures

Figures reproduced from arXiv: 1907.03813 by Alessandro Rinaldo, Leman Akoglu, Xiaoyi Gu.

**Figure 2.** Figure 2: Performance of DTM when the separation distance between the normal instances and [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: Examples of difficult datasets. Next, consider the right hand side of (13). We have that inf y∈S1 1 m Z m 0 rP,t(y) q dt = inf y∈S1 1 m Z ε 0 rP,t(y) q dt + 1 m Z m ε rP,t(y) q dt ≥ 0 + 1 m Z m ε η q dt = m − ε m η q Hence, (13) holds if b b + q m a(x)(1 − ε) q/b < m − ε m η q − h (14) ⇔ a(x) > m 1 − ε b + q b m − ε m η q − h −b/q (15) ⇔ x ∈ Aη (16) C Simulation Results on 23 Real Datasets from OD… view at source ↗

**Figure 4.** Figure 4: Performance on the difficult datasets. Case: ring [PITH_FULL_IMAGE:figures/full_fig_p015_4.png] view at source ↗

**Figure 5.** Figure 5: Performance on the difficult datasets. Case: local anomalies [PITH_FULL_IMAGE:figures/full_fig_p016_5.png] view at source ↗

**Figure 6.** Figure 6: Performance on the difficult datasets. Case: clustered anomalies [PITH_FULL_IMAGE:figures/full_fig_p017_6.png] view at source ↗

read the original abstract

Nearest-neighbor (NN) procedures are well studied and widely used in both supervised and unsupervised learning problems. In this paper we are concerned with investigating the performance of NN-based methods for anomaly detection. We first show through extensive simulations that NN methods compare favorably to some of the other state-of-the-art algorithms for anomaly detection based on a set of benchmark synthetic datasets. We further consider the performance of NN methods on real datasets, and relate it to the dimensionality of the problem. Next, we analyze the theoretical properties of NN-methods for anomaly detection by studying a more general quantity called distance-to-measure (DTM), originally developed in the literature on robust geometric and topological inference. We provide finite-sample uniform guarantees for the empirical DTM and use them to derive misclassification rates for anomalous observations under various settings. In our analysis we rely on Huber's contamination model and formulate mild geometric regularity assumptions on the underlying distribution of the data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper turns existing DTM convergence results into misclassification bounds for NN anomaly detection under contamination and backs the methods with standard benchmark simulations.

read the letter

The paper turns existing finite-sample uniform convergence results for the empirical distance-to-measure into explicit misclassification rates for anomalous points under Huber's contamination model plus mild geometric assumptions. That step is the actual new piece; prior DTM work existed, but applying it directly to anomaly error rates is not described as already done. The simulations are straightforward and show nearest-neighbor methods comparing favorably to other anomaly detectors on synthetic benchmarks, with some checks on real data that tie performance to dimension. This gives a practical sense that the methods hold up where people actually use them. The derivations rely on standard concentration tools applied to the DTM definition, so the math follows the usual path without visible circularity or unsupported jumps. The assumptions stay reasonable and are stated up front. One soft spot is the lack of visible detail on simulation protocols, run counts, dataset sizes, or how contamination was generated; without those numbers the empirical claims are harder to weigh precisely. The theory side looks cleaner because it builds on external results rather than fitting quantities inside the paper. This work is for researchers in statistical machine learning who want theoretical support for nearest-neighbor anomaly detection or who already follow the DTM literature. A reader focused on robust unsupervised methods would get usable bounds and a clear empirical comparison. It deserves a serious referee because the core program is grounded and fills a narrow but real gap between geometric inference and anomaly detection practice, even if the experiments need more documentation.

Referee Report

2 major / 2 minor

Summary. The paper claims that nearest-neighbor (NN) methods for anomaly detection compare favorably to other state-of-the-art algorithms in extensive simulations on benchmark synthetic datasets, that their performance on real datasets relates to problem dimensionality, and that finite-sample uniform guarantees can be derived for the empirical distance-to-measure (DTM) under Huber's contamination model plus mild geometric regularity assumptions, which in turn yield misclassification rate bounds for anomalous points.

Significance. If the finite-sample uniform convergence results for the empirical DTM hold and convert cleanly into misclassification bounds, the work supplies a useful statistical justification for NN-based anomaly detection in contaminated settings. The reliance on existing concentration tools from the DTM literature and the explicit focus on misclassification rates are strengths; the empirical component on synthetic and real data provides a practical counterpart, though its quantitative support remains to be verified from the full manuscript.

major comments (2)

[Abstract] Abstract: the assertion that 'simulations favor NN methods' and that 'derivations yield misclassification rates' is presented without accompanying quantitative support (error bars, dataset sizes, number of Monte Carlo repetitions, or explicit bound statements), which is load-bearing for the central empirical and theoretical claims.
[Theoretical analysis] Theoretical section (implied by the finite-sample guarantees claim): the conversion from uniform DTM convergence to misclassification rates under Huber's model requires explicit control on the contamination fraction and the geometric regularity constants; without the precise dependence shown, it is unclear whether the resulting rates are informative for high-dimensional regimes mentioned in the real-data experiments.

minor comments (2)

Clarify the precise statement of the geometric regularity assumptions (e.g., reach, density bounds) already in the introduction rather than deferring entirely to the theory section.
Add a short table or paragraph summarizing the simulation settings (dimensions, sample sizes, contamination levels) to make the 'extensive simulations' claim reproducible from the text alone.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thoughtful review and the recommendation of minor revision. We address each major comment below and will revise the manuscript accordingly to strengthen the presentation of quantitative details and theoretical dependencies.

read point-by-point responses

Referee: [Abstract] Abstract: the assertion that 'simulations favor NN methods' and that 'derivations yield misclassification rates' is presented without accompanying quantitative support (error bars, dataset sizes, number of Monte Carlo repetitions, or explicit bound statements), which is load-bearing for the central empirical and theoretical claims.

Authors: We agree that the abstract would be strengthened by including more specific quantitative information. The body of the manuscript reports 100 Monte Carlo repetitions, dataset sizes between 500 and 5000 points, and error bars in the simulation figures; the misclassification bounds are stated explicitly in Theorems 4.3 and 4.5. In the revised version we will incorporate a concise reference to these elements in the abstract. revision: yes
Referee: [Theoretical analysis] Theoretical section (implied by the finite-sample guarantees claim): the conversion from uniform DTM convergence to misclassification rates under Huber's model requires explicit control on the contamination fraction and the geometric regularity constants; without the precise dependence shown, it is unclear whether the resulting rates are informative for high-dimensional regimes mentioned in the real-data experiments.

Authors: The conversion is carried out explicitly in Section 4: the misclassification bound in Theorem 4.5 is stated as a function of the contamination fraction ε, the reach parameter, and the lower density bound appearing in the geometric regularity assumptions. The dimension dependence enters through the covering numbers used in the uniform convergence result (Proposition 3.2). The real-data section already relates empirical performance to dimensionality; we will add a short remark after Theorem 4.5 that makes this dependence and its implications for high-dimensional regimes more explicit. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's central derivation provides finite-sample uniform convergence bounds for the empirical distance-to-measure (DTM) to its population version under Huber's contamination model plus mild geometric regularity conditions on the data distribution, then converts those bounds into misclassification rates for anomalies. These steps rely on the standard DTM definition together with external concentration tools (e.g., empirical process theory) rather than any internally fitted quantities, self-definitional loops, or load-bearing self-citations. The analysis is therefore self-contained against external benchmarks and contains no reductions by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on two standard modeling choices from robust statistics rather than new free parameters or invented entities.

axioms (2)

domain assumption Huber's contamination model
Models the observed data as a mixture of the clean distribution and a small anomalous component; invoked to define anomalous observations.
domain assumption mild geometric regularity assumptions on the underlying distribution of the data
Required to obtain uniform convergence of the empirical DTM; stated in the abstract as necessary for the theoretical results.

pith-pipeline@v0.9.0 · 5691 in / 1219 out tokens · 38383 ms · 2026-05-25T00:40:04.237663+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

COMPASS: A Unified Decision-Intelligence System for Navigating Performance Trade-off in HPC
cs.PF 2026-04 conditional novelty 6.0

COMPASS formalizes HPC configuration questions as ML tasks on traces, quantifies recommendation trustworthiness, and delivers 65.93% lower average job turnaround time plus 80.93% lower node usage versus prior methods ...

Reference graph

Works this paper leans on

27 extracted references · 27 canonical work pages · cited by 1 Pith paper · 2 internal anchors

[1]

Alfred O. Hero. Geometric entropy minimization (gem) for anomaly detection and localization. In B. Schölkopf, J. C. Platt, and T. Hoffman, editors,Advances in Neural Information Processing Systems 19, pages 585–592. MIT Press, 2007

work page 2007
[2]

Kumar Sricharan and Alfred O. Hero. Efﬁcient anomaly detection using bipartite k-nn graphs. In J. Shawe-Taylor, R. S. Zemel, P. L. Bartlett, F. Pereira, and K. Q. Weinberger, editors,Advances in Neural Information Processing Systems 24, pages 478–486. Curran Associates, Inc., 2011

work page 2011
[3]

Prediction and outlier detection in classification problems

Leying Guan and Rob Tibshirani. Prediction and outlier detection: a distribution-free prediction set with a balanced objective. arXiv e-prints, page arXiv:1905.04396, May 2019

work page internal anchor Pith review Pith/arXiv arXiv 1905
[4]

JooSeuk Kim and Clayton D. Scott. Robust kernel density estimation. J. Mach. Learn. Res., 13(1):2529–2565, September 2012

work page 2012
[5]

Breunig, Hans-Peter Kriegel, Raymond T

Markus M. Breunig, Hans-Peter Kriegel, Raymond T. Ng, and Jörg Sander. Lof: Identifying density-based local outliers. SIGMOD Rec., 29(2):93–104, May 2000

work page 2000
[6]

Fast outlier detection in high dimensional spaces

Fabrizio Angiulli and Clara Pizzuti. Fast outlier detection in high dimensional spaces. In Principles of Data Mining and Knowledge Discovery, pages 15–27, Berlin, Heidelberg, 2002. Springer Berlin Heidelberg

work page 2002
[7]

Angle-based outlier detection algorithm with more stable relationships

Xiaojie Li, Jian Cheng Lv, and Dongdong Cheng. Angle-based outlier detection algorithm with more stable relationships. In Proceedings of the 18th Asia Paciﬁc Symposium on Intelligent and Evolutionary Systems, Volume 1 , pages 433–446, Cham, 2015. Springer International Publishing

work page 2015
[8]

Platt, John C

Bernhard Schölkopf, John C. Platt, John C. Shawe-Taylor, Alex J. Smola, and Robert C. Williamson. Estimating the support of a high-dimensional distribution. Neural Comput., 13(7):1443–1471, July 2001

work page 2001
[9]

Tax and Robert P.W

David M.J. Tax and Robert P.W. Duin. Support vector data description. Machine Learning, 54(1):45–66, Jan 2004

work page 2004
[10]

Aggarwal, and Deepak S

Jinghui Chen, Saket Sathe, Charu C. Aggarwal, and Deepak S. Turaga. Outlier detection with autoencoder ensembles. In SDM, 2017

work page 2017
[11]

Isolation forest

Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou. Isolation forest. In Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, ICDM ’08, pages 413–422, Washington, DC, USA, 2008. IEEE Computer Society

work page 2008
[12]

Loda: Lightweight on-line detector of anomalies.Machine Learning, 102(2):275– 304, Feb 2016

Tomáš Pevný. Loda: Lightweight on-line detector of anomalies.Machine Learning, 102(2):275– 304, Feb 2016

work page 2016
[13]

Geometric inference for probabil- ity measures

Frédéric Chazal, David Cohen-Steiner, and Quentin Mérigot. Geometric inference for probabil- ity measures. Foundations of Computational Mathematics, 11(6):733–751, Dec 2011

work page 2011
[14]

Robust topological inference: Distance to a measure and kernel distance

Frédéric Chazal, Brittany Fasy, Fabrizio Lecci, Bertr, Michel, Aless, ro Rinaldo, and Larry Wasserman. Robust topological inference: Distance to a measure and kernel distance. Journal of Machine Learning Research, 18(159):1–40, 2018

work page 2018
[15]

Efﬁcient algorithms for mining outliers from large data sets

Sridhar Ramaswamy, Rajeev Rastogi, and Kyuseok Shim. Efﬁcient algorithms for mining outliers from large data sets. In SIGMOD Conference, 2000

work page 2000
[16]

A comparative evaluation of unsupervised anomaly detection algorithms for multivariate data

Markus Goldstein and Seiichi Uchida. A comparative evaluation of unsupervised anomaly detection algorithms for multivariate data. PLOS ONE, 11(4):1–31, 04 2016. 9

work page 2016
[17]

Quantitative comparison of unsupervised anomaly detection algorithms for intrusion detection

Filipe Falcão, Tommaso Zoppi, Caio Barbosa Viera Silva, Anderson Santos, Baldoino Fon- seca, Andrea Ceccarelli, and Andrea Bondavalli. Quantitative comparison of unsupervised anomaly detection algorithms for intrusion detection. In Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, SAC ’19, pages 318–327, New York, NY , USA, 2019. ACM

work page 2019
[18]

Campos, Arthur Zimek, Jörg Sander, Ricardo J

Guilherme O. Campos, Arthur Zimek, Jörg Sander, Ricardo J. G. B. Campello, Barbora Micenková, Erich Schubert, Ira Assent, and Michael E. Houle. On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Mining and Knowledge Discovery, 30(4):891–927, Jul 2016

work page 2016
[19]

A Meta-Analysis of the Anomaly Detection Problem

Andrew Emmott, Shubhomoy Das, Thomas Dietterich, Alan Fern, and Weng-Keen Wong. A Meta-Analysis of the Anomaly Detection Problem. arXiv e-prints, page arXiv:1503.01158, Mar 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[20]

ODDS library

Shebuti Rayana. ODDS library. http://odds.cs.stonybrook.edu, 2016

work page 2016
[21]

Frank and A

A. Frank and A. Asuncion. Uci machine learning repository. http://archive.ics.uci. edu/ml, 2010

work page 2010
[22]

Peter J. Huber. Robust Estimation of a Location Parameter, pages 492–518. Springer New York, New York, NY , 1992

work page 1992
[23]

Peter J. Huber. A robust version of the probability ratio test. Ann. Math. Statist., 36(6):1753– 1758, 12 1965

work page 1965
[24]

Convergence rates for persistence diagram estimation in topological data analysis

Frédéric Chazal, Marc Glisse, Catherine Labruère, and Bertrand Michel. Convergence rates for persistence diagram estimation in topological data analysis. Journal of Machine Learning Research, 16:3603–3635, 2015

work page 2015
[25]

A plug-in approach to support estimation

Antonio Cuevas and Ricardo Fraiman. A plug-in approach to support estimation. Ann. Statist., 25(6):2300–2312, 12 1997

work page 1997
[26]

Introduction to Statistical Learning Theory, pages 169–207

Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi. Introduction to Statistical Learning Theory, pages 169–207. Springer Berlin Heidelberg, Berlin, Heidelberg, 2004

work page 2004
[27]

Consistent procedures for cluster tree estimation and pruning

Kamalika Chaudhuri, Sanjoy Dasgupta, Samory Kpotufe, and Ulrike von Luxburg. Consistent procedures for cluster tree estimation and pruning. IEEE Transactions on Information Theory, 60:7900–7912, 2014. Appendices A Deﬁnition for DTMF2 Deﬁnition A.1. The DTM2 and DTMF2 scores are deﬁned as: DTM2(x) =   1 k ∑ Xi∈Nk(x) ∥Xi−x∥2   1/2 , DTMF2(x) = 1 |Nk(x)|...

work page 2014

[1] [1]

Alfred O. Hero. Geometric entropy minimization (gem) for anomaly detection and localization. In B. Schölkopf, J. C. Platt, and T. Hoffman, editors,Advances in Neural Information Processing Systems 19, pages 585–592. MIT Press, 2007

work page 2007

[2] [2]

Kumar Sricharan and Alfred O. Hero. Efﬁcient anomaly detection using bipartite k-nn graphs. In J. Shawe-Taylor, R. S. Zemel, P. L. Bartlett, F. Pereira, and K. Q. Weinberger, editors,Advances in Neural Information Processing Systems 24, pages 478–486. Curran Associates, Inc., 2011

work page 2011

[3] [3]

Prediction and outlier detection in classification problems

Leying Guan and Rob Tibshirani. Prediction and outlier detection: a distribution-free prediction set with a balanced objective. arXiv e-prints, page arXiv:1905.04396, May 2019

work page internal anchor Pith review Pith/arXiv arXiv 1905

[4] [4]

JooSeuk Kim and Clayton D. Scott. Robust kernel density estimation. J. Mach. Learn. Res., 13(1):2529–2565, September 2012

work page 2012

[5] [5]

Breunig, Hans-Peter Kriegel, Raymond T

Markus M. Breunig, Hans-Peter Kriegel, Raymond T. Ng, and Jörg Sander. Lof: Identifying density-based local outliers. SIGMOD Rec., 29(2):93–104, May 2000

work page 2000

[6] [6]

Fast outlier detection in high dimensional spaces

Fabrizio Angiulli and Clara Pizzuti. Fast outlier detection in high dimensional spaces. In Principles of Data Mining and Knowledge Discovery, pages 15–27, Berlin, Heidelberg, 2002. Springer Berlin Heidelberg

work page 2002

[7] [7]

Angle-based outlier detection algorithm with more stable relationships

Xiaojie Li, Jian Cheng Lv, and Dongdong Cheng. Angle-based outlier detection algorithm with more stable relationships. In Proceedings of the 18th Asia Paciﬁc Symposium on Intelligent and Evolutionary Systems, Volume 1 , pages 433–446, Cham, 2015. Springer International Publishing

work page 2015

[8] [8]

Platt, John C

Bernhard Schölkopf, John C. Platt, John C. Shawe-Taylor, Alex J. Smola, and Robert C. Williamson. Estimating the support of a high-dimensional distribution. Neural Comput., 13(7):1443–1471, July 2001

work page 2001

[9] [9]

Tax and Robert P.W

David M.J. Tax and Robert P.W. Duin. Support vector data description. Machine Learning, 54(1):45–66, Jan 2004

work page 2004

[10] [10]

Aggarwal, and Deepak S

Jinghui Chen, Saket Sathe, Charu C. Aggarwal, and Deepak S. Turaga. Outlier detection with autoencoder ensembles. In SDM, 2017

work page 2017

[11] [11]

Isolation forest

Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou. Isolation forest. In Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, ICDM ’08, pages 413–422, Washington, DC, USA, 2008. IEEE Computer Society

work page 2008

[12] [12]

Loda: Lightweight on-line detector of anomalies.Machine Learning, 102(2):275– 304, Feb 2016

Tomáš Pevný. Loda: Lightweight on-line detector of anomalies.Machine Learning, 102(2):275– 304, Feb 2016

work page 2016

[13] [13]

Geometric inference for probabil- ity measures

Frédéric Chazal, David Cohen-Steiner, and Quentin Mérigot. Geometric inference for probabil- ity measures. Foundations of Computational Mathematics, 11(6):733–751, Dec 2011

work page 2011

[14] [14]

Robust topological inference: Distance to a measure and kernel distance

Frédéric Chazal, Brittany Fasy, Fabrizio Lecci, Bertr, Michel, Aless, ro Rinaldo, and Larry Wasserman. Robust topological inference: Distance to a measure and kernel distance. Journal of Machine Learning Research, 18(159):1–40, 2018

work page 2018

[15] [15]

Efﬁcient algorithms for mining outliers from large data sets

Sridhar Ramaswamy, Rajeev Rastogi, and Kyuseok Shim. Efﬁcient algorithms for mining outliers from large data sets. In SIGMOD Conference, 2000

work page 2000

[16] [16]

A comparative evaluation of unsupervised anomaly detection algorithms for multivariate data

Markus Goldstein and Seiichi Uchida. A comparative evaluation of unsupervised anomaly detection algorithms for multivariate data. PLOS ONE, 11(4):1–31, 04 2016. 9

work page 2016

[17] [17]

Quantitative comparison of unsupervised anomaly detection algorithms for intrusion detection

Filipe Falcão, Tommaso Zoppi, Caio Barbosa Viera Silva, Anderson Santos, Baldoino Fon- seca, Andrea Ceccarelli, and Andrea Bondavalli. Quantitative comparison of unsupervised anomaly detection algorithms for intrusion detection. In Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, SAC ’19, pages 318–327, New York, NY , USA, 2019. ACM

work page 2019

[18] [18]

Campos, Arthur Zimek, Jörg Sander, Ricardo J

Guilherme O. Campos, Arthur Zimek, Jörg Sander, Ricardo J. G. B. Campello, Barbora Micenková, Erich Schubert, Ira Assent, and Michael E. Houle. On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Mining and Knowledge Discovery, 30(4):891–927, Jul 2016

work page 2016

[19] [19]

A Meta-Analysis of the Anomaly Detection Problem

Andrew Emmott, Shubhomoy Das, Thomas Dietterich, Alan Fern, and Weng-Keen Wong. A Meta-Analysis of the Anomaly Detection Problem. arXiv e-prints, page arXiv:1503.01158, Mar 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[20] [20]

ODDS library

Shebuti Rayana. ODDS library. http://odds.cs.stonybrook.edu, 2016

work page 2016

[21] [21]

Frank and A

A. Frank and A. Asuncion. Uci machine learning repository. http://archive.ics.uci. edu/ml, 2010

work page 2010

[22] [22]

Peter J. Huber. Robust Estimation of a Location Parameter, pages 492–518. Springer New York, New York, NY , 1992

work page 1992

[23] [23]

Peter J. Huber. A robust version of the probability ratio test. Ann. Math. Statist., 36(6):1753– 1758, 12 1965

work page 1965

[24] [24]

Convergence rates for persistence diagram estimation in topological data analysis

Frédéric Chazal, Marc Glisse, Catherine Labruère, and Bertrand Michel. Convergence rates for persistence diagram estimation in topological data analysis. Journal of Machine Learning Research, 16:3603–3635, 2015

work page 2015

[25] [25]

A plug-in approach to support estimation

Antonio Cuevas and Ricardo Fraiman. A plug-in approach to support estimation. Ann. Statist., 25(6):2300–2312, 12 1997

work page 1997

[26] [26]

Introduction to Statistical Learning Theory, pages 169–207

Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi. Introduction to Statistical Learning Theory, pages 169–207. Springer Berlin Heidelberg, Berlin, Heidelberg, 2004

work page 2004

[27] [27]

Consistent procedures for cluster tree estimation and pruning

Kamalika Chaudhuri, Sanjoy Dasgupta, Samory Kpotufe, and Ulrike von Luxburg. Consistent procedures for cluster tree estimation and pruning. IEEE Transactions on Information Theory, 60:7900–7912, 2014. Appendices A Deﬁnition for DTMF2 Deﬁnition A.1. The DTM2 and DTMF2 scores are deﬁned as: DTM2(x) =   1 k ∑ Xi∈Nk(x) ∥Xi−x∥2   1/2 , DTMF2(x) = 1 |Nk(x)|...

work page 2014