Automatic Unsupervised Ensemble Outlier Model Selection--Extended Version

Bin Yang; Christian S. Jensen; Hong-Phuc Phan; Son Ha Xuan; Tuan-Anh Vu; Tung Kieu

arxiv: 2605.16567 · v1 · pith:LJYXT3E6new · submitted 2026-05-15 · 💻 cs.LG · cs.AI· cs.DB

Automatic Unsupervised Ensemble Outlier Model Selection--Extended Version

Hong-Phuc Phan , Tuan-Anh Vu , Tung Kieu , Son Ha Xuan , Bin Yang , Christian S. Jensen This is my paper

Pith reviewed 2026-05-20 19:32 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.DB

keywords unsupervised outlier detectionensemble model selectionmeta-learningmarginal gainssubmodular selectiondiversity regularizationoutlier ensembles

0 comments

The pith

MetaEns automatically selects compact high-quality outlier detection ensembles without labels by learning to predict marginal gains from meta-datasets.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes MetaEns as a way to form ensembles of outlier detectors when no ground-truth labels exist for the target data. It trains a predictor on labeled meta-datasets to estimate the expected improvement from adding each candidate model to a growing ensemble. This signal is combined with a proxy objective that encourages diversity and penalizes risk at the model-family level, allowing greedy selection to stop early when further additions yield little benefit. A sympathetic reader would care because outlier detection is typically unsupervised and single models can be unreliable, yet naively combining many models leads to redundancy and wasted computation. If the approach works, practitioners could obtain more accurate detection with smaller, more efficient model sets across varied real-world data.

Core claim

MetaEns learns a model on labeled meta-datasets to predict marginal ensemble gains and then, at test time on unlabeled data, uses this signal together with a submodular-inspired proxy objective that applies diversity-aware discounting and family-level risk regularization to drive greedy sequential selection with adaptive early stopping, thereby constructing compact high-quality ensembles.

What carries the argument

A meta-learned predictor of marginal ensemble gains combined with a submodular proxy objective enforcing diminishing returns through diversity discounting and family risk regularization.

Load-bearing premise

A model trained to predict marginal ensemble gains on labeled meta-datasets will produce accurate and useful signals when applied to new, unlabeled target datasets.

What would settle it

Testing the selected ensembles on additional unlabeled real-world datasets and finding that they fail to achieve higher average precision than state-of-the-art unsupervised selectors while also using more models would falsify the performance claims.

Figures

Figures reproduced from arXiv: 2605.16567 by Bin Yang, Christian S. Jensen, Hong-Phuc Phan, Son Ha Xuan, Tuan-Anh Vu, Tung Kieu.

**Figure 1.** Figure 1: Overview of MetaEns. (A) Offline Meta-Training: Oracle-greedy rollouts on labeled meta-datasets M generate state–gain pairs for partial ensembles. These pairs supervise a two-part gain predictor consisting of classifier fcls, which estimates whether a candidate will improve the ensemble, and regressor freg, which estimates the positive gain magnitude. Family-risk priors πF are computed from lower-tail orac… view at source ↗

**Figure 2.** Figure 2: Across all selectors, MetaEns consistently improves over the starting model, demonstrating that its partner selection mechanism is selector-agnostic and not tied to a particular initialization strategy. Improvements are especially pronounced in challenging cases where the primary model underperforms. Rather than propagating initial errors, MetaEns effectively recovers performance by selecting complementar… view at source ↗

**Figure 2.** Figure 2: Robustness analysis across four different primary selectors: ELECT, LOF, IForest, and Random Selection. Each panel compares the primary model’s performance (x-axis) against the final MetaEns ensemble (y-axis). Points above the diagonal indicate improvement. The shaded red “Rescue Zone” highlights where the primary model fails (AP < 0.4). MetaEns consistently rescues performance in these failure modes acros… view at source ↗

**Figure 3.** Figure 3: Model diversity visualization using t-SNE projection across four datasets. ELECT-10 selections tend to cluster within a single family, whereas MetaEns selects models spanning multiple families, indicating greater ensemble diversity. ods learn expressive representations of normality. These include autoencoders (AEs) (Goodge et al., 2020), variational autoencoders (VAEs) (Xu et al., 2018), and generative ad… view at source ↗

read the original abstract

Unsupervised outlier detection is attractive because it eliminates the need for labeled data. Moreover, forming multi-model ensembles can improve detection robustness. However, composing an ensemble without labeled data is challenging. Naively composed ensembles can suffer from ensemble saturation, where redundant or unreliable detection models degrade performance and incur unnecessary computation. We propose MetaEns, an automatic unsupervised framework for selecting ensembles of outlier detection models. Using labeled meta-datasets, MetaEns learns a model that predicts marginal ensemble gains, estimating the expected improvement from adding a candidate model to a partially constructed ensemble. At test time, this learned signal is combined with a submodular-inspired proxy objective that enforces diminishing returns through diversity-aware discounting and family-level risk regularization, thereby enabling greedy sequential selection with adaptive early stopping. As a result, MetaEns constructs compact, high-quality ensembles without access to ground-truth labels. Experiments on 39 real-world datasets show that MetaEns consistently outperforms state-of-the-art unsupervised selectors and ensemble baselines, achieving higher average precision while using fewer models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MetaEns meta-learns marginal gains for outlier ensemble selection and pairs them with a diversity-aware proxy, delivering compact high-AP ensembles on 39 datasets, but the zero-shot transfer step remains the weakest link.

read the letter

The main takeaway is that this paper gives a concrete recipe for building small, effective ensembles of unsupervised outlier detectors without any labels on the target data. It trains a predictor on labeled meta-datasets to estimate how much adding one more model will lift ensemble average precision, then feeds that signal into a greedy selector guided by a submodular-style objective that discounts for diversity and adds family-level risk penalties. That combination is the actual novelty; prior work on unsupervised selectors or simple ensembles does not use this exact meta-learned marginal-gain plus structured proxy setup. The experiments report consistent wins over baselines on 39 real-world datasets while keeping the selected ensembles smaller, which is useful for practitioners who want robustness without extra compute. The approach is clearly motivated by the saturation problem and the proxy adds some structure that makes the selection less ad-hoc. The soft spot is the transfer assumption. The predictor is fitted where true marginal gains are observable, yet it must rank models usefully on entirely new, unlabeled distributions whose statistics may differ. The end-to-end results look good, but without isolated tests showing that the chosen features stay predictive across dataset shifts, it is hard to tell how much of the reported gain is robust versus tied to the particular meta-collection. This work is for people already working on outlier detection ensembles who need a practical selection method. Readers who care about meta-learning for model choice or submodular proxies will find the framing relevant. The empirical support and the clear algorithmic contribution are enough to justify sending it to referees rather than rejecting it outright. I would recommend peer review so the transfer question and any post-hoc choices in the meta-training can be examined in detail.

Referee Report

1 major / 1 minor

Summary. The paper proposes MetaEns, an automatic unsupervised framework for selecting compact ensembles of outlier detection models. It trains a meta-predictor on labeled meta-datasets to estimate marginal ensemble gains (improvement in average precision when adding a candidate model to a partial ensemble), then at test time combines this signal with a submodular-inspired proxy objective incorporating diversity-aware discounting and family-level risk regularization to enable greedy sequential selection with adaptive early stopping on unlabeled target datasets. Experiments on 39 real-world datasets report that MetaEns outperforms state-of-the-art unsupervised selectors and ensemble baselines in average precision while using fewer models.

Significance. If the meta-predictor transfers reliably, the approach could meaningfully advance unsupervised outlier detection by automating the construction of robust, computationally efficient ensembles without requiring labels on the target data. The submodular proxy provides a structured way to enforce diminishing returns and diversity, which is a strength relative to purely heuristic selection methods.

major comments (1)

[Abstract and methodology description of meta-predictor training and test-time application] The central claim depends on zero-shot transfer of the meta-predictor (trained on labeled meta-datasets) to new unlabeled target datasets, yet no section isolates or validates the predictor's accuracy or ranking quality on held-out distributions whose statistics may differ from the meta-training data. The 39-dataset end-to-end results therefore do not rule out that reported gains arise from favorable meta-dataset selection or post-hoc choices rather than robust generalization of the marginal-gain estimates.

minor comments (1)

[Methodology] Clarify the exact input features to the meta-predictor (model-family statistics, data characteristics, etc.) and whether any normalization or invariance properties are assumed or enforced.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and for recognizing the potential of MetaEns to advance unsupervised outlier detection. We address the major comment below.

read point-by-point responses

Referee: The central claim depends on zero-shot transfer of the meta-predictor (trained on labeled meta-datasets) to new unlabeled target datasets, yet no section isolates or validates the predictor's accuracy or ranking quality on held-out distributions whose statistics may differ from the meta-training data. The 39-dataset end-to-end results therefore do not rule out that reported gains arise from favorable meta-dataset selection or post-hoc choices rather than robust generalization of the marginal-gain estimates.

Authors: We agree that isolating the meta-predictor's performance on held-out distributions would provide stronger direct evidence for zero-shot transfer. The current manuscript emphasizes end-to-end results on 39 diverse real-world datasets to demonstrate practical utility, but these do not separately quantify the predictor's ranking quality or accuracy under distribution shift. In the revised version we will add a new subsection that holds out a subset of meta-datasets, evaluates the meta-predictor on those held-out sets (reporting correlation between predicted and observed marginal gains as well as top-k selection accuracy), and discusses how the meta-training distribution was constructed to promote generalization. revision: yes

Circularity Check

0 steps flagged

No significant circularity; meta-learning framework is empirically grounded

full rationale

The paper describes a meta-learning method that trains a predictor on separate labeled meta-datasets to estimate marginal ensemble gains, then applies the predictor plus a submodular proxy objective to select ensembles on new unlabeled target datasets. This structure does not reduce any claimed result to its inputs by construction: the predictor is explicitly fitted on distinct meta-data rather than self-defined, the test-time selection uses an independent proxy, and performance claims rest on end-to-end experiments across 39 real-world datasets rather than tautological renaming or self-citation chains. No equations or steps in the provided description exhibit fitted inputs relabeled as independent predictions or uniqueness imported from prior self-work. The transfer assumption is a methodological risk but does not constitute circularity under the specified patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only the abstract is available, so the ledger is necessarily incomplete; the method rests on the representativeness of the meta-datasets and on the assumption that the learned predictor transfers to unseen data.

axioms (1)

domain assumption Labeled meta-datasets are sufficiently representative of the distribution of real-world outlier detection tasks encountered at test time.
The entire meta-learning step depends on this transfer assumption.

pith-pipeline@v0.9.0 · 5721 in / 1237 out tokens · 44903 ms · 2026-05-20T19:32:26.620116+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

180 extracted references · 180 canonical work pages · 1 internal anchor

[1]

Arthur Zimek and Ricardo J. G. B. Campello and J. Ensembles for unsupervised outlier detection: challenges and research questions a position paper , journal =

work page
[2]

Deep One-Class Classification , booktitle =

Lukas Ruff and Nico G. Deep One-Class Classification , booktitle =

work page
[3]

Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection , booktitle =

Bo Zong and Qi Song and Martin Renqiang Min and Wei Cheng and Cristian Lumezanu and Dae. Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection , booktitle =

work page
[4]

Breunig and Hans

Markus M. Breunig and Hans. Proceedings of the

work page
[5]

Isolation Forest , booktitle =

Fei Tony Liu and Kai Ming Ting and Zhi. Isolation Forest , booktitle =

work page
[6]

Zheng Li and Yue Zhao and Nicola Botta and Cezar Ionescu and Xiyang Hu , year = 2020, booktitle =

work page 2020
[7]

Yue Zhao and Zain Nasrullah and Zheng Li , year = 2019, journal =. PyOD:

work page 2019
[8]

CoRR , volume =

A Large-scale Study on Unsupervised Outlier Model Selection: Do Internal Strategies Suffice? , author =. CoRR , volume =

work page
[9]

Proceedings of the

Shebuti Rayana and Wen Zhong and Leman Akoglu , title =. Proceedings of the

work page
[10]

Marques and Ricardo J

Henrique O. Marques and Ricardo J. G. B. Campello and J. Internal Evaluation of Unsupervised Outlier Detection , journal =

work page
[11]

Varun Chandola and Arindam Banerjee and Vipin Kumar , title =

work page
[12]

ADBench: Anomaly Detection Benchmark , author =

work page
[13]

CoRR , volume =

Automating Outlier Detection via Meta-Learning , author =. CoRR , volume =

work page
[14]

Nemhauser and Laurence A

George L. Nemhauser and Laurence A. Wolsey and Marshall L. Fisher , title =. Math. Program. , volume =

work page
[15]

Alex Kulesza and Ben Taskar , title =. Found. Trends Mach. Learn. , volume =

work page
[16]

Rossi and Leman Akoglu , title =

Yue Zhao and Ryan A. Rossi and Leman Akoglu , title =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page
[17]

Proceedings of the

Toward Unsupervised Outlier Model Selection , author =. Proceedings of the

work page
[18]

Proceedings of the

Li Cheng and Yijie Wang and Xinwang Liu and Bin Li , title =. Proceedings of the

work page
[19]

Hospedales and Antreas Antoniou and Paul Micaelli and Amos Storkey , title =

Timothy M. Hospedales and Antreas Antoniou and Paul Micaelli and Amos Storkey , title =

work page
[20]

Theoretical Foundations and Algorithms for Outlier Ensembles , author =

work page
[21]

CoRR , volume =

How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms? , author =. CoRR , volume =

work page
[22]

Kauffmann and Robert A

Lukas Ruff and Jacob R. Kauffmann and Robert A. Vandermeulen and Gr. A Unifying Review of Deep and Shallow Anomaly Detection , journal =

work page
[23]

Proceedings of the

AutoOD: Neural Architecture Search for Outlier Detection , author =. Proceedings of the

work page
[24]

Proceedings of the Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD) , pages =

An Unsupervised Boosting Strategy for Outlier Detection Ensembles , author =. Proceedings of the Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD) , pages =

work page
[25]

Hryniewicki and Zheng Li , year = 2019, booktitle =

Yue Zhao and Zain Nasrullah and Maciej K. Hryniewicki and Zheng Li , year = 2019, booktitle =

work page 2019
[26]

Less is More: Building Selective Anomaly Ensembles , author =

work page
[27]

Proceedings of the Conference on Annual Meeting of the Association for Computational Linguistics (ACL) , pages =

A Class of Submodular Functions for Document Summarization , author =. Proceedings of the Conference on Annual Meeting of the Association for Computational Linguistics (ACL) , pages =

work page
[28]

Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , author =. J. Mach. Learn. Res. , volume = 9, pages =

work page
[29]

Proceedings of the International Conference on Machine Learning (ICML) , pages =

Near-optimal Batch Mode Active Learning and Adaptive Submodular Optimization , author =. Proceedings of the International Conference on Machine Learning (ICML) , pages =

work page
[30]

Data Min

On the Evaluation of Unsupervised Outlier Detection: Measures, Datasets, and an Empirical Study , author =. Data Min. Knowl. Discov. , volume = 30, pages =

work page
[31]

LightGBM:

Guolin Ke and Qi Meng and Thomas Finley and Taifeng Wang and Wei Chen and Weidong Ma and Qiwei Ye and Tie. LightGBM:. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page
[32]

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy , author =

work page
[33]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

PyTorch: An Imperative Style, High-Performance Deep Learning Library , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page
[34]

Sunil Aryal and Arbind Agrahari Baniya and Imran Razzak and KC Santosh , year = 2021, booktitle =

work page 2021
[35]

Chen , year = 2023, journal =

Zheng Li and Yue Zhao and Xiyang Hu and Nicola Botta and Cezar Ionescu and George H. Chen , year = 2023, journal =

work page 2023
[36]

Proceedings of the International Conference on Pattern Recognition (ICPR) , pages =

Outlier Detection Using k-Nearest Neighbour Graph , author =. Proceedings of the International Conference on Pattern Recognition (ICPR) , pages =

work page
[37]

Proceedings of the International Joint Conference on Neural Networks (IJCNN) , pages =

An Outlier Detection Algorithm based on KNN-kernel Density Estimation , author =. Proceedings of the International Joint Conference on Neural Networks (IJCNN) , pages =

work page
[38]

Proceedings of the

K-Nearest Neighbor Search and Outlier Detection via Minimax Distances , author =. Proceedings of the

work page
[39]

Anomaly Detection with Score Functions Based on the Reconstruction Error of the Kernel

Laetitia Chapel and Chlo. Anomaly Detection with Score Functions Based on the Reconstruction Error of the Kernel. Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD) , pages =

work page
[40]

Proceedings of the International Conference on World Wide Web (WWW) , pages =

Unsupervised Anomaly Detection via Variational Auto-Encoder for Seasonal KPIs in Web Applications , author =. Proceedings of the International Conference on World Wide Web (WWW) , pages =

work page
[41]

Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

Robustness of Autoencoders for Anomaly Detection Under Adversarial Impact , author =. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

work page
[42]

Proceedings of the

Swee Kiat Lim and Yi Loo and Ngoc. Proceedings of the

work page
[43]

Proceedings of the

Transformer for Point Anomaly Detection , author =. Proceedings of the

work page
[44]

Proceedings of the

Anomaly Detection with Robust Deep Autoencoders , author =. Proceedings of the

work page
[45]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

Support Vector Method for Novelty Detection , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page
[46]

Support Vector Data Description , author =. Mach. Learn. , volume = 54, number = 1, pages =

work page
[47]

Proceedings of the

Feature Bagging for Outlier Detection , author =. Proceedings of the

work page
[48]

Hryniewicki , year = 2018, booktitle =

Yue Zhao and Maciej K. Hryniewicki , year = 2018, booktitle =

work page 2018
[49]

Proceedings of the

Outlier Detection with Autoencoder Ensembles , author =. Proceedings of the

work page
[50]

Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

Outlier Detection for Time Series with Recurrent Autoencoder Ensembles , author =. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

work page
[51]

Proceedings of the

Anomaly Detection Using an Ensemble of Feature Models , author =. Proceedings of the

work page
[52]

Proceedings of the

Xu Han and Xiaohui Chen and Li. Proceedings of the

work page
[53]

Goadrich , year = 2006, booktitle =

Jesse Davis and Mark H. Goadrich , year = 2006, booktitle =. The relationship between Precision-Recall and

work page 2006
[54]

Scikit-learn: Machine Learning in Python , author =. J. Mach. Learn. Res. , volume = 12, pages =

work page
[55]

Kybernetika , volume = 17, number = 6, pages =

Estimating the Dimension of a Linear Model , author =. Kybernetika , volume = 17, number = 6, pages =

work page
[56]

Estimating the Dimension of a Model , author =. Ann. Stat. , volume = 6, number = 2, pages =

work page
[57]

The Determination of the Order of an Autoregression , author =. J. R. Stat. Soc., B: Stat. Methodol. , volume = 41, number = 2, pages =

work page
[58]

Cross-Validatory Choice and Assessment of Statistical Predictions , author =. J. R. Stat. Soc., B: Stat. Methodol. , volume = 36, number = 1, pages =

work page
[59]

Estimating the Error Rate of a Prediction Rule: Improvement on Cross-validation , author =. J. Am. Stat. Assoc. , volume = 78, number = 382, pages =

work page
[60]

Statistical Learning Theory , author =

work page
[61]

Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , author =. J. Mach. Learn. Res. , volume = 3, pages =

work page
[62]

Random Search for Hyper-Parameter Optimization , author =. J. Mach. Learn. Res. , volume = 13, pages =

work page
[63]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

Practical Bayesian Optimization of Machine Learning Algorithms , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page
[64]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

Efficient and Robust Automated Machine Learning , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page
[65]

Covariate Shift Adaptation by Importance Weighted Cross Validation , author =. J. Mach. Learn. Res. , volume = 8, pages =

work page
[66]

In Search of Lost Domain Generalization , author =

work page
[67]

A Perspective View and Survey of Meta-Learning , author =. Artif. Intell. Rev. , volume = 18, number = 2, pages =

work page
[68]

Extremely Randomized Trees , author =. Mach. Learn. , volume = 63, number = 1, pages =

work page
[69]

Visualizing Data using t-SNE , author =. J. Mach. Learn. Res. , volume = 9, number = 86, pages =

work page
[70]

Random Forests , author =. Mach. Learn. , volume = 45, number = 1, pages =

work page
[71]

2016 , journal =

Loda: Lightweight on-line detector of anomalies , author =. 2016 , journal =

work page 2016
[72]

2012 , journal=

Histogram-based outlier score (hbos): A fast unsupervised anomaly detection algorithm , author=. 2012 , journal=

work page 2012
[73]

2008 , booktitle =

Angle-based outlier detection in high-dimensional data , author =. 2008 , booktitle =

work page 2008
[74]

2002 , booktitle =

Enhancing Effectiveness of Outlier Detections for Low Density Patterns , author =. 2002 , booktitle =

work page 2002
[75]

2020 , journal =

Generative Adversarial Active Learning for Unsupervised Outlier Detection , author =. 2020 , journal =

work page 2020
[76]

Proceedings of the

XGBoost: A Scalable Tree Boosting System , author =. Proceedings of the

work page
[77]

Deep Learning , author =

work page
[78]

An Introduction to Statistical Learning--with Applications in R , author =

work page
[79]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , year =

Why do tree-based models still outperform deep learning on typical tabular data? , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , year =

work page
[80]

2022 , journal =

Tabular data: Deep learning is not all you need , author =. 2022 , journal =

work page 2022

Showing first 80 references.

[1] [1]

Arthur Zimek and Ricardo J. G. B. Campello and J. Ensembles for unsupervised outlier detection: challenges and research questions a position paper , journal =

work page

[2] [2]

Deep One-Class Classification , booktitle =

Lukas Ruff and Nico G. Deep One-Class Classification , booktitle =

work page

[3] [3]

Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection , booktitle =

Bo Zong and Qi Song and Martin Renqiang Min and Wei Cheng and Cristian Lumezanu and Dae. Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection , booktitle =

work page

[4] [4]

Breunig and Hans

Markus M. Breunig and Hans. Proceedings of the

work page

[5] [5]

Isolation Forest , booktitle =

Fei Tony Liu and Kai Ming Ting and Zhi. Isolation Forest , booktitle =

work page

[6] [6]

Zheng Li and Yue Zhao and Nicola Botta and Cezar Ionescu and Xiyang Hu , year = 2020, booktitle =

work page 2020

[7] [7]

Yue Zhao and Zain Nasrullah and Zheng Li , year = 2019, journal =. PyOD:

work page 2019

[8] [8]

CoRR , volume =

A Large-scale Study on Unsupervised Outlier Model Selection: Do Internal Strategies Suffice? , author =. CoRR , volume =

work page

[9] [9]

Proceedings of the

Shebuti Rayana and Wen Zhong and Leman Akoglu , title =. Proceedings of the

work page

[10] [10]

Marques and Ricardo J

Henrique O. Marques and Ricardo J. G. B. Campello and J. Internal Evaluation of Unsupervised Outlier Detection , journal =

work page

[11] [11]

Varun Chandola and Arindam Banerjee and Vipin Kumar , title =

work page

[12] [12]

ADBench: Anomaly Detection Benchmark , author =

work page

[13] [13]

CoRR , volume =

Automating Outlier Detection via Meta-Learning , author =. CoRR , volume =

work page

[14] [14]

Nemhauser and Laurence A

George L. Nemhauser and Laurence A. Wolsey and Marshall L. Fisher , title =. Math. Program. , volume =

work page

[15] [15]

Alex Kulesza and Ben Taskar , title =. Found. Trends Mach. Learn. , volume =

work page

[16] [16]

Rossi and Leman Akoglu , title =

Yue Zhao and Ryan A. Rossi and Leman Akoglu , title =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page

[17] [17]

Proceedings of the

Toward Unsupervised Outlier Model Selection , author =. Proceedings of the

work page

[18] [18]

Proceedings of the

Li Cheng and Yijie Wang and Xinwang Liu and Bin Li , title =. Proceedings of the

work page

[19] [19]

Hospedales and Antreas Antoniou and Paul Micaelli and Amos Storkey , title =

Timothy M. Hospedales and Antreas Antoniou and Paul Micaelli and Amos Storkey , title =

work page

[20] [20]

Theoretical Foundations and Algorithms for Outlier Ensembles , author =

work page

[21] [21]

CoRR , volume =

How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms? , author =. CoRR , volume =

work page

[22] [22]

Kauffmann and Robert A

Lukas Ruff and Jacob R. Kauffmann and Robert A. Vandermeulen and Gr. A Unifying Review of Deep and Shallow Anomaly Detection , journal =

work page

[23] [23]

Proceedings of the

AutoOD: Neural Architecture Search for Outlier Detection , author =. Proceedings of the

work page

[24] [24]

Proceedings of the Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD) , pages =

An Unsupervised Boosting Strategy for Outlier Detection Ensembles , author =. Proceedings of the Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD) , pages =

work page

[25] [25]

Hryniewicki and Zheng Li , year = 2019, booktitle =

Yue Zhao and Zain Nasrullah and Maciej K. Hryniewicki and Zheng Li , year = 2019, booktitle =

work page 2019

[26] [26]

Less is More: Building Selective Anomaly Ensembles , author =

work page

[27] [27]

Proceedings of the Conference on Annual Meeting of the Association for Computational Linguistics (ACL) , pages =

A Class of Submodular Functions for Document Summarization , author =. Proceedings of the Conference on Annual Meeting of the Association for Computational Linguistics (ACL) , pages =

work page

[28] [28]

Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , author =. J. Mach. Learn. Res. , volume = 9, pages =

work page

[29] [29]

Proceedings of the International Conference on Machine Learning (ICML) , pages =

Near-optimal Batch Mode Active Learning and Adaptive Submodular Optimization , author =. Proceedings of the International Conference on Machine Learning (ICML) , pages =

work page

[30] [30]

Data Min

On the Evaluation of Unsupervised Outlier Detection: Measures, Datasets, and an Empirical Study , author =. Data Min. Knowl. Discov. , volume = 30, pages =

work page

[31] [31]

LightGBM:

Guolin Ke and Qi Meng and Thomas Finley and Taifeng Wang and Wei Chen and Weidong Ma and Qiwei Ye and Tie. LightGBM:. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page

[32] [32]

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy , author =

work page

[33] [33]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

PyTorch: An Imperative Style, High-Performance Deep Learning Library , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page

[34] [34]

Sunil Aryal and Arbind Agrahari Baniya and Imran Razzak and KC Santosh , year = 2021, booktitle =

work page 2021

[35] [35]

Chen , year = 2023, journal =

Zheng Li and Yue Zhao and Xiyang Hu and Nicola Botta and Cezar Ionescu and George H. Chen , year = 2023, journal =

work page 2023

[36] [36]

Proceedings of the International Conference on Pattern Recognition (ICPR) , pages =

Outlier Detection Using k-Nearest Neighbour Graph , author =. Proceedings of the International Conference on Pattern Recognition (ICPR) , pages =

work page

[37] [37]

Proceedings of the International Joint Conference on Neural Networks (IJCNN) , pages =

An Outlier Detection Algorithm based on KNN-kernel Density Estimation , author =. Proceedings of the International Joint Conference on Neural Networks (IJCNN) , pages =

work page

[38] [38]

Proceedings of the

K-Nearest Neighbor Search and Outlier Detection via Minimax Distances , author =. Proceedings of the

work page

[39] [39]

Anomaly Detection with Score Functions Based on the Reconstruction Error of the Kernel

Laetitia Chapel and Chlo. Anomaly Detection with Score Functions Based on the Reconstruction Error of the Kernel. Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD) , pages =

work page

[40] [40]

Proceedings of the International Conference on World Wide Web (WWW) , pages =

Unsupervised Anomaly Detection via Variational Auto-Encoder for Seasonal KPIs in Web Applications , author =. Proceedings of the International Conference on World Wide Web (WWW) , pages =

work page

[41] [41]

Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

Robustness of Autoencoders for Anomaly Detection Under Adversarial Impact , author =. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

work page

[42] [42]

Proceedings of the

Swee Kiat Lim and Yi Loo and Ngoc. Proceedings of the

work page

[43] [43]

Proceedings of the

Transformer for Point Anomaly Detection , author =. Proceedings of the

work page

[44] [44]

Proceedings of the

Anomaly Detection with Robust Deep Autoencoders , author =. Proceedings of the

work page

[45] [45]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

Support Vector Method for Novelty Detection , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page

[46] [46]

Support Vector Data Description , author =. Mach. Learn. , volume = 54, number = 1, pages =

work page

[47] [47]

Proceedings of the

Feature Bagging for Outlier Detection , author =. Proceedings of the

work page

[48] [48]

Hryniewicki , year = 2018, booktitle =

Yue Zhao and Maciej K. Hryniewicki , year = 2018, booktitle =

work page 2018

[49] [49]

Proceedings of the

Outlier Detection with Autoencoder Ensembles , author =. Proceedings of the

work page

[50] [50]

Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

Outlier Detection for Time Series with Recurrent Autoencoder Ensembles , author =. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pages =

work page

[51] [51]

Proceedings of the

Anomaly Detection Using an Ensemble of Feature Models , author =. Proceedings of the

work page

[52] [52]

Proceedings of the

Xu Han and Xiaohui Chen and Li. Proceedings of the

work page

[53] [53]

Goadrich , year = 2006, booktitle =

Jesse Davis and Mark H. Goadrich , year = 2006, booktitle =. The relationship between Precision-Recall and

work page 2006

[54] [54]

Scikit-learn: Machine Learning in Python , author =. J. Mach. Learn. Res. , volume = 12, pages =

work page

[55] [55]

Kybernetika , volume = 17, number = 6, pages =

Estimating the Dimension of a Linear Model , author =. Kybernetika , volume = 17, number = 6, pages =

work page

[56] [56]

Estimating the Dimension of a Model , author =. Ann. Stat. , volume = 6, number = 2, pages =

work page

[57] [57]

The Determination of the Order of an Autoregression , author =. J. R. Stat. Soc., B: Stat. Methodol. , volume = 41, number = 2, pages =

work page

[58] [58]

Cross-Validatory Choice and Assessment of Statistical Predictions , author =. J. R. Stat. Soc., B: Stat. Methodol. , volume = 36, number = 1, pages =

work page

[59] [59]

Estimating the Error Rate of a Prediction Rule: Improvement on Cross-validation , author =. J. Am. Stat. Assoc. , volume = 78, number = 382, pages =

work page

[60] [60]

Statistical Learning Theory , author =

work page

[61] [61]

Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , author =. J. Mach. Learn. Res. , volume = 3, pages =

work page

[62] [62]

Random Search for Hyper-Parameter Optimization , author =. J. Mach. Learn. Res. , volume = 13, pages =

work page

[63] [63]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

Practical Bayesian Optimization of Machine Learning Algorithms , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page

[64] [64]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

Efficient and Robust Automated Machine Learning , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , pages =

work page

[65] [65]

Covariate Shift Adaptation by Importance Weighted Cross Validation , author =. J. Mach. Learn. Res. , volume = 8, pages =

work page

[66] [66]

In Search of Lost Domain Generalization , author =

work page

[67] [67]

A Perspective View and Survey of Meta-Learning , author =. Artif. Intell. Rev. , volume = 18, number = 2, pages =

work page

[68] [68]

Extremely Randomized Trees , author =. Mach. Learn. , volume = 63, number = 1, pages =

work page

[69] [69]

Visualizing Data using t-SNE , author =. J. Mach. Learn. Res. , volume = 9, number = 86, pages =

work page

[70] [70]

Random Forests , author =. Mach. Learn. , volume = 45, number = 1, pages =

work page

[71] [71]

2016 , journal =

Loda: Lightweight on-line detector of anomalies , author =. 2016 , journal =

work page 2016

[72] [72]

2012 , journal=

Histogram-based outlier score (hbos): A fast unsupervised anomaly detection algorithm , author=. 2012 , journal=

work page 2012

[73] [73]

2008 , booktitle =

Angle-based outlier detection in high-dimensional data , author =. 2008 , booktitle =

work page 2008

[74] [74]

2002 , booktitle =

Enhancing Effectiveness of Outlier Detections for Low Density Patterns , author =. 2002 , booktitle =

work page 2002

[75] [75]

2020 , journal =

Generative Adversarial Active Learning for Unsupervised Outlier Detection , author =. 2020 , journal =

work page 2020

[76] [76]

Proceedings of the

XGBoost: A Scalable Tree Boosting System , author =. Proceedings of the

work page

[77] [77]

Deep Learning , author =

work page

[78] [78]

An Introduction to Statistical Learning--with Applications in R , author =

work page

[79] [79]

Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , year =

Why do tree-based models still outperform deep learning on typical tabular data? , author =. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) , year =

work page

[80] [80]

2022 , journal =

Tabular data: Deep learning is not all you need , author =. 2022 , journal =

work page 2022