MacrOData supplies three large, curated benchmark suites totaling 2,446 datasets for tabular outlier detection, complete with standardized splits, metadata, and a public leaderboard.
An Open Source AutoML Benchmark
3 Pith papers cite this work. Polarity classification is still indexing.
abstract
In recent years, an active field of research has developed around automated machine learning (AutoML). Unfortunately, comparing different AutoML systems is hard and often done incorrectly. We introduce an open, ongoing, and extensible benchmark framework which follows best practices and avoids common mistakes. The framework is open-source, uses public datasets and has a website with up-to-date results. We use the framework to conduct a thorough comparison of 4 AutoML systems across 39 datasets and analyze the results.
representative citing papers
Experimental comparison of 15 HPO and NAS algorithms for automated feature preprocessing on 45 tabular datasets finds evolution-based methods and random search as top performers.
AutoGluon-Tabular achieves superior accuracy on tabular classification and regression by multi-layer model ensembling and stacking, outperforming other AutoML frameworks on 50 benchmarks and Kaggle competitions.
citing papers explorer
-
MacrOData: New Benchmarks of Thousands of Datasets for Tabular Outlier Detection
MacrOData supplies three large, curated benchmark suites totaling 2,446 datasets for tabular outlier detection, complete with standardized splits, metadata, and a public leaderboard.
-
Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data
Experimental comparison of 15 HPO and NAS algorithms for automated feature preprocessing on 45 tabular datasets finds evolution-based methods and random search as top performers.
-
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
AutoGluon-Tabular achieves superior accuracy on tabular classification and regression by multi-layer model ensembling and stacking, outperforming other AutoML frameworks on 50 benchmarks and Kaggle competitions.