pith. sign in

arxiv: 1309.5047 · v1 · pith:CQUZDOI2new · submitted 2013-09-19 · 💻 cs.LG · q-bio.GN· stat.ML

A Comparative Analysis of Ensemble Classifiers: Case Studies in Genomics

classification 💻 cs.LG q-bio.GNstat.ML
keywords ensemblemethodsanalysisclassifiersgenomicsmeta-learningcasecomparative
0
0 comments X
read the original abstract

The combination of multiple classifiers using ensemble methods is increasingly important for making progress in a variety of difficult prediction problems. We present a comparative analysis of several ensemble methods through two case studies in genomics, namely the prediction of genetic interactions and protein functions, to demonstrate their efficacy on real-world datasets and draw useful conclusions about their behavior. These methods include simple aggregation, meta-learning, cluster-based meta-learning, and ensemble selection using heterogeneous classifiers trained on resampled data to improve the diversity of their predictions. We present a detailed analysis of these methods across 4 genomics datasets and find the best of these methods offer statistically significant improvements over the state of the art in their respective domains. In addition, we establish a novel connection between ensemble selection and meta-learning, demonstrating how both of these disparate methods establish a balance between ensemble diversity and performance.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.