pith. sign in

arxiv: 1508.04409 · v2 · pith:JC5EKJW4new · submitted 2015-08-18 · 📊 stat.ML · stat.CO

ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R

classification 📊 stat.ML stat.CO
keywords implementationdataforestsrandomrangerdimensionalfastfeatures
0
0 comments X p. Extension
pith:JC5EKJW4 Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{JC5EKJW4}

Prints a linked pith:JC5EKJW4 badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We introduce the C++ application and R package ranger. The software is a fast implementation of random forests for high dimensional data. Ensembles of classification, regression and survival trees are supported. We describe the implementation, provide examples, validate the package with a reference implementation, and compare runtime and memory usage with other implementations. The new software proves to scale best with the number of features, samples, trees, and features tried for splitting. Finally, we show that ranger is the fastest and most memory efficient implementation of random forests to analyze data on the scale of a genome-wide association study.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.