Making Sense of Random Forest Probabilities: a Kernel Perspective

Abraham J. Wyner; Matthew A. Olson

arxiv: 1812.05792 · v1 · pith:K2BEF2B4new · submitted 2018-12-14 · 📊 stat.ML · cs.LG

Making Sense of Random Forest Probabilities: a Kernel Perspective

Matthew A. Olson , Abraham J. Wyner This is my paper

classification 📊 stat.ML cs.LG

keywords forestrandomkernelestimationprobabilitiesprobabilityaccomplishedcertain

0 comments

read the original abstract

A random forest is a popular tool for estimating probabilities in machine learning classification tasks. However, the means by which this is accomplished is unprincipled: one simply counts the fraction of trees in a forest that vote for a certain class. In this paper, we forge a connection between random forests and kernel regression. This places random forest probability estimation on more sound statistical footing. As part of our investigation, we develop a model for the proximity kernel and relate it to the geometry and sparsity of the estimation problem. We also provide intuition and recommendations for tuning a random forest to improve its probability estimates.

This paper has not been read by Pith yet.

Making Sense of Random Forest Probabilities: a Kernel Perspective

discussion (0)