pith. sign in

arxiv: 1706.08359 · v1 · pith:4EYLQFSRnew · submitted 2017-06-26 · 📊 stat.ML · cs.DC· cs.LG

GPU-acceleration for Large-scale Tree Boosting

classification 📊 stat.ML cs.DCcs.LG
keywords treehistogramalgorithmbuildingconstructiondecisionboostingfaster
0
0 comments X p. Extension
pith:4EYLQFSR Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{4EYLQFSR}

Prints a linked pith:4EYLQFSR badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

In this paper, we present a novel massively parallel algorithm for accelerating the decision tree building procedure on GPUs (Graphics Processing Units), which is a crucial step in Gradient Boosted Decision Tree (GBDT) and random forests training. Previous GPU based tree building algorithms are based on parallel multi-scan or radix sort to find the exact tree split, and thus suffer from scalability and performance issues. We show that using a histogram based algorithm to approximately find the best split is more efficient and scalable on GPU. By identifying the difference between classical GPU-based image histogram construction and the feature histogram construction in decision tree training, we develop a fast feature histogram building kernel on GPU with carefully designed computational and memory access sequence to reduce atomic update conflict and maximize GPU utilization. Our algorithm can be used as a drop-in replacement for histogram construction in popular tree boosting systems to improve their scalability. As an example, to train GBDT on epsilon dataset, our method using a main-stream GPU is 7-8 times faster than histogram based algorithm on CPU in LightGBM and 25 times faster than the exact-split finding algorithm in XGBoost on a dual-socket 28-core Xeon server, while achieving similar prediction accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Technique-agnostic exoplanet demography for the Roman era -- I. Testing a demography retrieval framework using simulated Kepler-like transit datasets

    astro-ph.EP 2025-09 unverdicted novelty 6.0

    A forward-modeling framework called TAED combines multi-technique exoplanet samples over Galactic scales and recovers demographic parameters accurately from simulated Kepler transit data using differential evolution o...