pith. sign in

arxiv: 1704.05194 · v1 · pith:SZVFIWS5new · submitted 2017-04-18 · 📊 stat.ML · cs.LG

Learning Piece-wise Linear Models from Large Scale Data for Ad Click Prediction

classification 📊 stat.ML cs.LG
keywords modeldatalargelearningls-plmpredictionproblemscale
0
0 comments X
read the original abstract

CTR prediction in real-world business is a difficult machine learning problem with large scale nonlinear sparse data. In this paper, we introduce an industrial strength solution with model named Large Scale Piece-wise Linear Model (LS-PLM). We formulate the learning problem with $L_1$ and $L_{2,1}$ regularizers, leading to a non-convex and non-smooth optimization problem. Then, we propose a novel algorithm to solve it efficiently, based on directional derivatives and quasi-Newton method. In addition, we design a distributed system which can run on hundreds of machines parallel and provides us with the industrial scalability. LS-PLM model can capture nonlinear patterns from massive sparse data, saving us from heavy feature engineering jobs. Since 2012, LS-PLM has become the main CTR prediction model in Alibaba's online display advertising system, serving hundreds of millions users every day.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Deep Personalized Re-targeting

    cs.IR 2019-07 unverdicted novelty 3.0

    Hybrid deep-shallow neural embeddings integrated into gradient boosting trees yield a 7% prediction performance lift in offline tests for vacation rental re-targeting.