LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

Jiayi Liu; Jinghe Hu; Jun Hao; Mantian Li; Weipeng P. Yan; Yang He; Yu Wang; Yuxiang Liu

arxiv: 1708.05565 · v2 · pith:PMPQ6P7Bnew · submitted 2017-08-18 · 💻 cs.LG · cs.AI· cs.CL· cs.GT

LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

Yu Wang , Jiayi Liu , Yuxiang Liu , Jun Hao , Yang He , Jinghe Hu , Weipeng P. Yan , Mantian Li This is my paper

classification 💻 cs.LG cs.AIcs.CLcs.GT

keywords agentbiddingonlinereal-timeauctionsdeepinformationinputs

0 comments

read the original abstract

We present LADDER, the first deep reinforcement learning agent that can successfully learn control policies for large-scale real-world problems directly from raw inputs composed of high-level semantic information. The agent is based on an asynchronous stochastic variant of DQN (Deep Q Network) named DASQN. The inputs of the agent are plain-text descriptions of states of a game of incomplete information, i.e. real-time large scale online auctions, and the rewards are auction profits of very large scale. We apply the agent to an essential portion of JD's online RTB (real-time bidding) advertising business and find that it easily beats the former state-of-the-art bidding policy that had been carefully engineered and calibrated by human experts: during JD.com's June 18th anniversary sale, the agent increased the company's ads revenue from the portion by more than 50%, while the advertisers' ROI (return on investment) also improved significantly.

This paper has not been read by Pith yet.

LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

discussion (0)