Learning to Run with Actor-Critic Ensemble

BoEr Zhuang; Shuchang Zhou; Xinyu Zhou; Zhewei Huang

arxiv: 1712.08987 · v1 · pith:GWEGRSCYnew · submitted 2017-12-25 · 💻 cs.LG

Learning to Run with Actor-Critic Ensemble

Zhewei Huang , Shuchang Zhou , BoEr Zhuang , Xinyu Zhou This is my paper

classification 💻 cs.LG

keywords ensemblemethodactor-criticdeterministiclearningactionactionsactors

0 comments

read the original abstract

We introduce an Actor-Critic Ensemble(ACE) method for improving the performance of Deep Deterministic Policy Gradient(DDPG) algorithm. At inference time, our method uses a critic ensemble to select the best action from proposals of multiple actors running in parallel. By having a larger candidate set, our method can avoid actions that have fatal consequences, while staying deterministic. Using ACE, we have won the 2nd place in NIPS'17 Learning to Run competition, under the name of "Megvii-hzwer".

This paper has not been read by Pith yet.

Learning to Run with Actor-Critic Ensemble

discussion (0)