Deep Reinforcement Learning with Surrogate Agent-Environment Interface

Song Wang; Yu Jing

arxiv: 1709.03942 · v3 · pith:5RIMF4KLnew · submitted 2017-09-12 · 💻 cs.LG

Deep Reinforcement Learning with Surrogate Agent-Environment Interface

Song Wang , Yu Jing This is my paper

classification 💻 cs.LG

keywords surrogateagent-environmentinterfaceactionlearningpolicyprobabilityalgorithm

0 comments

read the original abstract

In this paper, we propose surrogate agent-environment interface (SAEI) in reinforcement learning. We also state that learning based on probability surrogate agent-environment interface provides optimal policy of task agent-environment interface. We introduce surrogate probability action and develop the probability surrogate action deterministic policy gradient (PSADPG) algorithm based on SAEI. This algorithm enables continuous control of discrete action. The experiments show PSADPG achieves the performance of DQN in certain tasks with the stochastic optimal policy nature in the initial training stage.

This paper has not been read by Pith yet.

Deep Reinforcement Learning with Surrogate Agent-Environment Interface

discussion (0)