Cross-Entropic Learning of a Machine for the Decision in a Partially Observable Universe

Frederic Dambreville (DGA/CTA/DT/GIP)

arxiv: math/0605498 · v1 · submitted 2006-05-18 · 🧮 math.OC · cs.AI· cs.LG· cs.NE· cs.RO· math.ST· stat.TH

Cross-Entropic Learning of a Machine for the Decision in a Partially Observable Universe

Frederic Dambreville (DGA/CTA/DT/GIP) This is my paper

classification 🧮 math.OC cs.AIcs.LGcs.NEcs.ROmath.STstat.TH

keywords observablepartiallyuniversecross-entropicdecisionlearningmachinemarkov

0 comments

read the original abstract

Revision of the paper previously entitled "Learning a Machine for the Decision in a Partially Observable Markov Universe" In this paper, we are interested in optimal decisions in a partially observable universe. Our approach is to directly approximate an optimal strategic tree depending on the observation. This approximation is made by means of a parameterized probabilistic law. A particular family of hidden Markov models, with input \emph{and} output, is considered as a model of policy. A method for optimizing the parameters of these HMMs is proposed and applied. This optimization is based on the cross-entropic principle for rare events simulation developed by Rubinstein.

This paper has not been read by Pith yet.

Cross-Entropic Learning of a Machine for the Decision in a Partially Observable Universe

discussion (0)