pith. sign in

arxiv: math/0605498 · v1 · submitted 2006-05-18 · 🧮 math.OC · cs.AI· cs.LG· cs.NE· cs.RO· math.ST· stat.TH

Cross-Entropic Learning of a Machine for the Decision in a Partially Observable Universe

classification 🧮 math.OC cs.AIcs.LGcs.NEcs.ROmath.STstat.TH
keywords observablepartiallyuniversecross-entropicdecisionlearningmachinemarkov
0
0 comments X
read the original abstract

Revision of the paper previously entitled "Learning a Machine for the Decision in a Partially Observable Markov Universe" In this paper, we are interested in optimal decisions in a partially observable universe. Our approach is to directly approximate an optimal strategic tree depending on the observation. This approximation is made by means of a parameterized probabilistic law. A particular family of hidden Markov models, with input \emph{and} output, is considered as a model of policy. A method for optimizing the parameters of these HMMs is proposed and applied. This optimization is based on the cross-entropic principle for rare events simulation developed by Rubinstein.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.