pith. machine review for the scientific record. sign in

arxiv: 1301.6720 · v1 · submitted 2013-01-23 · 💻 cs.AI

Recognition: unknown

Solving POMDPs by Searching the Space of Finite Policies

Authors on Pith no claims yet
classification 💻 cs.AI
keywords optimalpoliciesfindingpolicyfiniteintractablemethodpomdps
0
0 comments X
read the original abstract

Solving partially observable Markov decision processes (POMDPs) is highly intractable in general, at least in part because the optimal policy may be infinitely large. In this paper, we explore the problem of finding the optimal policy from a restricted set of policies, represented as finite state automata of a given size. This problem is also intractable, but we show that the complexity can be greatly reduced when the POMDP and/or policy are further constrained. We demonstrate good empirical results with a branch-and-bound method for finding globally optimal deterministic policies, and a gradient-ascent method for finding locally optimal stochastic policies.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Discrete Diffusion for Codebook-Based Beam Candidate Generation

    eess.SP 2026-04 unverdicted novelty 6.0

    A discrete denoising diffusion model learns from probing histories to generate promising beam candidates, yielding better SNR, lower beam-miss probability, and reduced probe regret than baselines under tight probing budgets.