pith. sign in

Information-Theoretic Bounded Rationality

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

Bounded rationality, that is, decision-making and planning under resource limitations, is widely regarded as an important open problem in artificial intelligence, reinforcement learning, computational neuroscience and economics. This paper offers a consolidated presentation of a theory of bounded rationality based on information-theoretic ideas. We provide a conceptual justification for using the free energy functional as the objective function for characterizing bounded-rational decisions. This functional possesses three crucial properties: it controls the size of the solution space; it has Monte Carlo planners that are exact, yet bypass the need for exhaustive search; and it captures model uncertainty arising from lack of evidence or from interacting with other agents having unknown intentions. We discuss the single-step decision-making case, and show how to extend it to sequential decisions using equivalence transformations. This extension yields a very general class of decision problems that encompass classical decision rules (e.g. EXPECTIMAX and MINIMAX) as limit cases, as well as trust- and risk-sensitive planning.

fields

cs.AI 1 cs.LG 1

years

2026 1 2024 1

verdicts

UNVERDICTED 2

representative citing papers

Bounded-Rationality, Hedging, and Generalization

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

Generalization is a testable hedging property of the learner's response law, recovered via f-divergence regularizers that induce information-geometric curves between training loss and sample dependence.

Principles of frugal inference and control

cs.AI · 2024-06-20 · unverdicted · novelty 6.0

Introduces a resource-constrained POMDP framework and derives three principles of frugal inference and control that generalize to nonlinear tasks like pole balancing.

citing papers explorer

Showing 2 of 2 citing papers.

  • Bounded-Rationality, Hedging, and Generalization cs.LG · 2026-05-14 · unverdicted · none · ref 7 · internal anchor

    Generalization is a testable hedging property of the learner's response law, recovered via f-divergence regularizers that induce information-geometric curves between training loss and sample dependence.

  • Principles of frugal inference and control cs.AI · 2024-06-20 · unverdicted · none · ref 34 · internal anchor

    Introduces a resource-constrained POMDP framework and derives three principles of frugal inference and control that generalize to nonlinear tasks like pole balancing.