Behavioural Macroeconomic Policy: New perspectives on time inconsistency
Pith reviewed 2026-05-24 19:39 UTC · model grok-4.3
The pith
Embedding hyperbolic discounting into game-theoretic models of monetary policy narrows the enforceable range of inflation targets.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Substituting hyperbolic or quasihyperbolic discounting into the game-theoretic representation of temptation versus enforcement produces an encompassing model in which the enforceable range of inflation targets is narrowed, implying limits to the effectiveness of monetary targets under certain conditions.
What carries the argument
Encompassing model that nests combinations of time-consistent and time-inconsistent preferences by embedding behavioural discount functions into the temptation-enforcement game.
If this is right
- Monetary targets have limited effectiveness under behavioral discounting.
- The set of sustainable inflation rates contracts when agents use hyperbolic or quasihyperbolic discounting.
- Policy credibility problems intensify for targets that would have been enforceable under exponential discounting.
- The same framework can examine mixtures of consistent and inconsistent preferences.
Where Pith is reading between the lines
- Fiscal or regulatory rules might show analogous narrowing if behavioral discounting is introduced into their enforcement games.
- Central banks could test the model by measuring whether announced targets correlate with observed population discount rates across countries.
- Communication strategies that alter perceived future payoffs might expand the enforceable range even under hyperbolic discounting.
Load-bearing premise
The game-theoretic representation of temptation versus enforcement still correctly captures the strategic interactions that determine enforceable targets once behavioral discount functions replace standard ones.
What would settle it
Data showing that inflation targets outside the narrowed range remain credibly enforced in economies where population discount functions are measurably hyperbolic would contradict the central result.
read the original abstract
This paper brings together divergent approaches to time inconsistency from macroeconomic policy and behavioural economics. Behavioural discount functions from behavioural microeconomics are embedded into a game-theoretic analysis of temptation versus enforcement to construct an encompassing model, nesting combinations of time consistent and time inconsistent preferences. The analysis presented in this paper shows that, with hyperbolic/quasihyperbolic discounting, the enforceable range of inflation targets is narrowed. This suggests limits to the effectiveness of monetary targets, under certain conditions. The paper concludes with a discussion of monetary policy implications, explored specifically in the light of current macroeconomic policy debates.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper embeds hyperbolic and quasihyperbolic discount functions from behavioral economics into a game-theoretic model of temptation versus enforcement in monetary policy, nesting time-consistent and time-inconsistent preferences, and concludes that this narrows the enforceable range of inflation targets relative to standard exponential discounting, with implications for the effectiveness of monetary targets.
Significance. If the narrowing result is correctly derived, the paper would usefully connect behavioral micro-foundations to macro policy analysis, highlighting potential limits on inflation targeting under time-inconsistent preferences and informing current policy debates.
major comments (2)
- [Abstract] Abstract: the central claim that hyperbolic/quasihyperbolic discounting narrows the enforceable range of inflation targets is stated without any derivation steps, explicit equations, or robustness checks. This makes it impossible to assess whether the result follows from the model construction or from particular functional forms and parameter values chosen for the discount function.
- [Model/analysis (as described in abstract)] The game-theoretic construction: direct substitution of (β,δ) preferences into standard repeated-game incentive constraints (temptation versus enforcement) assumes the usual subgame-perfect trigger strategies and one-shot deviation principle remain valid. For time-inconsistent players this requires re-derivation under sophisticated or naive hyperbolic equilibrium concepts, because continuation values become time-dependent and effective discounting on punishment phases changes with the evaluation date; without this adjustment the set of enforceable targets may be mis-stated.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive report. The comments raise important issues about the clarity of the abstract and the handling of equilibrium concepts under time-inconsistent preferences. We address each major comment below and will revise the manuscript to strengthen the presentation.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim that hyperbolic/quasihyperbolic discounting narrows the enforceable range of inflation targets is stated without any derivation steps, explicit equations, or robustness checks. This makes it impossible to assess whether the result follows from the model construction or from particular functional forms and parameter values chosen for the discount function.
Authors: The abstract is intended as a concise overview. The full derivation of the narrowing result, including the explicit incentive constraints under exponential, hyperbolic, and quasihyperbolic discounting, appears in the main text together with numerical checks across parameter values. To address the concern, we will revise the abstract to include the central incentive-compatibility condition and a note on robustness to the choice of discount parameters. revision: yes
-
Referee: [Model/analysis (as described in abstract)] The game-theoretic construction: direct substitution of (β,δ) preferences into standard repeated-game incentive constraints (temptation versus enforcement) assumes the usual subgame-perfect trigger strategies and one-shot deviation principle remain valid. For time-inconsistent players this requires re-derivation under sophisticated or naive hyperbolic equilibrium concepts, because continuation values become time-dependent and effective discounting on punishment phases changes with the evaluation date; without this adjustment the set of enforceable targets may be mis-stated.
Authors: We model sophisticated agents who correctly anticipate their future selves. The incentive constraints are written from the current self’s perspective, incorporating the time-dependent continuation values. We nevertheless agree that an explicit re-derivation of the one-shot deviation principle and subgame-perfect conditions under (β,δ) preferences will improve rigor. We will add this derivation in a revised section or appendix and confirm that the narrowing result continues to hold. revision: yes
Circularity Check
No circularity: substitution of external discount functions yields independent narrowing result
full rationale
The paper substitutes standard hyperbolic and quasihyperbolic discount functions (imported from behavioural microeconomics) into an existing repeated-game model of monetary-policy temptation versus enforcement. The claimed narrowing of the enforceable inflation-target range is the direct mathematical consequence of that substitution into the incentive-compatibility conditions; it is not obtained by fitting parameters to the target data, by self-defining the outcome, or by a load-bearing self-citation chain. No equation is shown to equal its own input by construction, and the equilibrium concept remains the standard subgame-perfect one. The derivation is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Game-theoretic representation of policymaker-private agent interaction accurately models temptation and enforcement
- domain assumption Hyperbolic and quasihyperbolic discount functions correctly capture time-inconsistent preferences
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Enforcement = βδ · (1/2)(b̄²)/a (eq. 12); when β<1 the present value of future punishment falls and the temptation to cheat rises, narrowing the enforceable inflation-target range.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Barro-Gordon objective z_t = (a/2)π_t² − b_t(π_t − π^e_t) and the grim-trigger reputation construction under exponential vs. quasihyperbolic discounting.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
The hyperbolic consumption model: Calibration, simulation and empirical ev alu- ation
GM Angeletos, D Laibson, A Repetto, J Tobacman, and S Weinberg . The hyperbolic consumption model: Calibration, simulation and empirical ev alu- ation. Journal of Economic Perspectives , 15(3):47–68, 2001
work page 2001
-
[2]
R Axelrod. The Evolution of Co-operation . Basic Books, 1984
work page 1984
-
[3]
R Axelrod and WD Hamilton. The evolution of cooperation. Science, 211:1390–96, 1981
work page 1981
-
[4]
Behavioural macroeconomics: Time, optimism and an imal spirits
MC Baddeley. Behavioural macroeconomics: Time, optimism and an imal spirits. In Routledge Handbook of Behavioral Economics, pages 266–79. Abing- don: Routledge, 2016
work page 2016
-
[5]
Behavioural Economics and Finance - 2nd edition
MC Baddeley. Behavioural Economics and Finance - 2nd edition . Abingdon: Routledge, 2019
work page 2019
-
[6]
A positive theory of monetary policy in a n atural rate model
RJ Barro and DB Gordon. A positive theory of monetary policy in a n atural rate model. Journal of Political Economy , 91(4):589–610, 1983
work page 1983
-
[7]
Rules, discretion and reputation in a mod el of monetary policy
RJ Barro and DB Gordon. Rules, discretion and reputation in a mod el of monetary policy. Journal of Monetary Economics , 12(1):101–121, 1983
work page 1983
- [8]
-
[9]
S. Frederick, G. Loewenstein, and T. O’Donoghue. Time discount ing and time preference: A critical review. Journal of Economic Literature , XL:351– 401, 2002
work page 2002
-
[10]
A non-cooperative equilibrium for supergames
JW Friedman. A non-cooperative equilibrium for supergames. Review of Economic Studies , 38(1):112, 1971
work page 1971
-
[11]
C. Harris and D Laibson. Dynamic choices of hyperbolic consumer s. Econo- metrica, 69(4):935–957, 2002
work page 2002
-
[12]
Rules rather than discretion: The inc onsistency of optimal plans
F Kydland and E Prescott. Rules rather than discretion: The inc onsistency of optimal plans. Journal of Political Economy , 85(3):473–492, 1977
work page 1977
-
[13]
Golden eggs and hyperbolic discounting
D Laibson. Golden eggs and hyperbolic discounting. Quarterly Journal of Economics, 112(2):443–477, 1997
work page 1997
-
[14]
R.E. Lucas. Expectations and the neutrality of money. Journal of Economic Theory, 4:103–124, 1972
work page 1972
-
[15]
R.E. Lucas. Econometric policy evaluation: A critique. Carnegie-Rochester Conference Series on Public Policy , 1, 1976
work page 1976
-
[16]
T O’Donoghue and M Rabin. Doing it now or later. American Economic Review, 89(1):103–124, 1999
work page 1999
-
[17]
T O’Donoghue and M Rabin. Choice and procrastination. The Quarterly Journal of Economics , 116(1):121–160, 2001
work page 2001
-
[18]
Present bias: Lessons learned and to be learned
T O’Donoghue and M Rabin. Present bias: Lessons learned and to be learned. American Economic Review: Papers & Proceedings , 105(2):273–279, 2015
work page 2015
-
[19]
T. Sargent and N. Wallace. ’rational’ expectations, the optimal monetary instrument, and the optimal money supply rule. Journal of Political Economy, 83(2):241254, 1975
work page 1975
-
[20]
T. Sargent and N. Wallace. Rational expectations and the theo ry of eco- nomic policy. Journal of Monetary Economics , 2(2):169183, 1976
work page 1976
-
[21]
L.J. Savage. The Foundations of Statistics . Wiley, 1954
work page 1954
-
[22]
Myopia and inconsistency in dynamic utility maximization
RH Strotz. Myopia and inconsistency in dynamic utility maximization . The Review of Economic Studies , 23(3):165–180, 1955
work page 1955
-
[23]
J. von Neumann and O. Morgenstern. Theory of Games and Economic Behavior (3rd edition) . Princeton University Press, 1944/1953. 12
work page 1944
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.