Incomplete Reputation Information and Punishment in Indirect Reciprocity
Pith reviewed 2026-05-18 18:15 UTC · model grok-4.3
The pith
Incomplete observation leaves cooperation conditions unchanged in indirect reciprocity while reputation fading raises the required benefit-to-cost ratio and punishment norms expand the viable range.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Under incomplete observation the conditions for cooperation are unchanged because less frequent updates are exactly offset by higher reputational stakes. Reputation fading requires higher benefit-to-cost ratios as identification probability decreases. Norms incorporating punishment sustain cooperation across broader parameter ranges without reducing efficiency in the reputation fading model.
What carries the argument
Analytical frameworks for public assessment that compute long-run frequencies of strategies and reputations under probabilistic observation or identification.
If this is right
- Cooperation evolves under incomplete observation at the same benefit-to-cost ratios as under full information.
- Lower identification probabilities in reputation fading demand proportionally higher benefit-to-cost ratios to maintain cooperation.
- Punishment-inclusive norms support stable cooperation in reputation-fading models without efficiency loss.
- Punishment effectiveness hinges on the concrete type of information incompleteness rather than incompleteness in general.
Where Pith is reading between the lines
- Real-world reputation systems may need distinct design rules for observation noise versus label uncertainty rather than generic fixes.
- Experimental tests with human subjects under controlled probabilistic observation or fading labels could directly check the predicted thresholds.
- Extending the frameworks to include private assessments could uncover additional instabilities not visible in public-assessment limits.
Load-bearing premise
The analytical frameworks for public assessment capture the long-run population dynamics exactly with no hidden dependence on specific initial conditions, finite-population corrections, or private-assessment effects.
What would settle it
A finite-population simulation or private-assessment analysis that shifts the reported cooperation thresholds away from the values predicted by the public-assessment models would falsify the claim that conditions remain unchanged or require only higher benefit-to-cost ratios.
Figures
read the original abstract
Indirect reciprocity promotes cooperation by allowing individuals to help others based on reputation rather than direct reciprocation. Because it relies on accurate reputation information, its effectiveness can be undermined by information gaps. We examine two forms of incomplete information: incomplete observation, in which donor actions are observed only probabilistically, and reputation fading, in which recipient reputations are sometimes classified as "Unknown". Using analytical frameworks for public assessment, we show that these seemingly similar models yield qualitatively different outcomes. Under incomplete observation, the conditions for cooperation are unchanged, because less frequent updates are exactly offset by higher reputational stakes. In contrast, reputation fading hinders cooperation, requiring higher benefit-to-cost ratios as the identification probability decreases. We then evaluate costly punishment as a third action alongside cooperation and defection. Norms incorporating punishment can sustain cooperation across broader parameter ranges without reducing efficiency in the reputation fading model. This contrasts with previous work, which found punishment ineffective under a different type of information limitation, and highlights the importance of distinguishing between types of information constraints. Finally, we review past studies to identify when punishment is effective and when it is not in indirect reciprocity.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper analyzes two forms of incomplete information in indirect reciprocity—probabilistic observation of donor actions and reputation fading (recipients classified as 'Unknown')—using analytical frameworks for public assessment. It claims that incomplete observation leaves cooperation thresholds unchanged because reduced update frequency is exactly offset by higher reputational stakes, while reputation fading strictly increases the required benefit-to-cost ratio as identification probability falls. The work then extends the models to include costly punishment as a third action and reports that punishment-inclusive norms sustain cooperation over wider parameter ranges without efficiency loss in the reputation-fading case, contrasting with earlier findings under other information constraints; a review of prior studies identifies conditions for punishment effectiveness.
Significance. If the analytical thresholds and qualitative distinctions hold, the manuscript usefully separates two information-incompleteness regimes that prior literature often conflates and clarifies when punishment augments rather than undermines reputation-based cooperation. The synthesis of past results on punishment effectiveness is a constructive contribution to the indirect-reciprocity literature.
major comments (2)
- [Abstract / analytical frameworks paragraph] Abstract and the paragraph introducing the analytical frameworks: the headline invariance result under incomplete observation (and the contrasting increase in required b/c under reputation fading) rests on the unverified claim that the public-assessment models yield exact long-run stationary thresholds; no derivations, finite-N corrections, or initial-condition robustness checks are supplied to confirm the offset is parameter-independent and free of hidden dependence on private-assessment residuals.
- [Punishment-extension analysis] Section on the punishment extension: the result that norms incorporating punishment sustain cooperation across broader ranges without efficiency loss in the reputation-fading model inherits the same exact-dynamics assumption; any finite-population drift or initial-reputation sensitivity would alter the reported parameter ranges and the claimed contrast with previous work.
minor comments (1)
- [Abstract] The abstract could briefly name the specific assessment rules (e.g., image scoring, standing, etc.) employed in the public-assessment frameworks to aid immediate readability.
Simulated Author's Rebuttal
We thank the referee for their constructive report and recommendation for major revision. We address the two major comments in detail below, clarifying the analytical basis of our results and committing to enhancements in the revised manuscript.
read point-by-point responses
-
Referee: [Abstract / analytical frameworks paragraph] Abstract and the paragraph introducing the analytical frameworks: the headline invariance result under incomplete observation (and the contrasting increase in required b/c under reputation fading) rests on the unverified claim that the public-assessment models yield exact long-run stationary thresholds; no derivations, finite-N corrections, or initial-condition robustness checks are supplied to confirm the offset is parameter-independent and free of hidden dependence on private-assessment residuals.
Authors: We appreciate the referee's attention to the foundational assumptions. The public-assessment frameworks permit exact derivation of long-run stationary reputation distributions by solving the linear balance equations of the Markov chain for reputation updates under the assessment rules. The invariance under incomplete observation follows because the observation probability scales both update frequency and the reputational weight of each observation proportionally, leaving the stationary probability of a good reputation (and thus the b/c threshold) unchanged; the contrasting increase under reputation fading arises because unknown classifications dilute the effective information without a compensating stake adjustment. We acknowledge these intermediate steps were not shown explicitly. In the revision we will add the full derivations of the stationary thresholds to an appendix. The analysis is performed in the infinite-population mean-field limit, where finite-N drift is negligible by construction and the unique stationary state is reached independently of initial conditions. revision: yes
-
Referee: [Punishment-extension analysis] Section on the punishment extension: the result that norms incorporating punishment sustain cooperation across broader ranges without efficiency loss in the reputation-fading model inherits the same exact-dynamics assumption; any finite-population drift or initial-reputation sensitivity would alter the reported parameter ranges and the claimed contrast with previous work.
Authors: We agree that the punishment extension rests on the same public-assessment framework, now augmented with a third action whose reputation consequences are incorporated into the stationary balance equations. The expanded cooperation region without efficiency loss is obtained by comparing stationary expected payoffs across norms. We will include the explicit stationary-state calculations for the punishment-inclusive case in the revised appendix. The contrast with earlier findings remains valid because reputation fading affects identification probability differently from the information constraints examined in those studies. Finite-population drift and initial-condition sensitivity lie outside the current mean-field scope; we can add a brief discussion of these modeling assumptions and note them as directions for future stochastic analyses. revision: yes
Circularity Check
No significant circularity; derivations rest on standard public-assessment frameworks without reduction to inputs by construction
full rationale
The paper applies established analytical frameworks for public assessment in indirect reciprocity to compare incomplete observation and reputation fading. The central result that incomplete observation leaves cooperation thresholds unchanged (via exact offset between update frequency and reputational stakes) follows from the model's update rules and payoff structure rather than being defined circularly in terms of the identification probability. Reputation fading's stricter b/c requirement is likewise derived from the altered reputation dynamics, not presupposed. No self-definitional equations, fitted parameters relabeled as predictions, or load-bearing self-citations appear in the provided abstract or description; prior-work contrasts supply external benchmarks instead of closing a self-referential loop. The assumption that the frameworks yield exact long-run thresholds is an explicit modeling choice, not a hidden tautology, leaving the derivation chain self-contained.
Axiom & Free-Parameter Ledger
free parameters (2)
- observation probability
- identification probability
axioms (2)
- domain assumption Public assessment rules map observed actions to reputation labels in a deterministic or probabilistic but commonly known way.
- standard math Infinite or well-mixed population limit with weak selection.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
less frequent updates are exactly offset by higher reputational stakes... reputation fading requires higher benefit-to-cost ratios
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
analytical frameworks for public assessment... capture the long-run population dynamics exactly
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
the donor decides the action using S(G) with probability qa donor G donor
-
[2]
the donor’s reputation is updated by R(G, C) 2’. the donor’s reputation is randomly assigned with probability 1-qa assessment error G/B C G donor recipient G with probability qi donor
-
[3]
the donor’s reputation is updated by R(G, C) with probability 1-qi D donor recipient G donor
-
[4]
the intended cooperation is not executed
the intended cooperation is executed 1. the intended cooperation is not executed. Defection is executed instead
-
[5]
the donor’s reputation is updated by R(G, D) implementation error G BG D G donor recipient G with probability qp donor
-
[6]
the donor’s reputation is updated by R(G, D) with probability 1-qp donor recipient G donor
-
[7]
the defection is misperceived as cooperation
the defection is correctly perceived 1. the defection is misperceived as cooperation
-
[8]
(Upper left) In the model with assessment error, the donor’s reputation is not accurately updated
the donor’s reputation is updated by R(G, C) perception error G D B C G Figure 3.Comparison of the three models with errors: assessment error, implementation error, and perception error. (Upper left) In the model with assessment error, the donor’s reputation is not accurately updated. With probabilityqa, the donor’s reputation is correctly updated to G or...
-
[9]
Melis, A. P. & Semmann, D. How is human cooperation different?Philos. Transactions Royal Soc. B365, 2663–2674 (2010). 3.Nowak, M. A. & Sigmund, K. Evolution of indirect reciprocity.Nature437, 1291–1298 (2005)
work page 2010
-
[10]
Wedekind, C. & Braithwaite, V . A. The long-term benefits of human generosity in indirect reciprocity.Curr. biology12, 1012–1015 (2002). 5.Sigmund, K. Moral assessment in indirect reciprocity.J. theoretical biology299, 25–30 (2012)
work page 2002
-
[11]
Santos, F. P., Pacheco, J. M. & Santos, F. C. The complexity of human cooperation under indirect reciprocity.Philos. Transactions Royal Soc. B376, 20200291 (2021). 7.Okada, I. A review of theoretical studies on indirect reciprocity.Games11, 27 (2020). 8.Nowak, M. A. & Sigmund, K. Evolution of indirect reciprocity by image scoring.Nature393, 573 (1998)
work page 2021
-
[12]
Leimar, O. & Hammerstein, P. Evolution of cooperation through indirect reciprocity.Proc. R. Soc. B268, 745–753 (2001). 9/16 Figure 4.Comparison of the five models with different types of information inaccuracy: (a) incomplete observation, (b) reputation fading, (c) assessment error, (d) implementation error, and (e) perception error. The horizontal axis r...
work page 2001
- [13]
-
[14]
Ohtsuki, H. & Iwasa, Y . How should we define goodness? – reputation dynamics in indirect reciprocity.J. Theor . Biol. 231, 107–120 (2004)
work page 2004
-
[15]
Ohtsuki, H. & Iwasa, Y . The leading eight: social norms that can maintain cooperation by indirect reciprocity.J. Theor . Biol.239, 435–444 (2006)
work page 2006
-
[16]
Ohtsuki, H. & Iwasa, Y . Global analyses of evolutionary dynamics and exhaustive search for social norms that maintain cooperation by reputation.J. Theor . Biol.244, 518–531 (2007)
work page 2007
-
[17]
Ohtsuki, H., Iwasa, Y . & Nowak, M. A. Indirect reciprocity provides only a narrow margin of efficiency for costly punishment.Nature457, 79 (2009)
work page 2009
-
[18]
Pacheco, J. M., Santos, F. C. & Chalub, F. A. C. Stern-judging: A simple, successful norm which promotes cooperation under indirect reciprocity.PLoS computational biology2, e178 (2006)
work page 2006
-
[19]
Santos, F. P., Santos, F. C. & Pacheco, J. M. Social norms of cooperation in small-scale societies.PLoS Comput. Biol.12, e1004709 (2016)
work page 2016
-
[20]
Santos, F. P., Santos, F. C. & Pacheco, J. M. Social norm complexity and past reputations in the evolution of cooperation. Nature555, 242–245 (2018). 10/16 Table 3.Summary of the impact of different types of information inaccuracy on cooperation and the effectiveness of costly punishment. In the left column, we list the type of noise affecting the reputat...
work page 2018
-
[21]
Ingroup favoritism and intergroup cooperation under indirect reciprocity based on group reputation.J
Masuda, N. Ingroup favoritism and intergroup cooperation under indirect reciprocity based on group reputation.J. Theor. Biol.311, 8–18 (2012). 19.Berger, U. Learning to cooperate via indirect reciprocity.Games Econ. Behav.72, 30–37 (2011)
work page 2012
-
[22]
Clark, D., Fudenberg, D. & Wolitzky, A. Indirect reciprocity with simple records.Proc. Natl. Acad. Sci. USA117, 11344–11349 (2020)
work page 2020
-
[23]
Hilbe, C., Schmid, L., Tkadlec, J., Chatterjee, K. & Nowak, M. A. Indirect reciprocity with private, noisy, and incomplete information.Proc. Natl. Acad. Sci. USA115, 12241–12246 (2018). 22.Uchida, S. Effect of private information on indirect reciprocity.Phys. Rev. E82, 036111 (2010)
work page 2018
-
[24]
Kawakatsu, M., Kessinger, T. A. & Plotkin, J. B. A mechanistic model of gossip, reputations, and cooperation.Proc. Natl. Acad. Sci.121, e2400689121 (2024)
work page 2024
-
[25]
Kessinger, T. A., Tarnita, C. E. & Plotkin, J. B. Evolution of norms for judging social behavior.Proc. Natl. Acad. Sci.120, e2219480120 (2023)
work page 2023
-
[26]
Fujimoto, Y . & Ohtsuki, H. Who is a leader in the leading eight? indirect reciprocity under private assessment.PRX Life2, 023009 (2024)
work page 2024
-
[27]
Schmid, L., Ekbatani, F., Hilbe, C. & Chatterjee, K. Quantitative assessment can stabilize indirect reciprocity under imperfect information.Nat. Commun.14, 2086 (2023)
work page 2086
-
[28]
Michel-Mata, S.et al.The evolution of private reputations in information-abundant landscapes.Nature634, 883–889 (2024)
work page 2024
-
[29]
Murase, Y . & Hilbe, C. Computational evolution of social norms in well-mixed and group-structured populations.Proc. Natl. Acad. Sci.121, e2406885121 (2024)
work page 2024
-
[30]
Murase, Y . & Hilbe, C. Indirect reciprocity under opinion synchronization.Proc. Natl. Acad. Sci.121, e2418364121 (2024)
work page 2024
-
[31]
Murase, Y . & Hilbe, C. Indirect reciprocity with stochastic and dual reputation updates.PLOS Comput. Biol.19, e1011271 (2023)
work page 2023
-
[32]
Schmid, L., Chatterjee, K., Hilbe, C. & Nowak, M. A. A unified framework of direct and indirect reciprocity.Nat. Hum. Behav.5, 1292 (2021). 11/16
work page 2021
-
[33]
Costly punishment sustains indirect reciprocity under low defection detectability.J
Murase, Y . Costly punishment sustains indirect reciprocity under low defection detectability.J. Theor. Biol.600, 112043 (2025)
work page 2025
-
[34]
Glynatsi, N. E., Hilbe, C. & Murase, Y . Exact conditions for evolutionary stability in indirect reciprocity under noise (2025). ArXiv:2509.08006 [q-bio.PE], 2509.08006
-
[35]
Masuda, N. & Ohtsuki, H. Tag-based indirect reciprocity by incomplete social information.Proc. Royal Soc. B: Biol. Sci. 274, 689–695 (2007)
work page 2007
-
[36]
Nakamura, M. & Masuda, N. Indirect reciprocity under incomplete observation.PLoS Comput. Biol.7, e1002113 (2011). 36.Suzuki, S. & Kimura, H. Indirect reciprocity is sensitive to costs of information transfer.Sci. reports3, 1435 (2013)
work page 2011
-
[37]
Yoeli, E., Hoffman, M., Rand, D. G. & Nowak, M. A. Powering up with indirect reciprocity in a large-scale field experiment. Proc. Natl. Acad. Sci.110, 10424–10429 (2013)
work page 2013
-
[38]
Boyd, R. & Richerson, P. J. Punishment allows the evolution of cooperation (or anything else) in sizable groups.Ethol. sociobiology13, 171–195 (1992)
work page 1992
- [39]
-
[40]
Boyd, R., Gintis, H., Bowles, S. & Richerson, P. J. The evolution of altruistic punishment.Proc. Natl. Acad. Sci.100, 3531–3535 (2003)
work page 2003
-
[41]
Brandt, H., Hauert, C. & Sigmund, K. Punishing and abstaining for public goods.Proc. Natl. Acad. Sci. United States Am. 103, 495 (2006)
work page 2006
- [42]
-
[43]
Reciprocity: Weak or strong? what punishment experiments do (and do not) demonstrate.Behav
Guala, F. Reciprocity: Weak or strong? what punishment experiments do (and do not) demonstrate.Behav. brain sciences 35, 1–15 (2012). 44.Raihani, N. J. & Bshary, R. The reputation of punishers.Trends ecology & evolution30, 98–103 (2015)
work page 2012
-
[44]
Raihani, N. J. & Bshary, R. Third-party punishers are rewarded, but third-party helpers even more so.Evolution69, 993–1003 (2015). 46.Raihani, N. J. & Bshary, R. Punishment: one tool, many uses.Evol. Hum. Sci.1, e12 (2019)
work page 2015
- [45]
- [46]
-
[47]
Lee, S., Murase, Y . & Baek, S. K. A second-order stability analysis for the continuous model of indirect reciprocity.J. Theor. Biol.548, 111202 (2022). Acknowledgement The authors would like to thank C. Hilbe and S. K. Baek for insightful discussions and feedback on this work. HK acknowledges support by Basic Science Research Program through the National...
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.