We both think you did wrong -- How agreement shapes and is shaped by indirect reciprocity
Pith reviewed 2026-05-24 09:22 UTC · model grok-4.3
The pith
Certain moral judgment norms produce high agreement among private assessors even without shared information, and this agreement can be predicted analytically for any observation rate while also shaping overall cooperation levels.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Even when every individual assesses actions privately, particular moral judgment norms generate high levels of agreement on who is good or bad; these agreement levels admit an exact analytical prediction that holds for arbitrary observation rates and does not require agent-based simulations; the agreement in turn modulates reputations and therefore the equilibrium level of cooperation.
What carries the argument
The analytical derivation that computes population-wide agreement directly from the structure of a moral judgment norm and the observation rate, by tracking how private assessments of the same actions align or diverge under the norm's assignment rules.
If this is right
- Agreement produced by a norm directly determines the average reputation in the population.
- Norms that increase agreement can raise average reputations and therefore raise the level of cooperation.
- Norms that decrease agreement can lower average reputations and therefore lower the level of cooperation.
- The relationship between norm structure, agreement, and cooperation holds independently of how often individuals observe actions.
Where Pith is reading between the lines
- The same analytical approach could be applied to other reputation systems, such as online review platforms, to predict when private ratings will converge without central coordination.
- Selecting norms that maximize agreement might offer a way to sustain cooperation in large groups where public reputation sharing is costly or impossible.
- The method opens the possibility of classifying entire families of norms by their predicted agreement and cooperation effects before any simulation is run.
Load-bearing premise
Moral judgment norms can be formalized precisely enough for their induced agreement levels to follow an exact analytical formula that remains valid at every observation rate.
What would settle it
Compare the analytically predicted agreement values against measured agreement in agent-based simulations that use the same norms at several different observation rates; systematic mismatch at any rate would disprove the derivation.
Figures
read the original abstract
Humans judge each other's actions, which at least partly functions to detect and deter cheating and to enable helpfulness in an indirect reciprocity fashion. However, most forms of judging do not only concern the action itself, but also the moral status of the receiving individual (to deter cheating it must be morally acceptable to withhold help from cheaters). This is a problem, when not everybody agrees who is good and who is bad. Although it has been widely acknowledged that disagreement may exist and that it can be detrimental for indirect reciprocity, the details of this crucial feature of moral judgments have never been studied in depth. We show, that even when everybody assesses individually (aka privately), some moral judgement systems (aka norms) can lead to high levels of agreement. We give a detailed account of the mechanisms which cause it and we show how to predict agreement analytically without requiring agent-based simulations, and for any observation rate. Finally, we show that agreement may increase or decrease reputations and therefore how much helpfulness (aka cooperation) occurs.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper examines moral judgment norms in indirect reciprocity, showing that some norms produce high agreement levels even under fully private (individual) assessments. It details the mechanisms driving agreement, derives an analytical prediction for agreement that holds for arbitrary observation rates without needing agent-based simulations, and demonstrates that agreement can raise or lower reputations and thereby modulate cooperation levels.
Significance. If the analytical derivation is exact and independent of simulation outputs, the work would advance indirect reciprocity theory by supplying a closed-form route to agreement levels across norms and observation rates. This would reduce reliance on computational checks and clarify how private moral judgments affect reputation dynamics and helpfulness.
major comments (2)
- [§4] §4 (Analytical derivation of agreement): the claim that agreement admits an exact analytical expression valid for any observation rate must be supported by an explicit formula whose independence from simulation outputs is demonstrated. If the derivation invokes mean-field closures or independence assumptions between private assessments, these must be shown to remain valid when observation rates induce correlations; otherwise the 'without requiring agent-based simulations' guarantee fails for the full parameter range.
- [Results] Results section comparing analytical predictions to simulations: the manuscript must include direct numerical checks of the closed-form agreement expression against agent-based runs at both low and high observation rates (e.g., p_obs = 0.1 and p_obs = 0.9) for at least two distinct norms. Without these checks the central claim that the prediction holds exactly remains unverified.
minor comments (2)
- [Abstract] Abstract: the phrase 'for any observation rate' should be qualified by the range actually covered by the derivation (e.g., 0 < p_obs ≤ 1) to avoid overstatement.
- [Model] Notation: define the observation rate symbol (p_obs or equivalent) at first use and ensure it is used consistently in all equations and figure captions.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive report. The two major comments identify areas where the presentation of the analytical result can be strengthened. We address each point below and will revise the manuscript to incorporate the requested clarifications and verifications.
read point-by-point responses
-
Referee: [§4] §4 (Analytical derivation of agreement): the claim that agreement admits an exact analytical expression valid for any observation rate must be supported by an explicit formula whose independence from simulation outputs is demonstrated. If the derivation invokes mean-field closures or independence assumptions between private assessments, these must be shown to remain valid when observation rates induce correlations; otherwise the 'without requiring agent-based simulations' guarantee fails for the full parameter range.
Authors: The derivation in §4 solves the stationary distribution of the finite-state Markov chain whose states are the possible reputation configurations under private assessments. Because the transition matrix is constructed directly from the observation process (each observer draws an independent Bernoulli trial with success probability p_obs), the resulting expression for agreement is closed-form and exact for every p_obs in [0,1]; no mean-field closure or extra independence assumption is introduced. We will insert the explicit formula together with a short derivation appendix that makes this independence from simulation outputs explicit. revision: yes
-
Referee: [Results] Results section comparing analytical predictions to simulations: the manuscript must include direct numerical checks of the closed-form agreement expression against agent-based runs at both low and high observation rates (e.g., p_obs = 0.1 and p_obs = 0.9) for at least two distinct norms. Without these checks the central claim that the prediction holds exactly remains unverified.
Authors: We agree that explicit verification at the extremes strengthens the claim. In the revised Results section we will add side-by-side comparisons of the analytical formula against agent-based simulations for p_obs = 0.1 and p_obs = 0.9, using the stern-judging and shunning norms. The new panels will report both the analytical value and the simulation mean with 95 % confidence intervals, confirming agreement within sampling error. revision: yes
Circularity Check
No significant circularity; analytical predictions derive independently from norm formalization
full rationale
The paper formalizes moral judgment norms and derives closed-form expressions for agreement levels as functions of observation rate and norm parameters. These expressions are obtained directly from the probabilistic structure of private assessments and do not reduce to fitted quantities, self-citations, or ansatzes imported from prior work by the same authors. The claim of prediction without simulations is supported by explicit derivation steps that remain valid under the stated independence assumptions; no load-bearing step collapses to a self-referential definition or renaming of an empirical pattern. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Albert, R., Jeong, H., and Barab \' a si, A.-L. (1999). Diameter of the World-Wide Web . Nature , 401(6749):130--131
work page 1999
-
[2]
Andras, P., Esterle, L., Guckert, M., Han, T. A., Lewis, P. R., Milanovic, K., Payne, T., Perret, C., Pitt, J., Powers, S. T., Urquhart, N., and Wells, S. (2018). Trusting Intelligent Machines: Deepening Trust Within Socio-Technical Systems . IEEE Technology and Society Magazine , 37(4):76--83
work page 2018
-
[3]
Barabasi, A.-L. (2014). Linked-how Everything is Connected to Everything Else and what it Means F . Perseus Books Group
work page 2014
-
[4]
Barab \' a si, A.-L. (2016). Network Science . Cambridge University Press
work page 2016
-
[5]
Barab \' a si, A.-L. and Albert, R. (1999). Emergence of scaling in random networks . science , 286(5439):509--512
work page 1999
-
[6]
Barrat, A. and Pastor-Satorras, R. (2005). Rate equation approach for correlations in growing network models . Physical Review E , 71(3):36127
work page 2005
- [7]
-
[8]
Cimpeanu, T., Di Stefano , A., Perret, C., and Han, T. A. (2023). Social diversity reduces the complexity and cost of fostering fairness. Chaos, Solitons & Fractals , 167:113051
work page 2023
-
[9]
Cimpeanu, T. and Han, T. A. (2020). Making an example: signalling threat in the evolution of cooperation. In 2020 IEEE Congress on Evolutionary Computation (CEC) , pages 1--8. IEEE
work page 2020
-
[10]
Cimpeanu, T., Perret, C., and Han, T. A. (2021). Cost-efficient interventions for promoting fairness in the ultimatum game . Knowledge-Based Systems , 233:107545
work page 2021
-
[11]
Dafoe, A., Bachrach, Y., Hadfield, G., Horvitz, E., Larson, K., Graepel, T., et al. (2021). Cooperative ai: machines must learn to find common ground. Nature , 593(7857):33--36
work page 2021
- [12]
-
[13]
Duong, M. H. and Han, T. A. (2021). Cost efficiency of institutional incentives in finite populations . Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences
work page 2021
-
[14]
G \' o is, A. R., Santos, F. P., Pacheco, J. M., and Santos, F. C. (2019). Reward and punishment in climate change dilemmas . Sci. Rep. , 9(1):1--9
work page 2019
-
[15]
Han, T. A. (2022). Institutional incentives for the evolution of committed cooperation: ensuring participation is as important as enhancing compliance. Journal of the Royal Society Interface , 19(188):20220036
work page 2022
-
[16]
A., Lynch, S., Tran-Thanh, L., and Santos, F
Han, T. A., Lynch, S., Tran-Thanh, L., and Santos, F. C. (2018). Fostering Cooperation in Structured Populations Through Local and Global Interference Strategies . In IJCAI-ECAI'2018 , pages 289--295. AAAI Press
work page 2018
-
[17]
Han, T. A., Pereira, L. M., and Lenaerts, T. (2019). Modelling and Influencing the AI Bidding War: A Research Agenda . In Proceedings of the AAAI/ACM conference AI, Ethics and Society , pages 5--11
work page 2019
-
[18]
Han, T. A., Perret, C., and Powers, S. T. (2021). When to (or not to) trust intelligent machines: Insights from an evolutionary game theory analysis of trust in repeated games. Cognitive Systems Research , 68:111--124
work page 2021
-
[19]
Han, T. A., Santos, F. C., Lenaerts, T., and Pereira, L. M. (2015). Synergy between intention recognition and commitments in cooperation dilemmas . Scientific reports , 5(9312)
work page 2015
-
[20]
Han, T. A. and Tran-Thanh, L. (2018). Cost-effective external interference for promoting the evolution of cooperation . Scientific reports , 8(15997)
work page 2018
-
[21]
Hilbe, C., Traulsen, A., R \" o hl, T., and Milinski, M. (2014). Democratic decisions establish stable authorities that overcome the paradox of second-order punishment . PNAS , 111(2):752--756
work page 2014
-
[22]
Hofbauer, J. and Sigmund, K. (1998). Evolutionary Games and Population Dynamics . Cambridge University Press
work page 1998
-
[23]
Levin, S. A. (2000). Multiple scales and the maintenance of biodiversity . Ecosystems , 3(6):498--506
work page 2000
-
[24]
Maynard Smith , J. (1982). Evolution and the Theory of Games . Cambridge University Press, Cambridge
work page 1982
-
[25]
Newman, M. (2018). Networks, 2nd edition, . Oxford university press
work page 2018
-
[26]
Nowak, M. A. and May, R. M. (1992). Evolutionary games and spatial chaos . Nature , 359(6398):826--829
work page 1992
-
[27]
Paiva, A., Santos, F. P. F. C., and Santos, F. P. F. C. (2018). Engineering pro-sociality with autonomous agents . In Proceedings of the AAAI Conference on Artificial Intelligence , volume 32, pages 7994--7999
work page 2018
-
[28]
Penn, A. S., Watson, R. A., Kraaijeveld, A., and Webb, J. (2010). Systems Aikido-A Novel Approach to Managing Natural Systems. In in Proc. of the ALIFE XII Conference , pages 577--580. MIT press
work page 2010
-
[29]
Perc, M. (2012). Sustainable institutionalized punishment requires elimination of second-order free-riders. Scientific reports , 2(1):1--6
work page 2012
-
[30]
Perc, M., Jordan, J. J., Rand, D. G., Wang, Z., Boccaletti, S., and Szolnoki, A. (2017). Statistical physics of human cooperation . Physics Reports , 687:1--51
work page 2017
-
[31]
Raghunandan, M. A. and Subramanian, C. A. (2012). Sustaining cooperation on networks: an analytical study based on evolutionary game theory . In AAMAS'12 , volume 12, pages 913--920. Citeseer
work page 2012
-
[32]
Rand, D. G. and Nowak, M. A. (2013). Human cooperation . Trends in Cognitive Sciences , 17(8):413--425
work page 2013
-
[33]
Rand, D. G., Nowak, M. A., Fowler, J. H., and Christakis, N. A. (2014). Static network structure can stabilize human cooperation . Proc Natl Acad Sci USA , 111(48):17093--17098
work page 2014
-
[34]
Rand, D. G., Tarnita, C. E., Ohtsuki, H., and Nowak, M. A. (2013). Evolution of fairness in the one-shot anonymous Ultimatum Game . Proceedings of the National Academy of Sciences , 110(7):2581--2586
work page 2013
-
[35]
Santos, F. C. and Pacheco, J. M. (2005). Scale-free networks provide a unifying framework for the emergence of cooperation. Phys. Rev. Lett. , 95:98104
work page 2005
-
[36]
Santos, F. C., Pacheco, J. M., and Lenaerts, T. (2006). Evolutionary Dynamics of Social Dilemmas in Structured Heterogeneous Populations . Proceedings of the National Academy of Sciences of the United States of America , 103:3490--3494
work page 2006
-
[37]
Santos, F. C., Santos, M. D., and Pacheco, J. M. (2008). Social diversity promotes the emergence of cooperation in public goods games . Nature , 454:214--216
work page 2008
-
[38]
Sasaki, T., Okada, I., Uchida, S., and Chen, X. (2015). Commitment to cooperation and peer punishment: Its evolution. Games , 6(4):574--587
work page 2015
-
[39]
Sigmund, K. (2010). The Calculus of Selfishness . Princeton University Press
work page 2010
-
[40]
Sigmund, K., Hauert, C., and Nowak, M. (2001). Reward and punishment . Proceedings of the National Academy of Sciences , 98(19):10757--10762
work page 2001
-
[41]
D., Traulsen, A., and Hauert, C
Sigmund, K., Silva, H. D., Traulsen, A., and Hauert, C. (2010). Social learning promotes institutions for governing the commons . Nature , 466:7308
work page 2010
-
[42]
Su, J., Sharma, A., and Goel, S. (2016). The effect of recommendations on network structure . 25th International World Wide Web Conference, WWW 2016 , pages 1157--1167
work page 2016
-
[43]
Szab \' o , G. and F \' a th, G. (2007). Evolutionary games on graphs . Physics Reports , 446(4-6):97--216
work page 2007
-
[44]
Traulsen, A., Nowak, M. A., and Pacheco, J. M. (2006). Stochastic Dynamics of Invasion and Fixation . Phys. Rev. E , 74:11909
work page 2006
-
[45]
Wang, S., Chen, X., and Szolnoki, A. (2019). Exploring optimal institutional incentives for public cooperation . Communications in Nonlinear Science and Numerical Simulation , 79:104914
work page 2019
-
[46]
Zisis, I., Guida, S. D., Han, T. A., Kirchsteiger, G., and Lenaerts, T. (2015). Generosity motivated by acceptance - evolutionary analysis of an anticipation games . Scientific reports , 5(18076)
work page 2015
-
[47]
Brandt, H. and Sigmund, K. (2004). The logic of reprobation: Assessment and action rules for indirect reciprocation . Journal of Theoretical Biology , 231(4):475--486
work page 2004
-
[48]
Fujimoto, Y. and Ohtsuki, H. (2022). Reputation structure in indirect reciprocity under noisy and private assessment . Scientific Reports , 12(1):1--13
work page 2022
-
[49]
Hilbe, C., Schmid, L., Tkadlec, J., Chatterjee, K., and Nowak, M. A. (2018). Indirect reciprocity with private, noisy, and incomplete information . Proceedings of the National Academy of Sciences , page 201810565
work page 2018
-
[50]
Krellner, M. and Han, T. A. (2020). Putting oneself in everybody's shoes - Pleasing enables indirect reciprocity under private assessments . In The 2020 Conference on Artificial Life , pages 402--410, Cambridge, MA. MIT Press
work page 2020
-
[51]
Krellner, M. and Han, T. A. (2021). Pleasing Enhances Indirect Reciprocity-Based Cooperation Under Private Assessment . Artificial Life , pages 1--31
work page 2021
-
[52]
Krellner, M. and Han, T. A. (2022). The Last One Standing? - Recent Findings on the Feasibility of Indirect Reciprocity under Private Assessment . In The 2022 Conference on Artificial Life , volume 1, Cambridge, MA. MIT Press
work page 2022
-
[53]
Leimar, O. and Hammerstein, P. (2001). Evolution of cooperation through indirect reciprocity
work page 2001
-
[54]
Nowak, M. A. and Sigmund, K. (1998a). Evolution of indirect reciprocity by image scoring . Nature , 393(6685):573--577
-
[55]
Nowak, M. A. and Sigmund, K. (1998b). The dynamics of indirect reciprocity . Journal of Theoretical Biology , 194(4):561--574
-
[56]
Nowak, M. A. and Sigmund, K. (2005). Evolution of indirect reciprocity . Nature , 437(7063):1291--1298
work page 2005
-
[57]
Ohtsuki, H. (2004). Reactive strategies in indirect reciprocity . Journal of Theoretical Biology , 227(3):299--314
work page 2004
-
[58]
Ohtsuki, H. and Iwasa, Y. (2004). How should we define goodness? - Reputation dynamics in indirect reciprocity . Journal of Theoretical Biology , 231(1):107--120
work page 2004
-
[59]
Ohtsuki, H. and Iwasa, Y. (2006). The leading eight: Social norms that can maintain cooperation by indirect reciprocity . Journal of Theoretical Biology , 239(4):435--444
work page 2006
-
[60]
Okada, I. (2020a). A Review of Theoretical Studies on Indirect Reciprocity . Games , 11(3):27
-
[61]
Okada, I. (2020b). Two ways to overcome the three social dilemmas of indirect reciprocity . Scientific Reports , 10(1)
-
[62]
Okada, I., Sasaki, T., and Nakai, Y. (2017). Tolerant indirect reciprocity can boost social welfare through solidarity with unconditional cooperators in private monitoring . Scientific Reports , 7(1):9737
work page 2017
-
[63]
Okada, I., Sasaki, T., and Nakai, Y. (2018). A solution for private assessment in indirect reciprocity using solitary observation . Journal of Theoretical Biology , 455:7--15
work page 2018
-
[64]
Panchanathan, K. and Boyd, R. (2003). A tale of two defectors: The importance of standing for evolution of indirect reciprocity . Journal of Theoretical Biology , 224(1):115--126
work page 2003
-
[65]
Perret, C., Krellner, M., and Han, T. A. (2021). The evolution of moral rules in a model of indirect reciprocity with private assessment - DRAFT . Scientific Reports , 11(1)
work page 2021
-
[66]
Santos, F. P., Pacheco, J. M., and Santos, F. C. (2021). The complexity of human cooperation under indirect reciprocity . Philosophical Transactions of the Royal Society B: Biological Sciences , 376(1838)
work page 2021
-
[67]
Sasaki, T., Okada, I., and Nakai, Y. (2017). The evolution of conditional moral assessment in indirect reciprocity . Scientific Reports , 7(1):41870
work page 2017
-
[68]
Schmid, L., Chatterjee, K., Hilbe, C., and Nowak, M. A. (2021a). A unified framework of direct and indirect reciprocity . Nature Human Behaviour , 5(10):1292--1302
-
[69]
Schmid, L., Hilbe, C., Chatterjee, K., and Nowak, M. A. (2022). Direct reciprocity between individuals that use different strategy spaces . PLOS Computational Biology , 18(6):e1010149
work page 2022
-
[70]
Schmid, L., Shati, P., Hilbe, C., and Chatterjee, K. (2021b). The evolution of indirect reciprocity under action and assessment generosity . Scientific Reports , 11(1):1--14
-
[71]
Sigmund, K. (2016). The calculus of selflessness . Princeton University Press
work page 2016
-
[72]
Sudgen, R. (1986). The Economics of Rights, Cooperation and Welfare . Basic Blackwell
work page 1986
-
[73]
Uchida, S. (2010). Effect of private information on indirect reciprocity . Physical Review E , 82(3):036111
work page 2010
-
[74]
Uchida, S. and Sasaki, T. (2013). Effect of assessment error and private information on stern-judging in indirect reciprocity . Chaos, Solitons & Fractals , 56:175--180
work page 2013
-
[75]
Nowak, M. A. and Sigmund, K. (1992). Tit for tat in heterogeneous populations . Nature , 355:250--252
work page 1992
-
[76]
Nowak, M. A. and Sigmund, K. (1998). Evolution of indirect reciprocity by image scoring . Nature , 393(6685):573--577
work page 1998
-
[77]
Okada, I. (2020). Two ways to overcome the three social dilemmas of indirect reciprocity . Scientific Reports , 10(1)
work page 2020
-
[78]
Schmid, L., Chatterjee, K., Hilbe, C., and Nowak, M. A. (2021). A unified framework of direct and indirect reciprocity . Nature Human Behaviour , 5(10):1292--1302
work page 2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.