pith. sign in

arxiv: 2304.14826 · v1 · submitted 2023-04-22 · ⚛️ physics.soc-ph · cs.GT

We both think you did wrong -- How agreement shapes and is shaped by indirect reciprocity

Pith reviewed 2026-05-24 09:22 UTC · model grok-4.3

classification ⚛️ physics.soc-ph cs.GT
keywords indirect reciprocitymoral normsagreementreputationcooperationprivate assessmentanalytical predictionobservation rate
0
0 comments X

The pith

Certain moral judgment norms produce high agreement among private assessors even without shared information, and this agreement can be predicted analytically for any observation rate while also shaping overall cooperation levels.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper investigates moral judgments in indirect reciprocity, where individuals decide whether to help others based on reputations built from observed actions. It establishes that some norms for assigning good or bad status lead to substantial agreement across independent private assessments. The authors explain the mechanisms behind this convergence and provide an analytical method to compute agreement levels exactly, without needing simulations, that works at any frequency of observing others. They also demonstrate that the resulting agreement can raise or lower average reputations, which in turn increases or decreases the amount of helpful behavior that occurs.

Core claim

Even when every individual assesses actions privately, particular moral judgment norms generate high levels of agreement on who is good or bad; these agreement levels admit an exact analytical prediction that holds for arbitrary observation rates and does not require agent-based simulations; the agreement in turn modulates reputations and therefore the equilibrium level of cooperation.

What carries the argument

The analytical derivation that computes population-wide agreement directly from the structure of a moral judgment norm and the observation rate, by tracking how private assessments of the same actions align or diverge under the norm's assignment rules.

If this is right

  • Agreement produced by a norm directly determines the average reputation in the population.
  • Norms that increase agreement can raise average reputations and therefore raise the level of cooperation.
  • Norms that decrease agreement can lower average reputations and therefore lower the level of cooperation.
  • The relationship between norm structure, agreement, and cooperation holds independently of how often individuals observe actions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same analytical approach could be applied to other reputation systems, such as online review platforms, to predict when private ratings will converge without central coordination.
  • Selecting norms that maximize agreement might offer a way to sustain cooperation in large groups where public reputation sharing is costly or impossible.
  • The method opens the possibility of classifying entire families of norms by their predicted agreement and cooperation effects before any simulation is run.

Load-bearing premise

Moral judgment norms can be formalized precisely enough for their induced agreement levels to follow an exact analytical formula that remains valid at every observation rate.

What would settle it

Compare the analytically predicted agreement values against measured agreement in agent-based simulations that use the same norms at several different observation rates; systematic mismatch at any rate would disprove the derivation.

Figures

Figures reproduced from arXiv: 2304.14826 by Marcus Krellner, The Anh Han.

Figure 1
Figure 1. Figure 1: a) Two dimensions of global opinion state. Graph shows agreement-reputation space, with the solid lines a = 1 and a = ba indicating outer borders of possible agreement values in grey. (Subsequently, values above a = ba + 0.05 will be considered as significant additional agreement, since values very close to ba could conceivably be the result of noise or finite properties of the simulations; or be just too … view at source ↗
read the original abstract

Humans judge each other's actions, which at least partly functions to detect and deter cheating and to enable helpfulness in an indirect reciprocity fashion. However, most forms of judging do not only concern the action itself, but also the moral status of the receiving individual (to deter cheating it must be morally acceptable to withhold help from cheaters). This is a problem, when not everybody agrees who is good and who is bad. Although it has been widely acknowledged that disagreement may exist and that it can be detrimental for indirect reciprocity, the details of this crucial feature of moral judgments have never been studied in depth. We show, that even when everybody assesses individually (aka privately), some moral judgement systems (aka norms) can lead to high levels of agreement. We give a detailed account of the mechanisms which cause it and we show how to predict agreement analytically without requiring agent-based simulations, and for any observation rate. Finally, we show that agreement may increase or decrease reputations and therefore how much helpfulness (aka cooperation) occurs.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper examines moral judgment norms in indirect reciprocity, showing that some norms produce high agreement levels even under fully private (individual) assessments. It details the mechanisms driving agreement, derives an analytical prediction for agreement that holds for arbitrary observation rates without needing agent-based simulations, and demonstrates that agreement can raise or lower reputations and thereby modulate cooperation levels.

Significance. If the analytical derivation is exact and independent of simulation outputs, the work would advance indirect reciprocity theory by supplying a closed-form route to agreement levels across norms and observation rates. This would reduce reliance on computational checks and clarify how private moral judgments affect reputation dynamics and helpfulness.

major comments (2)
  1. [§4] §4 (Analytical derivation of agreement): the claim that agreement admits an exact analytical expression valid for any observation rate must be supported by an explicit formula whose independence from simulation outputs is demonstrated. If the derivation invokes mean-field closures or independence assumptions between private assessments, these must be shown to remain valid when observation rates induce correlations; otherwise the 'without requiring agent-based simulations' guarantee fails for the full parameter range.
  2. [Results] Results section comparing analytical predictions to simulations: the manuscript must include direct numerical checks of the closed-form agreement expression against agent-based runs at both low and high observation rates (e.g., p_obs = 0.1 and p_obs = 0.9) for at least two distinct norms. Without these checks the central claim that the prediction holds exactly remains unverified.
minor comments (2)
  1. [Abstract] Abstract: the phrase 'for any observation rate' should be qualified by the range actually covered by the derivation (e.g., 0 < p_obs ≤ 1) to avoid overstatement.
  2. [Model] Notation: define the observation rate symbol (p_obs or equivalent) at first use and ensure it is used consistently in all equations and figure captions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful and constructive report. The two major comments identify areas where the presentation of the analytical result can be strengthened. We address each point below and will revise the manuscript to incorporate the requested clarifications and verifications.

read point-by-point responses
  1. Referee: [§4] §4 (Analytical derivation of agreement): the claim that agreement admits an exact analytical expression valid for any observation rate must be supported by an explicit formula whose independence from simulation outputs is demonstrated. If the derivation invokes mean-field closures or independence assumptions between private assessments, these must be shown to remain valid when observation rates induce correlations; otherwise the 'without requiring agent-based simulations' guarantee fails for the full parameter range.

    Authors: The derivation in §4 solves the stationary distribution of the finite-state Markov chain whose states are the possible reputation configurations under private assessments. Because the transition matrix is constructed directly from the observation process (each observer draws an independent Bernoulli trial with success probability p_obs), the resulting expression for agreement is closed-form and exact for every p_obs in [0,1]; no mean-field closure or extra independence assumption is introduced. We will insert the explicit formula together with a short derivation appendix that makes this independence from simulation outputs explicit. revision: yes

  2. Referee: [Results] Results section comparing analytical predictions to simulations: the manuscript must include direct numerical checks of the closed-form agreement expression against agent-based runs at both low and high observation rates (e.g., p_obs = 0.1 and p_obs = 0.9) for at least two distinct norms. Without these checks the central claim that the prediction holds exactly remains unverified.

    Authors: We agree that explicit verification at the extremes strengthens the claim. In the revised Results section we will add side-by-side comparisons of the analytical formula against agent-based simulations for p_obs = 0.1 and p_obs = 0.9, using the stern-judging and shunning norms. The new panels will report both the analytical value and the simulation mean with 95 % confidence intervals, confirming agreement within sampling error. revision: yes

Circularity Check

0 steps flagged

No significant circularity; analytical predictions derive independently from norm formalization

full rationale

The paper formalizes moral judgment norms and derives closed-form expressions for agreement levels as functions of observation rate and norm parameters. These expressions are obtained directly from the probabilistic structure of private assessments and do not reduce to fitted quantities, self-citations, or ansatzes imported from prior work by the same authors. The claim of prediction without simulations is supported by explicit derivation steps that remain valid under the stated independence assumptions; no load-bearing step collapses to a self-referential definition or renaming of an empirical pattern. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no specific free parameters, axioms, or invented entities are described in the provided text.

pith-pipeline@v0.9.0 · 5708 in / 1110 out tokens · 30909 ms · 2026-05-24T09:22:27.414993+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

78 extracted references · 78 canonical work pages

  1. [1]

    Albert, R., Jeong, H., and Barab \' a si, A.-L. (1999). Diameter of the World-Wide Web . Nature , 401(6749):130--131

  2. [2]

    A., Lewis, P

    Andras, P., Esterle, L., Guckert, M., Han, T. A., Lewis, P. R., Milanovic, K., Payne, T., Perret, C., Pitt, J., Powers, S. T., Urquhart, N., and Wells, S. (2018). Trusting Intelligent Machines: Deepening Trust Within Socio-Technical Systems . IEEE Technology and Society Magazine , 37(4):76--83

  3. [3]

    Barabasi, A.-L. (2014). Linked-how Everything is Connected to Everything Else and what it Means F . Perseus Books Group

  4. [4]

    Barab \' a si, A.-L. (2016). Network Science . Cambridge University Press

  5. [5]

    and Albert, R

    Barab \' a si, A.-L. and Albert, R. (1999). Emergence of scaling in random networks . science , 286(5439):509--512

  6. [6]

    and Pastor-Satorras, R

    Barrat, A. and Pastor-Satorras, R. (2005). Rate equation approach for correlations in growing network models . Physical Review E , 71(3):36127

  7. [7]

    a nnstr \

    Chen, X., Sasaki, T., Br \" a nnstr \" o m, ., and Dieckmann, U. (2015). First carrot, then stick: how the adaptive hybridization of incentives promotes cooperation . Journal of the royal society interface , 12(102):20140935

  8. [8]

    Cimpeanu, T., Di Stefano , A., Perret, C., and Han, T. A. (2023). Social diversity reduces the complexity and cost of fostering fairness. Chaos, Solitons & Fractals , 167:113051

  9. [9]

    and Han, T

    Cimpeanu, T. and Han, T. A. (2020). Making an example: signalling threat in the evolution of cooperation. In 2020 IEEE Congress on Evolutionary Computation (CEC) , pages 1--8. IEEE

  10. [10]

    Cimpeanu, T., Perret, C., and Han, T. A. (2021). Cost-efficient interventions for promoting fairness in the ultimatum game . Knowledge-Based Systems , 233:107545

  11. [11]

    Dafoe, A., Bachrach, Y., Hadfield, G., Horvitz, E., Larson, K., Graepel, T., et al. (2021). Cooperative ai: machines must learn to find common ground. Nature , 593(7857):33--36

  12. [12]

    Duong, M., Durbac, C., and Han, T. (2022). Cost optimisation of hybrid institutional incentives for promoting cooperation in finite populations. arXiv preprint arXiv:2212.08823

  13. [13]

    Duong, M. H. and Han, T. A. (2021). Cost efficiency of institutional incentives in finite populations . Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences

  14. [14]

    R., Santos, F

    G \' o is, A. R., Santos, F. P., Pacheco, J. M., and Santos, F. C. (2019). Reward and punishment in climate change dilemmas . Sci. Rep. , 9(1):1--9

  15. [15]

    Han, T. A. (2022). Institutional incentives for the evolution of committed cooperation: ensuring participation is as important as enhancing compliance. Journal of the Royal Society Interface , 19(188):20220036

  16. [16]

    A., Lynch, S., Tran-Thanh, L., and Santos, F

    Han, T. A., Lynch, S., Tran-Thanh, L., and Santos, F. C. (2018). Fostering Cooperation in Structured Populations Through Local and Global Interference Strategies . In IJCAI-ECAI'2018 , pages 289--295. AAAI Press

  17. [17]

    A., Pereira, L

    Han, T. A., Pereira, L. M., and Lenaerts, T. (2019). Modelling and Influencing the AI Bidding War: A Research Agenda . In Proceedings of the AAAI/ACM conference AI, Ethics and Society , pages 5--11

  18. [18]

    A., Perret, C., and Powers, S

    Han, T. A., Perret, C., and Powers, S. T. (2021). When to (or not to) trust intelligent machines: Insights from an evolutionary game theory analysis of trust in repeated games. Cognitive Systems Research , 68:111--124

  19. [19]

    A., Santos, F

    Han, T. A., Santos, F. C., Lenaerts, T., and Pereira, L. M. (2015). Synergy between intention recognition and commitments in cooperation dilemmas . Scientific reports , 5(9312)

  20. [20]

    Han, T. A. and Tran-Thanh, L. (2018). Cost-effective external interference for promoting the evolution of cooperation . Scientific reports , 8(15997)

  21. [21]

    Hilbe, C., Traulsen, A., R \" o hl, T., and Milinski, M. (2014). Democratic decisions establish stable authorities that overcome the paradox of second-order punishment . PNAS , 111(2):752--756

  22. [22]

    and Sigmund, K

    Hofbauer, J. and Sigmund, K. (1998). Evolutionary Games and Population Dynamics . Cambridge University Press

  23. [23]

    Levin, S. A. (2000). Multiple scales and the maintenance of biodiversity . Ecosystems , 3(6):498--506

  24. [24]

    Maynard Smith , J. (1982). Evolution and the Theory of Games . Cambridge University Press, Cambridge

  25. [25]

    Newman, M. (2018). Networks, 2nd edition, . Oxford university press

  26. [26]

    Nowak, M. A. and May, R. M. (1992). Evolutionary games and spatial chaos . Nature , 359(6398):826--829

  27. [27]

    Paiva, A., Santos, F. P. F. C., and Santos, F. P. F. C. (2018). Engineering pro-sociality with autonomous agents . In Proceedings of the AAAI Conference on Artificial Intelligence , volume 32, pages 7994--7999

  28. [28]

    S., Watson, R

    Penn, A. S., Watson, R. A., Kraaijeveld, A., and Webb, J. (2010). Systems Aikido-A Novel Approach to Managing Natural Systems. In in Proc. of the ALIFE XII Conference , pages 577--580. MIT press

  29. [29]

    Perc, M. (2012). Sustainable institutionalized punishment requires elimination of second-order free-riders. Scientific reports , 2(1):1--6

  30. [30]

    J., Rand, D

    Perc, M., Jordan, J. J., Rand, D. G., Wang, Z., Boccaletti, S., and Szolnoki, A. (2017). Statistical physics of human cooperation . Physics Reports , 687:1--51

  31. [31]

    Raghunandan, M. A. and Subramanian, C. A. (2012). Sustaining cooperation on networks: an analytical study based on evolutionary game theory . In AAMAS'12 , volume 12, pages 913--920. Citeseer

  32. [32]

    Rand, D. G. and Nowak, M. A. (2013). Human cooperation . Trends in Cognitive Sciences , 17(8):413--425

  33. [33]

    G., Nowak, M

    Rand, D. G., Nowak, M. A., Fowler, J. H., and Christakis, N. A. (2014). Static network structure can stabilize human cooperation . Proc Natl Acad Sci USA , 111(48):17093--17098

  34. [34]

    G., Tarnita, C

    Rand, D. G., Tarnita, C. E., Ohtsuki, H., and Nowak, M. A. (2013). Evolution of fairness in the one-shot anonymous Ultimatum Game . Proceedings of the National Academy of Sciences , 110(7):2581--2586

  35. [35]

    Santos, F. C. and Pacheco, J. M. (2005). Scale-free networks provide a unifying framework for the emergence of cooperation. Phys. Rev. Lett. , 95:98104

  36. [36]

    C., Pacheco, J

    Santos, F. C., Pacheco, J. M., and Lenaerts, T. (2006). Evolutionary Dynamics of Social Dilemmas in Structured Heterogeneous Populations . Proceedings of the National Academy of Sciences of the United States of America , 103:3490--3494

  37. [37]

    C., Santos, M

    Santos, F. C., Santos, M. D., and Pacheco, J. M. (2008). Social diversity promotes the emergence of cooperation in public goods games . Nature , 454:214--216

  38. [38]

    Sasaki, T., Okada, I., Uchida, S., and Chen, X. (2015). Commitment to cooperation and peer punishment: Its evolution. Games , 6(4):574--587

  39. [39]

    Sigmund, K. (2010). The Calculus of Selfishness . Princeton University Press

  40. [40]

    Sigmund, K., Hauert, C., and Nowak, M. (2001). Reward and punishment . Proceedings of the National Academy of Sciences , 98(19):10757--10762

  41. [41]

    D., Traulsen, A., and Hauert, C

    Sigmund, K., Silva, H. D., Traulsen, A., and Hauert, C. (2010). Social learning promotes institutions for governing the commons . Nature , 466:7308

  42. [42]

    Su, J., Sharma, A., and Goel, S. (2016). The effect of recommendations on network structure . 25th International World Wide Web Conference, WWW 2016 , pages 1157--1167

  43. [43]

    and F \' a th, G

    Szab \' o , G. and F \' a th, G. (2007). Evolutionary games on graphs . Physics Reports , 446(4-6):97--216

  44. [44]

    A., and Pacheco, J

    Traulsen, A., Nowak, M. A., and Pacheco, J. M. (2006). Stochastic Dynamics of Invasion and Fixation . Phys. Rev. E , 74:11909

  45. [45]

    Wang, S., Chen, X., and Szolnoki, A. (2019). Exploring optimal institutional incentives for public cooperation . Communications in Nonlinear Science and Numerical Simulation , 79:104914

  46. [46]

    D., Han, T

    Zisis, I., Guida, S. D., Han, T. A., Kirchsteiger, G., and Lenaerts, T. (2015). Generosity motivated by acceptance - evolutionary analysis of an anticipation games . Scientific reports , 5(18076)

  47. [47]

    and Sigmund, K

    Brandt, H. and Sigmund, K. (2004). The logic of reprobation: Assessment and action rules for indirect reciprocation . Journal of Theoretical Biology , 231(4):475--486

  48. [48]

    and Ohtsuki, H

    Fujimoto, Y. and Ohtsuki, H. (2022). Reputation structure in indirect reciprocity under noisy and private assessment . Scientific Reports , 12(1):1--13

  49. [49]

    Hilbe, C., Schmid, L., Tkadlec, J., Chatterjee, K., and Nowak, M. A. (2018). Indirect reciprocity with private, noisy, and incomplete information . Proceedings of the National Academy of Sciences , page 201810565

  50. [50]

    and Han, T

    Krellner, M. and Han, T. A. (2020). Putting oneself in everybody's shoes - Pleasing enables indirect reciprocity under private assessments . In The 2020 Conference on Artificial Life , pages 402--410, Cambridge, MA. MIT Press

  51. [51]

    and Han, T

    Krellner, M. and Han, T. A. (2021). Pleasing Enhances Indirect Reciprocity-Based Cooperation Under Private Assessment . Artificial Life , pages 1--31

  52. [52]

    and Han, T

    Krellner, M. and Han, T. A. (2022). The Last One Standing? - Recent Findings on the Feasibility of Indirect Reciprocity under Private Assessment . In The 2022 Conference on Artificial Life , volume 1, Cambridge, MA. MIT Press

  53. [53]

    and Hammerstein, P

    Leimar, O. and Hammerstein, P. (2001). Evolution of cooperation through indirect reciprocity

  54. [54]

    Nowak, M. A. and Sigmund, K. (1998a). Evolution of indirect reciprocity by image scoring . Nature , 393(6685):573--577

  55. [55]

    Nowak, M. A. and Sigmund, K. (1998b). The dynamics of indirect reciprocity . Journal of Theoretical Biology , 194(4):561--574

  56. [56]

    Nowak, M. A. and Sigmund, K. (2005). Evolution of indirect reciprocity . Nature , 437(7063):1291--1298

  57. [57]

    Ohtsuki, H. (2004). Reactive strategies in indirect reciprocity . Journal of Theoretical Biology , 227(3):299--314

  58. [58]

    and Iwasa, Y

    Ohtsuki, H. and Iwasa, Y. (2004). How should we define goodness? - Reputation dynamics in indirect reciprocity . Journal of Theoretical Biology , 231(1):107--120

  59. [59]

    and Iwasa, Y

    Ohtsuki, H. and Iwasa, Y. (2006). The leading eight: Social norms that can maintain cooperation by indirect reciprocity . Journal of Theoretical Biology , 239(4):435--444

  60. [60]

    Okada, I. (2020a). A Review of Theoretical Studies on Indirect Reciprocity . Games , 11(3):27

  61. [61]

    Okada, I. (2020b). Two ways to overcome the three social dilemmas of indirect reciprocity . Scientific Reports , 10(1)

  62. [62]

    Okada, I., Sasaki, T., and Nakai, Y. (2017). Tolerant indirect reciprocity can boost social welfare through solidarity with unconditional cooperators in private monitoring . Scientific Reports , 7(1):9737

  63. [63]

    Okada, I., Sasaki, T., and Nakai, Y. (2018). A solution for private assessment in indirect reciprocity using solitary observation . Journal of Theoretical Biology , 455:7--15

  64. [64]

    and Boyd, R

    Panchanathan, K. and Boyd, R. (2003). A tale of two defectors: The importance of standing for evolution of indirect reciprocity . Journal of Theoretical Biology , 224(1):115--126

  65. [65]

    Perret, C., Krellner, M., and Han, T. A. (2021). The evolution of moral rules in a model of indirect reciprocity with private assessment - DRAFT . Scientific Reports , 11(1)

  66. [66]

    P., Pacheco, J

    Santos, F. P., Pacheco, J. M., and Santos, F. C. (2021). The complexity of human cooperation under indirect reciprocity . Philosophical Transactions of the Royal Society B: Biological Sciences , 376(1838)

  67. [67]

    Sasaki, T., Okada, I., and Nakai, Y. (2017). The evolution of conditional moral assessment in indirect reciprocity . Scientific Reports , 7(1):41870

  68. [68]

    Schmid, L., Chatterjee, K., Hilbe, C., and Nowak, M. A. (2021a). A unified framework of direct and indirect reciprocity . Nature Human Behaviour , 5(10):1292--1302

  69. [69]

    Schmid, L., Hilbe, C., Chatterjee, K., and Nowak, M. A. (2022). Direct reciprocity between individuals that use different strategy spaces . PLOS Computational Biology , 18(6):e1010149

  70. [70]

    Schmid, L., Shati, P., Hilbe, C., and Chatterjee, K. (2021b). The evolution of indirect reciprocity under action and assessment generosity . Scientific Reports , 11(1):1--14

  71. [71]

    Sigmund, K. (2016). The calculus of selflessness . Princeton University Press

  72. [72]

    Sudgen, R. (1986). The Economics of Rights, Cooperation and Welfare . Basic Blackwell

  73. [73]

    Uchida, S. (2010). Effect of private information on indirect reciprocity . Physical Review E , 82(3):036111

  74. [74]

    and Sasaki, T

    Uchida, S. and Sasaki, T. (2013). Effect of assessment error and private information on stern-judging in indirect reciprocity . Chaos, Solitons & Fractals , 56:175--180

  75. [75]

    Nowak, M. A. and Sigmund, K. (1992). Tit for tat in heterogeneous populations . Nature , 355:250--252

  76. [76]

    Nowak, M. A. and Sigmund, K. (1998). Evolution of indirect reciprocity by image scoring . Nature , 393(6685):573--577

  77. [77]

    Okada, I. (2020). Two ways to overcome the three social dilemmas of indirect reciprocity . Scientific Reports , 10(1)

  78. [78]

    Schmid, L., Chatterjee, K., Hilbe, C., and Nowak, M. A. (2021). A unified framework of direct and indirect reciprocity . Nature Human Behaviour , 5(10):1292--1302