How Human Feedback Shapes AI-generated Community Notes

Isaac Slaughter; Jiawei Guo; Jiayuan Yan; Martin Saveski; Qiao-Yun Cheng; Soham De; Sruti Banerjee

arxiv: 2606.30905 · v1 · pith:AH2SZT7Znew · submitted 2026-06-29 · 💻 cs.CY · cs.AI

How Human Feedback Shapes AI-generated Community Notes

Soham De , Isaac Slaughter , Jiawei Guo , Qiao-Yun Cheng , Jiayuan Yan , Sruti Banerjee , Martin Saveski This is my paper

Pith reviewed 2026-07-01 01:13 UTC · model grok-4.3

classification 💻 cs.CY cs.AI

keywords Community Noteshuman feedbackAI-generated notesfact-checkingcrowdsourced moderationsocial mediaLLM refinementhelpful ratings

0 comments

The pith

Human feedback improves AI-generated Community Notes, with the largest gains from suggestions that challenge the main claim in prior drafts.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper analyzes the full set of collaborative notes on X, where an LLM first drafts a note and humans then suggest revisions. It shows that feedback involving factual corrections or added context is most often adopted, while subjective judgments are rarely incorporated. Feedback that challenges the central claim produces the clearest rise in helpfulness scores, and suggestions from more active contributors carry more weight. Yet collaborative notes still reach the platform's helpful threshold and appear to users at lower rates than purely human or purely AI notes, mainly because few people participate. These notes fill a distinct niche by addressing posts that attract neither human-only nor AI-only notes.

Core claim

Human feedback on LLM-drafted Community Notes produces more helpful versions, driven especially by suggestions that contest the main claim of the previous draft and come from frequent contributors. Although scores rise, the notes achieve helpful status and visibility less often than human-only or AI-only notes, with low participation acting as the main constraint. Collaborative notes do not replace the other two types but instead cover a largely separate set of posts.

What carries the argument

Iterative refinement loop in which humans submit categorized suggestions to LLM-drafted notes, with selective incorporation of those suggestions into later versions.

If this is right

Suggestions that challenge the main claim produce larger helpfulness increases than other feedback types.
Feedback from more active contributors has greater effect than feedback from less active ones.
Limited human participation prevents collaborative notes from matching the display rates of human-only or AI-only notes.
Collaborative notes address posts that receive neither human-only nor AI-only notes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Increasing participation rates could raise the overall share of posts that receive any note.
Design changes that surface challenging suggestions might accelerate quality gains in future iterations.
The same feedback taxonomy could be applied to test whether similar patterns appear on other platforms using Community Notes.

Load-bearing premise

The platform's internal helpfulness metric and visibility rules measure note quality independently of whether the note was produced through collaborative drafting.

What would settle it

A controlled comparison that holds the post fixed and randomly assigns either human feedback iterations or no feedback, then measures the resulting helpfulness ratings and display rates, would test whether feedback itself drives the observed gains.

Figures

Figures reproduced from arXiv: 2606.30905 by Isaac Slaughter, Jiawei Guo, Jiayuan Yan, Martin Saveski, Qiao-Yun Cheng, Soham De, Sruti Banerjee.

**Figure 1.** Figure 1: Lifecycle of a Collaborative Note on X. ❶ When a potentially misleading post is flagged, an LLM generates a draft note which is then rated by human contributors. ❷ Raters assess helpfulness and may optionally submit revision suggestions; a matrix factorization model scores the note based on these ratings and assigns it a status of Currently Rated Helpful, Needs More Ratings, or Currently Rated Not Helpful.… view at source ↗

**Figure 2.** Figure 2: Frequency and success rates for suggestions with [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Characteristics of users who rated collaborative vs. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Evolution of collaborative notes across versions. [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 7.** Figure 7: Performance of collaborative notes relative to non [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗

**Figure 6.** Figure 6: Associations between note improvement in an iter [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 8.** Figure 8: Speed of rating accumulation for different note types, number of raters for different note types, and rater retention for [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗

**Figure 9.** Figure 9: Collaborative notes currently under-perform non [PITH_FULL_IMAGE:figures/full_fig_p011_9.png] view at source ↗

read the original abstract

Community Notes, a bridging-based crowd-sourced fact-checking system, has emerged as a new mechanism for moderating misleading information on social media and has been adopted by major platforms including X, Facebook, Instagram, Threads, and TikTok. Since its introduction, there has been an open question about what role AI could play in scaling and optimizing the system. Recently, X extended its Community Notes system by introducing Collaborative Notes: notes initially drafted by an LLM and iteratively refined based on feedback from human contributors. In this work, we systematically analyze the complete corpus of 19,146 collaborative notes and 211,850 instances of human feedback. First, we develop a taxonomy of human suggestions for improving AI-generated note drafts and find that suggestions involving factual corrections and additional context are most likely to be incorporated, while subjective policy judgments rarely are. Second, we examine changes in helpfulness across versions of collaborative notes and find that human feedback leads to more helpful notes, with the greatest impact coming from suggestions that challenge the main claim in the previous draft, particularly when submitted by more active contributors. Finally, we find that although collaborative notes improve through human feedback, they reach helpful status and are shown on the platform at lower rates than human-only or AI-only notes, with limited human participation emerging as a key bottleneck. Nevertheless, rather than serving as a weaker substitute, collaborative notes tend to play a complementary role, predominantly targeting posts that do not attract human-only or AI-only notes. Our analysis provides an initial description of efforts to use AI to improve crowdsourced content moderation in a real-world moderation system and outlines pathways for future improvements to such features.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

First large-scale look at human edits to AI-drafted Community Notes on X, with clear descriptive patterns but open causal questions.

read the letter

This paper gives the first systematic view of X's Collaborative Notes, where LLMs draft and humans iterate. They tracked 19k notes and 211k feedback instances, built a taxonomy of suggestion types, measured version-to-version helpfulness shifts, and compared adoption rates against human-only and AI-only notes.

The descriptive work is solid. Factual corrections and added context get incorporated far more than policy opinions. Notes improve after feedback, with bigger gains from challenging suggestions by active contributors. Collaborative notes still lag in reaching helpful status and visibility, but they cover posts that other note types miss. The scale and the taxonomy are new.

The soft spot is the causal link. The abstract attributes helpfulness gains to specific feedback content, yet the data is observational. Selection into which drafts receive feedback from active users, or platform scoring rules that might treat collaborative drafts differently, could drive the patterns. The stress-test note flags this correctly, and the abstract gives no sign of matching or controls that would separate those factors.

This is for researchers working on crowd-sourced moderation, AI-assisted fact-checking, and platform design. Anyone tracking how these systems actually behave will find the coverage and feedback taxonomy useful.

Send it for peer review. The dataset is original and the questions are practical; referees can tighten the identification and ask for robustness details.

Referee Report

2 major / 2 minor

Summary. The manuscript analyzes the full corpus of 19,146 collaborative notes and 211,850 human feedback instances on X's Community Notes platform. It first develops a taxonomy of suggestion types, then tracks version-to-version changes in helpfulness ratings to claim that human feedback improves notes (with largest gains from claim-challenging suggestions by active contributors), and finally compares success rates and post-targeting patterns to show that collaborative notes underperform human-only and AI-only notes in reaching visibility but serve a complementary role by addressing posts that attract neither.

Significance. If the reported patterns survive controls for selection and metric dependence, the work supplies the first large-scale empirical description of LLM-initiated notes refined by human feedback inside a live bridging-based moderation system, quantifying which feedback types are incorporated, identifying participation bottlenecks, and documenting complementarity rather than substitution.

major comments (2)

[Abstract / Helpfulness analysis] Abstract and the section on helpfulness changes across versions: the claim that feedback (especially challenging suggestions from active contributors) causes increases in helpfulness is presented without any reported controls, matching, fixed effects, or robustness checks that would separate the effect from selection into which drafts receive iteration or from platform rating rules that may treat collaborative drafts differently. This identification gap is load-bearing for the central causal attribution.
[Comparison of success rates] Section comparing rates to human-only and AI-only notes: the lower helpful-status and visibility rates for collaborative notes are reported without evidence that the platform's internal helpfulness metric and visibility rules are independent of the collaborative drafting process itself, leaving open the possibility that the metric is partly endogenous to the presence of human-AI iteration.

minor comments (2)

[Taxonomy development] The taxonomy construction and inter-annotator agreement statistics are not described in sufficient detail to allow replication of the classification of suggestion types.
[Data and methods] The manuscript would benefit from an explicit statement of the time window and any filtering criteria applied to the 19,146 collaborative notes.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the detailed and constructive report. The comments highlight important limitations in our observational analysis regarding causal identification and potential metric endogeneity. We address each major comment below and outline planned revisions.

read point-by-point responses

Referee: [Abstract / Helpfulness analysis] Abstract and the section on helpfulness changes across versions: the claim that feedback (especially challenging suggestions from active contributors) causes increases in helpfulness is presented without any reported controls, matching, fixed effects, or robustness checks that would separate the effect from selection into which drafts receive iteration or from platform rating rules that may treat collaborative drafts differently. This identification gap is load-bearing for the central causal attribution.

Authors: We agree that the manuscript presents version-to-version changes without fixed effects, matching, or explicit robustness checks to isolate feedback effects from selection into iteration or platform rating differences. The analysis is observational, and while within-note changes provide suggestive evidence, we cannot fully rule out confounding. We will revise the abstract and main text to replace causal phrasing ('leads to', 'causes') with associative language ('associated with increases in'). We will add a limitations subsection explicitly discussing selection biases and the absence of controls. If data permit, we will include supplementary robustness checks (e.g., stratification by contributor activity). This constitutes a partial revision. revision: partial
Referee: [Comparison of success rates] Section comparing rates to human-only and AI-only notes: the lower helpful-status and visibility rates for collaborative notes are reported without evidence that the platform's internal helpfulness metric and visibility rules are independent of the collaborative drafting process itself, leaving open the possibility that the metric is partly endogenous to the presence of human-AI iteration.

Authors: We acknowledge that the paper reports observed rates without direct evidence that X's internal metrics are independent of the collaborative process. Endogeneity is a plausible concern we cannot directly test. We will add an explicit discussion of this limitation in the relevant section. The complementarity result relies primarily on post-targeting patterns rather than the helpfulness metric itself, which may be less sensitive to this issue. We plan a partial revision to incorporate this caveat. revision: partial

standing simulated objections not resolved

Direct evidence on whether X's internal helpfulness algorithm or visibility rules treat collaborative notes differently, as this would require access to proprietary platform code and data not available to external researchers.

Circularity Check

0 steps flagged

No circularity: observational analysis of platform data with no self-referential derivations

full rationale

The paper conducts empirical analysis on a corpus of 19,146 collaborative notes and 211,850 feedback instances. It develops a taxonomy of suggestions, tracks version-to-version changes in platform helpfulness scores, and compares rates of reaching helpful status. No equations, fitted models, or parameters are defined in terms of the target outcomes; no predictions reduce to inputs by construction; and no self-citations serve as load-bearing uniqueness theorems or ansatzes. The central claims rest on direct data patterns rather than any closed derivation loop.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The work is purely empirical and relies on the platform's existing definitions of note helpfulness and visibility; no new free parameters, axioms, or invented entities are introduced.

pith-pipeline@v0.9.1-grok · 5839 in / 1245 out tokens · 34038 ms · 2026-07-01T01:13:21.433367+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

64 extracted references · 22 canonical work pages · 7 internal anchors

[1]

Proceedings of the ACM on Web Conference 2025 , pages=

Supernotes: Driving consensus in crowd-sourced fact-checking , author=. Proceedings of the ACM on Web Conference 2025 , pages=

2025
[2]

arXiv preprint arXiv:2509.11052 , year=

Commenotes: Synthesizing organic comments to support community-based fact-checking , author=. arXiv preprint arXiv:2509.11052 , year=

work page arXiv
[3]

Proceedings of the 31st annual ACM symposium on user interface software and technology , pages=

Believe it or not: designing a human-ai partnership for mixed-initiative fact-checking , author=. Proceedings of the 31st annual ACM symposium on user interface software and technology , pages=
[4]

Birdwatch: Crowd wisdom and bridging algorithms can inform understanding and reduce the spread of misin- formation,

Birdwatch: Crowd wisdom and bridging algorithms can inform understanding and reduce the spread of misinformation , author=. arXiv preprint arXiv:2210.15723 , year=

work page arXiv
[5]

Information Processing & Management , volume=

Crowdsourced fact-checking: Does it actually work? , author=. Information Processing & Management , volume=. 2024 , publisher=

2024
[6]

Proceedings of the National Academy of Sciences , volume=

Community notes reduce engagement with and diffusion of false information online , author=. Proceedings of the National Academy of Sciences , volume=. 2025 , publisher=

2025
[7]

Nature Communications , volume=

Community-based fact-checking reduces the spread of misleading posts on X (formerly Twitter) , author=. Nature Communications , volume=. 2026 , publisher=

2026
[8]

Althea: Human-AI Collaboration for Fact-Checking and Critical Reasoning

Althea: Human-AI Collaboration for Fact-Checking and Critical Reasoning , author=. arXiv preprint arXiv:2602.11161 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[9]

arXiv: 2306.17176 , year=

News Verifiers Showdown: A Comparative Performance Evaluation of ChatGPT 3.5, ChatGPT 4.0, Bing AI, and Bard in News Fact-Checking , author=. arXiv: 2306.17176 , year=

work page arXiv
[10]

arXiv preprint arXiv:2402.05904 , year=

FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs , author=. arXiv preprint arXiv:2402.05904 , year=

work page arXiv
[11]

arXiv preprint arXiv:2312.13096 , year=

In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? , author=. arXiv preprint arXiv:2312.13096 , year=

work page arXiv
[12]

arXiv preprint arXiv:2503.08404 , year=

Fact-Checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information , author=. arXiv preprint arXiv:2503.08404 , year=

work page arXiv
[13]

Journal of Medical Internet Research , year=

Use of Retrieval-Augmented Large Language Model for COVID-19 Fact-Checking: Development and Usability Study , author=. Journal of Medical Internet Research , year=
[14]

arXiv preprint arXiv:2505.18596 , year=

Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models , author=. arXiv preprint arXiv:2505.18596 , year=

work page arXiv
[15]

Proceedings of the ACM on Human-Computer Interaction , volume =

Liu, Houjiang and Das, Anubrata and Boltz, Alexander and Zhou, Didi and Pinaroc, Daisy and Lease, Matthew and Lee, Min Kyung , title =. Proceedings of the ACM on Human-Computer Interaction , volume =. 2024 , month =

2024
[16]

Perspectives on Psychological Science , volume=

Crowds can effectively identify misinformation at scale , author=. Perspectives on Psychological Science , volume=. 2024 , publisher=

2024
[17]

PNAS nexus , volume=

Community notes increase trust in fact-checking on social media , author=. PNAS nexus , volume=. 2024 , publisher=

2024
[18]

Quality-Sensitive Matrix Factorization for Community Notes: Towards Sample Efficiency and Manipulation Resistance

Quality-Sensitive Matrix Factorization for Community Notes: Towards Sample Efficiency and Manipulation Resistance , author=. arXiv preprint arXiv:2604.11224 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[19]

Beyond the Crowd: LLM-Augmented Community Notes for Governing Health Misinformation

Beyond the Crowd: LLM-Augmented Community Notes for Governing Health Misinformation , author=. arXiv preprint arXiv:2510.11423 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[20]

AI Fact-Checking in the Wild: A Field Evaluation of LLM-Written Community Notes on X

AI Fact-Checking in the Wild: A Field Evaluation of LLM-Written Community Notes on X , author=. arXiv preprint arXiv:2604.02592 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[21]

Journal of Online Trust and Safety , volume=

Scaling human judgment in community notes with llms , author=. Journal of Online Trust and Safety , volume=. doi:10.54501/jots.v3i1.255 , year=

work page doi:10.54501/jots.v3i1.255
[22]

arXiv preprint arXiv:2603.11120 , year=

The Laziness of the Crowd: Effort Aversion Among Raters Risks Undermining the Efficacy of X's Community Notes Program , author=. arXiv preprint arXiv:2603.11120 , year=

work page arXiv
[23]

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pages =

Hassan, Naeemul and Arslan, Fatma and Li, Chengkai and Tremayne, Mark , title =. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pages =. 2017 , publisher =

2017
[24]

Nature human behaviour , volume=

A consensus-based transparency checklist , author=. Nature human behaviour , volume=. 2020 , publisher=

2020
[25]

Communications of the ACM , volume=

Datasheets for datasets , author=. Communications of the ACM , volume=. 2021 , publisher=

2021
[26]

Centre for the Governance of AI

A guide to writing the NeurIPS impact statement , author=. Centre for the Governance of AI. URL: https://perma. cc/B5R8-2B9V , year=
[27]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts , pages=

Understanding Ethics in NLP Authoring and Reviewing , author=. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts , pages=
[28]

NeurIPS 2021 Paper Checklist Guidelines

NeurIPS. NeurIPS 2021 Paper Checklist Guidelines

2021
[29]

The FAIR Data principles

FORCE11. The FAIR Data principles
[30]

arXiv preprint arXiv:2506.15168 , year=

Algorithmic resolution of crowd-sourced moderation on X in polarized settings across countries , author=. arXiv preprint arXiv:2506.15168 , year=

work page arXiv
[31]

Proceedings of the 29th International Conference on Computational Linguistics , pages=

Twitter topic classification , author=. Proceedings of the 29th International Conference on Computational Linguistics , pages=
[32]

Journal of Machine Learning Research , volume=

Beyond english-centric multilingual machine translation , author=. Journal of Machine Learning Research , volume=
[33]

Proceedings of the ACM Web Conference , year=

Hoaxy: A platform for tracking online misinformation , author=. Proceedings of the ACM Web Conference , year=
[34]

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , year=

Toward automated fact-checking: Detecting check-worthy factual claims by claimbuster , author=. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , year=
[35]

Proceedings of the National Academy of Sciences , year=

Fighting misinformation on social media using crowdsourced judgments of news source quality , author=. Proceedings of the National Academy of Sciences , year=
[36]

Poynter Institute

Most Republicans don’t trust fact-checkers, and most Americans don’t trust the media , author=. Poynter Institute. , year=
[37]

Behavioral Sciences , year=

Americans’ perspectives on online media warning labels , author=. Behavioral Sciences , year=
[38]

HKS Misinformation Review , year=

Leveraging volunteer fact checking to identify misinformation about COVID-19 in social media , author=. HKS Misinformation Review , year=
[39]

Frontiers in Artificial Intelligence , year=

The perils and promises of fact-checking with large language models , author=. Frontiers in Artificial Intelligence , year=
[40]

Science Advances , year=

Scaling up fact-checking using the wisdom of crowds , author=. Science Advances , year=
[41]

Collective Intelligence , year=

Searching for or reviewing evidence improves crowdworkers’ misinformation judgments and reduces partisan bias , author=. Collective Intelligence , year=
[42]

Proceedings of the Conference on Human Factors in Computing Systems , year=

Will the crowd game the algorithm? Using layperson judgments to combat misinformation on social media by downranking distrusted sources , author=. Proceedings of the Conference on Human Factors in Computing Systems , year=
[43]

IEEE international Conference on Big Data , year=

The role of the crowd in countering misinformation: A case study of the COVID-19 infodemic , author=. IEEE international Conference on Big Data , year=
[44]

Mediashift , year=

Crowdsourced fact-checking? What we learned from Truthsquad , author=. Mediashift , year=
[45]

Proceedings of the 15th International Symposium on Open Collaboration , year=

Do you have a source for that? Understanding the Challenges of Collaborative Evidence-based Journalism , author=. Proceedings of the 15th International Symposium on Open Collaboration , year=
[46]

Proceedings of the International AAAI Conference on Web and Social Media , year=

Community-based fact-checking on Twitter’s Birdwatch platform , author=. Proceedings of the International AAAI Conference on Web and Social Media , year=
[47]

Proceedings of the ACM on Human-Computer Interaction , year=

Diffusion of community fact-checked misinformation on Twitter , author=. Proceedings of the ACM on Human-Computer Interaction , year=
[48]

osf.io/preprints/osf/3a4fe , year=

Community notes reduce the spread of misleading posts on X , author=. osf.io/preprints/osf/3a4fe , year=
[49]

PNAS Nexus , year=

Community notes increase trust in fact-checking on social media , author=. PNAS Nexus , year=
[50]

Snoping: How the Crowd Selects Fact-Checking Targets on Social Media , author=

Community Notes vs. Snoping: How the Crowd Selects Fact-Checking Targets on Social Media , author=. Proceedings of the International AAAI Conference on Web and Social Media , year=
[51]

osf.io/preprints/psyarxiv/qnjkf , year=

Leveraging ChatGPT for efficient fact-checking , author=. osf.io/preprints/psyarxiv/qnjkf , year=
[52]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year=

SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year=
[53]

arXiv:2310.05253 , year=

Explainable claim verification via knowledge-grounded reasoning with large language models , author=. arXiv:2310.05253 , year=

work page arXiv
[54]

Transactions of the Association for Computational Linguistics , year=

JustiLM: Few-shot Justification Generation for Explainable Fact-Checking of Real-world Claims , author=. Transactions of the Association for Computational Linguistics , year=
[55]

arXiv:2403.11169 , year=

Correcting misinformation on social media with a large language model , author=. arXiv:2403.11169 , year=

work page arXiv
[56]

2016 , publisher=

Detecting rumors from microblogs with recurrent neural networks , author=. 2016 , publisher=

2016
[57]

Proceedings of the 2011 conference on empirical methods in natural language processing , year=

Rumor has it: Identifying misinformation in microblogs , author=. Proceedings of the 2011 conference on empirical methods in natural language processing , year=

2011
[58]

ACM SIGKDD explorations newsletter , year=

Misinformation in social media: definition, manipulation, and detection , author=. ACM SIGKDD explorations newsletter , year=
[59]

arXiv:2308.10800 , year=

Artificial intelligence is ineffective and potentially harmful for fact checking , author=. arXiv:2308.10800 , year=

work page arXiv
[60]

arXiv preprint arXiv:2511.02615 , year=

Community Notes are vulnerable to rater bias and manipulation , author=. arXiv preprint arXiv:2511.02615 , year=

work page arXiv
[61]

AI Feedback Enhances Community-Based Content Moderation through Engagement with Counterarguments

AI feedback enhances community-based content moderation through engagement with counterarguments , author=. arXiv preprint arXiv:2507.08110 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[62]

The Benefit of Collective Intelligence in Community-Based Content Moderation is Limited by Overt Political Signalling

The Benefit of Collective Intelligence in Community-Based Content Moderation is Limited by Overt Political Signalling , author=. arXiv preprint arXiv:2601.22201 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[63]

arXiv preprint arXiv:2602.08970 , year=

Hyperactive Minority Alters the Stability of Community Notes , author=. arXiv preprint arXiv:2602.08970 , year=

work page arXiv
[64]

Grok in the Wild: Characterizing the Roles and Uses of Large Language Models on Social Media

Grok in the wild: Characterizing the roles and uses of large language models on social media , author=. arXiv preprint arXiv:2602.11286 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[1] [1]

Proceedings of the ACM on Web Conference 2025 , pages=

Supernotes: Driving consensus in crowd-sourced fact-checking , author=. Proceedings of the ACM on Web Conference 2025 , pages=

2025

[2] [2]

arXiv preprint arXiv:2509.11052 , year=

Commenotes: Synthesizing organic comments to support community-based fact-checking , author=. arXiv preprint arXiv:2509.11052 , year=

work page arXiv

[3] [3]

Proceedings of the 31st annual ACM symposium on user interface software and technology , pages=

Believe it or not: designing a human-ai partnership for mixed-initiative fact-checking , author=. Proceedings of the 31st annual ACM symposium on user interface software and technology , pages=

[4] [4]

Birdwatch: Crowd wisdom and bridging algorithms can inform understanding and reduce the spread of misin- formation,

Birdwatch: Crowd wisdom and bridging algorithms can inform understanding and reduce the spread of misinformation , author=. arXiv preprint arXiv:2210.15723 , year=

work page arXiv

[5] [5]

Information Processing & Management , volume=

Crowdsourced fact-checking: Does it actually work? , author=. Information Processing & Management , volume=. 2024 , publisher=

2024

[6] [6]

Proceedings of the National Academy of Sciences , volume=

Community notes reduce engagement with and diffusion of false information online , author=. Proceedings of the National Academy of Sciences , volume=. 2025 , publisher=

2025

[7] [7]

Nature Communications , volume=

Community-based fact-checking reduces the spread of misleading posts on X (formerly Twitter) , author=. Nature Communications , volume=. 2026 , publisher=

2026

[8] [8]

Althea: Human-AI Collaboration for Fact-Checking and Critical Reasoning

Althea: Human-AI Collaboration for Fact-Checking and Critical Reasoning , author=. arXiv preprint arXiv:2602.11161 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[9] [9]

arXiv: 2306.17176 , year=

News Verifiers Showdown: A Comparative Performance Evaluation of ChatGPT 3.5, ChatGPT 4.0, Bing AI, and Bard in News Fact-Checking , author=. arXiv: 2306.17176 , year=

work page arXiv

[10] [10]

arXiv preprint arXiv:2402.05904 , year=

FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs , author=. arXiv preprint arXiv:2402.05904 , year=

work page arXiv

[11] [11]

arXiv preprint arXiv:2312.13096 , year=

In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? , author=. arXiv preprint arXiv:2312.13096 , year=

work page arXiv

[12] [12]

arXiv preprint arXiv:2503.08404 , year=

Fact-Checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information , author=. arXiv preprint arXiv:2503.08404 , year=

work page arXiv

[13] [13]

Journal of Medical Internet Research , year=

Use of Retrieval-Augmented Large Language Model for COVID-19 Fact-Checking: Development and Usability Study , author=. Journal of Medical Internet Research , year=

[14] [14]

arXiv preprint arXiv:2505.18596 , year=

Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models , author=. arXiv preprint arXiv:2505.18596 , year=

work page arXiv

[15] [15]

Proceedings of the ACM on Human-Computer Interaction , volume =

Liu, Houjiang and Das, Anubrata and Boltz, Alexander and Zhou, Didi and Pinaroc, Daisy and Lease, Matthew and Lee, Min Kyung , title =. Proceedings of the ACM on Human-Computer Interaction , volume =. 2024 , month =

2024

[16] [16]

Perspectives on Psychological Science , volume=

Crowds can effectively identify misinformation at scale , author=. Perspectives on Psychological Science , volume=. 2024 , publisher=

2024

[17] [17]

PNAS nexus , volume=

Community notes increase trust in fact-checking on social media , author=. PNAS nexus , volume=. 2024 , publisher=

2024

[18] [18]

Quality-Sensitive Matrix Factorization for Community Notes: Towards Sample Efficiency and Manipulation Resistance

Quality-Sensitive Matrix Factorization for Community Notes: Towards Sample Efficiency and Manipulation Resistance , author=. arXiv preprint arXiv:2604.11224 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[19] [19]

Beyond the Crowd: LLM-Augmented Community Notes for Governing Health Misinformation

Beyond the Crowd: LLM-Augmented Community Notes for Governing Health Misinformation , author=. arXiv preprint arXiv:2510.11423 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[20] [20]

AI Fact-Checking in the Wild: A Field Evaluation of LLM-Written Community Notes on X

AI Fact-Checking in the Wild: A Field Evaluation of LLM-Written Community Notes on X , author=. arXiv preprint arXiv:2604.02592 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[21] [21]

Journal of Online Trust and Safety , volume=

Scaling human judgment in community notes with llms , author=. Journal of Online Trust and Safety , volume=. doi:10.54501/jots.v3i1.255 , year=

work page doi:10.54501/jots.v3i1.255

[22] [22]

arXiv preprint arXiv:2603.11120 , year=

The Laziness of the Crowd: Effort Aversion Among Raters Risks Undermining the Efficacy of X's Community Notes Program , author=. arXiv preprint arXiv:2603.11120 , year=

work page arXiv

[23] [23]

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pages =

Hassan, Naeemul and Arslan, Fatma and Li, Chengkai and Tremayne, Mark , title =. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pages =. 2017 , publisher =

2017

[24] [24]

Nature human behaviour , volume=

A consensus-based transparency checklist , author=. Nature human behaviour , volume=. 2020 , publisher=

2020

[25] [25]

Communications of the ACM , volume=

Datasheets for datasets , author=. Communications of the ACM , volume=. 2021 , publisher=

2021

[26] [26]

Centre for the Governance of AI

A guide to writing the NeurIPS impact statement , author=. Centre for the Governance of AI. URL: https://perma. cc/B5R8-2B9V , year=

[27] [27]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts , pages=

Understanding Ethics in NLP Authoring and Reviewing , author=. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts , pages=

[28] [28]

NeurIPS 2021 Paper Checklist Guidelines

NeurIPS. NeurIPS 2021 Paper Checklist Guidelines

2021

[29] [29]

The FAIR Data principles

FORCE11. The FAIR Data principles

[30] [30]

arXiv preprint arXiv:2506.15168 , year=

Algorithmic resolution of crowd-sourced moderation on X in polarized settings across countries , author=. arXiv preprint arXiv:2506.15168 , year=

work page arXiv

[31] [31]

Proceedings of the 29th International Conference on Computational Linguistics , pages=

Twitter topic classification , author=. Proceedings of the 29th International Conference on Computational Linguistics , pages=

[32] [32]

Journal of Machine Learning Research , volume=

Beyond english-centric multilingual machine translation , author=. Journal of Machine Learning Research , volume=

[33] [33]

Proceedings of the ACM Web Conference , year=

Hoaxy: A platform for tracking online misinformation , author=. Proceedings of the ACM Web Conference , year=

[34] [34]

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , year=

Toward automated fact-checking: Detecting check-worthy factual claims by claimbuster , author=. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , year=

[35] [35]

Proceedings of the National Academy of Sciences , year=

Fighting misinformation on social media using crowdsourced judgments of news source quality , author=. Proceedings of the National Academy of Sciences , year=

[36] [36]

Poynter Institute

Most Republicans don’t trust fact-checkers, and most Americans don’t trust the media , author=. Poynter Institute. , year=

[37] [37]

Behavioral Sciences , year=

Americans’ perspectives on online media warning labels , author=. Behavioral Sciences , year=

[38] [38]

HKS Misinformation Review , year=

Leveraging volunteer fact checking to identify misinformation about COVID-19 in social media , author=. HKS Misinformation Review , year=

[39] [39]

Frontiers in Artificial Intelligence , year=

The perils and promises of fact-checking with large language models , author=. Frontiers in Artificial Intelligence , year=

[40] [40]

Science Advances , year=

Scaling up fact-checking using the wisdom of crowds , author=. Science Advances , year=

[41] [41]

Collective Intelligence , year=

Searching for or reviewing evidence improves crowdworkers’ misinformation judgments and reduces partisan bias , author=. Collective Intelligence , year=

[42] [42]

Proceedings of the Conference on Human Factors in Computing Systems , year=

Will the crowd game the algorithm? Using layperson judgments to combat misinformation on social media by downranking distrusted sources , author=. Proceedings of the Conference on Human Factors in Computing Systems , year=

[43] [43]

IEEE international Conference on Big Data , year=

The role of the crowd in countering misinformation: A case study of the COVID-19 infodemic , author=. IEEE international Conference on Big Data , year=

[44] [44]

Mediashift , year=

Crowdsourced fact-checking? What we learned from Truthsquad , author=. Mediashift , year=

[45] [45]

Proceedings of the 15th International Symposium on Open Collaboration , year=

Do you have a source for that? Understanding the Challenges of Collaborative Evidence-based Journalism , author=. Proceedings of the 15th International Symposium on Open Collaboration , year=

[46] [46]

Proceedings of the International AAAI Conference on Web and Social Media , year=

Community-based fact-checking on Twitter’s Birdwatch platform , author=. Proceedings of the International AAAI Conference on Web and Social Media , year=

[47] [47]

Proceedings of the ACM on Human-Computer Interaction , year=

Diffusion of community fact-checked misinformation on Twitter , author=. Proceedings of the ACM on Human-Computer Interaction , year=

[48] [48]

osf.io/preprints/osf/3a4fe , year=

Community notes reduce the spread of misleading posts on X , author=. osf.io/preprints/osf/3a4fe , year=

[49] [49]

PNAS Nexus , year=

Community notes increase trust in fact-checking on social media , author=. PNAS Nexus , year=

[50] [50]

Snoping: How the Crowd Selects Fact-Checking Targets on Social Media , author=

Community Notes vs. Snoping: How the Crowd Selects Fact-Checking Targets on Social Media , author=. Proceedings of the International AAAI Conference on Web and Social Media , year=

[51] [51]

osf.io/preprints/psyarxiv/qnjkf , year=

Leveraging ChatGPT for efficient fact-checking , author=. osf.io/preprints/psyarxiv/qnjkf , year=

[52] [52]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year=

SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , year=

[53] [53]

arXiv:2310.05253 , year=

Explainable claim verification via knowledge-grounded reasoning with large language models , author=. arXiv:2310.05253 , year=

work page arXiv

[54] [54]

Transactions of the Association for Computational Linguistics , year=

JustiLM: Few-shot Justification Generation for Explainable Fact-Checking of Real-world Claims , author=. Transactions of the Association for Computational Linguistics , year=

[55] [55]

arXiv:2403.11169 , year=

Correcting misinformation on social media with a large language model , author=. arXiv:2403.11169 , year=

work page arXiv

[56] [56]

2016 , publisher=

Detecting rumors from microblogs with recurrent neural networks , author=. 2016 , publisher=

2016

[57] [57]

Proceedings of the 2011 conference on empirical methods in natural language processing , year=

Rumor has it: Identifying misinformation in microblogs , author=. Proceedings of the 2011 conference on empirical methods in natural language processing , year=

2011

[58] [58]

ACM SIGKDD explorations newsletter , year=

Misinformation in social media: definition, manipulation, and detection , author=. ACM SIGKDD explorations newsletter , year=

[59] [59]

arXiv:2308.10800 , year=

Artificial intelligence is ineffective and potentially harmful for fact checking , author=. arXiv:2308.10800 , year=

work page arXiv

[60] [60]

arXiv preprint arXiv:2511.02615 , year=

Community Notes are vulnerable to rater bias and manipulation , author=. arXiv preprint arXiv:2511.02615 , year=

work page arXiv

[61] [61]

AI Feedback Enhances Community-Based Content Moderation through Engagement with Counterarguments

AI feedback enhances community-based content moderation through engagement with counterarguments , author=. arXiv preprint arXiv:2507.08110 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[62] [62]

The Benefit of Collective Intelligence in Community-Based Content Moderation is Limited by Overt Political Signalling

The Benefit of Collective Intelligence in Community-Based Content Moderation is Limited by Overt Political Signalling , author=. arXiv preprint arXiv:2601.22201 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[63] [63]

arXiv preprint arXiv:2602.08970 , year=

Hyperactive Minority Alters the Stability of Community Notes , author=. arXiv preprint arXiv:2602.08970 , year=

work page arXiv

[64] [64]

Grok in the Wild: Characterizing the Roles and Uses of Large Language Models on Social Media

Grok in the wild: Characterizing the roles and uses of large language models on social media , author=. arXiv preprint arXiv:2602.11286 , year=

work page internal anchor Pith review Pith/arXiv arXiv