{"work":{"id":"efb3082c-4f47-4d65-b49b-c56ba744fbbf","openalex_id":null,"doi":null,"arxiv_id":"2188.344592","raw_key":null,"title":"Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell","authors":null,"authors_text":"Bahar Taskesen, Jose Blanchet, Daniel Kuhn, and Viet Anh Nguyen","year":2021,"venue":null,"abstract":null,"external_url":"https://arxiv.org/abs/2188.344592","cited_by_count":null,"metadata_source":"arxiv_reference","metadata_fetched_at":"2026-06-30T06:04:20.875214+00:00","pith_arxiv_id":null,"created_at":"2026-05-09T19:30:37.469410+00:00","updated_at":"2026-06-30T06:04:20.875214+00:00","title_quality_ok":false,"display_title":"Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell","render_title":"Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell"},"hub":{"state":{"work_id":"efb3082c-4f47-4d65-b49b-c56ba744fbbf","tier":"super_hub","tier_reason":"100+ Pith inbound or 10,000+ external citations","pith_inbound_count":103,"external_cited_by_count":null,"distinct_field_count":15,"first_pith_cited_at":"2021-07-14T06:06:52+00:00","last_pith_cited_at":"2026-06-29T14:17:33+00:00","author_build_status":"needed","summary_status":"needed","contexts_status":"needed","graph_status":"needed","ask_index_status":"needed","reader_status":"not_needed","recognition_status":"not_needed","updated_at":"2026-07-01T06:21:04.038916+00:00","tier_text":"super_hub"},"tier":"super_hub","role_counts":[{"context_role":"background","n":23},{"context_role":"baseline","n":1},{"context_role":"other","n":1}],"polarity_counts":[{"context_polarity":"background","n":16},{"context_polarity":"support","n":5},{"context_polarity":"unclear","n":3},{"context_polarity":"baseline","n":1}],"runs":{"ask_index":{"job_type":"ask_index","status":"succeeded","result":{"title":"Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell","claims":[{"claim_text":"Journal of Quantitative Description 1 (2021), 1-23. [20] Google. 2026. AdWords for nonprofits - Google Ad Grants Programme Details. https://www.google.com/intl/en_au/grants/details.html. Retrieved 2026-01-13. [21] Google Ads Help. 2026. How to become a Google Partner or Premier Partner. https://support.google.com/google-ads/answer/9702452? sjid=14590807631916300459-NA Retrieved 2026-01-12. [22] Lelia Marie Hampton. 2021. Black Feminist Musings on Algorithmic Oppression. In Proceedings of the 202","claim_type":"background","confidence":0.95,"evidence_strength":"citation_context"},{"claim_text":"Eating Disorder Helpline Takes Down Chatbot After it Gave Weight Loss Advice. NPR (2023). Available at: https://www.npr.org/2023/06/08/1181131532/eating -disorder-helpline-takes-down-chatbot-after-it-gave-weight-loss-advice. (Accessed: 17 th December 2023) [17] Noble, S. U. Algorithms of Oppression: How Search Engines Reinforce Racism. (New York University Press, 2018). [18] Bender, E. M., Gebru, T., McMillan -Major, A. & Shmitchell, S. On the Dangers of Stochastic Parrots. Proceedings of the 20","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"The extant literature on generative LMs has primarily examined bias via explicit identity prompting [2]. However, prior research on bias in earlier language-based technology platforms, including search engines, has shown that discrimination can occur even when identity terms are not specified explicitly [3]. Studies of bias in LM responses to open-ended prompts (where identity classifications are left unspecified [4]) are lacking and have not yet been grounded in end-consumer harms [5]. Here, we","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"[161] Rachel Kornfield, David C. Mohr, Rachel Ranney, Emily G. Lattie, Jonah Meyerhoff, Joseph J. Williams, and Madhu Reddy. 2022. Involving Crowdworkers with Lived Experience in Content-Development for Push-Based Digital Mental Health Tools: Lessons Learned from Crowdsourcing Mental Health Messages.Proc. ACM Hum.-Comput. Interact.(2022). doi:10.1145/3512946 [162] Angelie Kraft and Eloïse Soulier. 2024. Knowledge-Enhanced Language Models Are Not Bias-Proof: Situated Knowledge and Epistemic Injus","claim_type":"other","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"Recall Cue𝑞 𝑀 𝑡 None Subgraph + boundedness Analysis Post-recall𝑊 𝑞 None Read-only on𝑊 𝑞 and𝑀 𝑡 May 5, 2026 7 4.0.4 Identity and Uniqueness Constraints.DGMM enforces explicit identity semantics across node types. Element and Time nodes are subject to name-based uniqueness and are reused across memory ingestion: ∀𝑣𝑖, 𝑣 𝑗 ∈𝑉 Elem ∪𝑉 Time,name(𝑣 𝑖 )=name(𝑣 𝑗 ) ⇒𝑣 𝑖 =𝑣 𝑗 (2) These nodes function as canonical anchors for perceptual grounding and temporal alignment. In contrast, Concept nodes are inst","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"06580, version 3. [3] J. Sourati, J. A. Evans, Accelerating Science with Human-Aware Artificial Intelligence, Nature Human Behaviour 7 (2023) 1682-1696. doi:10.1038/s41562-023-01648-z. [4] L. Messeri, M. J. Crockett, Artificial Intelligence and Illusions of Understanding in Scientific Research, Nature 627 (2024) 49-58. doi:10.1038/s41586-024-07146-0. [5] E. M. Bender, et al., On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?, in: Proceedings of the 2021 ACM Conference on Fair","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"}],"why_cited":"Pith tracks Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell because it crossed a citation-hub threshold. Current citing contexts most often use it as background evidence (23 contexts).","role_counts":[{"n":23,"context_role":"background"},{"n":1,"context_role":"baseline"},{"n":1,"context_role":"other"}]},"error":null,"updated_at":"2026-06-29T08:28:38.841325+00:00"},"author_expand":{"job_type":"author_expand","status":"succeeded","result":{"authors_linked":[{"id":"e65f1f2e-ad59-4645-8669-dd03fe61c5b5","orcid":null,"display_name":"Bender"},{"id":"03540121-af5c-44d8-9394-2ffdb2bf5d66","orcid":null,"display_name":"Emily M"}]},"error":null,"updated_at":"2026-06-29T08:28:38.132404+00:00"},"context_extract":{"job_type":"context_extract","status":"succeeded","result":{"enqueued_papers":25},"error":null,"updated_at":"2026-05-14T17:49:40.220653+00:00"},"graph_features":{"job_type":"graph_features","status":"succeeded","result":{"co_cited":[{"title":"Language Models are Few-Shot Learners","work_id":"214732c0-2edd-44a0-af9e-28184a2b8279","shared_citers":7},{"title":"Training Verifiers to Solve Math Word Problems","work_id":"acab1aa8-b4d6-40e0-a3ee-25341701dca2","shared_citers":6},{"title":"URLhttps://doi.org/10.48550/arXiv","work_id":"5c2060c6-427c-4321-be22-49ccae439d80","shared_citers":6},{"title":"Attention Is All You Need","work_id":"baafb5a2-5272-43bc-932f-09fa9ffe5316","shared_citers":5},{"title":"Evaluating Large Language Models Trained on Code","work_id":"042493e9-b26f-4b4e-bbde-382072ca9b08","shared_citers":5},{"title":"GPT-4 Technical Report","work_id":"b928e041-6991-4c08-8c81-0359e4097c7b","shared_citers":5},{"title":"On the Opportunities and Risks of Foundation Models","work_id":"a18039e9-928d-47c9-a836-32656a71bf71","shared_citers":5},{"title":"The Llama 3 Herd of Models","work_id":"1549a635-88af-4ac1-acfe-51ae7bb53345","shared_citers":5},{"title":"Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models","work_id":"bb63abb3-0d50-4362-b97c-b5e725b03b39","shared_citers":4},{"title":"Efficient Estimation of Word Representations in Vector Space","work_id":"59edaa01-a696-45b3-9a08-5eae777a799e","shared_citers":4},{"title":"Gemini: A Family of Highly Capable Multimodal Models","work_id":"83f7c85b-3f11-450f-ac0c-64d9745220b2","shared_citers":4},{"title":"RoBERTa: A Robustly Optimized BERT Pretraining Approach","work_id":"41fe12c4-e538-4890-a244-480650ed3078","shared_citers":4},{"title":"Adam: A Method for Stochastic Optimization","work_id":"1910796d-9b52-4683-bf5c-de9632c1028b","shared_citers":3},{"title":"and Rudinger, Rachel","work_id":"88f8dce9-17cc-41c1-a26a-e9b502ca0742","shared_citers":3},{"title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding","work_id":"ed240a10-5b19-406c-baa5-30803f465785","shared_citers":3},{"title":"Bradley Efron and Robert J Tibshirani.An introduction to the bootstrap, volume","work_id":"414143a1-235c-49ec-a733-5cff2faefa32","shared_citers":3},{"title":"Chain-of-Thought Prompting Elicits Reasoning in Large Language Models","work_id":"d1cf6693-a082-403c-ada9-dac7b96341f9","shared_citers":3},{"title":"C ommonsense QA : A question answering challenge targeting commonsense knowledge","work_id":"628930d3-897c-43a2-8d6d-589da959e066","shared_citers":3},{"title":"doi: 10.18653/v1/2020.findings-emnlp.301","work_id":"b0252c97-1b16-43d3-9417-9f8a72176a9f","shared_citers":3},{"title":"Ethical and social risks of harm from Language Models","work_id":"b4ce1c45-ef69-445a-a872-dbb785b485e9","shared_citers":3},{"title":"Gender bias in coreference resolution: Evaluation and debiasing methods","work_id":"debd34b6-f412-41f5-8776-18ca1afb906a","shared_citers":3},{"title":"Improving language models by retrieving from trillions of tokens.Preprint arXiv:2112.04426","work_id":"c8e7ce21-2f18-4c10-b0cf-16cd3574fe33","shared_citers":3},{"title":"LaMDA: Language Models for Dialog Applications","work_id":"1b66d0a5-f6ae-4332-8025-c662dc64b238","shared_citers":3},{"title":"Mistral 7B","work_id":"eb5e1305-ad11-4875-ad8d-ad8b8f697599","shared_citers":3}],"time_series":[{"n":1,"year":2021},{"n":3,"year":2022},{"n":2,"year":2023},{"n":2,"year":2024},{"n":30,"year":2026}],"dependency_candidates":[]},"error":null,"updated_at":"2026-05-14T17:49:54.104926+00:00"},"identity_refresh":{"job_type":"identity_refresh","status":"succeeded","result":{"items":[{"title":"Qwen3 Technical Report","outcome":"unchanged","work_id":"25a4e30c-1232-48e7-9925-02fa12ba7c9e","resolver":"local_arxiv","confidence":0.98,"old_work_id":"25a4e30c-1232-48e7-9925-02fa12ba7c9e"}],"counts":{"fixed":0,"merged":0,"unchanged":1,"quarantined":0,"needs_external_resolution":0},"errors":[],"attempted":1},"error":null,"updated_at":"2026-05-14T17:49:53.967110+00:00"},"role_polarity":{"job_type":"role_polarity","status":"succeeded","result":{"title":"Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell","claims":[{"claim_text":"Journal of Quantitative Description 1 (2021), 1-23. [20] Google. 2026. AdWords for nonprofits - Google Ad Grants Programme Details. https://www.google.com/intl/en_au/grants/details.html. Retrieved 2026-01-13. [21] Google Ads Help. 2026. How to become a Google Partner or Premier Partner. https://support.google.com/google-ads/answer/9702452? sjid=14590807631916300459-NA Retrieved 2026-01-12. [22] Lelia Marie Hampton. 2021. Black Feminist Musings on Algorithmic Oppression. In Proceedings of the 202","claim_type":"background","confidence":0.95,"evidence_strength":"citation_context"},{"claim_text":"Eating Disorder Helpline Takes Down Chatbot After it Gave Weight Loss Advice. NPR (2023). Available at: https://www.npr.org/2023/06/08/1181131532/eating -disorder-helpline-takes-down-chatbot-after-it-gave-weight-loss-advice. (Accessed: 17 th December 2023) [17] Noble, S. U. Algorithms of Oppression: How Search Engines Reinforce Racism. (New York University Press, 2018). [18] Bender, E. M., Gebru, T., McMillan -Major, A. & Shmitchell, S. On the Dangers of Stochastic Parrots. Proceedings of the 20","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"The extant literature on generative LMs has primarily examined bias via explicit identity prompting [2]. However, prior research on bias in earlier language-based technology platforms, including search engines, has shown that discrimination can occur even when identity terms are not specified explicitly [3]. Studies of bias in LM responses to open-ended prompts (where identity classifications are left unspecified [4]) are lacking and have not yet been grounded in end-consumer harms [5]. Here, we","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"[161] Rachel Kornfield, David C. Mohr, Rachel Ranney, Emily G. Lattie, Jonah Meyerhoff, Joseph J. Williams, and Madhu Reddy. 2022. Involving Crowdworkers with Lived Experience in Content-Development for Push-Based Digital Mental Health Tools: Lessons Learned from Crowdsourcing Mental Health Messages.Proc. ACM Hum.-Comput. Interact.(2022). doi:10.1145/3512946 [162] Angelie Kraft and Eloïse Soulier. 2024. Knowledge-Enhanced Language Models Are Not Bias-Proof: Situated Knowledge and Epistemic Injus","claim_type":"other","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"Recall Cue𝑞 𝑀 𝑡 None Subgraph + boundedness Analysis Post-recall𝑊 𝑞 None Read-only on𝑊 𝑞 and𝑀 𝑡 May 5, 2026 7 4.0.4 Identity and Uniqueness Constraints.DGMM enforces explicit identity semantics across node types. Element and Time nodes are subject to name-based uniqueness and are reused across memory ingestion: ∀𝑣𝑖, 𝑣 𝑗 ∈𝑉 Elem ∪𝑉 Time,name(𝑣 𝑖 )=name(𝑣 𝑗 ) ⇒𝑣 𝑖 =𝑣 𝑗 (2) These nodes function as canonical anchors for perceptual grounding and temporal alignment. In contrast, Concept nodes are inst","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"},{"claim_text":"06580, version 3. [3] J. Sourati, J. A. Evans, Accelerating Science with Human-Aware Artificial Intelligence, Nature Human Behaviour 7 (2023) 1682-1696. doi:10.1038/s41562-023-01648-z. [4] L. Messeri, M. J. Crockett, Artificial Intelligence and Illusions of Understanding in Scientific Research, Nature 627 (2024) 49-58. doi:10.1038/s41586-024-07146-0. [5] E. M. Bender, et al., On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?, in: Proceedings of the 2021 ACM Conference on Fair","claim_type":"background","confidence":0.9,"evidence_strength":"citation_context"}],"why_cited":"Pith tracks Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell because it crossed a citation-hub threshold. Current citing contexts most often use it as background evidence (23 contexts).","role_counts":[{"n":23,"context_role":"background"},{"n":1,"context_role":"baseline"},{"n":1,"context_role":"other"}]},"error":null,"updated_at":"2026-06-29T08:28:38.839130+00:00"},"summary_claims":{"job_type":"summary_claims","status":"succeeded","result":{"title":"URL https://doi.org/10.1145/ 3442188.3445922","claims":[],"why_cited":"Pith tracks URL https://doi.org/10.1145/ 3442188.3445922 because it crossed a citation-hub threshold.","role_counts":[]},"error":null,"updated_at":"2026-05-14T17:49:57.374614+00:00"}},"summary":{"title":"URL https://doi.org/10.1145/ 3442188.3445922","claims":[],"why_cited":"Pith tracks URL https://doi.org/10.1145/ 3442188.3445922 because it crossed a citation-hub threshold.","role_counts":[]},"graph":{"co_cited":[{"title":"Language Models are Few-Shot Learners","work_id":"214732c0-2edd-44a0-af9e-28184a2b8279","shared_citers":7},{"title":"Training Verifiers to Solve Math Word Problems","work_id":"acab1aa8-b4d6-40e0-a3ee-25341701dca2","shared_citers":6},{"title":"URLhttps://doi.org/10.48550/arXiv","work_id":"5c2060c6-427c-4321-be22-49ccae439d80","shared_citers":6},{"title":"Attention Is All You Need","work_id":"baafb5a2-5272-43bc-932f-09fa9ffe5316","shared_citers":5},{"title":"Evaluating Large Language Models Trained on Code","work_id":"042493e9-b26f-4b4e-bbde-382072ca9b08","shared_citers":5},{"title":"GPT-4 Technical Report","work_id":"b928e041-6991-4c08-8c81-0359e4097c7b","shared_citers":5},{"title":"On the Opportunities and Risks of Foundation Models","work_id":"a18039e9-928d-47c9-a836-32656a71bf71","shared_citers":5},{"title":"The Llama 3 Herd of Models","work_id":"1549a635-88af-4ac1-acfe-51ae7bb53345","shared_citers":5},{"title":"Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models","work_id":"bb63abb3-0d50-4362-b97c-b5e725b03b39","shared_citers":4},{"title":"Efficient Estimation of Word Representations in Vector Space","work_id":"59edaa01-a696-45b3-9a08-5eae777a799e","shared_citers":4},{"title":"Gemini: A Family of Highly Capable Multimodal Models","work_id":"83f7c85b-3f11-450f-ac0c-64d9745220b2","shared_citers":4},{"title":"RoBERTa: A Robustly Optimized BERT Pretraining Approach","work_id":"41fe12c4-e538-4890-a244-480650ed3078","shared_citers":4},{"title":"Adam: A Method for Stochastic Optimization","work_id":"1910796d-9b52-4683-bf5c-de9632c1028b","shared_citers":3},{"title":"and Rudinger, Rachel","work_id":"88f8dce9-17cc-41c1-a26a-e9b502ca0742","shared_citers":3},{"title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding","work_id":"ed240a10-5b19-406c-baa5-30803f465785","shared_citers":3},{"title":"Bradley Efron and Robert J Tibshirani.An introduction to the bootstrap, volume","work_id":"414143a1-235c-49ec-a733-5cff2faefa32","shared_citers":3},{"title":"Chain-of-Thought Prompting Elicits Reasoning in Large Language Models","work_id":"d1cf6693-a082-403c-ada9-dac7b96341f9","shared_citers":3},{"title":"C ommonsense QA : A question answering challenge targeting commonsense knowledge","work_id":"628930d3-897c-43a2-8d6d-589da959e066","shared_citers":3},{"title":"doi: 10.18653/v1/2020.findings-emnlp.301","work_id":"b0252c97-1b16-43d3-9417-9f8a72176a9f","shared_citers":3},{"title":"Ethical and social risks of harm from Language Models","work_id":"b4ce1c45-ef69-445a-a872-dbb785b485e9","shared_citers":3},{"title":"Gender bias in coreference resolution: Evaluation and debiasing methods","work_id":"debd34b6-f412-41f5-8776-18ca1afb906a","shared_citers":3},{"title":"Improving language models by retrieving from trillions of tokens.Preprint arXiv:2112.04426","work_id":"c8e7ce21-2f18-4c10-b0cf-16cd3574fe33","shared_citers":3},{"title":"LaMDA: Language Models for Dialog Applications","work_id":"1b66d0a5-f6ae-4332-8025-c662dc64b238","shared_citers":3},{"title":"Mistral 7B","work_id":"eb5e1305-ad11-4875-ad8d-ad8b8f697599","shared_citers":3}],"time_series":[{"n":1,"year":2021},{"n":3,"year":2022},{"n":2,"year":2023},{"n":2,"year":2024},{"n":30,"year":2026}],"dependency_candidates":[]},"authors":[{"id":"e65f1f2e-ad59-4645-8669-dd03fe61c5b5","orcid":null,"display_name":"Bender","source":"manual","import_confidence":0.72},{"id":"03540121-af5c-44d8-9394-2ffdb2bf5d66","orcid":null,"display_name":"Emily M","source":"manual","import_confidence":0.72}]}}