WorldBench: Quantifying geographic disparities in LLM factual recall

Birhane, Abeba, Dehdashtian, Sepehr, Prabhu, Vinay, Boddeti, Vishnu , year = · 2024 · arXiv 0106.365896

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Unmasking LAION-5B: Age, Gender, Race, and Emotion Biases in Large-Scale Image Datasets

cs.CV · 2026-06-22 · unverdicted · novelty 6.0

Empirical audit of LAION-2B-en and LAION-2B-multi finds overrepresentation of young adults, White people, and males plus stereotypical emotion associations across two attribute classifiers.

Predictable Confabulations: Factual Recall by LLMs Scales with Model Size and Topic Frequency

cs.CL · 2026-05-18 · unverdicted · novelty 6.0

Factual recall quality in LLMs follows a sigmoid scaling law in the log-linear combination of model parameter count and topic frequency in training data, explaining 60% of variance across models and up to 94% within families.

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

cs.CL · 2026-04-21 · unverdicted · novelty 6.0

Each tested LLM shows its own characteristic unreliability when engaging in repair during extended math-question dialogues.

AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations

cs.HC · 2026-01-30 · unverdicted · novelty 6.0

13 participants became convinced AI understands human values after chatbot interactions evaluated with the VAPT toolkit.

SafeScreen: A Safety-First Screening Framework for Personalized Video Retrieval for Vulnerable Users

cs.CV · 2026-03-12 · unverdicted · novelty 5.0

SafeScreen enforces individualized safety constraints as a prerequisite for video retrieval by using profile extraction, adaptive VideoRAG analysis, and LLM decision-making to approve content for vulnerable users.

Understanding AI Trustworthiness: A Scoping Review of AIES & FAccT Articles

cs.AI · 2025-10-24 · unverdicted · novelty 3.0

A scoping review of AIES and FAccT literature concludes that AI trustworthiness research prioritizes technical precision over social, ethical, and institutional factors, leaving the sociotechnical nature of AI systems underexplored.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

WorldBench: Quantifying geographic disparities in LLM factual recall

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer