A new evaluation framework shows that even the best tested LLM only reliably adjusts response complexity in the intended direction 46% of the time across 98 scientific queries.
Epstein, Nicole B
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4roles
background 2polarities
background 2representative citing papers
Users entangle their lived experiences with AI predictions in menstrual tracking apps, leading to self-fulfilling prophecies, limited critical awareness from UI, and isolation for non-normative users.
Researchers created a stigma-aware WhatsApp chatbot for menstrual health education in Pakistan through co-design workshops and a two-week deployment, yielding insights on its use for challenging taboos alongside tensions around trust and cultural explanations.
SafeScreen enforces individualized safety constraints as a prerequisite for video retrieval by using profile extraction, adaptive VideoRAG analysis, and LLM decision-making to approve content for vulnerable users.
citing papers explorer
-
Explain Like I'm 5 or Whatever I Choose: Evaluating the Interactive Potential of Language Model Responses
A new evaluation framework shows that even the best tested LLM only reliably adjusts response complexity in the intended direction 46% of the time across 98 scientific queries.
-
"It became a self-fulfilling prophecy": How Lived Experiences are Entangled with AI Predictions in Menstrual Cycle Tracking Apps
Users entangle their lived experiences with AI predictions in menstrual tracking apps, leading to self-fulfilling prophecies, limited critical awareness from UI, and isolation for non-normative users.
-
Designing Around Stigma: Human-Centered LLMs for Menstrual Health
Researchers created a stigma-aware WhatsApp chatbot for menstrual health education in Pakistan through co-design workshops and a two-week deployment, yielding insights on its use for challenging taboos alongside tensions around trust and cultural explanations.
-
SafeScreen: A Safety-First Screening Framework for Personalized Video Retrieval for Vulnerable Users
SafeScreen enforces individualized safety constraints as a prerequisite for video retrieval by using profile extraction, adaptive VideoRAG analysis, and LLM decision-making to approve content for vulnerable users.