hub Canonical reference

Ong, and Nick Haber

Jared Moore, Declan Grabb, William Agnew, Kevin Klyman, Stevie Chancellor, Desmond C · 2025 · arXiv 5275.373203

Canonical reference. 83% of citing Pith papers cite this work as background.

14 Pith papers citing it

Background 83% of classified citations

read on arXiv browse 14 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 6

citation-polarity summary

background 5 support 1

representative citing papers

Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis

cs.CL · 2026-03-20 · conditional · novelty 7.0

Seven clinician-informed safety criteria enable LLM-as-a-Judge to reach substantial agreement with human consensus (Cohen's κ up to 0.75) on evaluating LLM responses to users demonstrating psychosis.

Direction-Flipped Influence Audits Reveal Hidden Structure in Moral Choices of LLMs

cs.LG · 2026-02-26 · conditional · novelty 7.0

Direction-flipped influence audits show contextual cues shift LLM moral choices by 12-18 points on average across multiple benchmarks, revealing asymmetries, backfires, and inconsistencies in 40% of conditions.

Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models

cs.CL · 2025-12-29 · accept · novelty 7.0

Spoken language models exhibit style amnesia and fail to maintain instructed paralinguistic styles across multi-turn conversations, with explicit recall offering partial mitigation.

Gradual Voluntary Participation: A Framework for Participatory AI Governance in Journalism

cs.HC · 2026-04-23 · unverdicted · novelty 6.0

The study proposes the Gradual Voluntary Participation (GVP) framework to reconceptualize participatory AI governance in journalism as a gradual and voluntary process using a bidimensional matrix.

Chaplains' Reflections on the Design and Usage of AI for Conversational Care

cs.HC · 2026-02-03 · unverdicted · novelty 6.0

Chaplains view AI chatbots as unable to provide attuned pastoral care for non-clinical emotional needs, based on themes of listening, connecting, carrying, and wanting.

AI at the Front Lines of Platform Governance: Using LLMs to Support Illegal Content Reporting under the Digital Services Act

cs.HC · 2026-05-22 · unverdicted · novelty 5.0

EvalAI providing pro/con arguments improves provision-level accuracy and reduces misclassification distance in DSA illegal content reporting under AI error conditions versus conventional XAI.

The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents

cs.HC · 2026-05-20 · unverdicted · novelty 5.0

Empirical analysis of 1,524 AI incident reports shows 83% arise from worker-AI trait misalignments, with 74% of those traceable to developers prioritizing efficiency over precision or personalization.

Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework

cs.CR · 2026-04-06 · unverdicted · novelty 5.0

A 16-factor structured prompt framework strengthens CoT reasoning in LLMs for security analysis, yielding up to 40% reasoning gains in smaller models and stable accuracy improvements validated by human raters with Cohen's k > 0.80.

Breakdowns in Conversational AI: Interactional Failures in Emotionally and Ethically Sensitive Contexts

cs.CL · 2026-04-03 · unverdicted · novelty 5.0

Mainstream conversational models show escalating affective misalignments and ethical guidance failures during staged emotional trajectories, organized into a taxonomy of interactional breakdowns.

Talking to a Human as an Attitudinal Barrier: A Mixed Methods Evaluation of Stigma, Access, and the Appeal of AI Mental Health Support

cs.HC · 2026-02-24 · unverdicted · novelty 5.0

Shame/stigma and access barriers to therapy predict higher perceived helpfulness of AI mental health support, especially for therapy-experienced users, while access and cost barriers predict greater usage intensity.

The Consensus Trap: Dissecting Subjectivity and the "Ground Truth" Illusion in Data Annotation

cs.AI · 2026-02-11 · unverdicted · novelty 5.0

A literature review concludes that pursuing consensus in data annotation creates biased AI by dismissing subjective disagreements and enforcing geographic hegemony, and proposes mapping diversity instead.

The Pitfalls of KV Cache Compression

cs.LG · 2025-09-30 · conditional · novelty 5.0

KV cache compression causes certain instructions to degrade rapidly and be ignored in multi-instruction prompting, with system prompt leakage worsened by method choice, instruction order, and eviction bias; simple policy changes can mitigate this.

Opportunities and Risks of Generative AI through the Health Information Journey

cs.CY · 2026-05-21 · unverdicted · novelty 4.0

Authors propose a four-stage framework to analyze opportunities and risks of generative AI across the health information journey from public sources to clinical care.

AI and Suicide Prevention: A Cross-Sector Primer

cs.CY · 2026-05-05 · unverdicted · novelty 3.0

A cross-sector primer on AI chatbots as de facto mental health support, mapping challenges in suicide and NSSI response and identifying priority areas for industry alignment on standards and oversight.

citing papers explorer

Showing 14 of 14 citing papers.

Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis cs.CL · 2026-03-20 · conditional · none · ref 45
Seven clinician-informed safety criteria enable LLM-as-a-Judge to reach substantial agreement with human consensus (Cohen's κ up to 0.75) on evaluating LLM responses to users demonstrating psychosis.
Direction-Flipped Influence Audits Reveal Hidden Structure in Moral Choices of LLMs cs.LG · 2026-02-26 · conditional · none · ref 4
Direction-flipped influence audits show contextual cues shift LLM moral choices by 12-18 points on average across multiple benchmarks, revealing asymmetries, backfires, and inconsistencies in 40% of conditions.
Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models cs.CL · 2025-12-29 · accept · none · ref 32
Spoken language models exhibit style amnesia and fail to maintain instructed paralinguistic styles across multi-turn conversations, with explicit recall offering partial mitigation.
Gradual Voluntary Participation: A Framework for Participatory AI Governance in Journalism cs.HC · 2026-04-23 · unverdicted · none · ref 68
The study proposes the Gradual Voluntary Participation (GVP) framework to reconceptualize participatory AI governance in journalism as a gradual and voluntary process using a bidimensional matrix.
Chaplains' Reflections on the Design and Usage of AI for Conversational Care cs.HC · 2026-02-03 · unverdicted · none · ref 85
Chaplains view AI chatbots as unable to provide attuned pastoral care for non-clinical emotional needs, based on themes of listening, connecting, carrying, and wanting.
AI at the Front Lines of Platform Governance: Using LLMs to Support Illegal Content Reporting under the Digital Services Act cs.HC · 2026-05-22 · unverdicted · none · ref 124
EvalAI providing pro/con arguments improves provision-level accuracy and reduces misclassification distance in DSA illegal content reporting under AI error conditions versus conventional XAI.
The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents cs.HC · 2026-05-20 · unverdicted · none · ref 65
Empirical analysis of 1,524 AI incident reports shows 83% arise from worker-AI trait misalignments, with 74% of those traceable to developers prioritizing efficiency over precision or personalization.
Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework cs.CR · 2026-04-06 · unverdicted · none · ref 44
A 16-factor structured prompt framework strengthens CoT reasoning in LLMs for security analysis, yielding up to 40% reasoning gains in smaller models and stable accuracy improvements validated by human raters with Cohen's k > 0.80.
Breakdowns in Conversational AI: Interactional Failures in Emotionally and Ethically Sensitive Contexts cs.CL · 2026-04-03 · unverdicted · none · ref 24
Mainstream conversational models show escalating affective misalignments and ethical guidance failures during staged emotional trajectories, organized into a taxonomy of interactional breakdowns.
Talking to a Human as an Attitudinal Barrier: A Mixed Methods Evaluation of Stigma, Access, and the Appeal of AI Mental Health Support cs.HC · 2026-02-24 · unverdicted · none · ref 13
Shame/stigma and access barriers to therapy predict higher perceived helpfulness of AI mental health support, especially for therapy-experienced users, while access and cost barriers predict greater usage intensity.
The Consensus Trap: Dissecting Subjectivity and the "Ground Truth" Illusion in Data Annotation cs.AI · 2026-02-11 · unverdicted · none · ref 146
A literature review concludes that pursuing consensus in data annotation creates biased AI by dismissing subjective disagreements and enforcing geographic hegemony, and proposes mapping diversity instead.
The Pitfalls of KV Cache Compression cs.LG · 2025-09-30 · conditional · none · ref 8
KV cache compression causes certain instructions to degrade rapidly and be ignored in multi-instruction prompting, with system prompt leakage worsened by method choice, instruction order, and eviction bias; simple policy changes can mitigate this.
Opportunities and Risks of Generative AI through the Health Information Journey cs.CY · 2026-05-21 · unverdicted · none · ref 102
Authors propose a four-stage framework to analyze opportunities and risks of generative AI across the health information journey from public sources to clinical care.
AI and Suicide Prevention: A Cross-Sector Primer cs.CY · 2026-05-05 · unverdicted · none · ref 4
A cross-sector primer on AI chatbots as de facto mental health support, mapping challenges in suicide and NSSI response and identifying priority areas for industry alignment on standards and oversight.

Ong, and Nick Haber

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer