Characterizing llm abstention behavior in science qa with context perturbations.arXiv preprint arXiv:2404.12452

Bingbing Wen, Bill Howe, Lucy Lu Wang · 2024 · arXiv 2404.12452

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Causal Evidence that Language Models use Confidence to Drive Behavior

cs.LG · 2026-03-23 · unverdicted · novelty 6.0

Language models deploy multidimensional internal confidence representations and threshold-based policies to control abstention behavior, with causal support from activation steering experiments.

ERA: Evidence-based Reliability Alignment for Honest Retrieval-Augmented Generation

cs.IR · 2026-02-24 · unverdicted · novelty 6.0

ERA models internal and external knowledge as independent Dirichlet belief masses and uses Dempster-Shafer Theory to quantify conflicts, enabling better abstention decisions in RAG systems.

Textual Bayes: Quantifying Prompt Uncertainty in LLM-Based Systems

cs.LG · 2025-06-11 · unverdicted · novelty 6.0

Introduces a Bayesian framework viewing LLM prompts as textual parameters and proposes MHLP, a novel MCMC algorithm using LLM proposals, to perform inference and improve accuracy plus uncertainty quantification on benchmarks.

citing papers explorer

Showing 3 of 3 citing papers.

Causal Evidence that Language Models use Confidence to Drive Behavior cs.LG · 2026-03-23 · unverdicted · none · ref 23
Language models deploy multidimensional internal confidence representations and threshold-based policies to control abstention behavior, with causal support from activation steering experiments.
ERA: Evidence-based Reliability Alignment for Honest Retrieval-Augmented Generation cs.IR · 2026-02-24 · unverdicted · none · ref 40
ERA models internal and external knowledge as independent Dirichlet belief masses and uses Dempster-Shafer Theory to quantify conflicts, enabling better abstention decisions in RAG systems.
Textual Bayes: Quantifying Prompt Uncertainty in LLM-Based Systems cs.LG · 2025-06-11 · unverdicted · none · ref 67
Introduces a Bayesian framework viewing LLM prompts as textual parameters and proposes MHLP, a novel MCMC algorithm using LLM proposals, to perform inference and improve accuracy plus uncertainty quantification on benchmarks.

Characterizing llm abstention behavior in science qa with context perturbations.arXiv preprint arXiv:2404.12452

fields

years

verdicts

representative citing papers

citing papers explorer