Addressing uncertainty in llms to enhance reliability in generative ai

Ramneet Kaur, Colin Samplawski, Adam D Cobb, Anirban Roy, Brian Matejek, Manoj Acharya, Daniel Elenius, Alexander M Berenbeim, John A Pavlik, Nathaniel D Bastian, et al · 2024 · arXiv 2411.02381

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Principled Detection of Hallucinations in Large Language Models via Multiple Testing

cs.CL · 2025-08-25 · unverdicted · novelty 6.0

The method aggregates multiple hallucination evaluation scores via conformal p-values to enable calibrated detection with controlled false alarm rates across LLMs and datasets.

From Actions to Understanding: Conformal Interpretability of Temporal Concepts in LLM Agents

cs.AI · 2026-03-27 · unverdicted · novelty 5.0

A conformal interpretability method labels LLM agent states step-by-step and extracts linearly separable temporal concept directions aligned with task success on ScienceWorld and AlfWorld.

citing papers explorer

Showing 2 of 2 citing papers.

Principled Detection of Hallucinations in Large Language Models via Multiple Testing cs.CL · 2025-08-25 · unverdicted · none · ref 7
The method aggregates multiple hallucination evaluation scores via conformal p-values to enable calibrated detection with controlled false alarm rates across LLMs and datasets.
From Actions to Understanding: Conformal Interpretability of Temporal Concepts in LLM Agents cs.AI · 2026-03-27 · unverdicted · none · ref 19
A conformal interpretability method labels LLM agent states step-by-step and extracts linearly separable temporal concept directions aligned with task success on ScienceWorld and AlfWorld.

Addressing uncertainty in llms to enhance reliability in generative ai

fields

years

verdicts

representative citing papers

citing papers explorer