The Illusion of Progress: Re-evaluating Hallucination Detection in LLM s

Janiak, Denis, Binkowski, Jakub, Sawczyn, Albert, Gabrys, Bogdan, Shwartz-Ziv, Ravid, Kajdanowicz, Tomasz Jan · 2025 · DOI 10.18653/v1/2025.emnlp-main.1761

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Automatic Layer Selection for Hallucination Detection

cs.AI · 2026-05-25 · unverdicted · novelty 6.0

FEPoID automatically selects optimal or near-optimal intermediate layers for hallucination detection across LLM architectures and tasks, outperforming prior criteria and baselines, with an added truncation step that further improves performance.

CORTEX: Token-Level Hallucination Detection in RAG via Comparative Internal Representations

cs.CL · 2026-06-30 · unverdicted · novelty 5.0

CORTEX detects token-level hallucinations in RAG via comparative internal representations, information propagation, and smoothing, reporting gains on two benchmarks with three LLMs.

From Signals to Transfer: A Factorised Study of Probe-Based Uncertainty Estimation in Large Language Models

cs.CL · 2026-06-26 · conditional · novelty 5.0

A factorized study finds raw hidden states and attention features hard to beat in-domain for LLM uncertainty probes, but structured compressed features are more robust under distribution shift, with pretrained probes transferring to open-ended generation.

citing papers explorer

Showing 2 of 2 citing papers after filters.

CORTEX: Token-Level Hallucination Detection in RAG via Comparative Internal Representations cs.CL · 2026-06-30 · unverdicted · none · ref 35
CORTEX detects token-level hallucinations in RAG via comparative internal representations, information propagation, and smoothing, reporting gains on two benchmarks with three LLMs.
From Signals to Transfer: A Factorised Study of Probe-Based Uncertainty Estimation in Large Language Models cs.CL · 2026-06-26 · conditional · none · ref 25
A factorized study finds raw hidden states and attention features hard to beat in-domain for LLM uncertainty probes, but structured compressed features are more robust under distribution shift, with pretrained probes transferring to open-ended generation.

The Illusion of Progress: Re-evaluating Hallucination Detection in LLM s

fields

years

verdicts

representative citing papers

citing papers explorer