SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs

Erchin Serpedin; Hasan Kurban; Parichit Sharma; Rachad Atat; Samir Abdaljalil

arxiv: 2503.05980 · v1 · pith:CGDCJ3U3new · submitted 2025-03-07 · 💻 cs.CL · cs.AI

SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs

Samir Abdaljalil , Hasan Kurban , Parichit Sharma , Erchin Serpedin , Rachad Atat This is my paper

classification 💻 cs.CL cs.AI

keywords llmsdetectionhallucinationacrossclusteringframeworkinconsistencysemantic

0 comments

read the original abstract

Large language models (LLMs) are increasingly deployed across diverse domains, yet they are prone to generating factually incorrect outputs - commonly known as "hallucinations." Among existing mitigation strategies, uncertainty-based methods are particularly attractive due to their ease of implementation, independence from external data, and compatibility with standard LLMs. In this work, we introduce a novel and scalable uncertainty-based semantic clustering framework for automated hallucination detection. Our approach leverages sentence embeddings and hierarchical clustering alongside a newly proposed inconsistency measure, SINdex, to yield more homogeneous clusters and more accurate detection of hallucination phenomena across various LLMs. Evaluations on prominent open- and closed-book QA datasets demonstrate that our method achieves AUROC improvements of up to 9.3% over state-of-the-art techniques. Extensive ablation studies further validate the effectiveness of each component in our framework.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models
cs.AI 2026-06 unverdicted novelty 7.0

A unified benchmark of 24 black-box UE methods for LLMs finds no universal winner but favors methods that reason over answer candidates and hybrid combinations of signals.