BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models

Hansika Aredla; Jinal Prashant Thakkar; Kondaguduru Abhiram; Naveen Bera; Pulijala Sai Nikhila; Shaik Gayaz Ali; Shaik Mohammed Omar; Shalmali Ayachit; Shoaib Sadiq Salehmohamed

arxiv: 2606.07528 · v1 · pith:HOFCZN66new · submitted 2026-04-20 · 💻 cs.CL · cs.AI· cs.LG

BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models

Naveen Bera , Pulijala Sai Nikhila , Kondaguduru Abhiram , Shaik Gayaz Ali , Shoaib Sadiq Salehmohamed , Shaik Mohammed Omar , Jinal Prashant Thakkar , Hansika Aredla

show 1 more author

Shalmali Ayachit

This is my paper

classification 💻 cs.CL cs.AIcs.LG

keywords hallucinationentropybeacondetectionachievesacrossaggregationauroc

0 comments

read the original abstract

Hallucination in large language models (LLMs), defined as the generation of factually incorrect or unsupported content, remains a critical barrier to reliable deployment. We present BEACON (Behavioral Entropy Aggregation for Cross-model hallucination detectiON), a black-box hallucination detection framework that operates purely on model outputs without requiring access to internal representations or external knowledge bases. BEACON extracts a 31-dimensional feature vector from structured multi-pass generation, integrating NLI-based semantic entropy, embedding geometry, chain-of-thought consistency, and paraphrase stability signals. A gradient-boosted classifier trained on 7,617 labeled examples across seven benchmarks achieves 0.8123 +/- 0.0102 AUROC (95% CI: 0.7632-0.8251), outperforming standalone semantic entropy (+0.2298) and SelfCheckGPT-style consistency baselines (+0.2457). Feature importance analysis shows that hallucination is inherently multi-dimensional, requiring combined uncertainty signals. An efficient 5-call variant achieves 0.7795 AUROC, enabling practical deployment across black-box LLM APIs.

This paper has not been read by Pith yet.

BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models

discussion (0)