Siren’s song in the ai ocean: A survey on hallucination in large language models

Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Chen Xu, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

EnvSimBench: A Benchmark for Evaluating and Improving LLM-Based Environment Simulation

cs.AI · 2026-05-08 · unverdicted · novelty 6.0

EnvSimBench reveals that state-of-the-art LLMs exhibit a universal state change cliff in environment simulation, with a new constraint-driven pipeline raising synthesis yield by 6.8% and cutting costs over 90%.

citing papers explorer

Showing 1 of 1 citing paper.

EnvSimBench: A Benchmark for Evaluating and Improving LLM-Based Environment Simulation cs.AI · 2026-05-08 · unverdicted · none · ref 16
EnvSimBench reveals that state-of-the-art LLMs exhibit a universal state change cliff in environment simulation, with a new constraint-driven pipeline raising synthesis yield by 6.8% and cutting costs over 90%.

Siren’s song in the ai ocean: A survey on hallucination in large language models

fields

years

verdicts

representative citing papers

citing papers explorer