pith. sign in

← back to paper

Review history

arxiv: 2605.31483 · 2 revisions

BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali

  1. 2026-06-30 UNVERDICTED LOW v0.9.1-grok novelty 7.0
    33752 ms 5826 in 1149 out 2026-06-30T10:49:22.094717+00:00
  2. 2026-06-28 UNVERDICTED LOW v0.9.1-grok novelty 8.0
    27799 ms 5826 in 1366 out 2026-06-28T22:40:52.427309+00:00