pith. sign in

arXiv preprint arXiv:2512.20949 , year=

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.CL 3

years

2026 3

verdicts

UNVERDICTED 3

representative citing papers

Sparse Reward Subsystem in Large Language Models

cs.CL · 2026-02-01 · unverdicted · novelty 6.0

LLM hidden states contain a sparse reward subsystem consisting of value neurons that predict state value and dopamine neurons that encode step-level temporal difference errors.

MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing

cs.CL · 2026-05-24 · unverdicted · novelty 5.0

MultiHaluDet uses multi-layer hidden-state probing, multi-scale attention, and a calibrated classifier ensemble to detect multilingual hallucinations, reporting up to 98.55% AUROC on English benchmarks and strong cross-lingual transfer to French, Bangla, and Amharic.

citing papers explorer

Showing 3 of 3 citing papers.