pith. machine review for the scientific record. sign in

Language models are hid- 41 den reasoners: Unlocking latent reasoning capabilities via self-rewarding

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

background 1 other 1

citation-polarity summary

fields

cs.AI 1 cs.IR 1

years

2026 1 2025 1

verdicts

UNVERDICTED 2

polarities

background 1 unclear 1

representative citing papers

LASAR: Latent Adaptive Semantic Aligned Reasoning for Generative Recommendation

cs.IR · 2026-05-11 · unverdicted · novelty 6.0

LASAR uses two-stage supervised training plus reinforcement learning to ground semantic IDs, align latent reasoning trajectories to CoT hidden states via KL divergence, and adaptively choose reasoning depth, halving average steps while improving quality on three datasets.

citing papers explorer

Showing 2 of 2 citing papers.