pith. sign in

(2) ShanghaiTech University

Identifiers

No identifiers captured yet.

Papers (1)

  1. The Two-Stage Decision-Sampling Hypothesis: Understanding the Emergence of Self-Reflection in RL-Trained LLMs cs.LG · 2026 · author #5

Mentions

No mention provenance yet.

Frequent Coauthors