DeepSeek-R1 Incentivizes Reasoning in LLMs Through Reinforcement Learning.Nature, 645 (8081):633–638

Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Peiyi Wang, Qihao Zhu, Runxin Xu, Ruoyu Zhang, Shirong Ma, Xiao Bi, et al · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Automated Conjecture Resolution with Formal Verification

cs.LG · 2026-04-04

citing papers explorer

Showing 1 of 1 citing paper.

Automated Conjecture Resolution with Formal Verification cs.LG · 2026-04-04 · unreviewed · ref 22