pith. sign in

Open- reasoner-zero: An open source approach to scaling up reinforcement learning on the base model, 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2025 1

verdicts

CONDITIONAL 1

representative citing papers

Skywork Open Reasoner 1 Technical Report

cs.LG · 2025-05-28 · conditional · novelty 4.0

Skywork-OR1 uses RL on distilled CoT models to lift math and coding benchmark accuracy by 13-15 points while open-sourcing everything.

citing papers explorer

Showing 1 of 1 citing paper.

  • Skywork Open Reasoner 1 Technical Report cs.LG · 2025-05-28 · conditional · none · ref 8

    Skywork-OR1 uses RL on distilled CoT models to lift math and coding benchmark accuracy by 13-15 points while open-sourcing everything.