Tacler: Tailored curriculum reinforcement learning for efficient reasoning

Huiyuan Lai, Malvina Nissim · 2026 · arXiv 2601.21711

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Curriculum Reinforcement Learning Can Incentivize Reasoning Capacity in LLMs Beyond the Base Model

cs.LG · 2026-06-21 · unverdicted · novelty 6.0

Boundary-aware Curriculum RL raises average pass@256 by 9.8 points over base models and 10.3 points over vanilla RLVR on Qwen, Llama, and DeepSeek families.

citing papers explorer

Showing 1 of 1 citing paper.

Curriculum Reinforcement Learning Can Incentivize Reasoning Capacity in LLMs Beyond the Base Model cs.LG · 2026-06-21 · unverdicted · none · ref 17
Boundary-aware Curriculum RL raises average pass@256 by 9.8 points over base models and 10.3 points over vanilla RLVR on Qwen, Llama, and DeepSeek families.

Tacler: Tailored curriculum reinforcement learning for efficient reasoning

fields

years

verdicts

representative citing papers

citing papers explorer