DRP combines teacher-guided pruning of chain-of-thought steps with distillation to cut token usage in reasoning models on GSM8K and AIME while maintaining or improving accuracy.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models
DRP combines teacher-guided pruning of chain-of-thought steps with distillation to cut token usage in reasoning models on GSM8K and AIME while maintaining or improving accuracy.