pith. sign in

← back to paper

Review history

arxiv: 2605.08738 · 2 revisions

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

  1. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 5.0
    25978 ms 5822 in 1290 out 2026-05-20T23:18:25.084291+00:00
  2. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 6.0
    45631 ms 5591 in 1247 out 2026-05-12T03:24:19.975833+00:00