pith. sign in

Deepseek-v4: Towards highly efficient million-token context intelligence

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

years

2026 6

verdicts

UNVERDICTED 6

clear filters

representative citing papers

DOPD: Dual On-policy Distillation

cs.AI · 2026-06-29 · unverdicted · novelty 5.0

DOPD is an advantage-aware dual distillation method that dynamically assigns token supervision from either privileged teacher or student to transfer capability while mitigating non-replicable information asymmetry in on-policy distillation.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • DOPD: Dual On-policy Distillation cs.AI · 2026-06-29 · unverdicted · none · ref 44

    DOPD is an advantage-aware dual distillation method that dynamically assigns token supervision from either privileged teacher or student to transfer capability while mitigating non-replicable information asymmetry in on-policy distillation.