Metis-specs: Decoupling multimodal learning via self-distilled preference-based cold start.arXiv preprint arXiv:2510.25801,

Chen, K · arXiv 2510.25801

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Flexible Entropy Control in RLVR with a Gradient-Preserving Perspective

cs.LG · 2026-02-10 · unverdicted · novelty 6.0

Dynamic clipping strategies based on importance sampling regions enable precise entropy management in RLVR, mitigating collapse and improving benchmark performance.

citing papers explorer

Showing 1 of 1 citing paper.

Flexible Entropy Control in RLVR with a Gradient-Preserving Perspective cs.LG · 2026-02-10 · unverdicted · none · ref 3
Dynamic clipping strategies based on importance sampling regions enable precise entropy management in RLVR, mitigating collapse and improving benchmark performance.

Metis-specs: Decoupling multimodal learning via self-distilled preference-based cold start.arXiv preprint arXiv:2510.25801,

fields

years

verdicts

representative citing papers

citing papers explorer