Benchmarks and Metrics We evaluate the models on a suite of mathematical reasoning benchmarks

Table 3 · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Flexible Entropy Control in RLVR with a Gradient-Preserving Perspective

cs.LG · 2026-02-10 · unverdicted · novelty 6.0

Dynamic clipping strategies based on importance sampling regions enable precise entropy management in RLVR, mitigating collapse and improving benchmark performance.

citing papers explorer

Showing 1 of 1 citing paper.

Flexible Entropy Control in RLVR with a Gradient-Preserving Perspective cs.LG · 2026-02-10 · unverdicted · none · ref 29
Dynamic clipping strategies based on importance sampling regions enable precise entropy management in RLVR, mitigating collapse and improving benchmark performance.

Benchmarks and Metrics We evaluate the models on a suite of mathematical reasoning benchmarks

fields

years

verdicts

representative citing papers

citing papers explorer