Title resolution pending

for mathematical tasks · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

When Importance Sampling Misallocates Credit: Asymmetric Ratios for Outcome-Supervised RL

cs.CL · 2025-10-07 · unverdicted · novelty 6.0

The paper identifies that importance sampling ratios in outcome-supervised RL misallocate credit by creating unbalanced token updates, and introduces ASPO to correct the asymmetry for positive-advantage tokens.

citing papers explorer

Showing 1 of 1 citing paper.

When Importance Sampling Misallocates Credit: Asymmetric Ratios for Outcome-Supervised RL cs.CL · 2025-10-07 · unverdicted · none · ref 20
The paper identifies that importance sampling ratios in outcome-supervised RL misallocate credit by creating unbalanced token updates, and introduces ASPO to correct the asymmetry for positive-advantage tokens.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer