Title resolution pending

Yong Yi Bay, Kathleen A · 2024 · arXiv 2403.01621

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

GRPO, Dr. GRPO, and DAPO Are Three Operations on One Number: The Group-Standard-Deviation Identity

cs.LG · 2026-06-30 · unverdicted · novelty 7.0

GRPO, Dr. GRPO, and DAPO are three settings of one dial on the group standard deviation of binary rewards, unified by the group-standard-deviation identity where disagreement equals update magnitude.

When More Sampling Hurts: The Modal Ceiling and Correlation Ceiling of Test-Time Scaling

cs.LG · 2026-06-27 · unverdicted · novelty 6.0

Test-time sampling improves coverage but stalls at modal and correlation ceilings for answer selection, with the effective number of samples as the practical limit.

citing papers explorer

Showing 2 of 2 citing papers after filters.

GRPO, Dr. GRPO, and DAPO Are Three Operations on One Number: The Group-Standard-Deviation Identity cs.LG · 2026-06-30 · unverdicted · none · ref 21
GRPO, Dr. GRPO, and DAPO are three settings of one dial on the group standard deviation of binary rewards, unified by the group-standard-deviation identity where disagreement equals update magnitude.
When More Sampling Hurts: The Modal Ceiling and Correlation Ceiling of Test-Time Scaling cs.LG · 2026-06-27 · unverdicted · none · ref 25
Test-time sampling improves coverage but stalls at modal and correlation ceilings for answer selection, with the effective number of samples as the practical limit.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer