Once incremental gains per additional curated million pairs fall below a user-defined threshold (e.g., ¡0.3 points on average benchmark score), it is reasonable to stop spending

Validation, stopping criteria

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

cs.CL · 2025-07-02 · conditional · novelty 6.0

Skywork-Reward-V2 models trained on 26 million human-AI curated preference pairs set new state-of-the-art results on seven major reward model benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy cs.CL · 2025-07-02 · conditional · none · ref 11
Skywork-Reward-V2 models trained on 26 million human-AI curated preference pairs set new state-of-the-art results on seven major reward model benchmarks.

Once incremental gains per additional curated million pairs fall below a user-defined threshold (e.g., ¡0.3 points on average benchmark score), it is reasonable to stop spending

fields

years

verdicts

representative citing papers

citing papers explorer