pith. sign in

Online learning from strategic human feedback in llm fine-tuning

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.GT 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

Incentivizing High-Quality Human Annotations with Golden Questions

cs.GT · 2025-05-25 · unverdicted · novelty 7.0

The paper derives a Θ(1/√(n log n)) hypothesis testing rate under strategic annotator behavior and shows that high-certainty, format-similar golden questions better reveal annotation quality than standard checks.

citing papers explorer

Showing 1 of 1 citing paper.

  • Incentivizing High-Quality Human Annotations with Golden Questions cs.GT · 2025-05-25 · unverdicted · none · ref 19

    The paper derives a Θ(1/√(n log n)) hypothesis testing rate under strategic annotator behavior and shows that high-certainty, format-similar golden questions better reveal annotation quality than standard checks.