pith. sign in

Deep reinforcement learning from human preferences.Advances in neural information pro- cessing systems, 30

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

years

2026 2

representative citing papers

Heterogeneous Judge-Aware Ranking with Sensitivity, Disagreement, and Confidence

stat.ME · 2026-05-06 · unverdicted · novelty 6.0

HJA ranking separates consensus ranking, judge sensitivity, and residual disagreement as distinct inferential targets with identifiability conditions and an anchored alternating algorithm, yielding better recovery and uncertainty calibration than pooled baselines on synthetic and real data.

citing papers explorer

Showing 2 of 2 citing papers.