arXiv preprint arXiv:2411.04991 , year=

Sun, H · 2024 · arXiv 2411.04991

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

Recursive generative retraining with heterogeneous rewards converges to a stable distribution satisfying a weighted Nash bargaining solution, preserving diversity under stated conditions.

How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators

cs.LG · 2025-02-10 · unverdicted · novelty 6.0

Develops self-consistency monitoring for preference annotators and derives sample-complexity bounds showing linear contracts achieve near-ideal performance faster than binary ones under continuous actions.

Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences

cs.LG · 2026-05-30 · unverdicted · novelty 3.0

Position paper advocating personalized preference learning in LLMs over aggregated approaches, grounded in social choice theory and demographic variation.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences cs.LG · 2026-05-08 · unverdicted · none · ref 81
Recursive generative retraining with heterogeneous rewards converges to a stable distribution satisfying a weighted Nash bargaining solution, preserving diversity under stated conditions.
How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators cs.LG · 2025-02-10 · unverdicted · none · ref 83
Develops self-consistency monitoring for preference annotators and derives sample-complexity bounds showing linear contracts achieve near-ideal performance faster than binary ones under continuous actions.
Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences cs.LG · 2026-05-30 · unverdicted · none · ref 27
Position paper advocating personalized preference learning in LLMs over aggregated approaches, grounded in social choice theory and demographic variation.

arXiv preprint arXiv:2411.04991 , year=

fields

years

verdicts

representative citing papers

citing papers explorer