A survey on personalized and pluralistic preference alignment in large language models

Xie, Z · 2025 · arXiv 2504.07070

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

ProactBench: Beyond What The User Asked For

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

ProactBench measures LLM conversational proactivity in three phases using 198 multi-agent dialogues and finds recovery behavior hard to predict from existing benchmarks.

Preference-Aware Rubric Learning for Personalized Evaluation

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

PARL formulates personalized LLM evaluation as a learning problem that induces preference-aware rubrics from raw user histories via discriminative RL and self-validation.

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

cs.AI · 2026-04-13 · unverdicted · novelty 6.0

TIPO applies preference-intensity weighting and padding gating to stabilize preference optimization for privacy personalization in mobile GUI agents, yielding higher alignment and distinction metrics than prior methods.

Personalization Meets Safety:Mechanisms,Risks,and Mitigations in Personalized LLMs

cs.AI · 2026-06-08 · unverdicted · novelty 5.0

A survey that maps safety risks in personalized LLMs, introduces a unified taxonomy, and highlights three structural inadequacies in existing research on user-invariant safety, isolated techniques, and short-term evaluations.

When to Ask a Question: Understanding Communication Strategies in Generative AI Tools

cs.GT · 2026-05-11 · unverdicted · novelty 5.0

A tradeoff model shows generative AI can reduce bias against diverse preferences by strategically eliciting information instead of always inferring from majority patterns.

POPI: Personalizing LLMs via Optimized Natural Language Preference Inference

cs.CL · 2025-10-17 · unverdicted · novelty 5.0

POPI distills user preferences into reusable natural-language summaries via a shared inference model and conditions a generator on them, trained jointly with RL to improve personalization quality while cutting context length by up to 10x on benchmarks.

Toward Human-Centered Multi-Agent Systems: Integrating Cognition, Culture, Values, and Cooperation in AI Agents

cs.MA · 2026-06-06 · unverdicted · novelty 3.0

A literature survey across cognitive science, sociolinguistics, and AI alignment that identifies the absence of unified frameworks for embedding cognition, culture, values, and cooperation into multi-agent LLM systems and outlines future directions.

citing papers explorer

Showing 7 of 7 citing papers after filters.

ProactBench: Beyond What The User Asked For cs.LG · 2026-05-09 · unverdicted · none · ref 72
ProactBench measures LLM conversational proactivity in three phases using 198 multi-agent dialogues and finds recovery behavior hard to predict from existing benchmarks.
Preference-Aware Rubric Learning for Personalized Evaluation cs.CL · 2026-05-29 · unverdicted · none · ref 29
PARL formulates personalized LLM evaluation as a learning problem that induces preference-aware rubrics from raw user histories via discriminative RL and self-validation.
Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization cs.AI · 2026-04-13 · unverdicted · none · ref 35
TIPO applies preference-intensity weighting and padding gating to stabilize preference optimization for privacy personalization in mobile GUI agents, yielding higher alignment and distinction metrics than prior methods.
Personalization Meets Safety:Mechanisms,Risks,and Mitigations in Personalized LLMs cs.AI · 2026-06-08 · unverdicted · none · ref 35
A survey that maps safety risks in personalized LLMs, introduces a unified taxonomy, and highlights three structural inadequacies in existing research on user-invariant safety, isolated techniques, and short-term evaluations.
When to Ask a Question: Understanding Communication Strategies in Generative AI Tools cs.GT · 2026-05-11 · unverdicted · none · ref 58
A tradeoff model shows generative AI can reduce bias against diverse preferences by strategically eliciting information instead of always inferring from majority patterns.
POPI: Personalizing LLMs via Optimized Natural Language Preference Inference cs.CL · 2025-10-17 · unverdicted · none · ref 46
POPI distills user preferences into reusable natural-language summaries via a shared inference model and conditions a generator on them, trained jointly with RL to improve personalization quality while cutting context length by up to 10x on benchmarks.
Toward Human-Centered Multi-Agent Systems: Integrating Cognition, Culture, Values, and Cooperation in AI Agents cs.MA · 2026-06-06 · unverdicted · none · ref 35
A literature survey across cognitive science, sociolinguistics, and AI alignment that identifies the absence of unified frameworks for embedding cognition, culture, values, and cooperation into multi-agent LLM systems and outlines future directions.

A survey on personalized and pluralistic preference alignment in large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer