Personalisation within bounds: A risk taxonomy and policy frame- work for the alignment of large language models with personalised feedback

Hannah Rose Kirk, Bertie Vidgen, Paul R¨ ottger, Scott A · 2023 · arXiv 2303.05453

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

"Label from Somewhere": Reflexive Annotating for Situated AI Alignment

cs.HC · 2026-01-25 · unverdicted · novelty 6.0

Reflexive annotating elicits intersectional and positional metadata from crowd workers to make AI alignment annotations more situated and less assumed-neutral.

Simple synthetic data reduces sycophancy in large language models

cs.CL · 2023-08-07 · unverdicted · novelty 6.0

Scaling and instruction tuning increase sycophancy in LLMs on opinion and fact tasks, but a synthetic data fine-tuning intervention reduces it on held-out prompts.

AI-Augmented Surveys: Leveraging Large Language Models and Surveys for Opinion Prediction

cs.CL · 2023-05-16 · unverdicted · novelty 6.0

LLM embeddings enable strong retrodiction of masked GSS opinions via cross-validation and external validation but only modest performance on entirely unasked opinions.

When to Ask a Question: Understanding Communication Strategies in Generative AI Tools

cs.GT · 2026-05-11 · unverdicted · novelty 5.0

A tradeoff model shows generative AI can reduce bias against diverse preferences by strategically eliciting information instead of always inferring from majority patterns.

AI Alignment From Social Choice Perspectives

cs.AI · 2026-06-19 · unverdicted · novelty 3.0

This survey examines applications of social choice theory to aggregating human feedback in AI alignment, identifying failure modes and expanding design options for disagreement.

Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences

cs.LG · 2026-05-30 · unverdicted · novelty 3.0

Position paper advocating personalized preference learning in LLMs over aggregated approaches, grounded in social choice theory and demographic variation.

Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research

cs.CL · 2024-11-30 · unverdicted · novelty 2.0

This survey paper identifies opportunities for LLMs in low-resource language humanities research along with challenges in data accessibility, model adaptability, and cultural sensitivity.

citing papers explorer

Showing 7 of 7 citing papers.

"Label from Somewhere": Reflexive Annotating for Situated AI Alignment cs.HC · 2026-01-25 · unverdicted · none · ref 62
Reflexive annotating elicits intersectional and positional metadata from crowd workers to make AI alignment annotations more situated and less assumed-neutral.
Simple synthetic data reduces sycophancy in large language models cs.CL · 2023-08-07 · unverdicted · none · ref 17
Scaling and instruction tuning increase sycophancy in LLMs on opinion and fact tasks, but a synthetic data fine-tuning intervention reduces it on held-out prompts.
AI-Augmented Surveys: Leveraging Large Language Models and Surveys for Opinion Prediction cs.CL · 2023-05-16 · unverdicted · none · ref 60
LLM embeddings enable strong retrodiction of masked GSS opinions via cross-validation and external validation but only modest performance on entirely unasked opinions.
When to Ask a Question: Understanding Communication Strategies in Generative AI Tools cs.GT · 2026-05-11 · unverdicted · none · ref 27
A tradeoff model shows generative AI can reduce bias against diverse preferences by strategically eliciting information instead of always inferring from majority patterns.
AI Alignment From Social Choice Perspectives cs.AI · 2026-06-19 · unverdicted · none · ref 124
This survey examines applications of social choice theory to aggregating human feedback in AI alignment, identifying failure modes and expanding design options for disagreement.
Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences cs.LG · 2026-05-30 · unverdicted · none · ref 16
Position paper advocating personalized preference learning in LLMs over aggregated approaches, grounded in social choice theory and demographic variation.
Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research cs.CL · 2024-11-30 · unverdicted · none · ref 69
This survey paper identifies opportunities for LLMs in low-resource language humanities research along with challenges in data accessibility, model adaptability, and cultural sensitivity.

Personalisation within bounds: A risk taxonomy and policy frame- work for the alignment of large language models with personalised feedback

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer