Liang, Ronan Le Bras, Katharina Reinecke, and Maarten Sap

Sebastin Santy, Jenny T Liang, Ronan Le Bras, Katharina Reinecke, Maarten Sap · 2023 · arXiv 2306.01943

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

support 1

representative citing papers

A Roadmap to Pluralistic Alignment

cs.AI · 2024-02-07 · unverdicted · novelty 6.0

The paper formalizes three types of pluralistic AI models and three benchmark classes, arguing that current alignment techniques may reduce rather than increase distributional pluralism.

Framing an AI with Values Reduces AI Reliance in AI-supported Writing Tasks

cs.HC · 2026-05-19 · unverdicted · novelty 4.0

An online experiment finds that showing users an overview of an AI's values reduces reliance on AI suggestions during writing tasks.

Inertia in Moral and Value Judgments of Large Language Models

cs.CL · 2024-08-16 · unverdicted · novelty 4.0

LLMs exhibit persistent inertia in value orientations, with harm avoidance and fairness remaining skewed across persona prompts.

citing papers explorer

Showing 2 of 2 citing papers after filters.

A Roadmap to Pluralistic Alignment cs.AI · 2024-02-07 · unverdicted · none · ref 284
The paper formalizes three types of pluralistic AI models and three benchmark classes, arguing that current alignment techniques may reduce rather than increase distributional pluralism.
Inertia in Moral and Value Judgments of Large Language Models cs.CL · 2024-08-16 · unverdicted · none · ref 38
LLMs exhibit persistent inertia in value orientations, with harm avoidance and fairness remaining skewed across persona prompts.

Liang, Ronan Le Bras, Katharina Reinecke, and Maarten Sap

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer