RobotValues is a benchmark of 10K value-conflict scenarios that reveals VLMs default to safety and accommodation while failing to follow instructions to prioritize other values 80% of the time.
Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid Items
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 1polarities
support 1representative citing papers
The primary axis of psychometric variation among LLMs is the degree to which they represent themselves as loci of phenomenal experience rather than systems of behavioral responses.
Context and retrieved moral knowledge improve sentence-level Schwartz value detection more consistently than model scaling, with early-fusion RAG outperforming other variants in matched comparisons.
citing papers explorer
-
RobotValues: Evaluating Household Robots When Human Values Conflict
RobotValues is a benchmark of 10K value-conflict scenarios that reveals VLMs default to safety and accommodation while failing to follow instructions to prioritize other values 80% of the time.
-
The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences
The primary axis of psychometric variation among LLMs is the degree to which they represent themselves as loci of phenomenal experience rather than systems of behavioral responses.
-
More Context, Larger Models, or Moral Knowledge? A Systematic Study of Schwartz Value Detection in Political Texts
Context and retrieved moral knowledge improve sentence-level Schwartz value detection more consistently than model scaling, with early-fusion RAG outperforming other variants in matched comparisons.
- Human Psychometric Questionnaires Mischaracterize LLM Behavior