pith. sign in

Towards evaluating AI systems for moral status using self-reports

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.CL 3

years

2026 2 2024 1

verdicts

UNVERDICTED 3

roles

background 1

polarities

support 1

representative citing papers

LLM Evaluators Recognize and Favor Their Own Generations

cs.CL · 2024-04-15 · unverdicted · novelty 6.0

LLMs show measurable self-recognition that linearly correlates with self-preference bias in evaluations, supported by fine-tuning experiments and controls for confounders.

citing papers explorer

Showing 3 of 3 citing papers.