Language models prefer what they know: Relative confidence estimation via confidence preferences.arXiv preprint arXiv:2502.01126

Vaishnavi Shrivastava, Ananya Kumar, Percy Liang · 2025 · arXiv 2502.01126

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Quantifying Consistency in LLM Logical Reasoning via Structural Uncertainty

cs.AI · 2026-06-15 · unverdicted · novelty 7.0

Structural uncertainty from self-preference-induced rankings of LLM reasoning paths complements answer dispersion for identifying unreliable instances on logical tasks while collapsing on factual retrieval.

CLSGen: A Dual-Head Fine-Tuning Framework for Joint Probabilistic Classification and Verbalized Explanation

cs.CL · 2026-04-13 · unverdicted · novelty 6.0

CLSGen is a dual-head LLM fine-tuning framework that enables joint probabilistic classification and verbalized explanation generation without catastrophic forgetting of generative capabilities.

Inertia in Moral and Value Judgments of Large Language Models

cs.CL · 2024-08-16 · unverdicted · novelty 4.0

LLMs exhibit persistent inertia in value orientations, with harm avoidance and fairness remaining skewed across persona prompts.

citing papers explorer

Showing 3 of 3 citing papers.

Quantifying Consistency in LLM Logical Reasoning via Structural Uncertainty cs.AI · 2026-06-15 · unverdicted · none · ref 12
Structural uncertainty from self-preference-induced rankings of LLM reasoning paths complements answer dispersion for identifying unreliable instances on logical tasks while collapsing on factual retrieval.
CLSGen: A Dual-Head Fine-Tuning Framework for Joint Probabilistic Classification and Verbalized Explanation cs.CL · 2026-04-13 · unverdicted · none · ref 9
CLSGen is a dual-head LLM fine-tuning framework that enables joint probabilistic classification and verbalized explanation generation without catastrophic forgetting of generative capabilities.
Inertia in Moral and Value Judgments of Large Language Models cs.CL · 2024-08-16 · unverdicted · none · ref 45
LLMs exhibit persistent inertia in value orientations, with harm avoidance and fairness remaining skewed across persona prompts.

Language models prefer what they know: Relative confidence estimation via confidence preferences.arXiv preprint arXiv:2502.01126

fields

years

verdicts

representative citing papers

citing papers explorer