Navigating the grey area: Expressions of overconfidence and uncertainty in language models

Kaitlyn Zhou, Dan Jurafsky, Tatsunori Hashimoto · 2023 · arXiv 2302.13439

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

cs.CL · 2023-05-19 · unverdicted · novelty 6.0

CRITIC improves LLM outputs on question answering, math synthesis, and toxicity reduction by having the model interact with tools to critique and revise its initial generations.

Calibrating Model-Based Evaluation Metrics for Summarization

cs.CL · 2026-04-19 · unverdicted · novelty 5.0

A reference-free proxy scoring framework combined with GIRB calibration produces better-aligned evaluation metrics for summarization and outperforms baselines across seven datasets.

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

cs.AI · 2023-08-10 · accept · novelty 5.0

Survey organizes LLM trustworthiness into seven categories and 29 sub-categories, measures eight sub-categories on popular models, and finds that more aligned models generally score higher but with varying effectiveness.

citing papers explorer

Showing 3 of 3 citing papers.

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing cs.CL · 2023-05-19 · unverdicted · none · ref 5
CRITIC improves LLM outputs on question answering, math synthesis, and toxicity reduction by having the model interact with tools to critique and revise its initial generations.
Calibrating Model-Based Evaluation Metrics for Summarization cs.CL · 2026-04-19 · unverdicted · none · ref 101
A reference-free proxy scoring framework combined with GIRB calibration produces better-aligned evaluation metrics for summarization and outperforms baselines across seven datasets.
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment cs.AI · 2023-08-10 · accept · none · ref 93
Survey organizes LLM trustworthiness into seven categories and 29 sub-categories, measures eight sub-categories on popular models, and finds that more aligned models generally score higher but with varying effectiveness.

Navigating the grey area: Expressions of overconfidence and uncertainty in language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer