Adapts multi-layer token-level Mahalanobis distance with supervised linear regression to yield improved uncertainty scores for LLM truthfulness tasks.
Title resolution pending
4 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 4representative citing papers
Introduces a unified framework integrating uncertainty estimation, calibration, and tool-based abstention for reliable code predictions in language models.
Proposes a two-stage on-the-fly input adaptation framework to reduce mispredictions in code language models across understanding tasks without retraining or additional supervision.
LLMs show improved accuracy on gastroenterology questions but remain overconfident in self-reported certainty across commercial, open-source, and quantized variants.
citing papers explorer
-
When to Answer and When to Defer: A Decision Framework for Reliable Code Predictions
Introduces a unified framework integrating uncertainty estimation, calibration, and tool-based abstention for reliable code predictions in language models.
-
On-the-Fly Input Adaptation for Reliable Code Intelligence
Proposes a two-stage on-the-fly input adaptation framework to reduce mispredictions in code language models across understanding tasks without retraining or additional supervision.