hub

Learning Confidence for Out-of-Distribution Detection in Neural Networks

· 2018 · stat.ML · arXiv 1802.04865

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

open full Pith review browse 12 citing papers arXiv PDF

abstract

Modern neural networks are very powerful predictive models, but they are often incapable of recognizing when their predictions may be wrong. Closely related to this is the task of out-of-distribution detection, where a network must determine whether or not an input is outside of the set on which it is expected to safely perform. To jointly address these issues, we propose a method of learning confidence estimates for neural networks that is simple to implement and produces intuitively interpretable outputs. We demonstrate that on the task of out-of-distribution detection, our technique surpasses recently proposed techniques which construct confidence based on the network's output distribution, without requiring any additional labels or access to out-of-distribution examples. Additionally, we address the problem of calibrating out-of-distribution detectors, where we demonstrate that misclassified in-distribution examples can be used as a proxy for out-of-distribution examples.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

GEODE: Angle-Adaptive OOD Detection with Universal Scorer Compatibility

cs.LG · 2026-05-01 · unverdicted · novelty 7.0

GEODE uses per-sample cosine-similarity scaling in a norm loss to preserve feature geometry for universal scorer-compatible OOD detection, matching or exceeding OE performance on CIFAR benchmarks.

Knowing when to trust machine-learned interatomic potentials

cs.LG · 2026-05-01 · unverdicted · novelty 7.0

PROBE recasts MLIP uncertainty quantification as selective classification by training a compact discriminative classifier on frozen per-atom backbone embeddings, yielding a reliability probability that tracks actual error better than ensemble disagreement.

A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplete Multimodal EHRs

cs.LG · 2026-04-06 · unverdicted · novelty 7.0

HealthPoint represents clinical events as points in a 4D space (content, time, modality, case) and applies low-rank relational attention to achieve state-of-the-art mortality prediction from multi-level incomplete multimodal EHRs.

Exploiting Local Flatness for Efficient Out-of-Distribution Detection

cs.LG · 2026-06-29 · unverdicted · novelty 6.0

Fold is a post-hoc OOD detector that exploits larger feature-Hessian curvature on OOD inputs together with partial feature normalization and a self-supervised AutoFold calibration scheme.

Invascal: Inverse-Vacuity Self-Calibration for Uncertainty-Aware LiDAR Range-View Semantic Segmentation

cs.RO · 2026-05-20 · unverdicted · novelty 6.0

Introduces an architecture-agnostic Adapter Head and Invascal self-calibration objective to produce calibrated evidential uncertainty estimates for LiDAR range-view semantic segmentation while preserving accuracy.

PVLM: Parsing-Aware Vision Language Model with Dynamic Contrastive Learning for Zero-Shot Deepfake Attribution

cs.CV · 2025-04-19 · unverdicted · novelty 6.0

PVLM combines parsing-aware vision-language modeling with dynamic contrastive learning to enable fine-grained zero-shot attribution of deepfakes to unseen generators and outperforms prior methods on a new benchmark.

Component-Based Out-of-Distribution Detection

cs.CV · 2026-04-23 · unverdicted · novelty 6.0

CoOD decomposes inputs into components and applies Component Shift Score plus Compositional Consistency Score to improve detection of both standard and compositional out-of-distribution data.

Rethinking Uncertainty in Segmentation: From Estimation to Decision

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

Uncertainty optimization alone misses most safety gains; a decision-stage deferral policy removes up to 80% segmentation errors at 25% pixel deferral with cross-dataset robustness, while calibration does not improve decision quality.

RankOOD -- Class Ranking-based Out-of-Distribution Detection

cs.LG · 2025-11-25 · unverdicted · novelty 5.0

RankOOD detects out-of-distribution samples by training a model to predict fixed class-specific ranking permutations via the Plackett-Luce loss, achieving a 4.3% FPR95 reduction on near-OOD TinyImageNet.

A Systematic Analysis of Out-of-Distribution Detection Under Representation and Training Paradigm Shifts

cs.LG · 2025-11-14 · unverdicted · novelty 5.0

Benchmark across architectures and shift regimes finds OOD detector rankings shift with representation collapse; proposes NC-based shortlist predictor and PCA filter without extra OOD data.

A deep learning pipeline for PAM50 subtype classification using histopathology images and multi-objective patch selection

cs.CV · 2026-04-02 · unverdicted · novelty 5.0

An optimization-based deep learning pipeline selects informative patches from H&E whole-slide images to classify breast cancer into PAM50 subtypes, achieving F1 scores of 0.88 internally and 0.80 externally.

Trust-Aware Predictive Emissions Monitoring for Gas Turbine Fleets with Limited Labelled Data

cs.LG · 2026-06-04 · unverdicted · novelty 4.0

A multi-head RNN framework with learned confidence, ensemble uncertainty, auxiliary predictions, distance analysis, and diagnostics produces calibrated trust scores for NOx prediction, reducing MAE from 0.202 to 0.070 on the top 10% confidence subset.

citing papers explorer

Showing 12 of 12 citing papers.

GEODE: Angle-Adaptive OOD Detection with Universal Scorer Compatibility cs.LG · 2026-05-01 · unverdicted · none · ref 20
GEODE uses per-sample cosine-similarity scaling in a norm loss to preserve feature geometry for universal scorer-compatible OOD detection, matching or exceeding OE performance on CIFAR benchmarks.
Knowing when to trust machine-learned interatomic potentials cs.LG · 2026-05-01 · unverdicted · none · ref 51
PROBE recasts MLIP uncertainty quantification as selective classification by training a compact discriminative classifier on frozen per-atom backbone embeddings, yielding a reliability probability that tracks actual error better than ensemble disagreement.
A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplete Multimodal EHRs cs.LG · 2026-04-06 · unverdicted · none · ref 34
HealthPoint represents clinical events as points in a 4D space (content, time, modality, case) and applies low-rank relational attention to achieve state-of-the-art mortality prediction from multi-level incomplete multimodal EHRs.
Exploiting Local Flatness for Efficient Out-of-Distribution Detection cs.LG · 2026-06-29 · unverdicted · none · ref 12 · internal anchor
Fold is a post-hoc OOD detector that exploits larger feature-Hessian curvature on OOD inputs together with partial feature normalization and a self-supervised AutoFold calibration scheme.
Invascal: Inverse-Vacuity Self-Calibration for Uncertainty-Aware LiDAR Range-View Semantic Segmentation cs.RO · 2026-05-20 · unverdicted · none · ref 22 · internal anchor
Introduces an architecture-agnostic Adapter Head and Invascal self-calibration objective to produce calibrated evidential uncertainty estimates for LiDAR range-view semantic segmentation while preserving accuracy.
PVLM: Parsing-Aware Vision Language Model with Dynamic Contrastive Learning for Zero-Shot Deepfake Attribution cs.CV · 2025-04-19 · unverdicted · none · ref 37 · internal anchor
PVLM combines parsing-aware vision-language modeling with dynamic contrastive learning to enable fine-grained zero-shot attribution of deepfakes to unseen generators and outperforms prior methods on a new benchmark.
Component-Based Out-of-Distribution Detection cs.CV · 2026-04-23 · unverdicted · none · ref 3
CoOD decomposes inputs into components and applies Component Shift Score plus Compositional Consistency Score to improve detection of both standard and compositional out-of-distribution data.
Rethinking Uncertainty in Segmentation: From Estimation to Decision cs.CV · 2026-04-14 · unverdicted · none · ref 2
Uncertainty optimization alone misses most safety gains; a decision-stage deferral policy removes up to 80% segmentation errors at 25% pixel deferral with cross-dataset robustness, while calibration does not improve decision quality.
RankOOD -- Class Ranking-based Out-of-Distribution Detection cs.LG · 2025-11-25 · unverdicted · none · ref 6 · internal anchor
RankOOD detects out-of-distribution samples by training a model to predict fixed class-specific ranking permutations via the Plackett-Luce loss, achieving a 4.3% FPR95 reduction on near-OOD TinyImageNet.
A Systematic Analysis of Out-of-Distribution Detection Under Representation and Training Paradigm Shifts cs.LG · 2025-11-14 · unverdicted · none · ref 2 · internal anchor
Benchmark across architectures and shift regimes finds OOD detector rankings shift with representation collapse; proposes NC-based shortlist predictor and PCA filter without extra OOD data.
A deep learning pipeline for PAM50 subtype classification using histopathology images and multi-objective patch selection cs.CV · 2026-04-02 · unverdicted · none · ref 36
An optimization-based deep learning pipeline selects informative patches from H&E whole-slide images to classify breast cancer into PAM50 subtypes, achieving F1 scores of 0.88 internally and 0.80 externally.
Trust-Aware Predictive Emissions Monitoring for Gas Turbine Fleets with Limited Labelled Data cs.LG · 2026-06-04 · unverdicted · none · ref 3 · internal anchor
A multi-head RNN framework with learned confidence, ensemble uncertainty, auxiliary predictions, distance analysis, and diagnostics produces calibrated trust scores for NOx prediction, reducing MAE from 0.202 to 0.070 on the top 10% confidence subset.

Learning Confidence for Out-of-Distribution Detection in Neural Networks

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer