Title resolution pending

Chuan Guo, Geoff Pleiss, Yu Sun, Kilian Q · 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

When Accuracy Is Not Enough: Uncertainty Collapse between Noisy Label Learning and Out-of-Distribution Detection

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

High accuracy in noisy-label learning does not guarantee OOD detection reliability due to uncertainty collapse, and Virtual Margin Regularization offers partial mitigation.

NeuroState-Bench: A Human-Calibrated Benchmark for Commitment Integrity in LLM Agent Profiles

cs.AI · 2026-05-03 · unverdicted · novelty 6.0 · 2 refs

NeuroState-Bench supplies human-calibrated tasks and probes that measure commitment integrity in LLM agents and shows this measure diverges from ordinary task success.

citing papers explorer

Showing 2 of 2 citing papers.

When Accuracy Is Not Enough: Uncertainty Collapse between Noisy Label Learning and Out-of-Distribution Detection cs.LG · 2026-05-18 · unverdicted · none · ref 22
High accuracy in noisy-label learning does not guarantee OOD detection reliability due to uncertainty collapse, and Virtual Margin Regularization offers partial mitigation.
NeuroState-Bench: A Human-Calibrated Benchmark for Commitment Integrity in LLM Agent Profiles cs.AI · 2026-05-03 · unverdicted · none · ref 8 · 2 links
NeuroState-Bench supplies human-calibrated tasks and probes that measure commitment integrity in LLM agents and shows this measure diverges from ordinary task success.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer