DeGroot and Stephen E

Morris H · 1983 · DOI 10.2307/2987588

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open at publisher browse 5 citing papers

representative citing papers

When Individually Calibrated Models Become Collectively Miscalibrated

cs.LG · 2026-05-14 · conditional · novelty 7.0

Individually calibrated predictors become collectively miscalibrated under Brier-optimal strategic responses with positive belief correlations, but VCG aggregation restores dominant-strategy incentive compatibility and near-optimal performance.

RubricRefine: Improving Tool-Use Agent Reliability with Training-Free Pre-Execution Refinement

cs.LG · 2026-05-10 · unverdicted · novelty 7.0 · 3 refs

RubricRefine is a training-free pre-execution method that creates rubrics to score and fix inter-tool contract violations in agent code, reaching 0.86 average on M3ToolEval across seven models with zero executions and lower latency.

Too Sharp, Too Sure: When Calibration Follows Curvature

cs.LG · 2026-04-22 · unverdicted · novelty 7.0

Calibration error tracks curvature via shared margin-dependent exponential tails; a margin-aware objective improves out-of-sample calibration across optimizers.

The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

cs.AI · 2026-01-30 · unverdicted · novelty 6.0

AI model failures on complex tasks become increasingly incoherent with longer reasoning chains, making consistent misalignment less likely than chaotic errors as capabilities scale.

Do Small Language Models Know When They're Wrong? Confidence-Based Cascade Scoring for Educational Assessment

cs.CY · 2026-03-29 · unverdicted · novelty 4.0

Verbalized confidence from small LMs enables cost-effective cascade routing for automated educational scoring, matching large-model accuracy at 76% lower cost when discrimination is strong.

citing papers explorer

Showing 5 of 5 citing papers.

When Individually Calibrated Models Become Collectively Miscalibrated cs.LG · 2026-05-14 · conditional · none · ref 54
Individually calibrated predictors become collectively miscalibrated under Brier-optimal strategic responses with positive belief correlations, but VCG aggregation restores dominant-strategy incentive compatibility and near-optimal performance.
RubricRefine: Improving Tool-Use Agent Reliability with Training-Free Pre-Execution Refinement cs.LG · 2026-05-10 · unverdicted · none · ref 33 · 3 links
RubricRefine is a training-free pre-execution method that creates rubrics to score and fix inter-tool contract violations in agent code, reaching 0.86 average on M3ToolEval across seven models with zero executions and lower latency.
Too Sharp, Too Sure: When Calibration Follows Curvature cs.LG · 2026-04-22 · unverdicted · none · ref 2
Calibration error tracks curvature via shared margin-dependent exponential tails; a margin-aware objective improves out-of-sample calibration across optimizers.
The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity? cs.AI · 2026-01-30 · unverdicted · none · ref 5
AI model failures on complex tasks become increasingly incoherent with longer reasoning chains, making consistent misalignment less likely than chaotic errors as capabilities scale.
Do Small Language Models Know When They're Wrong? Confidence-Based Cascade Scoring for Educational Assessment cs.CY · 2026-03-29 · unverdicted · none · ref 30
Verbalized confidence from small LMs enables cost-effective cascade routing for automated educational scoring, matching large-model accuracy at 76% lower cost when discrimination is strong.

DeGroot and Stephen E

fields

years

verdicts

representative citing papers

citing papers explorer