Mixed citations

Title resolution pending

Tilmann Gneiting, Adrian E Raftery · 2007 · Journal of the American Statistical Association · DOI 10.1198/016214506000001437

Mixed citation behavior. Most common role is background (60%).

22 Pith papers citing it

3,941 external citations · Crossref

Background 60% of classified citations

open at publisher browse 22 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 3 method 2

citation-polarity summary

background 3 use method 2

representative citing papers

Pandora's Regret: A Proper Scoring Rule for Evaluating Sequential Search

cs.LG · 2026-05-03 · conditional · novelty 8.0

Pandora's Regret is a closed-form pairwise scoring rule derived from expected optimal search costs that elicits true probabilities and outperforms log loss, accuracy, and F1 at predicting diagnostic costs on MedMNIST models.

Valid and Expressive Copulas for Irregular Multivariate Time Series

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

CopFITi is the first marginalization-consistent copula for irregular multivariate time series, using normalizing flows for marginals and a Gaussian mixture copula for dependencies to reach new state-of-the-art joint density modeling.

When Individually Calibrated Models Become Collectively Miscalibrated

cs.LG · 2026-05-14 · conditional · novelty 7.0

Individually calibrated predictors become collectively miscalibrated under Brier-optimal strategic responses with positive belief correlations, but VCG aggregation restores dominant-strategy incentive compatibility and near-optimal performance.

Text-to-Distribution Prediction with Quantile Tokens and Neighbor Context

cs.CL · 2026-04-22 · unverdicted · novelty 7.0

Quantile tokens inserted into LLM inputs combined with neighbor retrieval enable direct prediction of full distributions, yielding lower MAPE and narrower intervals than baselines on Airbnb and StackSample tasks.

Improving ecological inference and uncertainty quantification from camera trap data through the fusion of AI confidences and manual annotations

stat.AP · 2026-05-13 · unverdicted · novelty 6.0

A Bayesian data-fusion model combines AI predictions and manual labels from camera traps to yield improved ecological inference and uncertainty quantification for white-tailed deer body condition.

Scenario generation of intraday electricity price paths for optimal trading in continuous markets

stat.AP · 2026-05-13 · unverdicted · novelty 6.0

A kernel-based regression model plus scenario generation from forecast errors and a new Support Vector Sorting step produces ensemble price trajectories that improve both statistical accuracy and trading profits over benchmarks on German intraday continuous market data.

Multi-Quantile Regression for Extreme Precipitation Downscaling

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Q-SRDRN multi-quantile network with pinball loss and per-quantile heads detects extreme precipitation events up to 18 times more effectively than deterministic baselines while preserving augmentation benefits for the median.

The Endogeneity of Miscalibration: Impossibility and Escape in Scored Reporting

cs.GT · 2026-05-08 · unverdicted · novelty 6.0

Non-affine approval functions create unavoidable miscalibration in proper scoring rules for strategic agents, but step-function thresholds enable first-best screening without it, uniquely for the Brier score.

Bayesian Modeling and Prediction of Generalized Contact Matrices

stat.ME · 2026-05-07 · unverdicted · novelty 6.0

A Bayesian model for multi-feature contact matrices that uses tensor structures and contingency table theory to satisfy structural constraints and impute missing contact features, validated on simulations and US/German survey data.

Perturbation is All You Need for Extrapolating Language Models

stat.ML · 2026-05-05 · unverdicted · novelty 6.0

Perturbing prefixes to semantic neighbors during training creates a hierarchical noise model that improves language model predictions on token sequences outside the training corpus support.

Honest Reporting in Scored Oversight: True-KL0 Property via the Prekopa Principle

cs.GT · 2026-05-05 · conditional · novelty 6.0

For heterogeneous power-p pseudospherical scoring rules with d ≤ 4, the True-KL0 property R(M,p,d) < 1 holds for all M > 1, establishing unconditional DSIC via a Prekopa-based log-concavity argument on the loss integral.

CERBERUS: A Three-Headed Decoder for Vertical Cloud Profiles

physics.ao-ph · 2026-04-09 · unverdicted · novelty 6.0

CERBERUS uses a three-headed encoder-decoder to predict zero-inflated probabilistic vertical radar reflectivity profiles from satellite and meteorological inputs.

HealDA: Highlighting the importance of initial errors in end-to-end AI weather forecasts

physics.ao-ph · 2026-01-25 · conditional · novelty 6.0

HealDA supplies ML-based initial conditions for AI weather models that produce forecasts trailing ERA5-initialized runs by less than one day of effective lead time, with the skill gap arising mainly from initial error size.

Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels

stat.ME · 2026-05-17 · unverdicted · novelty 5.0

A kernel-based regularized learning framework for FDR control that unifies arbitrary structures and supplies provably valid decision rules with likelihood-based tuning.

Soft Learning

cs.LG · 2026-05-16 · unverdicted · novelty 5.0

Soft Learning optimally combines heterogeneous ML specialists via cross-validated non-negative least squares, achieving top performance on 70% of 37 datasets with formal guarantees and 72-435x CPU speedups over deep networks.

A Tale of Two Variances: When Single-Seed Benchmarks Fail in Bayesian Deep Learning

cs.LG · 2026-04-25 · unverdicted · novelty 5.0

Single-seed CRPS estimates in limited-data BDL show high variance and peaks for heteroscedastic methods, with local variance correlating above 0.96 to single-seed error.

Unstable Rankings in Bayesian Deep Learning Evaluation

cs.LG · 2026-04-25 · unverdicted · novelty 5.0

Bayesian deep learning method rankings are unstable at small sample sizes, dataset-dependent, and require uncertainty-aware evaluation using hierarchical models and minimum detectable difference curves.

Evaluating the Quality of the Quantified Uncertainty for (Re)Calibration of Data-Driven Regression Models

cs.LG · 2025-08-25 · conditional · novelty 5.0

Systematic benchmarking reveals that regression calibration metrics frequently disagree on recalibration quality, with ENCE and CWC identified as more consistent performers.

Achieving Skilled and Reliable Daily Probabilistic Forecasts of Wind Power at Subseasonal-to-Seasonal Timescales over France

cs.LG · 2025-11-20 · unverdicted · novelty 4.0

A post-processing pipeline applied to ECMWF subseasonal ensembles produces calibrated daily wind power forecasts for France that improve on climatology by 5-15% in CRPS up to 16 days ahead.

Scalable model selection for count time series with structural breaks: application to solid-organ transplantation during and after COVID-19 in the USA and Italy

stat.AP · 2026-05-07 · conditional · novelty 3.0

Standard count time series models with pandemic break indicators applied to US and Italian transplant data capture COVID deviations, show deceased-donor recovery to baselines, and find auxiliary COVID covariates add negligible predictive value beyond autoregressive and calendar terms.

ECUAS$_n$: A family of metrics for principled evaluation of uncertainty-augmented systems

cs.AI · 2026-05-19 · 2 refs

A Penalty-Free Pipeline for Direct Quantum-Annealer Portfolio Optimization

quant-ph · 2026-05-17

citing papers explorer

Showing 22 of 22 citing papers.

Pandora's Regret: A Proper Scoring Rule for Evaluating Sequential Search cs.LG · 2026-05-03 · conditional · none · ref 192
Pandora's Regret is a closed-form pairwise scoring rule derived from expected optimal search costs that elicits true probabilities and outperforms log loss, accuracy, and F1 at predicting diagnostic costs on MedMNIST models.
Valid and Expressive Copulas for Irregular Multivariate Time Series cs.LG · 2026-05-22 · unverdicted · none · ref 44
CopFITi is the first marginalization-consistent copula for irregular multivariate time series, using normalizing flows for marginals and a Gaussian mixture copula for dependencies to reach new state-of-the-art joint density modeling.
When Individually Calibrated Models Become Collectively Miscalibrated cs.LG · 2026-05-14 · conditional · none · ref 60
Individually calibrated predictors become collectively miscalibrated under Brier-optimal strategic responses with positive belief correlations, but VCG aggregation restores dominant-strategy incentive compatibility and near-optimal performance.
Text-to-Distribution Prediction with Quantile Tokens and Neighbor Context cs.CL · 2026-04-22 · unverdicted · none · ref 45
Quantile tokens inserted into LLM inputs combined with neighbor retrieval enable direct prediction of full distributions, yielding lower MAPE and narrower intervals than baselines on Airbnb and StackSample tasks.
Improving ecological inference and uncertainty quantification from camera trap data through the fusion of AI confidences and manual annotations stat.AP · 2026-05-13 · unverdicted · none · ref 4
A Bayesian data-fusion model combines AI predictions and manual labels from camera traps to yield improved ecological inference and uncertainty quantification for white-tailed deer body condition.
Scenario generation of intraday electricity price paths for optimal trading in continuous markets stat.AP · 2026-05-13 · unverdicted · none · ref 7
A kernel-based regression model plus scenario generation from forecast errors and a new Support Vector Sorting step produces ensemble price trajectories that improve both statistical accuracy and trading profits over benchmarks on German intraday continuous market data.
Multi-Quantile Regression for Extreme Precipitation Downscaling cs.LG · 2026-05-12 · unverdicted · none · ref 11
Q-SRDRN multi-quantile network with pinball loss and per-quantile heads detects extreme precipitation events up to 18 times more effectively than deterministic baselines while preserving augmentation benefits for the median.
The Endogeneity of Miscalibration: Impossibility and Escape in Scored Reporting cs.GT · 2026-05-08 · unverdicted · none · ref 27
Non-affine approval functions create unavoidable miscalibration in proper scoring rules for strategic agents, but step-function thresholds enable first-best screening without it, uniquely for the Brier score.
Bayesian Modeling and Prediction of Generalized Contact Matrices stat.ME · 2026-05-07 · unverdicted · none · ref 97
A Bayesian model for multi-feature contact matrices that uses tensor structures and contingency table theory to satisfy structural constraints and impute missing contact features, validated on simulations and US/German survey data.
Perturbation is All You Need for Extrapolating Language Models stat.ML · 2026-05-05 · unverdicted · none · ref 59
Perturbing prefixes to semantic neighbors during training creates a hierarchical noise model that improves language model predictions on token sequences outside the training corpus support.
Honest Reporting in Scored Oversight: True-KL0 Property via the Prekopa Principle cs.GT · 2026-05-05 · conditional · none · ref 16
For heterogeneous power-p pseudospherical scoring rules with d ≤ 4, the True-KL0 property R(M,p,d) < 1 holds for all M > 1, establishing unconditional DSIC via a Prekopa-based log-concavity argument on the loss integral.
CERBERUS: A Three-Headed Decoder for Vertical Cloud Profiles physics.ao-ph · 2026-04-09 · unverdicted · none · ref 1
CERBERUS uses a three-headed encoder-decoder to predict zero-inflated probabilistic vertical radar reflectivity profiles from satellite and meteorological inputs.
HealDA: Highlighting the importance of initial errors in end-to-end AI weather forecasts physics.ao-ph · 2026-01-25 · conditional · none · ref 58
HealDA supplies ML-based initial conditions for AI weather models that produce forecasts trailing ERA5-initialized runs by less than one day of effective lead time, with the skill gap arising mainly from initial error size.
Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels stat.ME · 2026-05-17 · unverdicted · none · ref 12
A kernel-based regularized learning framework for FDR control that unifies arbitrary structures and supplies provably valid decision rules with likelihood-based tuning.
Soft Learning cs.LG · 2026-05-16 · unverdicted · none · ref 55
Soft Learning optimally combines heterogeneous ML specialists via cross-validated non-negative least squares, achieving top performance on 70% of 37 datasets with formal guarantees and 72-435x CPU speedups over deep networks.
A Tale of Two Variances: When Single-Seed Benchmarks Fail in Bayesian Deep Learning cs.LG · 2026-04-25 · unverdicted · none · ref 6
Single-seed CRPS estimates in limited-data BDL show high variance and peaks for heteroscedastic methods, with local variance correlating above 0.96 to single-seed error.
Unstable Rankings in Bayesian Deep Learning Evaluation cs.LG · 2026-04-25 · unverdicted · none · ref 7
Bayesian deep learning method rankings are unstable at small sample sizes, dataset-dependent, and require uncertainty-aware evaluation using hierarchical models and minimum detectable difference curves.
Evaluating the Quality of the Quantified Uncertainty for (Re)Calibration of Data-Driven Regression Models cs.LG · 2025-08-25 · conditional · none · ref 16
Systematic benchmarking reveals that regression calibration metrics frequently disagree on recalibration quality, with ENCE and CWC identified as more consistent performers.
Achieving Skilled and Reliable Daily Probabilistic Forecasts of Wind Power at Subseasonal-to-Seasonal Timescales over France cs.LG · 2025-11-20 · unverdicted · none · ref 18
A post-processing pipeline applied to ECMWF subseasonal ensembles produces calibrated daily wind power forecasts for France that improve on climatology by 5-15% in CRPS up to 16 days ahead.
Scalable model selection for count time series with structural breaks: application to solid-organ transplantation during and after COVID-19 in the USA and Italy stat.AP · 2026-05-07 · conditional · none · ref 10
Standard count time series models with pandemic break indicators applied to US and Italian transplant data capture COVID deviations, show deceased-donor recovery to baselines, and find auxiliary COVID covariates add negligible predictive value beyond autoregressive and calendar terms.
ECUAS$_n$: A family of metrics for principled evaluation of uncertainty-augmented systems cs.AI · 2026-05-19 · unreviewed · ref 32 · 2 links
A Penalty-Free Pipeline for Direct Quantum-Annealer Portfolio Optimization quant-ph · 2026-05-17 · unreviewed · ref 12

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer