Pandora's Regret is a closed-form pairwise scoring rule derived from expected optimal search costs that elicits true probabilities and outperforms log loss, accuracy, and F1 at predicting diagnostic costs on MedMNIST models.
Journal of the Royal Statistical Society Series B , author=
23 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
representative citing papers
A nudged-system optimization method recovers parameters in the Lorenz-63 system from partial noisy observations, with theoretical guarantees on synchronization and identifiability.
A distributional regression network acts as a backward operator to produce uncertainty-quantified, multivariate Gaussian retrievals of cloud properties from six solar channels for data assimilation.
Generalized conformal predictive systems are extended to non-exchangeable settings under distributional shifts via permutation weights and robust weight-uncertainty boxes with finite-sample or asymptotic guarantees.
Neural network-parameterized regression splines enable joint optimization of forecast quality and stability in distribution-free probabilistic time series models by penalizing dissimilarities from forecast updates.
Stochastic Attention adds calibrated uncertainty to transformer foundation models through inference-time multinomial sampling of attention weights and univariate post-hoc tuning of a concentration parameter.
Finite ensemble sizes cause systematic slope attenuation in conditional reliability diagnostics for means, spreads, and probabilities; analytical expressions and practical estimators correct for this bias.
A Bayesian hierarchical model integrates coherence penalization and level-specific focus into forecasting estimation, yielding improved predictive accuracy on simulated and Australian tourism data.
A deep learning method amortizes probabilistic XCO2 retrieval from OCO-2 spectra via Laplace approximations and normalizing flows, trained on simulations with model errors to achieve faster inference and better-calibrated uncertainties than operational solvers.
PliableBVS is a new Bayesian hierarchical spike-and-slab model for simultaneous selection of high-dimensional main effects and interactions under an asymmetric weak hierarchical constraint, shown to outperform pliable lasso in simulations.
For binary LLM judge validation, Pearson's r, Spearman's ρ, Kendall's τ_b, phi, and Matthews correlation all equal a single number on non-degenerate data, Cohen's κ supplies the extra signal on label-rate drift, and a reporting checklist is provided.
Rashomon-seeded annealing repurposes Rashomon sets as warm starts for annealed importance sampling to enable full posterior inference in factorial designs without exhaustive enumeration.
Proposes adaptive multiple importance sampling for robust Bayesian model evidence estimation under parameter non-identifiability, shown to outperform deterministic methods on ecological case studies while being cheaper than MCMC.
HCM estimates uncertainty in neural network outputs by quantifying violation of a unit hypersphere constraint on the normalized direction vector.
Embedding selection mechanisms into generative simulators enables amortized Bayesian inference to produce debiased, well-calibrated posteriors without tractable likelihoods.
Zero-shot TSFMs conditioned on leakage-safe covariates from Google Trends and an institutional index forecast commencing enrolments competitively with classical methods under data sparsity.
EnScale emulates high-resolution regional climate model outputs from global circulation models for multiple variables using a two-step generative process with sparse local stochastic layers and energy score optimization, including a temporally consistent variant.
A new error-damping estimator for compositional score matching enables stable amortized inference on hierarchical Bayesian models with over 750,000 parameters using fewer than one full model simulation on large problems.
A model-agnostic conformal selection method reformulates CATE-based beneficiary identification as multiple testing with RCT-calibrated p-values and FDR control, allowing external data for model training.
Proposes PcovRnnp method enabling simultaneous dimension reduction and regularized coefficient estimation via nuclear norm penalty in high-dimensional settings.
Develops a restricted MCAR model via reparameterization to measure and control informativeness in multivariate spatial modeling of health events across subgroups.
A cost-sensitive trigger using specification debt for deciding when to re-specify forecasting model forms, shown on M4 data to match full-update accuracy at 28% of the compute cost.
rcosmo is an R package offering functions to convert and analyze geographic, point pattern, and star-shaped spherical data in HEALPix format with ready-to-use code examples.
citing papers explorer
-
EnScale: Temporally-consistent multivariate generative downscaling via proper scoring rules
EnScale emulates high-resolution regional climate model outputs from global circulation models for multiple variables using a two-step generative process with sparse local stochastic layers and energy score optimization, including a temporally consistent variant.
-
Compositional amortized inference for large-scale hierarchical Bayesian models
A new error-damping estimator for compositional score matching enables stable amortized inference on hierarchical Bayesian models with over 750,000 parameters using fewer than one full model simulation on large problems.