Journal of Business and Economic Statistics , volume =

James E · 2018 · arXiv 0015.2016

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Life After Benchmark Saturation: A Case Study of CORE-Bench

cs.AI · 2026-06-23 · unverdicted · novelty 6.0

Using CORE-Bench as a case study, the paper shows that saturated benchmarks can still deliver insights on efficiency, reliability, model-scaffold differences, and human collaboration even after accuracy plateaus, and introduces improved benchmark versions plus a small randomized experiment demonstra

Foundation Models for Credit Risk Prediction: A Game Changer?

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

Tabular foundation models outperform standard methods in credit risk PD and LGD tasks, with larger gains on smaller datasets when used out-of-the-box.

Which Small-Sample Correction Should Be Used When Analyzing Stepped-Wedge Designs with Time-Varying Treatment Effects?

stat.ME · 2026-04-20 · unverdicted · novelty 4.0

Simulations recommend the Mancl-DeRouen correction with t-distribution for continuous outcomes and the Morel-Bokossa-Neerchal estimator for binary outcomes in ETI models for SW-CRTs, while long-term effect estimates remain unstable.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Life After Benchmark Saturation: A Case Study of CORE-Bench cs.AI · 2026-06-23 · unverdicted · none · ref 48
Using CORE-Bench as a case study, the paper shows that saturated benchmarks can still deliver insights on efficiency, reliability, model-scaffold differences, and human collaboration even after accuracy plateaus, and introduces improved benchmark versions plus a small randomized experiment demonstra
Foundation Models for Credit Risk Prediction: A Game Changer? cs.LG · 2026-05-18 · unverdicted · none · ref 128
Tabular foundation models outperform standard methods in credit risk PD and LGD tasks, with larger gains on smaller datasets when used out-of-the-box.
Which Small-Sample Correction Should Be Used When Analyzing Stepped-Wedge Designs with Time-Varying Treatment Effects? stat.ME · 2026-04-20 · unverdicted · none · ref 37
Simulations recommend the Mancl-DeRouen correction with t-distribution for continuous outcomes and the Morel-Bokossa-Neerchal estimator for binary outcomes in ETI models for SW-CRTs, while long-term effect estimates remain unstable.

Journal of Business and Economic Statistics , volume =

fields

years

verdicts

representative citing papers

citing papers explorer