A statistically significant result (i.e., a small p-value) implies that we can reject the null hypothesis that the data follows a normal distribution

is a standard statistical procedure that assesses normality by quantifying deviations in sample skewness, kurtosis from that of a normal distribution · 2007

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios

cs.CL · 2025-11-30 · unverdicted · novelty 6.0 · 2 refs

Reward Auditor is a statistical framework that audits reward models for suitability by testing for significant degradation in preference perception confidence distributions under real-world perturbations.

citing papers explorer

Showing 1 of 1 citing paper.

Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios cs.CL · 2025-11-30 · unverdicted · none · ref 1 · 2 links
Reward Auditor is a statistical framework that audits reward models for suitability by testing for significant degradation in preference perception confidence distributions under real-world perturbations.

A statistically significant result (i.e., a small p-value) implies that we can reject the null hypothesis that the data follows a normal distribution

fields

years

verdicts

representative citing papers

citing papers explorer