pith. sign in

Symmetrization of Loss Functions for Robust Training of Neural Networks in the Presence of Noisy Labels

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Labeling a training set is often expensive and susceptible to errors, making the design of robust loss functions for label noise an important problem. The symmetry condition provides theoretical guarantees for robustness to such noise. In this work, we study a symmetrization method arising from the unique decomposition of any multi-class loss function into a symmetric component and a class-insensitive term. In particular, symmetrizing the cross-entropy loss leads to a linear multi-class extension of the unhinged loss. Unlike in the binary case, the multi-class version must have specific coefficients in order to satisfy the symmetry condition. Under suitable assumptions, we show that this multi-class unhinged loss is the unique convex multi-class symmetric loss. We also show that it has a fundamental local role: the linear approximation of any symmetric loss around score vectors with equal components is equivalent to the multi-class unhinged loss. We then introduce SGCE and alpha-MAE, two loss functions that interpolate between the multi-class unhinged loss and the Mean Absolute Error while allowing control of the beta-smoothness of the loss. Experiments on standard noisy-label benchmarks show competitive performance compared with existing robust loss functions.

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Smoothness-Based Derandomization of PAC-Bayes Bounds

cs.LG · 2026-06-17 · unverdicted · novelty 6.0

Derives smoothness-based PAC-Bayes derandomization bounds for deterministic predictors using Rademacher complexity of the Jensen gap class, yielding Jacobian/Hessian flatness terms and a practical regularizer tested on CIFAR-10.

citing papers explorer

Showing 1 of 1 citing paper.

  • Smoothness-Based Derandomization of PAC-Bayes Bounds cs.LG · 2026-06-17 · unverdicted · none · ref 11 · internal anchor

    Derives smoothness-based PAC-Bayes derandomization bounds for deterministic predictors using Rademacher complexity of the Jensen gap class, yielding Jacobian/Hessian flatness terms and a practical regularizer tested on CIFAR-10.