A General Theory for Structured Prediction with Smooth Convex Surrogates
read the original abstract
In this work we provide a theoretical framework for structured prediction that generalizes the existing theory of surrogate methods for binary and multiclass classification based on estimating conditional probabilities with smooth convex surrogates (e.g. logistic regression). The theory relies on a natural characterization of structural properties of the task loss and allows to derive statistical guarantees for many widely used methods in the context of multilabeling, ranking, ordinal regression and graph matching. In particular, we characterize the smooth convex surrogates compatible with a given task loss in terms of a suitable Bregman divergence composed with a link function. This allows to derive tight bounds for the calibration function and to obtain novel results on existing surrogate frameworks for structured prediction such as conditional random fields and quadratic surrogates.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Generalization Bounds of Surrogate Policies for Combinatorial Optimization Problems
Derives generalization bounds for perturbed surrogate policies in combinatorial optimization by decomposing excess risk into perturbation bias controlled by fan-crossing probability, statistical estimation error, and ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.