Non-Convex SGD Learns Halfspaces with Adversarial Label Noise

Christos Tzamos; Ilias Diakonikolas; Nikos Zarifis; Vasilis Kontonis

arxiv: 2006.06742 · v1 · pith:YTOBHRPXnew · submitted 2020-06-11 · 💻 cs.LG · stat.ML

Non-Convex SGD Learns Halfspaces with Adversarial Label Noise

Ilias Diakonikolas , Vasilis Kontonis , Christos Tzamos , Nikos Zarifis This is my paper

classification 💻 cs.LG stat.ML

keywords errormisclassificationdistributionshalfspacesnon-convexadversarialagnosticallybest-fitting

0 comments

read the original abstract

We study the problem of agnostically learning homogeneous halfspaces in the distribution-specific PAC model. For a broad family of structured distributions, including log-concave distributions, we show that non-convex SGD efficiently converges to a solution with misclassification error $O(\opt)+\eps$, where $\opt$ is the misclassification error of the best-fitting halfspace. In sharp contrast, we show that optimizing any convex surrogate inherently leads to misclassification error of $\omega(\opt)$, even under Gaussian marginals.

This paper has not been read by Pith yet.

Non-Convex SGD Learns Halfspaces with Adversarial Label Noise

discussion (0)