pith. sign in

arxiv: 1811.12290 · v2 · pith:FQCV5WIEnew · submitted 2018-11-29 · 📡 eess.AS · cs.LG· cs.SD· stat.ML

Tuplemax Loss for Language Identification

classification 📡 eess.AS cs.LGcs.SDstat.ML
keywords lossidentificationlanguagelanguagestuplemaxerrorfunctionrate
0
0 comments X
read the original abstract

In many scenarios of a language identification task, the user will specify a small set of languages which he/she can speak instead of a large set of all possible languages. We want to model such prior knowledge into the way we train our neural networks, by replacing the commonly used softmax loss function with a novel loss function named tuplemax loss. As a matter of fact, a typical language identification system launched in North America has about 95% users who could speak no more than two languages. Using the tuplemax loss, our system achieved a 2.33% error rate, which is a relative 39.4% improvement over the 3.85% error rate of standard softmax loss method.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.