Explaining Predictions of Non-Linear Classifiers in NLP

Franziska Horn; Gr\'egoire Montavon; Klaus-Robert M\"uller; Leila Arras; Wojciech Samek

arxiv: 1606.07298 · v1 · pith:42JJE54Snew · submitted 2016-06-23 · 💻 cs.CL · cs.IR· cs.LG· cs.NE· stat.ML

Explaining Predictions of Non-Linear Classifiers in NLP

Leila Arras , Franziska Horn , Gr\'egoire Montavon , Klaus-Robert M\"uller , Wojciech Samek This is my paper

classification 💻 cs.CL cs.IRcs.LGcs.NEstat.ML

keywords predictionsanalysisexplainingclassifiersnon-lineartechniqueapplycategorization

0 comments

read the original abstract

Layer-wise relevance propagation (LRP) is a recently proposed technique for explaining predictions of complex non-linear classifiers in terms of input variables. In this paper, we apply LRP for the first time to natural language processing (NLP). More precisely, we use it to explain the predictions of a convolutional neural network (CNN) trained on a topic categorization task. Our analysis highlights which words are relevant for a specific prediction of the CNN. We compare our technique to standard sensitivity analysis, both qualitatively and quantitatively, using a "word deleting" perturbation experiment, a PCA analysis, and various visualizations. All experiments validate the suitability of LRP for explaining the CNN predictions, which is also in line with results reported in recent image classification studies.

This paper has not been read by Pith yet.

Explaining Predictions of Non-Linear Classifiers in NLP

discussion (0)