Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

Armando Solar-Lezama; Rishabh Singh; Xin Zhang

arxiv: 1802.07384 · v2 · pith:AM7BIZD5new · submitted 2018-02-21 · 💻 cs.LG · cs.AI· stat.ML

Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

Xin Zhang , Armando Solar-Lezama , Rishabh Singh This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords networkneuraloutputwhetheralgorithmcorrectioncorrectionsdrawing

0 comments

read the original abstract

We present a new algorithm to generate minimal, stable, and symbolic corrections to an input that will cause a neural network with ReLU activations to change its output. We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output. Our algorithm generates such a correction by solving a series of linear constraint satisfaction problems. The technique is evaluated on three neural network models: one predicting whether an applicant will pay a mortgage, one predicting whether a first-order theorem can be proved efficiently by a solver using certain heuristics, and the final one judging whether a drawing is an accurate rendition of a canonical drawing of a cat.

This paper has not been read by Pith yet.

Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

discussion (0)