NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks

Avery Nortonsmith; D. Anthony Bau; Fahim Dalvi; Hassan Sajjad; James Glass; Nadir Durrani; Yonatan Belinkov

arxiv: 1812.09359 · v1 · pith:Q4QCS7MNnew · submitted 2018-12-21 · 💻 cs.CL

NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks

Fahim Dalvi , Avery Nortonsmith , D. Anthony Bau , Yonatan Belinkov , Hassan Sajjad , Nadir Durrani , James Glass This is my paper

classification 💻 cs.CL

keywords modelneuronstoolkitneuralthemunderstandingablateaccuracy

0 comments

read the original abstract

We present a toolkit to facilitate the interpretation and understanding of neural network models. The toolkit provides several methods to identify salient neurons with respect to the model itself or an external task. A user can visualize selected neurons, ablate them to measure their effect on the model accuracy, and manipulate them to control the behavior of the model at the test time. Such an analysis has a potential to serve as a springboard in various research directions, such as understanding the model, better architectural choices, model distillation and controlling data biases.

This paper has not been read by Pith yet.

NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks

discussion (0)