NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks

Fahim Dalvi; Avery Nortonsmith; Anthony Bau; Yonatan Belinkov; Hassan Sajjad; Nadir Durrani; James Glass

doi:10.1609/aaai.v33i01.33019851

NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks

Authors

Fahim Dalvi Hamad Bin Khalifa University
Avery Nortonsmith Massachusetts Institute of Technology
Anthony Bau Massachusetts Institute of Technology
Yonatan Belinkov Massachusetts Institute of Technology
Hassan Sajjad Hamad Bin Khalifa University
Nadir Durrani Hamad Bin Khalifa University
James Glass Massachusetts Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v33i01.33019851

Abstract

We present a toolkit to facilitate the interpretation and understanding of neural network models. The toolkit provides several methods to identify salient neurons with respect to the model itself or an external task. A user can visualize selected neurons, ablate them to measure their effect on the model accuracy, and manipulate them to control the behavior of the model at the test time. Such an analysis has a potential to serve as a springboard in various research directions, such as understanding the model, better architectural choices, model distillation and controlling data biases. The toolkit is available for download.¹

Downloads

Published

2019-07-17

How to Cite

Dalvi, F., Nortonsmith, A., Bau, A., Belinkov, Y., Sajjad, H., Durrani, N., & Glass, J. (2019). NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 9851-9852. https://doi.org/10.1609/aaai.v33i01.33019851

Download Citation

Issue

Vol. 33 No. 01: AAAI-19, IAAI-19, EAAI-20

Section

Demonstration Track

NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription