[Demo] NeuroX Library for Neuron Analysis of Deep NLP Models

Fahim Dalvi, Hassan Sajjad, Nadir Durrani

Demo: Interpretability and Analysis of Models for NLP (demo) Demo Paper

Demo Session 5: Interpretability and Analysis of Models for NLP (demo) (Poster)
Conference Room: Frontenac Ballroom and Queen's Quay
Conference Time: July 11, 16:15-17:45 (EDT) (America/Toronto)
Global Time: July 11, Demo Session 5 (20:15-21:45 UTC)
TLDR: Neuron analysis provides insights into how knowledge is structured in representations and discovers the role of neurons in the network. In addition to developing an understanding of our models, neuron analysis enables various applications such as debiasing, domain adaptation and architectural search...
You can open the #paper-D56 channel in a separate window.
Abstract: Neuron analysis provides insights into how knowledge is structured in representations and discovers the role of neurons in the network. In addition to developing an understanding of our models, neuron analysis enables various applications such as debiasing, domain adaptation and architectural search. We present NeuroX, a comprehensive open-source toolkit to conduct neuron analysis of natural language processing models. It implements various interpretation methods under a unified API, and provides a framework for data processing and evaluation, thus making it easier for researchers and practitioners to perform neuron analysis. The Python toolkit is available at https://www.github.com/fdalvi/NeuroX. Demo Video available at: https://youtu.be/mLhs2YMx4u8