Interpreting Recurrent and Attention-Based Neural Models: a Case Study on Natural Language Inference

Abstract

Deep learning models have achieved remarkable success in natural languageinference (NLI) tasks. While these models are widely explored, they are hard tointerpret and it is often unclear how and why they actually work. In thispaper, we take a step toward explaining such deep learning based models througha case study on a popular neural model for NLI. In particular, we propose tointerpret the intermediate layers of NLI models by visualizing the saliency ofattention and LSTM gating signals. We present several examples for which ourmethods are able to reveal interesting insights and identify the criticalinformation contributing to the model decisions.

Quick Read (beta)

loading the full paper ...