Abstract
An advantage of seq2seq abstractive summarization models is that theygenerate text in a free-form manner, but this flexibility makes it difficult tointerpret model behavior. In this work, we analyze summarization decoders inboth blackbox and whitebox ways by studying on the entropy, or uncertainty, ofthe model's token-level predictions. For two strong pre-trained models, PEGASUSand BART on two summarization datasets, we find a strong correlation betweenlow prediction entropy and where the model copies tokens rather than generatingnovel text. The decoder's uncertainty also connects to factors like sentenceposition and syntactic distance between adjacent pairs of tokens, giving asense of what factors make a context particularly selective for the model'snext output token. Finally, we study the relationship of decoder uncertaintyand attention behavior to understand how attention gives rise to these observedeffects in the model. We show that uncertainty is a useful perspective foranalyzing summarization and text generation models more broadly.