Abstract
Deep learning methods have recently achieved great empirical success onmachine translation, dialogue response generation, summarization, and othertext generation tasks. At a high level, the technique has been to trainend-to-end neural network models consisting of an encoder model to produce ahidden representation of the source text, followed by a decoder model togenerate the target. While such models have significantly fewer pieces thanearlier systems, significant tuning is still required to achieve goodperformance. For text generation models in particular, the decoder can behavein undesired ways, such as by generating truncated or repetitive outputs,outputting bland and generic responses, or in some cases producingungrammatical gibberish. This paper is intended as a practical guide forresolving such undesired behavior in text generation models, with the aim ofhelping enable real-world applications.