Unsupervised Neural Text Simplification

Abstract

The paper presents a first attempt towards unsupervised neural textsimplification that relies only on unlabeled text corpora. The core frameworkis composed of a shared encoder and a pair of attentional-decoders and gainsknowledge of simplification through discrimination based-losses and denoising.The framework is trained using unlabeled text collected from en-Wikipedia dump.Our analysis (both quantitative and qualitative involving human evaluators) ona public test data shows that the proposed model can performtext-simplification at both lexical and syntactic levels, competitive toexisting supervised methods. Addition of a few labelled pairs also improves theperformance further. We open source our implementation for academic use.

Quick Read (beta)

loading the full paper ...