Unsupervised Neural Text Simplification

  • 2019-01-10 11:43:46
  • Sai Surya, Abhijit Mishra, Anirban Laha, Parag Jain, Karthik Sankaranarayanan
The paper presents a first attempt towards unsupervised neural textsimplification that relies only on unlabeled text corpora. The core frameworkis composed of a shared encoder and a pair of attentional-decoders and gainsknowledge of simplification through discrimination based-losses and denoising.The framework is trained using unlabeled text collected from en-Wikipedia dump.Our analysis (both quantitative and qualitative involving human evaluators) ona public test data shows that the proposed model can performtext-simplification at both lexical and syntactic levels, competitive toexisting supervised methods. Addition of a few labelled pairs also improves theperformance further. We open source our implementation for academic use.


