Abstract
The dominant approach to unsupervised "style transfer" in text is based onthe idea of learning a latent representation, which is independent of theattributes specifying its "style". In this paper, we show that this conditionis not necessary and is not always met in practice, even with domainadversarial training that explicitly aims at learning such disentangledrepresentations. We thus propose a new model that controls several factors ofvariation in textual data where this condition on disentanglement is replacedwith a simpler mechanism based on back-translation. Our method allows controlover multiple attributes, like gender, sentiment, product type, etc., and amore fine-grained control on the trade-off between content preservation andchange of style with a pooling operator in the latent space. Our experimentsdemonstrate that the fully entangled model produces better generations, evenwhen tested on new and more challenging benchmarks comprising reviews withmultiple sentences and multiple attributes.