Learning Dynamic Author Representations with Temporal Language Models

  • 2019-09-11 11:51:43
  • Edouard Delasalles, Sylvain Lamprier, Ludovic Denoyer
  • 4

Abstract

Language models are at the heart of numerous works, notably in the textmining and information retrieval communities. These statistical models aim atextracting word distributions, from simple unigram models to recurrentapproaches with latent variables that capture subtle dependencies in texts.However, those models are learned from word sequences only, and authors'identities, as well as publication dates, are seldom considered. We propose aneural model, based on recurrent language modeling, which aims at capturinglanguage diffusion tendencies in author communities through time. Byconditioning language models with author and temporal vector states, we areable to leverage the latent dependencies between the text contexts. This allowsus to beat several temporal and non-temporal language baselines on tworeal-world corpora, and to learn meaningful author representations that varythrough time.

 

Quick Read (beta)

loading the full paper ...