Predicting Twitter User Socioeconomic Attributes with Network and Language Information

  • 2018-04-11 17:00:27
  • Nikolaos Aletras, Benjamin Paul Chamberlain
  • 7

Abstract

Inferring socioeconomic attributes of social media users such as occupationand income is an important problem in computational social science. Automatedinference of such characteristics has applications in personalised recommendersystems, targeted computational advertising and online political campaigning.While previous work has shown that language features can reliably predictsocioeconomic attributes on Twitter, employing information coming from users'social networks has not yet been explored for such complex usercharacteristics. In this paper, we describe a method for predicting theoccupational class and the income of Twitter users given information extractedfrom their extended networks by learning a low-dimensional vectorrepresentation of users, i.e. graph embeddings. We use this representation totrain predictive models for occupational class and income. Results on twopublicly available datasets show that our method consistently outperforms thestate-of-the-art methods in both tasks. We also obtain further significantimprovements when we combine graph embeddings with textual features,demonstrating that social network and language information are complementary.

 

Quick Read (beta)

loading the full paper ...