Twitter Corpus of the #BlackLivesMatter Movement And Counter Protests: 2013 to 2020

  • 2020-09-28 16:20:16
  • Salvatore Giorgi, Sharath Chandra Guntuku, Muhammad Rahman, McKenzie Himelein-Wachowiak, Amy Kwarteng, Brenda Curtis
  • 0

Abstract

Black Lives Matter (BLM) is a grassroots movement protesting violence towardsBlack individuals and communities with a focus on police brutality. Themovement has gained significant media and political attention following thekillings of Ahmaud Arbery, Breonna Taylor, and George Floyd and the shooting ofJacob Blake in 2020. Due to its decentralized nature, the #BlackLivesMattersocial media hashtag has come to both represent the movement and been used as acall to action. Similar hashtags have appeared to counter the BLM movement,such as #AllLivesMatter and #BlueLivesMatter. We introduce a data set of 41.8million tweets from 10 million users which contain one of the followingkeywords: BlackLivesMatter, AllLivesMatter and BlueLivesMatter. This data setcontains all currently available tweets from the beginning of the BLM movementin 2013 to June 2020. We summarize the data set and show temporal trends in useof both the BlackLivesMatter keyword and keywords associated with countermovements. In the past, similarly themed, though much smaller in scope, BLMdata sets have been used for studying discourse in protest and counter protestmovements, predicting retweets, examining the role of social media in protestmovements and exploring narrative agency. This paper open-sources a large-scaledata set to facilitate research in the areas of computational social science,communications, political science, natural language processing, and machinelearning.

 

Quick Read (beta)

loading the full paper ...