Abstract
Digital media enables not only fast sharing of information, but alsodisinformation. One prominent case of an event leading to circulation ofdisinformation on social media is the MH17 plane crash. Studies analysing thespread of information about this event on Twitter have focused on small,manually annotated datasets, or used proxys for data annotation. In this work,we examine to what extent text classifiers can be used to label data forsubsequent content analysis, in particular we focus on predicting pro-Russianand pro-Ukrainian Twitter content related to the MH17 plane crash. Even thoughwe find that a neural classifier improves over a hashtag based baseline,labeling pro-Russian and pro-Ukrainian content with high precision remains achallenging problem. We provide an error analysis underlining the difficulty ofthe task and identify factors that might help improve classification in futurework. Finally, we show how the classifier can facilitate the annotation taskfor human annotators.