IIT Home Page CNR Home Page

Extremist Propaganda Tweet Classification with Deep Learning in Realistic Scenarios

In this work, we tackled the problem of the automatic classification of the extremist propaganda on Twitter, focusing on the Islamic State of Iraq and al-Sham (ISIS). We built and published several datasets, obtained by mixing 15,684 ISIS propaganda tweets with a variable number of neutral tweets, related to ISIS, and random ones, accounting for imbalances up to 1%. We considered three state-of-the-art, deep learning techniques, representative of the main current approaches to text classification, and two strong linear machine learning baselines. We compared their performance when varying the composition of the training and test sets, in order to explore different training strategies, and to evaluate the results when approaching realistic conditions. We demonstrated that a Recurrent-Convolutional Neural Network, based on pre-trained word embeddings, can reach an excellent F1 score of 0.9 on the most challenging test condition (1% imbalance).

Proceedings of the 11th International ACM Conference on Web Science (WebSci'19), Boston, USA, 2019

External authors: Marco Avvenuti (University of Pisa)
IIT authors:

Type: Contributo in atti di convegno
Field of reference: Computer Science & Engineering

File: Nizzoli, 2019, Extremist Propaganda Tweet Classification with Deep Learning in Realistic Scenarios.pdf

Activity: Social Media Analysis