Publications – Information Sciences Institute

Detecting Troll Behavior via Inverse Reinforcement Learning: A Case Study of Russian Trolls in the 2016 US Election

Abstract

Since the 2016 US Presidential election, social media abuse has been eliciting massive concern in the academic community and beyond. Preventing and limiting the malicious activity of users, such as trolls and bots, in their manipulation campaigns is of paramount importance for the integrity of democracy, public health, and more. However, the automated detection of troll accounts is an open challenge. In this work, we propose an approach based on Inverse Reinforcement Learning (IRL) to capture troll behavior and identify troll accounts. We employ IRL to infer a set of online incentives that may steer user behavior, which in turn highlights behavioral differences between troll and non-troll accounts, enabling their accurate classification. As a study case, we consider the troll accounts identified by the US Congress during the investigation of Russian meddling in the 2016 US Presidential election. We report promising results: the IRL-based approach is able to accurately detect troll accounts (AUC= 89.1%). The differences in the predictive features between the two classes of accounts enables a principled understanding of the distinctive behaviors reflecting the incentives trolls and non-trolls respond to.

Metadata

publication: Proceedings of the 2020 International Conference of Web and Social Media (ICWSM), 2020
year: 2020
publication date: 2020/6/8
authors: Luca Luceri, Silvia Giordano, Emilio Ferrara
link: https://aaai.org/ojs/index.php/ICWSM/article/view/7311
resource_link: https://aaai.org/ojs/index.php/ICWSM/article/download/7311/7165
journal: Proceedings of the 2020 International Conference of Web and Social Media (ICWSM)