Detecting Bots on Russian Political Twitter
Abstract
Automated and semiautomated Twitter accounts, bots, have recently gained significant public attention due to their potential interference in the political realm. In this study, we develop a methodology for detecting bots on Twitter using an ensemble of classifiers and apply it to study bot activity within political discussions in the Russian Twittersphere. We focus on the interval from February 2014 to December 2015, an especially consequential period in Russian politics. Among accounts actively Tweeting about Russian politics, we find that on the majority of days, the proportion of Tweets produced by bots exceeds 50%. We reveal bot characteristics that distinguish them from humans in this corpus, and find that the software platform used for Tweeting is among the best predictors of bots. Finally, we find suggestive evidence that one prominent activity that bots were involved in on Russian political Twitter is the spread of news stories and promotion of media who produce them.
Get full access to this article
View all available purchase options and get full access to this article.
References
1.
Howard P, Hussain M. Democracy's fourth wave? Digital media and the Arab Spring. Oxford, UK: Oxford University Press, 2013.
2.
Barber P. How social media reduces mass political polarization. Evidence from Germany, Spain, and the U.S. In: Presented at the Annual Meeting of the American Political Science Association, San Francisco, CA, 2015.
3.
Thompson A. Journalists and Trump voters live in separate online bubbles, MIT analysis shows. Vice News, December 2016.
4.
Kollanyi B, Howard P, Woolley S. Bots and automation over Twitter during the U.S. election. COMPROP Data Memo, November 2016.
5.
Bessi A, Ferrara E. Social bots distort the 2016 U.S. Presidential election online discussion. First Monday. 2016;21. Available online at: http://firstmonday.org/article/view/7090/5653
6.
Oentaryo R, Murdopo A, Prasetyo P, Lim E-P. On profiling bots in social media. Soc Informat. 2016;10046:92–109.
7.
Ford H, Dubois E, Puschmann C. Keeping Ottawa honest one Tweet at a time? Politicians, Journalists, Wikipedians and their Twitter bots. Int J Commun. 2016;10:4891–4914.
8.
Sanovich S, Stukal D, Tucker J. Turning the virtual tables: Government strategies for addressing online opposition with an application to Russia. Comparative Politics 2018.
9.
Woolley S. Automating power: Social bot interference in global politics. First Monday. 2016;21. Available online at: http://firstmonday.org/article/view/6161/5300
10.
Guriev S, Treisman D. How modern dictators survive: Cooptation, censorship, propaganda, and repression. 2015.
11.
Munger K, Bonneau R, Jost J, et al. Elites Tweet to get feet off the streets: Measuring regime social media strategies during protest. 2017.
12.
Sanovich S. Computational propaganda in Russia: The origins of digital misinformation. 2017.
13.
Lasswell H, Lerner D, Hans S. Propaganda and communication in world history: Volume I. The symbolic instrument in early times. Honolulu, HI: The University Press of Hawaii, 1979.
14.
Lasswell H, Lerner D, Hans S. Propaganda and communication in world history: Volume II. Emergence of public opinion in the west. Honolulu, HI: The University Press of Hawaii, 1980.
15.
Kallis A. Nazi propaganda and the Second World War. London: Palgrave Macmillan, 2005.
16.
Berkhoff K. Motherland in danger: Soviet propaganda during World War II. Cambridge: Harvard University Press, 2012.
17.
Welch D. Nazi propaganda: The power and the limitations. London: Routledge, 2014.
18.
Chen A. The agency. The New York Times, June 2, 2015.
19.
Snegovaya M. Putin's Information Warfare in Ukraine: Soviet Origins of Russia's Hybrid Warfare. Washington, DC: Institute for the Study of War, 2015.
20.
Pomerantsev P. Inside the Kremlin's hall of mirrors. The Guardian, April 9, 2015.
21.
Applebaum A, Lucas E. The danger of Russian disinformation. The Washington Post, May 6, 2016.
22.
Ananyev M, Sobolev A. Fantastic beasts and whether they matter: Do Internet “Trolls” influence political conversations in Russia? In: Presented at Midwest Political Science Association, Chicago, IL, April 2017.
23.
Remnick D, Yaffa J, Osnos E. Trump, Putin, and the New Cold War. The New Yorker (Annals of Diplomacy), March 2017.
24.
Davis C, Varol O, Ferrara E, et al. BotOrNot: A system to evaluate social bots. In: Proceedings of the 25th International Conference on World Wide Web Companion, Montréal, Québec, Canada, April 11–15, 2016. pp. 273–274.
25.
Nimmo B. #BotSpot: Twelve Ways to Spot a Bot. DFRLab. 2017. Available online at: https://medium.com/dfrlab/botspot-twelve-ways-to-spot-a-bot-aedc7d9c110c
26.
Zeller T Jr. Gaming the search engine, in a political season. The New York Times, November 6, 2006.
27.
Dickerson J, Kagan V, Subrahmanian VS. Using sentiment to detect bots on Twitter: Are humans more opinionated than bots? In: Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Beijing, China, August 17–20, 2014. pp. 620–627.
28.
Cresci S, Pietro RD, Petrocchi M, et al. The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, April 3–7, 2017. pp. 963–972.
29.
Subrahmanian VS, Azaria A, Durst S, et al. The DARPA Twitter bot challenge. Computer. 2016;49:38–46.
30.
Cresci S, Pietro RD, Petrocchi M, et al. DNA-inspired online behavioral modeling and its application to spambot detection. IEEE Intell Syst. 2016;31:58–64.
31.
Chavoshi N, Hamooni H, Mueen A. Identifying correlated bots in Twitter. Soc Informat. 2016;10047:14–21.
32.
Ratkiewicz J, Conover M, Meiss M, et al. Detecting and tracking political abuse in social media. In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media (CWSM), July 17–21, 2011, Barcelona, Spain, pp. 297–304.
33.
Chu Z, Gianvecchio S, Wang H, Jajodia S. Detecting automation of Twitter accounts: Are you a human, bot, or cyborg? IEEE Trans Depend Sec Comput. 2012;9:811–824.
34.
Warwick K, Shah H. Human misidentification in Turing tests. J Exp Theor Artif Intell. 2015;27:123–135.
35.
Choi S-S, Cha S-H, Tappert C. A survey of binary similarity and distance measures. J Syst Cybern Inform. 2010;8:43–48.
36.
Weiss G. Mining with rarity: A unifying framework. ACM SIGKDD Explor Newslett. 2004;6:7–19.
37.
Weiss G, Provost F. Learning when training data are costly: The effect of class distribution on tree induction. J Artif Intell Res. 2003;19:315–354.
38.
Sun Y, Wong A, Kamel M. Classification of imbalanced data: A review. Int J Pattern Recogn Artif Intell. 2009;23:687–719.
39.
Kuncheva L. A theoretical study on six classifier fusion strategies. IEEE Trans Pattern Anal Mach Intell. 2002;24:281–286.
40.
van Erp M, Vuurpijl L, Schomaker L. An overview and comparison of voting methods for pattern recognition. In: Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition (IWFHR), Ontario, Canada, August 6–8, 2002. pp. 195–200.
41.
Lin X, Yacoub S, Burns J, Simske S. Performance analysis of pattern classifier combination by plurality voting. Pattern Recogn Lett. 2003;24:1959–1969.
42.
Lam L, Suen C. Application of majority voting to pattern recognition: An analysis of its behavior and performance. IEEE Trans Syst Man Cybernet A. 1997;27:55-3.
43.
Kuncheva L, Rodrguez J. A weighted voting framework for classifiers ensembles. Knowl Inf Syst. 2014;38:259–275.
44.
Zhu J, Hastie T, Rosset S, Zou H. Multi-class AdaBoost. Stat Interface. 2009;2:349–360.
References
Cite this article as: Stukal D, Sanovich S, Bonneau R, Tucker JA (2017) Detecting bots on Russian political Twitter. Big Data 5:4, 310–324, DOI: 10.1089/big.2017.0038.
Information & Authors
Information
Published In

Copyright
Copyright 2017, Mary Ann Liebert, Inc.
History
Published in print: December 2017
Published online: 1 December 2017
Topics
Authors
Authors' Contributions
Stukal, Sanovich, Bonneau, and Tucker designed the study. Stukal developed the code for all the analyses and prepared the first draft of the article. Stukal recruited and Sanovich trained human coders. Both Sanovich and Stukal supervised human coders. Bonneau and Tucker oversaw the data collection process. All the authors participated in revising and editing of the article.
Author Disclosure Statement
No competing financial interests exist.
Metrics & Citations
Metrics
Citations
Export Citation
Export citation
Select the format you want to export the citations of this publication.
View Options
Access content
To read the fulltext, please use one of the options below to sign in or purchase access.⚠ Society Access
If you are a member of a society that has access to this content please log in via your society website and then return to this publication.