Experiments on Detecting Fake News using Machine Learning Algorithms
AUTHORS
Harika Kudarvalli,Department of Computer Science, Lakehead University, Canada
Jinan Fiaidhi*,Department of Computer Science, Lakehead University, Canada
ABSTRACT
Spreading fake news has become a serious issue in the current social media world. It is broadcasted with dishonest intentions to mislead people. This has caused many unfortunate incidents in different countries. The most recent one was the latest presidential elections where the voters were misleading to support a leader. Twitter is a popular social media platform where it represents the gateway for real time news. We extracted real time data on multiple domains through twitter and performed analysis. The dataset was preprocessed and user verified column played a vital role. Multiple machine algorithms were then performed on the extracted features from preprocessed dataset. Logistic Regression and Support Vector Machine had promising results with both above 92% accuracy. Naive Bayes and Long-Short Term memory didn’t achieve desired accuracies. The model can also be applied to images and videos for better detection of fake news.
KEYWORDS
Fake news, Twitter, Machine learning, Detection system
REFERENCES
[1] Meyer Robinson, “The grim conclusions of the largest-ever study of fake news,” The Atlantic 8 (2018)
[2] Joshkelliott, “One hantavirus death in China sparks ‘hysteria’ over old disease,” Retrieved from https://globalnews.ca/news/6724399/hantavirus-china-death-coronavirus/, (2020)
[3] CDC – Hantavirus, https://www.cdc.gov/hantavirus/index.html, (2019)
[4] M. Aldwairi and A. Alwahedi, “Detecting fake news in social media networks,” Procedia Comput. Sci., vol.141, pp.215–222, (2018)
[5] Abdullah-All-Tanvir, E. M. Mahir, S. Akhter, and M. R. Huq, “Detecting fake news using machine learning and deep learning algorithms,” 2019 7th International Conference on Smart Computing & Communications (ICSCC), Sarawak, Malaysia, Malaysia, pp.1-5, (2019)
[6] http://www.fakenewschallenge.org/
[7] D. Katsaros, G. Stavropoulos, and D. Papakostas, “Which machine learning paradigm for fake news detection?” 2019 IEEE/WIC/ACM International Conference on Web Intelligence (WI), Thessaloniki, Greece, pp.383-387, (2019)
[8] H. Ahmed, I. Traore, and S. Saad, “Detection of online fake news using n-gram analysis and machine learning techniques,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (2017)
[9] Google Trends. 2020. Google Trends. https://trends.google.com/trends/?geo=US, (2020)
[10] Help.twitter.com, About Verified Accounts. https://help.twitter.com/en/managing-your-account/about-twitter-verified-accounts, (2020)
[11] O. Aborisade and M. Anwar, “Classification for authorship of tweets by comparing logistic regression and naive bayes classifiers,” 2018 IEEE International Conference on Information Reuse and Integration (IRI), Salt Lake City, UT, pp.269-276, (2018)
[12] H. Rashkin, E. Choi, J. Y. Jang, S. Volkova, and Y. Choi, “Truth of varying shades: Analyzing language in fake news and political fact-checking,” EMNLP 2017 - Conf. Empir. Methods Nat. Lang. Process. Proc., pp.2931–2937, (2017)
[13] G. Cauwenberghs and T. Poggio, “Incremental and decrementai support vector machine learning,” Adv. Neural Inf. Process. Syst., no.x, (2001)