To search, Click below search items.


All Published Papers Search Service


An Ensemble Approach to Detect Fake News Spreaders on Twitter


Muhammad Nabeel Sarwar, Riaz UlAmin and Sidra Jabeen


Vol. 22  No. 5  pp. 294-302


Detection of fake news is a complex and a challenging task. Generation of fake news is very ?hard to stop, only steps to control its circulation may help in minimizing its impacts. Humans tend to ?believe in misleading false information. Researcher started ?with social media sites to categorize in terms of ?real or fake news. False information misleads any individual or an organization that may cause of big ?failure and any financial loss. Automatic system for detection of false information circulating on social ?media is an emerging area of research. It is gaining attention of both industry and academia since US ?presidential elections 2016. Fake news has negative and severe effects on individuals and organizations ?elongating its hostile effects on the society. Prediction of fake news in timely manner is important. This ?research focuses on detection of fake news spreaders. In this context, overall, 6 models are developed ?during this research, trained and tested with dataset of PAN 2020. Four approaches N-gram based; user ?statistics-based models are trained with different values of hyper parameters. Extensive grid search with ?cross validation is applied in each machine learning model. In N-gram based models, out of numerous ?machine learning models this research focused on better results yielding algorithms, assessed by deep ?reading of state-of-the-art related work in the field. For better accuracy, author aimed at developing ?models using Random Forest, Logistic Regression, SVM, and XGBoost. All four machine learning ?algorithms were trained with cross validated grid search hyper parameters. Advantages of this research ?over previous work is user statistics-based model and then ensemble learning model. Which were designed ?in a way to help classifying Twitter users as fake news spreader or not with highest reliability. User ?statistical model used 17 features, on the basis of which it categorized a Twitter user as malicious. New ?dataset based on predictions of machine learning models was constructed. And then Three techniques of ?simple mean, logistic regression and random forest in combination with ensemble model is applied. Logistic ?regression combined in ensemble model gave best training and testing results, achieving an accuracy of ??72%.?


Fake news detection, Fake news Spreaders, Ensemble learning, Statistical Model, Feature Analysis of fake news spreaders, identification of twitter users.