Detecting Fake News about Covid-19 on Small Datasets with Machine Learning Algorithms

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Nowadays the problem of fake news in social media is dramatically increasing, especially when it refers to fake news about Covid-19, as it is a recent and global problem. Because of this fact, it is important to have the ability to detect and delete such news immediately. In our research we concentrate our efforts on detecting fake news about Coronavirus on small datasets, using the Constraint-2021 corpus: the full dataset (10,700 messages) and the limited dataset (1,000 messages). We compare classical Machine Learning Algorithms (4 algorithms: Logistic Regression, Support Vectors Machine, Gradient Boosting, Random Forest) - algorithms of classification from the Scikit-learn library, GMDH-Shell tool (2 algorithms: Combi and Neuro), and Deep Neural Network (LSTM model). The results show that GMDH algorithms outperform traditional Machine Learning Algorithms and are comparable with Neural Networks model's results on the limited dataset.

Original languageEnglish
Title of host publicationProceedings of the 30th Conference of Open Innovations Association FRUCT, FRUCT 2021
EditorsJuha Roning, Tatiana Shatalova
PublisherIEEE Computer Society
Pages253-258
Number of pages6
ISBN (Electronic)9789526924465
DOIs
Publication statusPublished - 2021
Event30th Conference of Open Innovations Association FRUCT, FRUCT 2021 - Oulu, Finland
Duration: 27 Oct 202129 Oct 2021

Publication series

NameConference of Open Innovation Association, FRUCT
Volume2021-October
ISSN (Print)2305-7254

Conference

Conference30th Conference of Open Innovations Association FRUCT, FRUCT 2021
Country/TerritoryFinland
CityOulu
Period27/10/2129/10/21

Fingerprint

Dive into the research topics of 'Detecting Fake News about Covid-19 on Small Datasets with Machine Learning Algorithms'. Together they form a unique fingerprint.

Cite this