Duridi, Tasneem, Eleyan, Derar, Eleyan, Amna ORCID: https://orcid.org/0000-0002-2025-3027 and Bejaoui, Tarek (2024) Arabic Fake News Detection Using Machine Learning Approach. In: 2024 International Symposium on Networks, Computers and Communications (ISNCC), 22 October 2024 - 25 October 2024, Washington DC, USA.
|
Accepted Version
Available under License In Copyright. Download (202kB) | Preview |
Abstract
In the contemporary digital realm, the widespread dissemination of false Arabic news is a significant social concern laden with various risks. Recognizing the seriousness of this issue, our research utilizes cutting-edge technologies, specifically Machine Learning, to distinguish between authentic Arabic news and deceptive counterparts. The consequences of propagating misinformation go beyond compromising social cohesion; they extend to the erosion of digital information's credibility, fostering an atmosphere of mistrust and deception that undermines the very foundations of societal bonds. Through the application of contemporary technologies, this study aims to identify and underscore Arabic news that embodies such risks. The intricate characteristics of Arabic language morphology, marked by words carrying multiple meanings based on inflectional forms and the prevalence of numerous diacritical marks, intensify the challenges of text classification. Despite contending with these linguistic intricacies, modern natural language processing approaches offer practical solutions. Notably, our methodology relies on the preprocessing of two available datasets for training and testing, a crucial step for seamlessly integrating a range of Machine Learning techniques, including Random Forest (RF), Logistic Regression (LR), Decision Tree (DT), and Multinomial Naive Bayes (NB). Importantly, the Logistic Regression technique emerged as the most effective, achieving an accuracy of 95.92% in the SANAD dataset for discerning nuanced Arabic news
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.