Runsewe, Ife, Latifi, Majid ORCID: https://orcid.org/0000-0002-2671-0516, Ahsan, Mominul ORCID: https://orcid.org/0000-0002-7300-506X and Haider, Julfikar ORCID: https://orcid.org/0000-0001-7010-8285 (2024) Machine learning for predicting key factors to identify misinformation in football transfer news. Computers, 13 (6). 127.
|
Published Version
Available under License Creative Commons Attribution. Download (5MB) | Preview |
Abstract
The spread of misinformation in football transfer news has become a growing concern. To address this challenge, this study introduces a novel approach by employing ensemble learning techniques to identify key factors for predicting such misinformation. The performance of three ensemble learning models, namely Random Forest, AdaBoost, and XGBoost, was analyzed on a dataset of transfer rumours. Natural language processing (NLP) techniques were employed to extract structured data from the text, and the veracity of each rumor was verified using factual transfer data. The study also investigated the relationships between specific features and rumor veracity. Key predictive features such as a player’s market value, age, and timing of the transfer window were identified. The Random Forest model outperformed the other two models, achieving a cross-validated accuracy of 95.54%. The top features identified by the model were a player’s market value, time to the start/end of the transfer window, and age. The study revealed weak negative relationships between a player’s age, time to the start/end of the transfer window, and rumor veracity, suggesting that for older players and times further from the transfer window, rumors are slightly less likely to be true. In contrast, a player’s market value did not have a statistically significant relationship with rumor veracity. This study contributes to the existing knowledge of misinformation detection and ensemble learning techniques. Despite some limitations, this study has significant implications for media agencies, football clubs, and fans. By discerning the credibility of transfer news, stakeholders can make informed decisions, reduce the spread of misinformation, and foster a more transparent transfer market.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.