e-space
Manchester Metropolitan University's Research Repository

Sentiment analysis for Urdu online reviews using deep learning models

Safder, Iqra, Mehmood, Zainab, Sarwar, Raheem, Hassan, Saeed‐Ul, Aljohani, Naif Radi and Nawaz, Raheel ORCID logoORCID: https://orcid.org/0000-0001-9588-0052 (2021) Sentiment analysis for Urdu online reviews using deep learning models. Expert Systems, 38 (8). e12751. ISSN 0266-4720

[img]
Preview
Accepted Version
Download (17MB) | Preview

Abstract

Most existing studies are focused on popular languages like English, Spanish, Chinese, Japanese, and others, however, limited attention has been paid to Urdu despite having more than 60 million native speakers. In this paper, we develop a deep learning model for the sentiments expressed in this under-resourced language. We develop an open-source corpus of 10,008 reviews from 566 online threads on the topics of sports, food, software, politics, and entertainment. The objectives of this work are bi-fold (a) the creation of a human-annotated corpus for the research of sentiment analysis in Urdu; and (b) measurement of up-to-date model performance using a corpus. For their assessment, we performed binary and ternary classification studies utilizing another model, namely long short-term memory (LSTM), recurrent convolutional neural network (RCNN) Rule-Based, N-gram, support vector machine , convolutional neural network, and LSTM. The RCNN model surpasses standard models with 84.98% accuracy for binary classification and 68.56% accuracy for ternary classification. To facilitate other researchers working in the same domain, we have open-sourced the corpus and code developed for this research.

Impact and Reach

Statistics

Activity Overview
6 month trend
31Downloads
6 month trend
60Hits

Additional statistics for this dataset are available via IRStats2.

Altmetric

Actions (login required)

View Item View Item