e-space
Manchester Metropolitan University's Research Repository

    Deep sentiments in Roman Urdu text using Recurrent Convolutional Neural Network model

    Mahmood, Zainab, Safder, Iqra, Nawab, Rao Muhammad Adeel, Bukhari, Faisal, Nawaz, Raheel ORCID logoORCID: https://orcid.org/0000-0001-9588-0052, Alfakeeh, Ahmed S, Aljohani, Naif Radi and Hassan, Saeed-Ul (2020) Deep sentiments in Roman Urdu text using Recurrent Convolutional Neural Network model. Information Processing & Management, 57 (4). p. 102233. ISSN 0306-4573

    [img]
    Preview
    Accepted Version
    Download (4MB) | Preview

    Abstract

    Although over 64 million people worldwide speak Urdu language and are well aware of its Roman script, limited research and efforts have been made to carry out sentiment analysis and build language resources for the Roman Urdu language. This article proposes a deep learning model to mine the emotions and attitudes of people expressed in Roman Urdu - consisting of 10,021 sentences from 566 online threads belonging to the following genres: Sports; Software; Food & Recipes; Drama; and Politics. The objectives of this research are twofold: (1) to develop a human-annotated benchmark corpus for the under-resourced Roman Urdu language for the sentiment analysis; and (2) to evaluate sentiment analysis techniques using the Rule-based, N-gram, and Recurrent Convolutional Neural Network (RCNN) models. Using Corpus, annotated by three experts to be positive, negative, and neutral with 0.557 Cohen's Kappa score, we run two sets of tests, i.e., binary classification (positive and negative) and tertiary classification (positive, negative and neutral). Finally, the results of the RCNN model are analyzed by comparing it with the outcome of the Rule-based and N-gram models. We show that the RCNN model outperforms baseline models in terms of accuracy of 0.652 for binary classification and 0.572 for tertiary classification.

    Impact and Reach

    Statistics

    Activity Overview
    6 month trend
    355Downloads
    6 month trend
    260Hits

    Additional statistics for this dataset are available via IRStats2.

    Altmetric

    Repository staff only

    Edit record Edit record