e-space
Manchester Metropolitan University's Research Repository

    Personality detection using kernel-based ensemble model for leveraging social psychology in online networks

    Kumar, Akshi ORCID logoORCID: https://orcid.org/0000-0003-4263-7168, Beniwal, Rohit and Jain, Dipika (2023) Personality detection using kernel-based ensemble model for leveraging social psychology in online networks. ACM Transactions on Asian and Low-Resource Language Information Processing, 22 (5). pp. 1-20. ISSN 2375-4699

    [img]
    Preview
    Accepted Version
    Download (854kB) | Preview

    Abstract

    The Asian social networking market dominates the world landscape with the highest consumer penetration rate. Businesses and investors often look for winning strategies to attract consumers to increase revenues from sales, advertisements, and other services offered on social media platforms. Social media engagement and online relational cohesion have often been defined within the frameworks of social psychology and personality identification is a possible way in which social psychology can inform, engage, and learn from social media. Personality profiling has many real-world applications, including preference-based recommendation systems, relationship building, and career counseling. This research puts forward a novel kernel-based soft-voting ensemble model for personality detection from natural language, KBSVE-P. The KBSVE-P model is built by firstly evaluating the performance of various Support Vector Machine (SVM) kernels, namely radial basis function (RBF), linear, sigmoidal, and polynomial, to find the best-suited kernel for automatic personality detection in natural language text. Next, an ensemble of SVM kernels is implemented with a variety of voting techniques, such as soft voting, hard voting, and weighted hard voting. The model is evaluated on the publicly available Kaggle_MBTI dataset and a novel South Asian, Indian, low-resource Hindi language विशेष चरित्र_MBTI (pronounced as vishesh charitr, meaning personality in Hindi) dataset for detecting a user's personality across four personality traits, namely introvert/extrovert (IE), thinking/feeling (TF), sensing/intuitive (SI), and judging/perceiving (JP). The proposed kernel-based ensemble with soft voting, KBSVE-P, outperforms the existing models on English Kaggle-MBTI dataset with an average F-score of 85.677 and achieves an accuracy of 66.89 for the Hindi विशेष चरित्र_MBTI dataset.

    Impact and Reach

    Statistics

    Activity Overview
    6 month trend
    442Downloads
    6 month trend
    60Hits

    Additional statistics for this dataset are available via IRStats2.

    Altmetric

    Repository staff only

    Edit record Edit record