Real-time emotional health detection using fine-tuned transfer networks with multimodal fusion

Sharma, A, Sharma, K and Kumar, A ORCID: https://orcid.org/0000-0003-4263-7168 (2022) Real-time emotional health detection using fine-tuned transfer networks with multimodal fusion. Neural Computing and Applications. ISSN 0941-0643

Preview

Accepted Version
Available under License In Copyright.
Download (992kB) | Preview

Official URL: https://link.springer.com/article/10.1007/s00521-0...

Abstract

Recognizing and regulating human emotion or a wave of riding emotions are a vital life skill as it can play an important role in how a person thinks, behaves and acts. Accurate real-time emotion detection can revolutionize the human–computer interaction industry and has the potential to provide a proactive approach to mental health care. Several untapped sources of data, including social media data (psycholinguistic markers), multimodal data (audio and video signals) combined with the sensor-based psychophysiological and brain signals, help to comprehend the affective states and emotional experiences. In this work, we propose a model that utilizes three modalities, i.e., visual (facial expression and body gestures), audio (speech) and text (spoken content), to classify emotion into discrete categories based on Ekman’s model with an additional category for ‘neutral’ state. Transfer learning has been used with multistage fine-tuning for each modality instead of training on a single dataset to make the model generalizable. The use of multiple modalities allows integration of heterogeneous data from different sources effectively. The results of the three modalities are combined at the decision-level using weighted fusion technique. The proposed EmoHD model compares favorably to the state-of-the-art technique on two benchmark datasets MELD and IEMOCAP.

Item Type:	Article
Peer-reviewed:	Yes
Date Deposited:	04 Apr 2022 12:07
Publisher:	Springer (part of Springer Nature)
Additional Information:	This is an Author Accepted Manuscript of an article published in Neural Computing and Applications by Springer.
Divisions:	Organisation > Science and Engineering
Subject terms:	0801 Artificial Intelligence and Image Processing, 0906 Electrical and Electronic Engineering, 1702 Cognitive Sciences, Artificial Intelligence & Image Processing
URI:	https://e-space.mmu.ac.uk/id/eprint/629493
DOI:	https://doi.org/10.1007/s00521-022-06913-2
ISSN	0941-0643
e-ISSN	1433-3058

Impact and Reach

Statistics

DownloadsShow export options

Activity Overview

6 month trend

894Downloads

6 month trend

206Hits

Additional statistics for this dataset are available via IRStats2.

Altmetric

Repository staff only

Edit record