ResMem-Net: memory based deep CNN for image memorability estimation

Praveen, Arockia, Noorwali, Abdulfattah, Samiayya, Duraimurugan, Khan, Mohammad Zubair, Vincent, PMDR, Bashir, Ali Kashif ORCID: https://orcid.org/0000-0001-7595-2522 and Alagupandi, Vinoth (2021) ResMem-Net: memory based deep CNN for image memorability estimation. PeerJ Computer Science, 7. e767-e767. ISSN 2376-5992

Preview

Published Version
Available under License Creative Commons Attribution.
Download (759kB) | Preview

Official URL: https://peerj.com/articles/cs-767/

Abstract

Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learning architecture called ResMem-Net that is a hybrid of LSTM and CNN that uses information from the hidden layers of the CNN to compute the memorability score of an image. The intermediate layers are important for predicting the output because they contain information about the intrinsic properties of the image. The proposed architecture automatically learns visual emotions and saliency, shown by the heatmaps generated using the GradRAM technique. We have also used the heatmaps and results to analyze and answer one of the most important questions in image memorability: ‘‘What makes an image memorable?“. The model is trained and evaluated using the publicly available Large-scale Image Memorability dataset (LaMem) from MIT. The results show that the model achieves a rank correlation of 0.679 and a mean squared error of 0.011, which is better than the current state-of-the-art models and is close to human consistency (p = 0.68). The proposed architecture also has a significantly low number of parameters compared to the state-of-the-art architecture, making it memory efficient and suitable for production.

Item Type:	Article
Peer-reviewed:	Yes
Date Deposited:	28 Nov 2022 11:32
Publisher:	PeerJ Inc.
Additional Information:	This is an Open Access article which appeared in PeerJ Computer Science, published by PeerJ Inc.
Divisions:	Organisation > Science and Engineering
Subject terms:	Deep Learning, Image Memorability, Object Interestingness, Saliency, Visual Emotions, 0806 Information Systems
Data Access Statement:	The following information was supplied regarding data availability: The data is available at http://memorability.csail.mit.edu/download.html The Isola et al. dataset is available at: https://web.mit.edu/phillipi/Public/WhatMakesAnImageMemorable/ The code is available at GitHub: https://github.com/praveenbenedict/ResMemNet.
URI:	https://e-space.mmu.ac.uk/id/eprint/630867
DOI:	https://doi.org/10.7717/peerj-cs.767
ISSN	2376-5992
e-ISSN	2376-5992

Impact and Reach

Statistics

DownloadsShow export options

Activity Overview

6 month trend

289Downloads

6 month trend

179Hits

Additional statistics for this dataset are available via IRStats2.

Altmetric

Repository staff only

Edit record