Explainable YouTube video identification using sufficient input subsets

Afandi, Waleed, Bukhari, Syed Muhammad Ammar Hassan, Khan, Muhammad US, Maqsood, Tahir, Fayyaz, Muhammad AB, Ansari, Ali R and Nawaz, Raheel (2023) Explainable YouTube video identification using sufficient input subsets. IEEE Access, 11. pp. 33178-33188. ISSN 2169-3536

Preview

Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Download (1MB) | Preview

Official URL: https://doi.org/10.1109/ACCESS.2023.3261562

Abstract

Neural network models are black boxes in nature. The mechanics behind these black boxes are practically unexplainable. Having the insight into patterns identified by these algorithms can help unravel important properties of the subject in query. These artificial intelligence based algorithms are used in every domain for prediction. This research focuses on patterns formed in network traffic that can be leveraged to identify videos streaming over the network. The proposed work uses a sufficient input subset (SIS) model on two separate video identification techniques to understand and explain the patterns detected by the techniques. The first technique creates the fingerprints of videos on a period-based algorithm to handle variable bitrate inconsistencies. These fingerprints are passed to a convolutional Neural Network (CNN) for pattern recognition. The second technique is based on traffic pattern plot identification that creates a graph of packet size with respect to time for each stream before passing that to a CNN as an image. For model explainability, a sufficient input subset (SIS) model is used to identify features that are sufficient to reach the same prediction under a certain threshold of confidence by the model. The generated SIS of each input sample is clustered using DBSCAN, K-Means, and cosine-based Hierarchical clustering. The clustered SIS highlight the common patterns for each class. The SIS patterns learnt by each model of three individual videos are discussed. Furthermore, these patterns are used to investigate misclassification and provide a rationale behind it to justify the working of the classifier model.

Item Type:	Article
Peer-reviewed:	Yes
Date Deposited:	31 Aug 2023 15:07
Publisher:	IEEE
Additional Information:	This is an Open Access article which appeared in IEEE Access
Divisions:	Faculties > Business and Law
Subject terms:	08 Information and Computing Sciences, 09 Engineering, 10 Technology
URI:	https://e-space.mmu.ac.uk/id/eprint/632486
DOI:	https://doi.org/10.1109/ACCESS.2023.3261562
ISSN	2169-3536
e-ISSN	2169-3536

Impact and Reach

Statistics

DownloadsShow export options

Activity Overview

6 month trend

63Downloads

6 month trend

340Hits

Additional statistics for this dataset are available via IRStats2.

Altmetric

Repository staff only

Edit record