e-space
Manchester Metropolitan University's Research Repository

    Predicting primary sequence-based protein-protein interactions using a Mercer series representation of nonlinear support vector machine

    Chatrabgoun, Omid, Daneshkhah, Alireza, Esmaeilbeigi, Mohsen, Sohrabi Safa, Nader ORCID logoORCID: https://orcid.org/0000-0003-4897-0084, Alenezi, Ali H and Rahman, Arafatur (2022) Predicting primary sequence-based protein-protein interactions using a Mercer series representation of nonlinear support vector machine. IEEE Access, 10. pp. 124345-124354. ISSN 2169-3536

    [img]
    Preview
    Published Version
    Available under License Creative Commons Attribution.

    Download (2MB) | Preview

    Abstract

    The prediction of protein-protein interactions (PPIs) is essential to understand the cellular processes from a medical perspective. Among the various machine learning techniques, kernel-based Support Vector Machine (SVM) has been commonly employed to discriminate between interacting and non-interacting protein pairs. The main drawback of employing the kernel-based SVM to datasets with many features, such as the primary sequence-based protein-protein dataset, is the significant increase in computational time of training stage. This increase in computational time is mainly due to the presence of the kernel in solving the quadratic optimisation problem (QOP) involved in nonlinear SVM. In order to fix this issue, we propose a novel and efficient computational algorithm by approximating the kernel-based SVM using a low-rank truncated Mercer series as well as desired. As a result, the QOP for the approximated kernel-based SVM will be very tractable in the sense that there is a significant reduction in computational time of training and validating stages. We illustrate the novelty of the proposed method by predicting the PPIs of “S. Cerevisiae” where the protein features extracted using the multiscale local descriptor (MLD), and then we compare the predictive performance of the proposed low-rank approximation with the existing methods. Finally, the new method results in significant reduction in computational time for predicting PPIs with almost as accuracy as kernel-based SVM.

    Impact and Reach

    Statistics

    Activity Overview
    6 month trend
    95Downloads
    6 month trend
    67Hits

    Additional statistics for this dataset are available via IRStats2.

    Altmetric

    Repository staff only

    Edit record Edit record