e-space
Manchester Metropolitan University's Research Repository

    Provenance Graph Kernel

    Marzagao, DK, Huynh, TD, Helal, A ORCID logoORCID: https://orcid.org/0000-0001-7003-9945, Baccas, S and Moreau, L (2025) Provenance Graph Kernel. IEEE Transactions on Knowledge and Data Engineering. pp. 1-16. ISSN 1041-4347

    [img]
    Preview
    Accepted Version
    Available under License In Copyright.

    Download (1MB) | Preview

    Abstract

    Provenance is a standardised record that describes how entities, activities, and agents have influenced a piece of data; it is commonly represented as graphs with relevant labels on both their nodes and edges. With the growing adoption of provenance in a wide range of application domains, users are increasingly confronted with an abundance of graph data, which may prove challenging to process. Graph kernels, on the other hand, have been successfully used to efficiently analyse graphs. In this paper, we introduce a novel graph kernel called provenance kernel, which is inspired by and tailored for provenance data. We employ provenance kernels to classify provenance graphs from three application domains. Our evaluation shows that they perform well in terms of classification accuracy and yield competitive results when compared against existing graph kernel methods and the provenance network analytics method while more efficient in computing time. Moreover, the provenance types used by provenance kernels are a symbolic representation of a tree pattern which can, in turn, be described using the domain-agnostic vocabulary of provenance. Therefore, provenance types thus allow for the creation of explanations of predictive models built on them.

    Impact and Reach

    Statistics

    Activity Overview
    6 month trend
    6Downloads
    6 month trend
    7Hits

    Additional statistics for this dataset are available via IRStats2.

    Altmetric

    Repository staff only

    Edit record Edit record