e-space
Manchester Metropolitan University's Research Repository

    End user licence to open government data? A simulated penetration attack on two social survey datasets.

    Elliott, M, Mackey, E, O'Shea, SM, Tudor, C and Spicer, K (2016) End user licence to open government data? A simulated penetration attack on two social survey datasets. Journal of Official Statistics, 32 (2). pp. 329-348. ISSN 0282-423X

    [img]
    Preview

    Available under License Creative Commons Attribution Share Alike.

    Download (204kB) | Preview

    Abstract

    In the UK, the transparency agenda is forcing data stewardship organisations to review their dissemination policies and to consider whether to release data that is currently only available to a restricted community of researchers under licence as open data. Here we describe the results of a study providing evidence about the risks of such an approach via a simulated attack on two social survey datasets. This is also the first systematic attempt to simulate a jigsaw identification attack (one using a mashup of multiple data sources) on an anonymised dataset. The information that we draw on is collected from multiple online data sources and purchasable commercial data. The results indicate that such an attack against anonymised end user licence (EUL) datasets, if converted into open datasets, is possible and therefore we would recommend that penetration tests should be factored into any decision to make datasets (that are about people) open.

    Impact and Reach

    Statistics

    Activity Overview
    6 month trend
    284Downloads
    6 month trend
    328Hits

    Additional statistics for this dataset are available via IRStats2.

    Altmetric

    Repository staff only

    Edit record Edit record