Manchester Metropolitan University's Research Repository

    Mining the Context of Citations in Scientific Publications

    Hassan, Saeed-Ul, Iqbal, Sehrish, Imran, Mubashir, Aljohani, Naif Radi and Nawaz, Raheel (2018) Mining the Context of Citations in Scientific Publications. In: ICADL 2018 - The 20th International Conference on Asia-Pacific Digital Libraries, 19 November 2018 - 22 November 2018, University of Waikato, Hamilton, New Zealand.


    Download (1MB) | Preview


    Recent advancements in information retrieval systems significantly rely on the context-based features and semantic matching techniques to provide relevant information to users from ever-growing digital libraries. Scientific communities seek to understand the implications of research, its importance and its applicability for future research directions. To mine this information, absolute citations merely fail to measure the importance of scientific literature, as a citation may have a specific context in full text. Thus, a comprehensive contextual understanding of cited references is necessary. For this purpose, numerous techniques have been proposed that tap the power of artificial intelligence models to detect important or incidental (non-important) citations in full text scholarly publications. In this paper, we compare and build upon on four state-of-the-art models that detect important citations using 450 manually annotated citations by experts - randomly selected from 20,527 papers from the Association for Computational Linguistics corpus. Of the total 64 unique features proposed by the four selected state-of-the-art models, the top 29 were chosen using the Extra-Trees classifier. These were then fed it to our supervised machine learning based models: Random Forest (RF) and Support Vector Machine. The RF model outperforms existing selected systems by more than 10%, with 89% precision-recall curve. Finally, we qualitatively assessed important and non-important citations by employing and self-organizing maps. Overall, our research work supports information retrieval algorithms that detect and fetch scientific articles on the basis of both qualitative and quantitative indices in scholarly big data.

    Impact and Reach


    Activity Overview
    6 month trend
    6 month trend

    Additional statistics for this dataset are available via IRStats2.


    Repository staff only

    Edit record Edit record