A new benchmark dataset with production methodology for short text semantic similarity algorithms

O'shea, J, Bandar, Z and Crockett, K (2013) A new benchmark dataset with production methodology for short text semantic similarity algorithms. ACM Transactions on Speech and Language Processing, 10. ISSN 1550-4875

Preview

Available under License In Copyright.
Download (627kB) | Preview

Preview

Available under License In Copyright.
Download (808kB) | Preview

Abstract

This research presents a new benchmark dataset for evaluating Short Text Semantic Similarity (STSS) measurement algorithms and the methodology used for its creation. The power of the dataset is evaluated by using it to compare two established algorithms, STASIS and Latent Semantic Analysis. This dataset focuses on measures for use in Conversational Agents; other potential applications include email processing and data mining of social networks. Such applications involve integrating the STSS algorithm in a complex system, but STSS algorithms must be evaluated in their own right and compared with others for their effectiveness before systems integration. Semantic similarity is an artifact of human perception; therefore its evaluation is inherently empirical and requires benchmark datasets derived from human similarity ratings. The new dataset of 64 sentence pairs, STSS-131, has been designed to meet these requirements drawing on a range of resources from traditional grammar to cognitive neuroscience. The human ratings are obtained from a set of trials using new and improved experimental methods, with validated measures and statistics. The results illustrate the increased challenge and the potential longevity of the STSS-131 dataset as the Gold Standard for future STSS algorithm evaluation. © 2013 ACM 1550-4875/2013/12-ART17 15.00.

Item Type:	Article
Peer-reviewed:	No
Date Deposited:	05 Jul 2016 11:38
Publisher:	Association for Computing Machinery (ACM)
Additional Information:	© ACM, 2013. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM Transactions on Speech and Language Processing, Vol.40, Iss.4, December 2013, http://doi.acm.org/10.1145/2537046
Divisions:	Organisation > Science and Engineering
URI:	https://e-space.mmu.ac.uk/id/eprint/615505
DOI:	https://doi.org/10.1145/2537046
ISSN	1550-4875

Impact and Reach

Statistics

DownloadsShow export options

Activity Overview

6 month trend

900Downloads

6 month trend

515Hits

Additional statistics for this dataset are available via IRStats2.

Altmetric

Repository staff only

Edit record