Crockett, Keeley ORCID: https://orcid.org/0000-0003-1941-6201, Adel, Naeemeh ORCID: https://orcid.org/0000-0003-4449-7410, O'Shea, James ORCID: https://orcid.org/0000-0001-5645-2370, Crispin, Alan, Chandran, David and Carvalho, João Paulo (2017) Application of Fuzzy Semantic Similarity Measures to Event Detection Within Tweets. In: IEEE International conference on Fuzzy Systems (Fuzz-IEEE 2017), 09 July 2017 - 12 July 2017, Naples, Italy.
|
Accepted Version
Available under License In Copyright. Download (807kB) | Preview |
Abstract
This paper examines the suitability of applying fuzzy semantic similarity measures (FSSM) to the task of detecting potential future events through the use of a group of prototypical event tweets. FSSM are ideal measures to be used to analyse the semantic textual content of tweets due to the ability to deal equally with not only nouns, verbs, adjectives and adverbs, but also perception based fuzzy words. The proposed methodology first creates a set of prototypical event related tweets and a control group of tweets from a data source, then calculates the semantic similarity against an event dataset compiled from tweets issued during the 2011 London riots. The dataset of tweets contained a proportion of tweets that the Guardian Newspaper publically released that were attributed to 200 influential Twitter users during the actual riot. The effects of changing the semantic similarity threshold are investigated in order to evaluate if Twitter tweets can be used in conjunction with fuzzy short text similarity measures and prototypical event related tweets to determine if an event is more likely to occur. By looking at the increase in frequency of tweets in the dataset, over a certain similarity threshold when matched with prototypical event tweets about riots, the results have shown that a potential future event can be detected.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.