Luciano, G and Shardlow, MJ (2018) Manchester Metropolitan at SemEval-2018 Task 2: Random Forest with an Ensemble of Features for Predicting Emoji in Tweets. In: 12th International workshop on semantic evaluation (SemEval 2018), 05 June 2018 - 06 June 2018, New Orleans, USA.
|
Published Version
Available under License Creative Commons Attribution. Download (1MB) | Preview |
Abstract
We present our submission to the Semeval 2018 task on emoji prediction. We used a random forest, with an ensemble of bag-of-words, sentiment and psycholinguistic features. Although we performed well on the trial dataset (attaining a macro f-score of 63.185 for English and 81.381 for Spanish), our approach did not perform as well on the test data. We describe our features and classification protocol, as well as initial experiments, concluding with a discussion of the discrepancy between our trial and test results.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.