Skip to Main content Skip to Navigation
Conference papers

From Emoji Usage to Categorical Emoji Prediction

Abstract : Emoji usage drastically increased recently, they are becoming some of the most common ways to convey emotions and sentiments in social messaging applications. Several research works automatically recommend emojis, so users do not have to go through a library of thousands of emojis. In order to improve emoji recommendation, we present and distribute two useful resources: an emoji embedding model from real usage, and emoji clustering based on these embeddings to automatically identify groups of emojis. Assuming that emojis are part of written natural language and can be considered as words, we only used unsu-pervised learning methods to extract patterns and knowledge from real emoji usage in tweets. Thereby, emotion categories of face emojis were obtained directly from text in a fully reproductible way. These resources and methodology have multiple usages; for example, they could be used to improve our understanding of emojis or enhance emoji recommendation .
Complete list of metadata

Cited literature [20 references]  Display  Hide  Download
Contributor : Gaël Guibon Connect in order to contact the contributor
Submitted on : Monday, September 10, 2018 - 11:35:59 AM
Last modification on : Thursday, July 14, 2022 - 4:08:20 AM
Long-term archiving on: : Tuesday, December 11, 2018 - 2:05:22 PM


Files produced by the author(s)


  • HAL Id : hal-01871045, version 1



Gaël Guibon, Magalie Ochs, Patrice Bellot. From Emoji Usage to Categorical Emoji Prediction. 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLING 2018), Mar 2018, Hanoï, Vietnam. ⟨hal-01871045⟩



Record views


Files downloads