From Emoji Usage to Categorical Emoji Prediction

Abstract : Emoji usage drastically increased recently, they are becoming some of the most common ways to convey emotions and sentiments in social messaging applications. Several research works automatically recommend emojis, so users do not have to go through a library of thousands of emojis. In order to improve emoji recommendation, we present and distribute two useful resources: an emoji embedding model from real usage, and emoji clustering based on these embeddings to automatically identify groups of emojis. Assuming that emojis are part of written natural language and can be considered as words, we only used unsu-pervised learning methods to extract patterns and knowledge from real emoji usage in tweets. Thereby, emotion categories of face emojis were obtained directly from text in a fully reproductible way. These resources and methodology have multiple usages; for example, they could be used to improve our understanding of emojis or enhance emoji recommendation .
Complete list of metadatas

Cited literature [16 references]  Display  Hide  Download

https://hal-amu.archives-ouvertes.fr/hal-01871045
Contributor : Gaël Guibon <>
Submitted on : Monday, September 10, 2018 - 11:35:59 AM
Last modification on : Monday, April 15, 2019 - 5:02:32 PM
Long-term archiving on : Tuesday, December 11, 2018 - 2:05:22 PM

File

cicling2-cameraready-fixed.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01871045, version 1

Collections

Citation

Gaël Guibon, Magalie Ochs, Patrice Bellot. From Emoji Usage to Categorical Emoji Prediction. 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLING 2018), Mar 2018, Hanoï, Vietnam. ⟨hal-01871045⟩

Share

Metrics

Record views

103

Files downloads

222