Loading…
Saturday May 24, 2025 12:00pm - 2:00pm EDT

Authors - Manideep Pendyala, Udit Goel, Jim Samuel, Pal Patel, Janki Kanakia, Alexander Pelaez, Neel Savalia, Tanya Khanna
Abstract - Emojis have become an integral part of modern digital communication. Despite their widespread use, most sentiment analysis methods and models disregard emojis during preprocessing, leading to the loss of vital emotional cues. This paper introduces a curated dataset of sentence pairs, with and without emojis, each annotated across three sentiment categories, to assess the impact of emoji inclusion on sentiment classification. We evaluate emoji-inclusive and emoji-exclusive strategies against our human-determined gold standard, using a range of approaches, including the traditional lexicon-dictionary based methods, and also artificial intelligence (AI) methods including pre-trained machine learning (ML) based classifiers, and large language models (LLMs). Results show that retaining emojis significantly enhances the performance of all the LLMs we tested, with models such as Qwen, Deepseek, Bert and Mistral achieving accuracy improvements of over 25%, over an emoji-exclusive strategy. These findings highlight that emojis carry meaningful semantic and affective signals. We emphasize the limitations of current approaches to emoji handling, where emojis are often ignored or treated as irrelevant noise. Instead, we advocate for more thoughtful methods that recognize emojis as meaningful components of communication and incorporate them as valuable sources of information.
Paper Presenter
avatar for Manideep Pendyala

Manideep Pendyala

United States of America
Saturday May 24, 2025 12:00pm - 2:00pm EDT
Virtual Room D New York, USA

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link