Classification and Comparative Evaluation of Text and Emoji-Based Tweets With Deep Neural Network Models

  • J. N. Chandra Sekhar
  • , M. Kiran Mayee
  • , Ranjana Nadagoudar
  • , N. Chinna Alluraiah
  • , C. Dhanamjayulu
  • , Ravikumar Chinthaginjala
  • , K. Ravi
  • , M. Praveenkumar
  • , Satyajit Mohanty
  • , Baseem Khan

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Emojis have become increasingly prevalent in today’s digital world, allowing individuals to convey a wide range of emotions, from uncomplicated to intricate, to a greater extent than previously. Consequently, emojis are being utilized in sentiment analysis and tailored marketing strategies. The ongoing research on conducting emotion detection on both tweets and a symbolic expression dataset sourced from Kaggle. Given that tweets are largely commentaries, we utilized two end-to-end sentence embedding models, the DistilBERT, USE-Large3, and RoBERTa, which generate embeddings. These embeddings are further utilized for training with dense neural networks (NNs) and LSTM techniques. Remarkably, it was perceived that the text classification accuracy for both models was consistently high, hovering around 98%. However, when the validation set is constructed with that of symbolic expression or emojis not included in the training dataset, a significant drop in accuracy for both models, plummeting to 75%, had been observed. Additionally, a distributed training methodology is utilized as a substitute for the conventional single-threaded model to enhance scalability. This approach resulted in a roughly 17% reduction in the runtime while maintaining accuracy. Lastly, in pursuit of explainable AI, the SHAP and LIME algorithms are employed to elucidate the model’s behavior and assess any potential biases in the dataset. The creative use of advanced deep NN techniques customized for the delicate complexities of hybrid-data sentiment analysis indicates a significant leap forward. Our proposed work provides the critical gap in existing sentiment analysis methods, which primarily aimed at either text or emojis in isolation, thereby exploring more holistic understanding of sentiment in digital communications. Moreover, the application of explainable AI techniques, SHAP and LIME, to demystify model decisions emphasizes commitment to advancing transparent and trustworthy deep learning technologies in sentiment analysis.

Original languageEnglish
Article number9652424
JournalJournal of Electrical and Computer Engineering
Volume2024
Issue number1
DOIs
Publication statusPublished - 2024
Externally publishedYes

Keywords

  • artificial intelligence
  • emojis
  • sentiment analysis
  • social networks

ASJC Scopus subject areas

  • Signal Processing
  • General Computer Science
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Classification and Comparative Evaluation of Text and Emoji-Based Tweets With Deep Neural Network Models'. Together they form a unique fingerprint.

Cite this