TY - GEN
T1 - Slang-Based Text Sentiment Analysis in Instagram
AU - Aly, Elton Shah
AU - van der Haar, Dustin Terence
N1 - Publisher Copyright:
© Springer Nature Singapore Pte Ltd 2020.
PY - 2020
Y1 - 2020
N2 - A large amount of user-generated content on social media has led to the pursuit of quickly and accurately mining through data and gathering useful insights. Text sentiment analysis has become a necessary tool in classifying user opinions within Web generated content. Due to the various ways, opinions can be conveyed, performing text sentiment analysis in specific domains becomes a difficult task. With an even greater degree of difficulty added when slang or colloquialisms are used. There is a great deal of research into investigating various classifiers in a traditional natural language processing setting each with their own merits and demerits. In this paper, we present a slang-based dictionary classifier with the objective of determining the sentiment of Instagram comments within the context of fashion, or more specifically sports shoes, and compare it with the performance of other classifiers such as a Naive Bayes, J48, lexicon and random forest. The dataset used for the benchmark was created from popular fashion Instagram accounts. Overall, the random forest classifier yields the best results with an accuracy of 88%, precision of 84% and a recall of 88%.
AB - A large amount of user-generated content on social media has led to the pursuit of quickly and accurately mining through data and gathering useful insights. Text sentiment analysis has become a necessary tool in classifying user opinions within Web generated content. Due to the various ways, opinions can be conveyed, performing text sentiment analysis in specific domains becomes a difficult task. With an even greater degree of difficulty added when slang or colloquialisms are used. There is a great deal of research into investigating various classifiers in a traditional natural language processing setting each with their own merits and demerits. In this paper, we present a slang-based dictionary classifier with the objective of determining the sentiment of Instagram comments within the context of fashion, or more specifically sports shoes, and compare it with the performance of other classifiers such as a Naive Bayes, J48, lexicon and random forest. The dataset used for the benchmark was created from popular fashion Instagram accounts. Overall, the random forest classifier yields the best results with an accuracy of 88%, precision of 84% and a recall of 88%.
KW - Machine learning
KW - Natural language processing
KW - Sentiment recognition
UR - http://www.scopus.com/inward/record.url?scp=85078427189&partnerID=8YFLogxK
U2 - 10.1007/978-981-32-9343-4_25
DO - 10.1007/978-981-32-9343-4_25
M3 - Conference contribution
AN - SCOPUS:85078427189
SN - 9789813293427
T3 - Advances in Intelligent Systems and Computing
SP - 321
EP - 329
BT - 4th International Congress on Information and Communication Technology - ICICT 2019, London
A2 - Yang, Xin-She
A2 - Sherratt, Simon
A2 - Dey, Nilanjan
A2 - Joshi, Amit
PB - Springer
T2 - 4th International Congress on Information and Communication Technology, ICICT 2019
Y2 - 27 February 2019 through 28 February 2019
ER -