TY - GEN
T1 - Group Emotion Recognition in the Wild using Pose Estimation and LSTM Neural Networks
AU - Slogrove, Kayleigh
AU - Van Der Haar, Dustin
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Emotion recognition in the wild determines the individual's or a group's emotions within non-laboratory conditions. The accuracy of these systems is relatively low due to the various conditions that can occur in real-world scenarios. A method for increasing the accuracy of detecting a group's emotions in the wild is proposed in this paper. The approach focuses on deep learning methods using pose estimation as features of a captured video which is then passed to a Long Short Term Memory (LSTM) network for classification. In the wild, systems need to accommodate different illuminations, occlusions, blurring, and orientations as these systems will be used in real-world scenarios. Due to these accommodations, these systems' accuracy is relatively low compared to the systems used in controlled environments. The proposed system aims to increase this accuracy by focusing on the people's body language within the video to identify the group's emotions. The system achieved an accuracy, precision, recall and F1 score of 91% on the VGAF dataset. The metrics derived from the results show that the proposed system beats the current state of the art and is a valid method to detect group emotions in the wild.
AB - Emotion recognition in the wild determines the individual's or a group's emotions within non-laboratory conditions. The accuracy of these systems is relatively low due to the various conditions that can occur in real-world scenarios. A method for increasing the accuracy of detecting a group's emotions in the wild is proposed in this paper. The approach focuses on deep learning methods using pose estimation as features of a captured video which is then passed to a Long Short Term Memory (LSTM) network for classification. In the wild, systems need to accommodate different illuminations, occlusions, blurring, and orientations as these systems will be used in real-world scenarios. Due to these accommodations, these systems' accuracy is relatively low compared to the systems used in controlled environments. The proposed system aims to increase this accuracy by focusing on the people's body language within the video to identify the group's emotions. The system achieved an accuracy, precision, recall and F1 score of 91% on the VGAF dataset. The metrics derived from the results show that the proposed system beats the current state of the art and is a valid method to detect group emotions in the wild.
KW - Emotion Recognition
KW - Group Emotion Recognition
KW - In the Wild
KW - Pose Estimation
UR - http://www.scopus.com/inward/record.url?scp=85138001954&partnerID=8YFLogxK
U2 - 10.1109/icABCD54961.2022.9856227
DO - 10.1109/icABCD54961.2022.9856227
M3 - Conference contribution
AN - SCOPUS:85138001954
T3 - 5th International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems, icABCD 2022 - Proceedings
BT - 5th International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems, icABCD 2022 - Proceedings
A2 - Pudaruth, Sameerchand
A2 - Singh, Upasana
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems, icABCD 2022
Y2 - 4 August 2022 through 5 August 2022
ER -