TY - GEN
T1 - Scene Recognition Using AlexNet to Recognize Significant Events Within Cricket Game Footage
AU - Moodley, Tevin
AU - van der Haar, Dustin
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - In the last decade, special attention has been made toward the automated analysis of human activity and other related fields. Cricket, as a research field, has of late received more attention due to its increased popularity. The cricket domain currently lacks datasets, specifically relating to cricket strokes. The limited datasets restrict the amount of research within the environment. In the study, this research paper proposes a scene recognition model to recognize frames with a cricket batsman. Two different classes are addressed, namely; the gameplay class and the stroke class. Two pipelines were evaluated; the first pipeline proposes the Support Vector Machine (SVM) algorithm, which undergoes data capturing, feature extraction using histogram of oriented gradients and lastly classification. The Support Vector Machine (SVM) model yielded an accuracy of 95.441%. The second pipeline is the AlexNet Convolutional Neural Network (CNN) architecture, which underwent data capturing, data augmentation that includes rescaling and shear zoom followed by feature extraction and classification using AlexNet. The AlexNet architecture performed exceptionally well, producing a model accuracy of 96.661%. The AlexNet pipeline is preferred over the Support Vector Machine pipeline for the domain. By recognizing a significant event, that is when a stroke and none stoke (gameplay) scene is recognized. The model is able to filter only relevant footage from large volumes of data, which is then later used for analysis. The research proves there is value in exploring deep-learning methods for scene recognition.
AB - In the last decade, special attention has been made toward the automated analysis of human activity and other related fields. Cricket, as a research field, has of late received more attention due to its increased popularity. The cricket domain currently lacks datasets, specifically relating to cricket strokes. The limited datasets restrict the amount of research within the environment. In the study, this research paper proposes a scene recognition model to recognize frames with a cricket batsman. Two different classes are addressed, namely; the gameplay class and the stroke class. Two pipelines were evaluated; the first pipeline proposes the Support Vector Machine (SVM) algorithm, which undergoes data capturing, feature extraction using histogram of oriented gradients and lastly classification. The Support Vector Machine (SVM) model yielded an accuracy of 95.441%. The second pipeline is the AlexNet Convolutional Neural Network (CNN) architecture, which underwent data capturing, data augmentation that includes rescaling and shear zoom followed by feature extraction and classification using AlexNet. The AlexNet architecture performed exceptionally well, producing a model accuracy of 96.661%. The AlexNet pipeline is preferred over the Support Vector Machine pipeline for the domain. By recognizing a significant event, that is when a stroke and none stoke (gameplay) scene is recognized. The model is able to filter only relevant footage from large volumes of data, which is then later used for analysis. The research proves there is value in exploring deep-learning methods for scene recognition.
KW - AlexNet architecture
KW - Automation
KW - Cricket strokes
KW - Scene recognition
KW - Support vector machines
UR - http://www.scopus.com/inward/record.url?scp=85091329993&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-59006-2_9
DO - 10.1007/978-3-030-59006-2_9
M3 - Conference contribution
AN - SCOPUS:85091329993
SN - 9783030590055
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 98
EP - 109
BT - Computer Vision and Graphics - International Conference, ICCVG 2020, Proceedings
A2 - Chmielewski, Leszek J.
A2 - Kozera, Ryszard
A2 - Orlowski, Arkadiusz
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Computer Vision and Graphics, ICCVG 2020
Y2 - 14 September 2020 through 16 September 2020
ER -