TY - GEN
T1 - I3D-AE-LSTM
T2 - 2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025
AU - Moodley, Tevin
AU - Van Der Haar, Dustin
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - In this study, we introduce UJ-AQA-CricketVision, a dataset comprising 8,540 video clips of cricket strokes, each annotated with detailed phase breakdowns. We develop a novel multi-variate approach for Action Quality Assessment (AQA) at a body level that leverages an Autoencoder for extracting sophisticated feature representations from video frames and pose estimated keypoints. These features are subsequently utilised by a multilayer perceptron regression-based model to accurately predict the quality of cricket actions in terms of their head, shoulder, hands, hips, and feet. Our approach is benchmarked against contemporary state-of-the-art AQA methods and achieves a Spearman Rank Correlation score of 0.84. The performance highlights the significance of integrating pose keypoint and frame data for the nuanced analysis of short and complex action sequences in sports such as cricket. This work aims to foster the development of accurate Action Quality Assessment methods on Cricket Video data. The dataset can be found here: https://github.com/dvanderhaar/uj-aqa-cricketvision.
AB - In this study, we introduce UJ-AQA-CricketVision, a dataset comprising 8,540 video clips of cricket strokes, each annotated with detailed phase breakdowns. We develop a novel multi-variate approach for Action Quality Assessment (AQA) at a body level that leverages an Autoencoder for extracting sophisticated feature representations from video frames and pose estimated keypoints. These features are subsequently utilised by a multilayer perceptron regression-based model to accurately predict the quality of cricket actions in terms of their head, shoulder, hands, hips, and feet. Our approach is benchmarked against contemporary state-of-the-art AQA methods and achieves a Spearman Rank Correlation score of 0.84. The performance highlights the significance of integrating pose keypoint and frame data for the nuanced analysis of short and complex action sequences in sports such as cricket. This work aims to foster the development of accurate Action Quality Assessment methods on Cricket Video data. The dataset can be found here: https://github.com/dvanderhaar/uj-aqa-cricketvision.
KW - action quality assessment
KW - aqa dataset
KW - autoencoders
KW - cricket ai
KW - pose estimation
KW - two stream autoencoder
UR - http://www.scopus.com/inward/record.url?scp=105003628103&partnerID=8YFLogxK
U2 - 10.1109/WACV61041.2025.00534
DO - 10.1109/WACV61041.2025.00534
M3 - Conference contribution
AN - SCOPUS:105003628103
T3 - Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025
SP - 5470
EP - 5478
BT - Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 28 February 2025 through 4 March 2025
ER -