I3D-AE-LSTM: A 2-Stream Autoencoder for Action Quality Assessment Using a Newly Created Cricket Batsman Video Dataset

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this study, we introduce UJ-AQA-CricketVision, a dataset comprising 8,540 video clips of cricket strokes, each annotated with detailed phase breakdowns. We develop a novel multi-variate approach for Action Quality Assessment (AQA) at a body level that leverages an Autoencoder for extracting sophisticated feature representations from video frames and pose estimated keypoints. These features are subsequently utilised by a multilayer perceptron regression-based model to accurately predict the quality of cricket actions in terms of their head, shoulder, hands, hips, and feet. Our approach is benchmarked against contemporary state-of-the-art AQA methods and achieves a Spearman Rank Correlation score of 0.84. The performance highlights the significance of integrating pose keypoint and frame data for the nuanced analysis of short and complex action sequences in sports such as cricket. This work aims to foster the development of accurate Action Quality Assessment methods on Cricket Video data. The dataset can be found here: https://github.com/dvanderhaar/uj-aqa-cricketvision.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5470-5478
Number of pages9
ISBN (Electronic)9798331510831
DOIs
Publication statusPublished - 2025
Event2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025 - Tucson, United States
Duration: 28 Feb 20254 Mar 2025

Publication series

NameProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025

Conference

Conference2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025
Country/TerritoryUnited States
CityTucson
Period28/02/254/03/25

Keywords

  • action quality assessment
  • aqa dataset
  • autoencoders
  • cricket ai
  • pose estimation
  • two stream autoencoder

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Human-Computer Interaction
  • Modeling and Simulation
  • Radiology, Nuclear Medicine and Imaging

Fingerprint

Dive into the research topics of 'I3D-AE-LSTM: A 2-Stream Autoencoder for Action Quality Assessment Using a Newly Created Cricket Batsman Video Dataset'. Together they form a unique fingerprint.

Cite this