Comparative Study of Machine Learning Classifiers for Modelling Road Traffic Accidents

Tebogo Bokaba, Wesley Doorsamy, Babu Sena Paul

Research output: Contribution to journalArticlepeer-review

45 Citations (Scopus)

Abstract

Road traffic accidents (RTAs) are a major cause of injuries and fatalities worldwide. In recent years, there has been a growing global interest in analysing RTAs, specifically concerned with analysing and modelling accident data to better understand and assess the causes and effects of accidents. This study analysed the performance of widely used machine learning classifiers using a real-life RTA dataset from Gauteng, South Africa. The study aimed to assess prediction model designs for RTAs to assist transport authorities and policymakers. It considered classifiers such as naïve Bayes, logistic regression, k-nearest neighbour, AdaBoost, support vector machine, random forest, and five missing data methods. These classifiers were evaluated using five evaluation metrics: accuracy, root-mean-square error, precision, recall, and receiver operating characteristic curves. Furthermore, the assessment involved parameter adjustment and incorporated dimensionality reduction techniques. The empirical results and analyses show that the RF classifier, combined with multiple imputations by chained equations, yielded the best performance when compared with the other combinations.

Original languageEnglish
Article number828
JournalApplied Sciences (Switzerland)
Volume12
Issue number2
DOIs
Publication statusPublished - 1 Jan 2022

Keywords

  • Data analysis
  • Dimensionality reduction
  • Machine learning
  • Missing data
  • Road traffic accidents

ASJC Scopus subject areas

  • General Materials Science
  • Instrumentation
  • General Engineering
  • Process Chemistry and Technology
  • Computer Science Applications
  • Fluid Flow and Transfer Processes

Fingerprint

Dive into the research topics of 'Comparative Study of Machine Learning Classifiers for Modelling Road Traffic Accidents'. Together they form a unique fingerprint.

Cite this