TY - JOUR
T1 - Predicting Software Perfection Through Advanced Models to Uncover and Prevent Defects
AU - Shahzad, Tariq
AU - Khan, Sunawar
AU - Mazhar, Tehseen
AU - Ahmad, Wasim
AU - Ouahada, Khmaies
AU - Hamam, Habib
N1 - Publisher Copyright:
Copyright © 2025 Tariq Shahzad et al. IET Software published by John Wiley & Sons Ltd.
PY - 2025
Y1 - 2025
N2 - Software defect prediction is a critical task in software engineering, enabling organizations to proactively identify and address potential issues in software systems, thereby improving quality and reducing costs. In this study, we evaluated and compared various machine learning models, including logistic regression (LR), random forest (RF), support vector machines (SVMs), convolutional neural networks (CNNs), and eXtreme Gradient Boosting (XGBoost), for software defect prediction using a combination of diverse datasets. The models were trained and tested on preprocessed and feature-selected data, followed by optimization through hyperparameter tuning. Performance evaluation metrics were employed to analyze the results comprehensively, including classification reports, confusion matrices, receiver operating characteristic–area under the curve (ROC-AUC) curves, precision–recall curves, and cumulative gain charts. The results revealed that XGBoost consistently outperformed other models, achieving the highest accuracy, precision, recall, and AUC scores across all metrics. This indicates its robustness and suitability for predicting software defects in real-world applications.
AB - Software defect prediction is a critical task in software engineering, enabling organizations to proactively identify and address potential issues in software systems, thereby improving quality and reducing costs. In this study, we evaluated and compared various machine learning models, including logistic regression (LR), random forest (RF), support vector machines (SVMs), convolutional neural networks (CNNs), and eXtreme Gradient Boosting (XGBoost), for software defect prediction using a combination of diverse datasets. The models were trained and tested on preprocessed and feature-selected data, followed by optimization through hyperparameter tuning. Performance evaluation metrics were employed to analyze the results comprehensively, including classification reports, confusion matrices, receiver operating characteristic–area under the curve (ROC-AUC) curves, precision–recall curves, and cumulative gain charts. The results revealed that XGBoost consistently outperformed other models, achieving the highest accuracy, precision, recall, and AUC scores across all metrics. This indicates its robustness and suitability for predicting software defects in real-world applications.
KW - ensemble learning
KW - hyperparameter optimization
KW - machine learning models
KW - NLP techniques
KW - software defect prediction
KW - text mining
UR - http://www.scopus.com/inward/record.url?scp=105006580359&partnerID=8YFLogxK
U2 - 10.1049/sfw2/8832164
DO - 10.1049/sfw2/8832164
M3 - Article
AN - SCOPUS:105006580359
SN - 1751-8806
VL - 2025
JO - IET Software
JF - IET Software
IS - 1
M1 - 8832164
ER -