TY - JOUR
T1 - Prediction of waste generation forecast and emission potential on the Erode City solid waste dump yards based on machine learning approach
AU - Priyanka, E. B.
AU - Vijayshanthy, S.
AU - Thangavel, S.
AU - Anand, R.
AU - Bhavana, G. B.
AU - Khan, Baseem
AU - Jeyanthi, K.
AU - Ambikapathy, A.
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Proposed research presents a data-driven framework for forecasting municipal solid waste (MSW) generation and emission dynamics in Erode City, India, by employing supervised machine learning algorithms. Leveraging a five-year dataset (2019–2024) comprising socio-economic variables, zonal waste typologies, and historical waste volumes, the model integrates Support Vector Machine (SVM), Random Forest (RF), and Naive Bayes (NB) classifiers. Feature selection and proximity ranking techniques were applied to identify high-impact variables, with plastic and organic waste emerging as dominant predictors. Data pre-processing included normalization, missing value imputation, and spatial zoning analysis. The model was validated through cross-validation with an 80:20 training-to-testing ratio. Among the tested models, SVM exhibited Superior performance, achieving a prediction accuracy of 96%, lowest mean squared error (MSE = 4860), and minimal computational latency (0.67 seconds), indicating suitability for real-time deployment. The integration of proximity matrix analysis and zonal feature clustering enhanced interpretability and robustness. The proposed framework demonstrates significant potential for scalable waste forecasting applications, enabling emission quantification and strategic decision-making. Future work includes the incorporation of real-time sensor data, temporal decomposition, and hybrid deep learning architectures to optimize waste handling and carbon mitigation strategies.
AB - Proposed research presents a data-driven framework for forecasting municipal solid waste (MSW) generation and emission dynamics in Erode City, India, by employing supervised machine learning algorithms. Leveraging a five-year dataset (2019–2024) comprising socio-economic variables, zonal waste typologies, and historical waste volumes, the model integrates Support Vector Machine (SVM), Random Forest (RF), and Naive Bayes (NB) classifiers. Feature selection and proximity ranking techniques were applied to identify high-impact variables, with plastic and organic waste emerging as dominant predictors. Data pre-processing included normalization, missing value imputation, and spatial zoning analysis. The model was validated through cross-validation with an 80:20 training-to-testing ratio. Among the tested models, SVM exhibited Superior performance, achieving a prediction accuracy of 96%, lowest mean squared error (MSE = 4860), and minimal computational latency (0.67 seconds), indicating suitability for real-time deployment. The integration of proximity matrix analysis and zonal feature clustering enhanced interpretability and robustness. The proposed framework demonstrates significant potential for scalable waste forecasting applications, enabling emission quantification and strategic decision-making. Future work includes the incorporation of real-time sensor data, temporal decomposition, and hybrid deep learning architectures to optimize waste handling and carbon mitigation strategies.
KW - Machine Learning
KW - NB
KW - RF
KW - SVM
KW - Solid Waste Management
UR - https://www.scopus.com/pages/publications/105019501767
U2 - 10.1038/s41598-025-19288-w
DO - 10.1038/s41598-025-19288-w
M3 - Article
AN - SCOPUS:105019501767
SN - 2045-2322
VL - 15
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 37021
ER -