TY - JOUR
T1 - Prediction of the monthly river water level by using ensemble decomposition modeling
AU - Pande, Chaitanya Baliram
AU - Sidek, Lariyah Mohd
AU - Halder, Bijay
AU - Katipoğlu, Okan Mert
AU - Rajput, Jitendra
AU - Alshehri, Fahad
AU - Chakrabortty, Rabin
AU - Pal, Subodh Chandra
AU - Dom, Norlida Mohd
AU - Scholz, Miklas
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - The decomposition, artificial intelligence (AI) and machine learning (ML) modeling have been important role in hydrological and river basin related prediction and forecasting to help the flood management and sustainable water resources development. In this paper, developed the hybrid modeling combined with complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), along with standalone models support vector machine (SVM-linear), and Random Forest (RF), Random Subspace (RS) for accurate prediction of monthly river water level in Sg Muar at Buloh Kasap, Johor station during 2014 to 2023. In this paper, two combinations of variables such as Lags and IMFS are used for development of different models for river water level prediction. Hence, these models are compared and measured the performance of models based on the various statistics metrics. Therefore hybrid models performance is measured based on the coefficient of determination (R2), hence all models results are shown the CEEMDAN-SVM-LINEAR (R2 = 0.87), CEEMDAN-SVM-RBF (R2 = 0.91), CEEMDAN-RF (R2 = 0.98), and CEEMDAN-RS (R2 = 0.88) in the second combination variables, while standalone models performance are shown SVM-Linear (R2 = 0.84), SVM-RBF (R2 = 0.87), RF (R2 = 0.97), and RS (R2 = 0.86) during the training phase stage in the first combination variables. Similarly, in the testing phase, the best two models performances are very well as a CEEMDAN-RF (R2:0.94) and CEEMDAN-RS (R2:0.90) in second combination variables, and the first combination variables based SVM- Linear (R2:0.93) and RF (R2:0.89) models are performance higher compared with other models. Finally, the CEEMDAN-RF hybrid model is best model based on the lowest observed errors of Root mean square error (RMSE): 0.13, Mean square error (MSE): 0.02 and high R2: 0.94, hence this model is appropriate for prediction of river water level. Hence, the best hybrid model has been concluded that the CEEMDAN data decomposition technique is very useful for improve performance of the prediction model, the complex river water level predictions by separating the data sets into various sub-frequencies, allowing a better understanding of trends, seasonality and fluctuations in the data. Therefore, the CEEMDAN based novel hybrid modeling is effective decomposition modeling for complex field utilized in the sustainable and optimized utilization of the water resources for sustainable development goal (SDG).
AB - The decomposition, artificial intelligence (AI) and machine learning (ML) modeling have been important role in hydrological and river basin related prediction and forecasting to help the flood management and sustainable water resources development. In this paper, developed the hybrid modeling combined with complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), along with standalone models support vector machine (SVM-linear), and Random Forest (RF), Random Subspace (RS) for accurate prediction of monthly river water level in Sg Muar at Buloh Kasap, Johor station during 2014 to 2023. In this paper, two combinations of variables such as Lags and IMFS are used for development of different models for river water level prediction. Hence, these models are compared and measured the performance of models based on the various statistics metrics. Therefore hybrid models performance is measured based on the coefficient of determination (R2), hence all models results are shown the CEEMDAN-SVM-LINEAR (R2 = 0.87), CEEMDAN-SVM-RBF (R2 = 0.91), CEEMDAN-RF (R2 = 0.98), and CEEMDAN-RS (R2 = 0.88) in the second combination variables, while standalone models performance are shown SVM-Linear (R2 = 0.84), SVM-RBF (R2 = 0.87), RF (R2 = 0.97), and RS (R2 = 0.86) during the training phase stage in the first combination variables. Similarly, in the testing phase, the best two models performances are very well as a CEEMDAN-RF (R2:0.94) and CEEMDAN-RS (R2:0.90) in second combination variables, and the first combination variables based SVM- Linear (R2:0.93) and RF (R2:0.89) models are performance higher compared with other models. Finally, the CEEMDAN-RF hybrid model is best model based on the lowest observed errors of Root mean square error (RMSE): 0.13, Mean square error (MSE): 0.02 and high R2: 0.94, hence this model is appropriate for prediction of river water level. Hence, the best hybrid model has been concluded that the CEEMDAN data decomposition technique is very useful for improve performance of the prediction model, the complex river water level predictions by separating the data sets into various sub-frequencies, allowing a better understanding of trends, seasonality and fluctuations in the data. Therefore, the CEEMDAN based novel hybrid modeling is effective decomposition modeling for complex field utilized in the sustainable and optimized utilization of the water resources for sustainable development goal (SDG).
KW - Artificial intelligence
KW - Data decomposition
KW - Energy
KW - Muar river
KW - Performance assessment
KW - River water level
KW - Sustainable development goal
UR - https://www.scopus.com/pages/publications/105011361494
U2 - 10.1038/s41598-025-10893-3
DO - 10.1038/s41598-025-10893-3
M3 - Article
AN - SCOPUS:105011361494
SN - 2045-2322
VL - 15
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 26895
ER -