TY - GEN
T1 - Modelling Photovoltaic power output using Machine Learning techniques
AU - May, Siyasanga Innocent
AU - Bokoro, Pitshou
AU - Pratt, Lawrence
AU - Roro, Kittessa
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - This work focuses on modelling the power output of multiple PV technologies installed at the outdoor test facility of the Council for Scientific and Industrial Research (CSIR) in Pretoria. Extreme Gradient Boosting (XGB) and Artificial Neural Network (ANN) machine learning models are trained with historic time-series datasets (measured meteorological and PV electrical parameters) to model the historical output power of the photovoltaic (PV) system. To facilitate the training, sub-hourly measured data from January to November 2019 were averaged at hourly intervals. For testing, sub-hourly data from January 2020 to March 2020 was divided into clear, moderate, and cloudy skies monthly records. Outliers were identified and removed from the data during pre-processing. The short circuit current (Isc) and PV output have shown a very strong correlation (r2=0.93) since PV output is heavily influenced by array irradiance and current generation. PV output strongly correlated with plane of array irradiance and albedo (r2=0.83,0.69), and with module temperature (r2=0.70), depending on the module type. To quantify model accuracy, root mean squared error (RMSE) was used. ANN outperforms XGB by a wide margin based on the RMSE values. ANN produced the lowest RMSE values with 4. 1W to XGB record high 17. 5W during moderate skies. The majority of the observed maximum RMSE values came from XGB modelling. The trained models will be used to predict PV output power using only forecasted weather data as inputs in future work.
AB - This work focuses on modelling the power output of multiple PV technologies installed at the outdoor test facility of the Council for Scientific and Industrial Research (CSIR) in Pretoria. Extreme Gradient Boosting (XGB) and Artificial Neural Network (ANN) machine learning models are trained with historic time-series datasets (measured meteorological and PV electrical parameters) to model the historical output power of the photovoltaic (PV) system. To facilitate the training, sub-hourly measured data from January to November 2019 were averaged at hourly intervals. For testing, sub-hourly data from January 2020 to March 2020 was divided into clear, moderate, and cloudy skies monthly records. Outliers were identified and removed from the data during pre-processing. The short circuit current (Isc) and PV output have shown a very strong correlation (r2=0.93) since PV output is heavily influenced by array irradiance and current generation. PV output strongly correlated with plane of array irradiance and albedo (r2=0.83,0.69), and with module temperature (r2=0.70), depending on the module type. To quantify model accuracy, root mean squared error (RMSE) was used. ANN outperforms XGB by a wide margin based on the RMSE values. ANN produced the lowest RMSE values with 4. 1W to XGB record high 17. 5W during moderate skies. The majority of the observed maximum RMSE values came from XGB modelling. The trained models will be used to predict PV output power using only forecasted weather data as inputs in future work.
KW - Artificial Intelligence
KW - Artificial Neural Networks
KW - Extreme Gradient Boosting
KW - Photovoltaic
UR - http://www.scopus.com/inward/record.url?scp=85141452352&partnerID=8YFLogxK
U2 - 10.1109/PowerAfrica53997.2022.9905279
DO - 10.1109/PowerAfrica53997.2022.9905279
M3 - Conference contribution
AN - SCOPUS:85141452352
T3 - 2022 IEEE PES/IAS PowerAfrica, PowerAfrica 2022
BT - 2022 IEEE PES/IAS PowerAfrica, PowerAfrica 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE PES/IAS PowerAfrica, PowerAfrica 2022
Y2 - 22 August 2022 through 26 August 2022
ER -