Separable Shadow Hamiltonian Hybrid Monte Carlo for Bayesian Neural Network Inference in wind speed forecasting

Rendani Mbuvha, Wilson Tsakane Mongwe, Tshilidzi Marwala

Research output: Contribution to journalArticlepeer-review

18 Citations (Scopus)

Abstract

Accurate wind speed and consequently wind power forecasts form a critical enabling tool for large scale wind energy adoption. Probabilistic machine learning models such as Bayesian Neural Network (BNN) models are often preferred in the forecasting task as they facilitate estimates of predictive uncertainty and automatic relevance determination (ARD). Hybrid Monte Carlo (HMC) is widely used to perform asymptotically exact inference of the network parameters. A significant limitation to the increased adoption of HMC in inference for large scale machine learning systems is the exponential degradation of the acceptance rates and the corresponding effective sample sizes with increasing model dimensionality due to numerical integration errors. This paper presents a solution to this problem by sampling from a modified or shadow Hamiltonian that is conserved to a higher-order by the leapfrog integrator. BNNs trained using Separable Shadow Hamiltonian Hybrid Monte Carlo (S2HMC) are used to forecast one hour ahead wind speeds on the Wind Atlas for South Africa (WASA) datasets. Experimental results find that S2HMC yields higher effective sample sizes than the competing HMC. The predictive performance of S2HMC and HMC based BNNs is found to be similar. We further perform hierarchical inference for BNN parameters by combining the S2HMC sampler with Gibbs sampling of hyperparameters for relevance determination. A generalisable ARD committee framework is introduced to synthesise the various sampler ARD outputs into robust feature selections. Experimental results show that this ARD committee approach selects features of high predictive information value. Further, the results show that dimensionality reduction performed through this approach improves the sampling performance of samplers that suffer from random walk behaviour such as Metropolis–Hastings (MH).

Original languageEnglish
Article number100108
JournalEnergy and AI
Volume6
DOIs
Publication statusPublished - Dec 2021

Keywords

  • Automatic Relevance Determination
  • Bayesian Neural Networks
  • Forecasting
  • Markov Chain Monte Carlo
  • Separable Hamiltonian
  • Shadow Hybrid Monte Carlo
  • Wind power
  • Wind speed

ASJC Scopus subject areas

  • Engineering (miscellaneous)
  • General Energy
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Separable Shadow Hamiltonian Hybrid Monte Carlo for Bayesian Neural Network Inference in wind speed forecasting'. Together they form a unique fingerprint.

Cite this