Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

Nicholas Baard, Terence L. Van Zyl

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.

Original languageEnglish
Title of host publication2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics, CIFEr 2022 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665442343
DOIs
Publication statusPublished - 2022
Event2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics, CIFEr 2022 - Virtual, Helsinki, Finland
Duration: 4 May 20225 May 2022

Publication series

Name2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics, CIFEr 2022 - Proceedings

Conference

Conference2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics, CIFEr 2022
Country/TerritoryFinland
CityVirtual, Helsinki
Period4/05/225/05/22

Keywords

  • DDPG
  • Portfolio Selection
  • Reinforcement Learning
  • TD3

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems and Management
  • Economics and Econometrics
  • Finance
  • Computational Mathematics
  • Control and Optimization
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection'. Together they form a unique fingerprint.

Cite this