Incremental Machine Learning-Based Approach for Credit Scoring in the Age of Big Data

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The determination of the financial credibility of a loan applicant by financial institutions is quantified using a credit score. Sources of credit, such as banks and financial institutions, play a crucial role in sustaining economies and keeping cash flowing in the market. Financial institutions solve the problem of lack of data in credit scoring by extracting customer information from data sources such as social networks. Such data sources store data in large quantities. Traditional data mining techniques fail to accurately distinguish between a credit-worthy applicant and a non-creditworthy applicant using big data. The problem of big data has necessitated the advent of machine learning algorithms capable of sifting through large volumes of credit data sourced from social networks. Recently, to automate, streamline and digitise business processes such as credit scoring, machine learning approaches have been widely used, but the design and deployment of effective and robust credit scoring models require a lot of time, and if the behaviour of customers changes or the customer variables drift over time, the credit score model becomes obsolete or outdated. As a result, credit scoring tasks should be considered as an ephemeral scenario due to big data, as variables tend to drift over time. Incremental and adaptive credit scoring models can help to mitigate the loss of time of re-creating credit models due to drifting variables, big data challenges and changes in customer behaviour. This necessitates the design of robust and effective credit score models capable of learning incrementally, adaptive and able to detect changes. This paper proposes the Incremental Adaptive and Heterogeneous ensemble (IAHE) credit scoring model capable of learning incrementally, adapt to drifting variables and detect changes in customer behaviour and learn big data in a streaming fashion. Empirical experiments conducted indicate that IAHE has the strongest ability to recognise default samples and demonstrated the best generalisation ability on the datasets and the same time maintained a strong interpretability of the results when compared to nine credit scoring models on four public datasets. The superior generalisation performance of IAHE is statistically significant and demonstrated excellent robustness and adaptation to drifting variables.

Original languageEnglish
Title of host publicationTowards Digitally Transforming Accounting and Business Processes - Proceedings of the International Conference of Accounting and Business iCAB, Johannesburg 2023
EditorsTankiso Moloi, Babu George
PublisherSpringer Nature
Pages547-565
Number of pages19
ISBN (Print)9783031461767
DOIs
Publication statusPublished - 2024
EventInternational Conference of Accounting and Business, iCAB 2023 - Johannesburg, South Africa
Duration: 29 Jun 202330 Jun 2023

Publication series

NameSpringer Proceedings in Business and Economics
ISSN (Print)2198-7246
ISSN (Electronic)2198-7254

Conference

ConferenceInternational Conference of Accounting and Business, iCAB 2023
Country/TerritorySouth Africa
CityJohannesburg
Period29/06/2330/06/23

Keywords

  • Credit scoring
  • Ensemble selection
  • Incremental learning
  • Machine learning

ASJC Scopus subject areas

  • General Business,Management and Accounting
  • General Economics,Econometrics and Finance

Fingerprint

Dive into the research topics of 'Incremental Machine Learning-Based Approach for Credit Scoring in the Age of Big Data'. Together they form a unique fingerprint.

Cite this