2602.19663

Total: 1

#1 The impact of class imbalance in logistic regression models for low-default portfolios in credit risk [PDF] [Copy] [Kimi] [REL]

Authors: Willem D. Schutte, Charl Pretorius, Neill Smit, Leandra van der Merwe, Robert Maxwell

In this paper, we study how class imbalance, typical of low-default credit portfolios, affects the performance of logistic regression models. Using a simulation study with controlled data-generating mechanisms, we vary (i) the level of class imbalance and (ii) the strength of association between the predictors and the response. The results show that, for a given strength of association, achievable classification accuracy deteriorates markedly as the event rate decreases, and the optimal classification cut-off shifts with the level of imbalance. In contrast, the Gini coefficient is comparatively stable with respect to class imbalance once sample sizes are sufficiently large, even when classification accuracy is strongly affected. As a practical guideline, we summarise attainable classification performance as a function of the event rate and strength of association between the predictors and the response.

Subjects: Risk Management , Computation

Publish: 2026-02-23 10:10:46 UTC