Liang, J. orcid.org/0000-0001-9807-9949, Wei, X. orcid.org/0000-0002-6064-7290 and Summers, B. orcid.org/0000-0002-9294-0088 (Accepted: 2025) Tabular Image: a method to convert tabular data to images for convolutional neural networks. Annals of Operations Research. ISSN 0254-5330 (In Press)
Abstract
Improving the predictive capability of credit scoring models is always an active area of research in the financial sector. Recognising the impressive effectiveness of neural networks in different domains (such as computer vision and natural language processing), various neural networks have been tested to potentially improve loan default prediction on credit data. Nevertheless, a significant challenge emerges due to the predominantly tabular nature of credit data, which is not well-suited to the structure and strengths of neural networks, hindering their ability to surpass traditional machine learning models in credit scoring. To overcome the challenge, we propose a novel data transformation method called Tabular Image that converts tabular data into images to take advantage of the powerful two-dimensional convolutional neural networks that perform extremely well on images while mitigating the challenges tabular data poses to deep networks. The Tabular Image can convert tabular data into compact and resilient images compared with existing transformation methods by creatively embedding two crucial measures in credit scoring, the weight of evidence and information value, in the image. Applications to three credit scoring benchmark datasets suggest that simply training a two-dimensional convolutional neural network with Tabular Image can provide state-of-the-art predictive performance. In addition, the advantage of our proposed method’s prediction is more evident in the large dataset. Our innovative approach raises the possibility of leveraging two-dimensional convolutional neural networks in credit scoring using a proper data representation method. Furthermore, a flexible framework is provided to suit various tabular datasets in other domains.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | This is an author produced version of an article accepted for publication in the Annals of Operations Research, made available under the terms of the Creative Commons Attribution License (CC-BY), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. |
Keywords: | Risk management, Credit scoring, Deep learning, Convolutional neural networks, Tabular data |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Business (Leeds) > Analytics, Technology & Ops Department The University of Leeds > Faculty of Business (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 16 Jul 2025 12:47 |
Last Modified: | 16 Jul 2025 15:09 |
Status: | In Press |
Publisher: | Springer |
Sustainable Development Goals: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:229212 |