Abstract
The credit scoring is a statistical analysis performed by financial institutions to represent the creditworthiness of an individual or small and medium-sized enterprise. A credit score is a numerical quantity that can be qualified by a rating label showing the potential risk. In order to determine which customers are likely to bring in the most revenue at the exact interest rate and credit limits, the lenders take into accounts these credit scores. In this context, the robust statistical models are inevitable to reduce the number of wrong decisions in the credit evaluation process. Although the machine learning approaches provide superior performance to conventional statistical methods, they are mostly criticized due to the selection of model structure, model complexity, tuning parameters, time consumption in the high-dimensional and excessive nonlinear cases. For this reason, this study introduces an efficient model estimation and feature selection procedure for artificial neural network (ANN) classifiers in the context of credit scoring. Essentially, this procedure hybridizes training of ANNs with a novel feature selection approach based on genetic algorithms and information complexity criterion. In the application, the proposed procedure was performed on a couple of benchmark credit scoring datasets. According to analysis results, the proposed approach not only estimates robust models from ANNs in terms of model complexity, feature selection, and time consumption, but also outperforms the traditional training procedure for the classification accuracies, false positive, and false negative errors overtraining and test datasets.