Abstract
The increase in debt levels of families in different parts of the world has drawn the attention of organizations dedicated to the prevention of financial risk and has highlighted the need to develop early detection methods for over-indebtedness. In this paper, we propose a hybrid model of the adaptive neural fuzzy inference system (ANFIS) and Probit model for the prediction of household over-indebtedness. The proposed model is compared with Probit, artificial neural networks (ANN), classification and regression trees (CART), random forest (RF) and support vector machine (SVM) models. The most relevant parameters for the performance of each model are optimized, and we address data balance problems through the synthetic minority over-sampling technique (SMOTE). We use data obtained from the Financial Household Survey of the Central Bank of Chile. The results show that the proposed model performs significantly better than the reference models in terms of the correct classification of indebted individuals. Consequently, this model provides an innovative understanding of household over-indebtedness, which can be useful for different governmental entities focused on preventing excessive indebtedness and maintaining financial stability.

Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
The information contained in the variables \({X}_{1},{X}_{2},{X}_{5},{X}_{6}{X}_{8}\) represent data of individuals designated as heads of household. More details about the data and variables used can be found in Appendix 1.
References
Altman EI, Marco G, Varetto F (1994) Corporate distress diagnosis: comparisons using linear discriminant analysis and neural networks (the Italian experience). J Bank Finance 18(3):505–529. https://doi.org/10.1016/0378-4266(94)90007-8
Anderloni L, Vandone D (2008) Households over-indebtedness in the economic literature. Universit’a Degli Studi Di Milano Working Paper, 46, 775
Angel S, Heitzmann K (2015) Over-indebtedness in Europe: the relevance of country-level variables for the over-indebtedness of private households. J Eur Soc Policy 25(3):331–351. https://doi.org/10.1177/0958928715588711
Athey S, Imbens GW (2019) Machine learning methods that economists should know about. Annual Rev Econom 11:685–725
Azayite FZ, Achchab S (2016) Hybrid discriminant neural networks for bankruptcy prediction and risk scoring. Procedia Comput Sci 83:670–674. https://doi.org/10.1016/J.PROCS.2016.04.149
Bahrammirzaee A (2010) A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems. Neur Comput Appl 19(8):1165–1195
Betti G, Dourmashkin N, Rossi M, Ping Yin Y (2007) Consumer over-indebtedness in the EU: measurement and characteristics. J Econ Stud 34(2):136–156. https://doi.org/10.1108/01443580710745371
Chandra DK, Ravi V, Bose I (2009) Failure prediction of dotcom companies using hybrid intelligent techniques. Expert Syst Appl 36(3):4830–4837. https://doi.org/10.1016/J.ESWA.2008.05.047
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357
Charpentier A, Flachaire E, Ly A (2018) Econometrics and machine learning. Econ Stat 505(1):147–169
D’Alessio G, Iezzi S (2013) Household over-indebtedness: definition and measurement with Italian data. accessed from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.642.1533&rep=rep1&type=pdf#page=502
Fedorova E, Gilenko E, Dovzhenko S (2013) Bankruptcy prediction for Russian companies: application of combined classifiers. Expert Syst Appl 40(18):7285–7293. https://doi.org/10.1016/J.ESWA.2013.07.032
Gathergood J (2012) Self-control, financial literacy and consumer over-indebtedness. J Econ Psychol 33(3):590–602
Gogas P, Papadimitriou T (2021) Machine Learning in Economics and Finance. Computat Econ. https://doi.org/10.1007/s10614-021-10094-w
Gutierrez P, Gérardy JY (2017) Causal inference and uplift modelling: A review of the literature. In International conference on predictive applications and APIs (pp. 1–13). PMLR
Henrique BM, Sobreiro VA, Kimura H (2019) Literature review: Machine learning techniques applied to financial market prediction. Expert Syst Appl 124:226–251
Hojman DA, Miranda Á, Ruiz-Tagle J (2016) Debt trajectories and mental health. Soc Sci Med 167:54–62
Hsu MW, Lessmann S, Sung MC, Ma T, Johnson JE (2016) Bridging the divide in financial market forecasting: machine learners vs. financial economists. Expert Syst Appl 61:215–234
Khairalla M, AL-Jallad NT (2017) Hybrid forecasting scheme for financial time-series data using neural network and statistical methods. Int J Adv Comput Sci Appl 8(9):319–327
Ladas A, Garibaldi J, Scarpel R, Aickelin U (2104) Augmented neural networks for modelling consumer indebtedness, 2104 International joint conference on neural networks
Lea SEG, Webley P, Walker CM (1995) Psychological factors in consumer debt: money management, economic socialization, and credit use. J Econ Psychol 16(4):681–701. https://doi.org/10.1016/0167-4870(95)00013-4
Melnychenko O (2020) Is artificial intelligence ready to assess an enterprise’s financial security? J Risk Finan Manage 13(9):191
Mi Y (2013) Imbalanced classification based on active learning SMOTE. Resear J Appl Sci Eng Tech 5:944–949
Min JH, Lee Y-C (2005) Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters. Expert Syst Appl 28(4):603–614. https://doi.org/10.1016/J.ESWA.2004.12.008
Mselmi N, Lahiani A, Hamza T (2017) Financial distress prediction: The case of French small and medium-sized firms. Int Rev Finance Anal 50:67–80. https://doi.org/10.1016/J.IRFA.2017.02.004
Mullainathan S, Spiess J (2017) Machine learning: an applied econometric approach. J Econ Perspect 31(2):87–106
Ntsalaze L, Ikhide S (2016) Household over-indebtedness: understanding its extent and characteristics of those affected. J Social Scien 48(1–2):79–93. https://doi.org/10.1080/09718923.2016.11893573
Peng Y, Wang G, Kou G, Shi Y (2011) An empirical study of classification algorithm evaluation for financial risk prediction. Appl Soft Comput 11(2):2906–2915. https://doi.org/10.1016/J.ASOC.2010.11.028
Ruiz-Tagle J, García L, Miranda A (2013) Proceso de endeudamiento y sobre endeudamiento de los hogares en chile. Banco Central de Chile, Documento de trabajo, 703
Sarle WS (1997) Neural Network FAQ, periodic posting to the Usenet newsgroup comp. ai. neural-nets. URL:Ftp://Ftp.Sas.Com/Pub/Neural/FAQ.Html.
Stone B, Maury RV (2006) Indicators of personal financial debt using a multi-disciplinary behavioral model. J Econ Psychol 27(4):543–556. https://doi.org/10.1016/J.JOEP.2005.11.002
Syrgkanis V, Lewis G, Oprescu M, Hei M, Battocchi K, Dillon E, Lee JY (2021) Causal inference and machine learning in practice with econml and causalml: Industrial use cases at microsoft, tripadvisor, uber. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining (pp. 4072–4073)
Tkáč M, Verner R (2016) Artificial neural networks in business: Two decades of research. Appl Soft Comput 38:788–804. https://doi.org/10.1016/J.ASOC.2015.09.040
Varian HR (2014) Big data: New tricks for econometrics. J Econ Perspect 28(2):3–28
Veganzones D, Séverin E (2018) An investigation of bankruptcy prediction in imbalanced datasets. Decis Support Syst 112:111–124. https://doi.org/10.1016/J.DSS.2018.06.011
Zelenkov Y, Volodarskiy N (2021) Bankruptcy prediction on the base of the unbalanced data using multi-objective selection of classifiers. Expert Syst Appl 185:115559
Zhang G, Hu YM, Eddy Patuwo B, Indro DC (1999) Artificial neural networks in bankruptcy prediction: general framework and cross-validation analysis. Eur J Oper Res 116(1):16–32. https://doi.org/10.1016/S0377-2217(98)00051-4
Zoričák M, Gnip P, Drotár P, Gazda V (2020) Bankruptcy prediction for small-and medium-sized companies using severely imbalanced datasets. Econ Model 84:165–176
Acknowledgements
The authors are grateful for the financial support of the General Directorate for Research, Innovation, and Postgraduate Studies (DGIIP) of UTFSM Chile through the "Program of Incentives for Scientific Initiation" (PIIC).
Funding
The article was funded by Universidad Técnica Federico Santa María, PIIC, Nicole Astudillo
Author information
Authors and Affiliations
Contributions
Werner Kristjanpoller: Conceptualization, Methodology, Software, Methodology, Supervision, Data curation, Writing- Reviewing and Editing, Formal analysis, Validation. Nicole Astudillo: Conceptualization, Methodology, Software, Data curation, Writing Original draft preparation, Formal analysis. Josephine Olson: Methodology, Supervision, Writing- Reviewing and Editing, Formal analysis, Validation.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This paper does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
For this type of study formal consent is not required.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1
1.1 Variable description
The following table describes the variables used in this study to describe household financial over-indebtedness for the developed models.
Variables | Description |
---|---|
\({X}_{1}\) Gender | Dichotomous variable that describes the gender of the head of household where:1: Woman / 0: Man |
\({X}_{2}\) Age | Integer variable that indicates the age of the head of household |
\({X}_{3}\) Savings | Continuous variable that describes the total declared amount of savings recorded by the household in the last 12 months. Expressed in millions of Chilean pesos |
\({X}_{4}\) Workers | Integer variable that describes the number of members of the household that are working |
\({X}_{5}\) Card | Dichotomous variable that indicates if any members of the household have a credit card with:1: Has a card/ 0: Does not have a card |
\({X}_{6}\) Marital status | Dichotomous variable that represents the marital status of the head of household with:1: Married or partner / 0: Other |
\({X}_{7}\) Income | Continuous variable that indicates the total amount of income that the household receives monthly. Expressed in millions of Chilean pesos |
\({X}_{8}\) Educational level | Integer variable that represents the years of schooling of the head of household. Chile's educational system has an average of 8 years for primary education, 4 years for secondary education, and 6 for higher education. The survey also considers postgraduate studies or second professions |
\({X}_{9}\) Mortgage burden | Continuous variable that indicates the total amount that the household pays monthly for mortgage debt. Expressed in millions of Chilean pesos |
\({X}_{10}\) Educational debt | Continuous variable which indicates the total amount owed by the household for the education of its members. Expressed in millions of Chilean pesos |
Appendix 2
1.1 Performance measurement
To evaluate and compare all the models, we use the confusion matrices obtained from each evaluated model and calculate the accuracy metrics (ACC), positive (\(A{R}_{P}\)) and negative \((A{R}_{N})\) class accuracy and area under curve (AUC). These are defined in Table
6 as follows:
In order to supplement the evaluation of over-indebtedness model performance, we consider the presence and significance of misclassification. Given a null hypothesis (over-indebted household), each time this hypothesis is rejected when it is true, a type I error is committed. On the other hand, when the null hypothesis is false (household is not over-indebted) and it is rejected, a type II error is committed. For the over-indebtedness models developed in this article, the relevance of a type I error is much greater than a type II error, since it impairs the usefulness and general applicability of the model. For this reason, the key performance measurement is the AUC.
Rights and permissions
About this article
Cite this article
Kristjanpoller, W., Astudillo, N. & Olson, J.E. An empirical application of a hybrid ANFIS model to predict household over-indebtedness. Neural Comput & Applic 34, 17343–17353 (2022). https://doi.org/10.1007/s00521-022-07389-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07389-w