Credit_Scoring_System_Using_Machine_Learning
Credit_Scoring_System_Using_Machine_Learning
ARTICLEINFO ABSTRACT
Copyright © 2024 The Author(s): This is an open-access article distributed under the terms of the Creative 376
Commons Attribution 4.0 International License (CC BY-NC 4.0)
Abhishek Kumar et al Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol., May-June-2024, 10 (3) : 376-380
scoring, highlighting the transition from manual, SAS (Statistical Analysis System): SAS is very
judgment-based processes to automated, data-driven commonly used for predictive analysis , stats and data
systems. The foundation of credit scoring lies in analysis.
traditional models such as the FICO score and other R: A programming language and software environment
credit bureau-based assessments. Research by E. designed for statistical computing and graphics, which
Altman (1968) and subsequent studies by “L. Thomas is open-source.
(2000) have delved into the mechanics of traditional Python (with libraries like NumPy, Pandas, and Scikit-
credit scoring, emphasizing the significance of factors learn): Popular for machine learning and data analysis.
such as payment history, credit utilization, length of 2. Machine Learning Frameworks:
credit history, types of credit used, and new credit.” TensorFlow, developed by Google, is a machine
The literature highlights several challenges learning framework available as open-source software.
confronting current credit scoring systems. Research PyTorch, also open-source, is a popular machine
by S. Mullainathan and J. Spiess (2017) and others learning library commonly used for deep learning
discusses the inherent biases present in credit scoring applications.
models, raising concerns about fairness and Scikit-learn, which focuses on classical machine
discriminatory outcomes. learning algorithms in Python.
The lack of transparency and explainability in credit 3.DBMS :
scoring models, as explored by R. Chen and A. Rudin SQL: Important for querying and managing databases.
(2018), poses another challenge. Recent literature MySQL, PostgreSQL, Oracle: Common relational
explores the integration of emerging technologies in database management systems (RDBMS) for storing
credit scoring. Research by Y. LeCun et al. (2015) and retrieving structured data.
discusses the application of machine learning 4. Big Data Technologies:
algorithms in credit risk assessment, highlighting the Hadoop : employed for distributed storage and the
potential for increased accuracy and adaptability. processing of extensive datasets.
Blockchain technology has also been investigated for Spark : recognized for its speed and adaptability,
its potential to enhance the security and transparency functioning as a cluster computing framework tailored
of credit histories. Studies by M. Swan (2015) and for processing big data.
others discuss the implications of blockchain in 5. Cloud Computing Platforms:
mitigating fraud and providing individuals with Cloud computing platforms, such as Amazon Web
greater control over their credit information. Services (AWS), Microsoft Azure, and Google Cloud
Platform (GCP), offer a range of cloud services that
III.TOOLS AND TECHNOLOGY encompass computing power, storage solutions, and
machine learning tools.
Credit scoring systems leverage a variety of tools and 6. Data Visualization Tools:
technologies to process, analyze, and evaluate large Tableau, Power BI, and QlikView are employed to craft
volumes of financial and non-financial data. The tools interactive and visually engaging dashboards tailored
can be selected depending on the severity or for data analysis and reporting purposes.
complexity of this model, the available data sources, 7. Credit Scoring Software:
and the desired level of automation. Here are some FICO is recognized as one of the most prevalent credit
common tools and technologies used in credit scoring scoring models, providing software solutions primarily
systems: for credit risk assessment.
1. Statistical Software:
VII. REFERENCES