Upi Fraud Detection Using Machine Learning
Upi Fraud Detection Using Machine Learning
ISSN (P):2349-3968, ISSN (O): 2349-3976 Volume XI - Issue III, March - 2024
Abstract : The project focuses on the development of a machine learning model that can analyze UPI transaction data
in real-time to identify fraudulent activities. The primary objective is to create a system that enhances the security of
UPI transactions and reduces financial losses due to fraud. With the advancement of technology, today most of the
modern commerce is relying upon the online banking and cashless payments. Due to adaption of online payment
among businesses, the fraud cases are also increasing which cause financial losses to them. Fraudsters are inventing
new techniques to perform fraudulent transaction which seem legitimate. Hence, there is an urgent need to develop
fraud detection measures which can deal with these fraudsters on real time basis. Deep learning techniques have the
capability to detect these fraudulent transactions efficiently and has a huge scope in fraud detection. However, there are
many challenges faced by the researchers in online transactions fraud detection because the datasets are not publicly
available due to privacy issue of the financial institutions as customers data is sensitive and it can be misused and the
datasets which are available are imbalanced. This paper presents a review of deep learning techniques used for online
transactions fraud detection. It also provides the information about datasets used by the researchers and the results
achieved by them in their research work.
I. Introduction
patterns or anomalies. Although these systems have
The emergence of online banking has caused a
shown some degree of effectiveness, they have
paradigm shift in the current financial transaction
shortcomings in terms of accuracy, scalability, and
landscape, providing individuals and organisations
adaptability, especially when it comes to sophisticated
globally with unmatched ease and efficiency. But there
and ever-evolving fraud techniques.
are drawbacks to this digital transformation as well,
the most significant of which is the growing frequency A prevalent obstacle within the current framework
of fraudulent activity in online transactions. The swift is its dependence on static rules, which may not be
expansion of online banking services has made it able to identify subtle or evolving patterns that point to
easier for bad actors to take advantage of weaknesses, fraudulent activity. Furthermore, the rule-based
which puts the security and integrity of financial approach frequently finds it difficult to manage the
systems at serious risk. Furthermore, the COVID-19 subtleties and intrinsic complexity of transaction data,
pandemic's start has acted as a trigger, quickening the particularly given the ever-changing environment of
shift to remote operations and raising the possibility of online banking transactions. The current system's
fraudulent activity in the digital sphere. It is therefore vulnerability to false positives and false negatives is
more important than ever to create reliable fraud another drawback. Static rules have the potential to
detection systems in the face of the epidemic highlight unintentionally identify genuine transactions as
how important it is for both customers and financial fraudulent (false positives) or fail to identify actual
institutions to strengthen their defences against fraud. fraudulent activity (false negatives), which can result
The increasing trend of financial transactions occurring in inefficiencies, unhappy customers, and financial
on digital platforms underscores the need for advanced losses for financial institutions as well as consumers.
security measures and flexible approaches to protect Furthermore, particularly in the context of the Unified
the integrity of online banking systems. Payments Interface (UPI), the current systems would
find it difficult to handle the enormous volume and
Exiting System
variety of transaction data generated in real-time. The
Regarding the field of online banking fraud scalability and computing efficiency needed to
detection, the systems that are now in place primarily efficiently process and analyse large-scale transaction
depend on conventional techniques and rule-based datasets may be lacking in traditional methodologies.
methods. In order to identify potentially fraudulent
transactions, these systems frequently use static rules
and thresholds, usually based on established
33
International Journal On Engineering Technology and Sciences – IJETS
ISSN (P):2349-3968, ISSN (O): 2349-3976 Volume XI - Issue III, March - 2024
34
International Journal On Engineering Technology and Sciences – IJETS
ISSN (P):2349-3968, ISSN (O): 2349-3976 Volume XI - Issue III, March - 2024
with varying precision scores. Particularly noteworthy treatment of missing data played a crucial role in
is Logistic Regression with L1 regularization (LR1), getting the dataset ready for efficient model training.
which stands out by achieving the best overall The selection of algorithms was an important factor
accuracy among all algorithms. This remarkable in guaranteeing the effectiveness of the system in
accuracy underscores LR1's effectiveness in correctly detecting fraudulent actions. We made sure our system
identifying fraud cases with minimal false positives. could detect a wide range of fraudulent patterns by
Such precision is of paramount importance in fraud implementing a diverse ensemble of machine learning
detection, where minimizing false positives is crucial algorithms, from more sophisticated techniques like
to avoid inconveniencing legitimate users. These convolutional neural networks to more conventional
results provide actionable insights for refining fraud methods like logistic regression and decision trees.
detection strategies in real-world applications. For With the contributions of each algorithm, a
instance, the outstanding precision achieved by LR1, comprehensive fraud detection system that could
along with the perfect precision scores attained by handle a variety of fraud scenarios was created.
Decision Tree, Naive Bayes, and KNN models, The system architecture was crucial in
suggests their suitability for deployment in scenarios guaranteeing scalability, efficiency, and usability
where false positives must be minimized. On the other during the development and deployment stages. By
hand, while FNN and CNN models demonstrate high using technologies like Flask, HTML, CSS, Python,
accuracy, further optimization or consideration of and other programming languages, we were able to
additional factors may be necessary to address smoothly integrate the fraud detection system into the
potential challenges with false positives. current financial infrastructure, increasing its usability
Moreover, in the context of user interface design, and accessibility for both stakeholders and end users.
presenting these insights in a comprehensible manner The seamless deployment and functioning of the
becomes essential. A user-friendly interface should system in real-world settings were made possible by
provide clear visualizations and summaries of the this harmonious technology integration.
model performance, emphasizing the trade-offs The fraud detection system is a major improvement
between accuracy and precision. This aids decision- in protecting financial transactions from fraudulent
makers in selecting models aligned with the specific activity, as we can see when we consider the results of
requirements of the application. Additionally, our work. It is crucial to preserving the integrity and
incorporating user feedback mechanisms into the reliability of financial systems because of its capacity to
interface can enhance the adaptability of the system, precisely detect fraudulent transactions while reducing
allowing for iterative improvements based on real- false positives. But our adventure doesn't end here.
world usage and evolving fraud patterns. Overall, the Maintaining protection and security in the ever-
integration of insightful model evaluations and a user- changing world of financial transactions will require
friendly interface forms a cohesive strategy for constant system improvement and refinement to meet
refining and deploying effective fraud detection changing fraud trends and new security risks. We are
systems in practical applications. prepared to meet upcoming challenges and protect the
integrity of financial systems around the globe with a
IV. Conclusion dedication to innovation and vigilance.
We have gone through several phases of data
collection, pre-processing, algorithm selection, and REFERENCES
system implementation in this extensive effort to create
1. ALESKEROV E, FREISLEBEN, B., and, RAO B (1997)
a strong fraud detection system. The end result is a CARDWATCH: A neural network-based database mining
solution that has the potential to greatly improve the system for credit card fraud detection. In Conference (pp.
security and dependability of financial transactions. We 220–226). IEEE, Piscataway, NJ
started our trip by obtaining a large and comprehensive
dataset that included complex transactional 2. Sahin M (2017) Understanding Telephony Fraud as an
Essential Step to Better Fight it [Thesis]. École Doctorale
information, which served as the foundation for our
Informatique, Télécommunication et Électronique, Paris
later investigations.
Setting up a strong basis for later model training 3. Abdallah A, Maarof MA, Zainal A (2016) Fraud detection
required the first stage of data pre-processing. We system: A survey. J Netw Comput Appl 68:90–113
carefully addressed issues, including the dataset's
4. ANDREWS PP, PETERSON MB (eds) (1990) Criminal
unequal class distribution, using calculated pre-
Intelligence Analysis. Palmer Enterprises, Loomis, CA
processing techniques to reduce biases and guarantee
the stability of our models. Methods like 5. ARTÍS M, AyUSO M, GUILLÉN M (1999) Modeling
standardization based feature scaling and careful different types of automobile insurance fraud behavior in the
Spanish market. Insurance Math Econ 24:67–81
35
International Journal On Engineering Technology and Sciences – IJETS
ISSN (P):2349-3968, ISSN (O): 2349-3976 Volume XI - Issue III, March - 2024
36