Research Paper
Research Paper
Abstract
This paper introduces a novel approach for automated expense tracking and forecasting through the
extraction and analysis of SMS-based financial data. Traditional expense management tools often rely
on manual data input, making tracking time-consuming and prone to errors. Our proposed system
automatically captures and categorizes transaction details from SMS notifications sent by financial
institutions, ensuring accurate and real-time expense recording without user intervention. Utilizing
natural language processing (NLP) techniques, the system identifies relevant transaction information,
filters extraneous content, and securely stores data for analysis. In addition, we implement a
predictive model capable of forecasting weekly expenditures based on previous spending patterns,
providing users with actionable financial insights and enhancing their budgeting capabilities. Privacy
and data security are prioritized, with all processing conducted locally on the device to maintain user
confidentiality. Experimental results demonstrate the model’s high accuracy in both categorization
and spending prediction, underscoring its potential as a reliable, privacy-centered solution for
automated financial management. This approach could offer significant value in emerging markets,
where SMS remains a primary means of transactional communication, and supports a growing need
for accessible personal finance tools that leverage existing, accessible data sources
Keywords:
Automated Expense Tracking, SMS Data, Extraction, Predictive Model ,Financial Forecasting,
Privacy-Centered Solution
Introduction: With the increasing ubiquity of smartphones,
nearly everyone can benefit from such a
solution, which promises both simplicity and
In the contemporary era, managing personal efficiency in managing personal finances.
finances has become increasingly difficult due Unlike traditional apps that require users to
to the complexities of tracking various income input every transaction manually or sync with
sources, expenditures, and investments across bank accounts, an SMS-based solution
multiple platforms. Traditional methods of automatically collects transaction information
managing finances—such as manual tracking, and integrates it into a real-time database,
paper-based systems, or even generic financial drastically reducing human effort and
management apps—often fail to offer the minimizing the chances of errors in financial
necessary flexibility, real-time data, and tracking. Moreover, this data can be processed
predictive capabilities to help users effectively in real-time to offer insights into spending
control their financial situations. As financial patterns and trends, making it a potent tool
transactions have moved into the digital for users looking to gain a deeper
realm, particularly with the advent of mobile understanding of their finances.
banking and online payment systems, a new
wave of solutions has emerged that leverages
In addition to the data collection aspect, the
technology to automate the tedious aspects
potential for predictive analytics within such a
of financial management.
system is vast. By integrating machine learning
techniques, particularly time-series
Among the most promising innovations is the forecasting, users can receive predictions
use of Short Message Service (SMS) messages about their future expenditure based on past
as a means of extracting financial transaction transaction data. These predictions could be
data. Many financial institutions and service useful for budget planning, identifying trends,
providers now send transaction alerts via SMS or offering suggestions to avoid financial
to their customers, which contain vital shortfalls. Machine learning models, such as
information, including the amount spent, the regression analysis and neural networks, are
merchant's name, and the date and time of well-suited for analyzing past financial
the transaction. This automated transmission behaviors and predicting future spending
of financial information offers a unique patterns with reasonable accuracy, enabling
opportunity to develop a system that can users to make more informed decisions about
seamlessly collect, categorize, and analyze their financial future.
expenditure data without requiring users to
manually input each transaction.
This research paper delves into the to identify key transaction details, such as
development of a system that extracts SMS- amount, merchant, and date, ensuring that the
based financial transaction data and applies extracted data is accurate and reliable.
machine learning algorithms to predict and 2. Data Categorization: To categorize the extracted
manage weekly or monthly expenditures. The transactions into meaningful financial
proposed system will enable users to categories (e.g., groceries, entertainment,
seamlessly track their expenses, forecast utilities). This categorization process will assist
future financial behavior, and make informed in providing users with a clear breakdown of
financial decisions based on real-time data. their spending habits, enabling them to make
The system’s automation of both data informed decisions about their financial
extraction and prediction will reduce the priorities.
burden of manual financial tracking while 3. Expense Prediction: To integrate machine
providing users with actionable insights into learning models that use historical transaction
their spending habits. data to forecast future spending patterns. By
applying time-series forecasting techniques, the
As mobile phones continue to play an system will predict the user’s future
increasingly significant role in everyday life, expenditure, which can help in budget planning
the ability to leverage SMS data for financial and financial decision-making.
tracking and predictive analysis represents an 4. User-Friendly Interface: To design an intuitive,
exciting frontier in personal finance user-friendly interface that displays the
management. This research seeks to extracted data and predictions clearly, providing
demonstrate how automation and machine users with insights into their spending behavior.
learning can be integrated to enhance the This interface should allow for easy interaction
user experience and offer a more proactive with the system and facilitate quick access to
approach to personal finance. By automating financial summaries and forecasts.
the extraction of transactional data and 5. System Performance Evaluation: To evaluate the
providing accurate predictions, this system system’s performance in terms of data
aims to bridge the gap between the current extraction accuracy, categorization efficiency,
state of financial technology and the future of and prediction accuracy. Metrics such as
smarter, data-driven financial management precision, recall, and root mean square error
tools. (RMSE) will be used to assess the overall
effectiveness of the system.
6. Practical Application and Impact: To assess the
Objectives practical usability and impact of the system on
The primary objective of this research is to personal finance management. This includes
design and develop a system that automates evaluating user satisfaction, the system's ability
the extraction, categorization, and prediction of to assist in better financial planning, and the
personal financial transactions using SMS potential for broader application across
messages. This system aims to provide a different demographic groups.
comprehensive solution for managing personal
finances with minimal user input. The specific
objectives of the research are as follows: By achieving these objectives, the research will
contribute to the advancement of automated
1. Automated SMS Extraction: To develop a
financial management solutions, demonstrating
mechanism that automatically extracts financial
the potential for SMS-based systems in
transaction data from incoming SMS messages
providing actionable insights into personal
without requiring manual intervention from the
finance management through the use of
user. This will involve parsing the SMS content
predictive analytics. Ultimately, this work aims
to empower users to take control of their
financial futures with minimal effort and greater
accuracy.
Literature Review
In recent years, the field of personal finance transaction details from user messages,
management has witnessed significant including amount, merchant information, and
advancements through automation and date. This system utilized regular expressions
predictive modeling. This literature review and keyword-based techniques to detect
explores the current state of research and patterns in the text. Patel et al. (2022)
developments in automated systems for expanded on this by incorporating machine
financial tracking, SMS-based data extraction, learning models to improve the accuracy and
and machine learning-driven financial robustness of SMS parsing. They used a
predictions. combination of supervised learning algorithms,
such as Random Forests and Support Vector
Automated Personal Finance Management Machines (SVMs), to classify and extract
Systems transaction categories more effectively.
Personal finance management (PFM) has
traditionally involved manual data entry and The primary challenge in SMS-based data
complex spreadsheet calculations, making it extraction lies in the diversity of SMS formats
difficult for many users to manage their and the inconsistency of language used by
finances efficiently. However, with the advent of various banks and merchants. Rathi and Sharma
digital technologies, numerous studies have (2021) proposed a hybrid model that combined
explored the potential of automating financial rule-based extraction with deep learning
management tasks. Research by Pereira et al. techniques to adapt to varying SMS formats.
(2020) introduced the concept of automating Their model demonstrated a higher extraction
expense tracking using user input via mobile accuracy compared to traditional methods,
apps, emphasizing the importance of which is critical in ensuring the reliability of
accessibility and ease of use in such systems. automated financial systems.
Their work demonstrated that automation
could significantly improve user engagement Financial Data Categorization and Analysis
and financial awareness. Other studies, like Once financial data is extracted, it must be
Hussain et al. (2019), explored various categorized into meaningful groups for users to
technologies like Optical Character Recognition understand their spending habits. Chakraborty
(OCR) and natural language processing (NLP) to et al. (2020) reviewed various approaches to
automate the extraction of financial data from automatic categorization of expenses, including
documents and emails, which laid the rule-based, supervised learning, and
groundwork for SMS-based extraction unsupervised learning techniques. They
approaches. concluded that machine learning techniques,
particularly K-means clustering and Naive Bayes
SMS-Based Data Extraction for Financial classifiers, were most effective in categorizing
Management expenses into predefined groups like groceries,
SMS messages have become a popular means utilities, and entertainment.
of delivering financial transaction notifications
from banks, payment gateways, and merchants. More recently, Patel et al. (2023) proposed an
Several research papers have explored how SMS intelligent categorization system that adapts
messages can be parsed to extract meaningful over time by learning from user input. This
financial data. Kumar et al. (2021) developed an system uses reinforcement learning to refine its
SMS parsing model that automatically extracts categorization accuracy based on user
feedback, enhancing the system’s ability to data clearly through visualizations such as pie
personalize financial tracking. Additionally, charts, bar graphs, and trend lines. Moreover,
Smith and Zhao (2024) implemented an Hernandez et al. (2022) emphasized the
innovative deep learning-based method using importance of integrating notifications and
recurrent neural networks (RNNs) to better alerts within the UI to keep users informed
predict spending behaviors and classify about their financial status and upcoming bills,
transactions, particularly in complex financial further enhancing user satisfaction.
environments.
System Evaluation and Impact
Predictive Analytics in Financial Management Evaluating the effectiveness of automated
Predictive analytics plays a critical role in financial systems is crucial for ensuring their
forecasting future spending patterns, helping impact on users' financial well-being. Studies
users make informed financial decisions. A like Jones et al. (2020) introduced metrics such
significant body of research has focused on as accuracy, precision, and recall to measure the
applying time-series forecasting techniques, success of data extraction algorithms.
such as ARIMA (AutoRegressive Integrated Furthermore, Ghosh et al. (2021) proposed
Moving Average), LSTM (Long Short-Term user-centric evaluation models that assess the
Memory) networks, and Facebook Prophet to system's ability to improve financial decision-
predict future expenditures. Tan et al. (2020) making, with an emphasis on user satisfaction,
demonstrated the efficacy of ARIMA models in the ease of use of the system, and its ability to
forecasting daily financial transactions, while predict future expenses accurately.
Singh and Kumar (2021) compared the
performance of ARIMA with LSTM networks in These evaluations not only provide insights into
predicting monthly expenses, finding that LSTM the technical effectiveness of the systems but
networks performed better in capturing also inform developers on how to optimize user
complex non-linear patterns in spending experiences and increase the utility of personal
behavior. finance management tools.
6. Chakraborty, A., Gupta, R., & Bandyopadhyay, 13. Hernandez, S., Kaur, P., & Zhang, Z.,
S., "Automatic Categorization of Financial "Integrating Notifications and Alerts in
Transactions using Machine Learning," Journal Personal Finance Systems," Proceedings of the
of Financial Technology & Applications, vol. International Conference on User Interface
25, no. 1, pp. 56-67, 2020. Design, 2022, pp. 56-63.
7. Patel, K., Singh, M., & Mehta, S., "Adaptive 14. Jones, M., Reiner, C., & Tan, J., "Evaluating
Categorization of Expenses using Financial Data Extraction Systems: A User-
Reinforcement Learning," Proceedings of the Centric Approach," Journal of Systems and
International Conference on Machine Learning Software Evaluation, vol. 13, no. 1, pp. 12-21,
and Data Science, 2023, pp. 22-29. 2020.
8. Smith, J., & Zhao, L., "Financial Data 15. Ghosh, S., & Patel, M., "User Evaluation of
Classification with Deep Learning Models," Personal Finance Management Systems:
Journal of Machine Learning for Finance, vol. Metrics and Impact," Journal of Financial
14, no. 2, pp. 77-85, 2024. Technology, vol. 18, no. 3, pp. 94-101, 2021.