0% found this document useful (0 votes)
6 views10 pages

Data Analysis

This report analyzes retail sales data from a supermarket chain to improve decision-making through data analytics. It covers the methodology used for data preprocessing, exploratory analysis, forecasting, regression, and customer segmentation, revealing insights on sales trends, store performance, and customer behavior. Recommendations include better inventory management, targeted marketing strategies, and improved data collection to enhance business operations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views10 pages

Data Analysis

This report analyzes retail sales data from a supermarket chain to improve decision-making through data analytics. It covers the methodology used for data preprocessing, exploratory analysis, forecasting, regression, and customer segmentation, revealing insights on sales trends, store performance, and customer behavior. Recommendations include better inventory management, targeted marketing strategies, and improved data collection to enhance business operations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Project

Student Names - Seenu Rani (SPI230468)


Bipin Panthi (SPI 240561 )
Bikash Khatri (SPI230852)
Manish (SPI240545)

Subject - MDS610, Decision Making for Analytics


Date:- 24 May, 2025

Page 1 / 1
Data Analytics Report
Retail Sales Forecasting for a Chain of Supermarkets

1. Introduction
Background of the Dataset
Today, stores uses the data (information) to make better decisions, help the customers, and work
more smoothly. Big store chains that have shops in many places collect a lot of data every day.
This data comes from things like sales, talking to customers, checking what’s in stock, and
running discounts. When this data is studied, it helps the stores know what to sell, how much to
charge, how much stock to keep, and how to advertise to different kinds of customers.

This report uses a made-up but realistic set of data to show how a supermarket chain might work
in both cities and villages. The data is based on the real life examples found on websites like
Kaggle and UCI that share data or datasets for learning. It includes over 100,000 sales made
during one year, showing shopping the habits, changes during different seasons, and how
discounts can affect buying.

Each row in the dataset shows one shopping transaction and includes the following important
details:

• Date of purchase: Shows the exact day something is bought by someone. This helps us
to see the patterns over days, weeks, months, and seasons.

• Store location and type: Tells us if the store is in a city or village, and in which city.
This helps compare how well stores are doing in different places.

• Product category: Products are grouped into the types like groceries, personal care,
electronics, clothes, and household items. This helps us to see which types of the
products make the most money and which are most affected by the sales and offers.

• Units sold and money earned: It shows how many items were sold and how much
money was made from them. This helps to measure store performance, how fast the
products sell, and how much the money is made from per customer.

• Discounts: Shows if a product was sold at a lower price and how big the discount was.
This helps us to understand that if sales and offers really work or not.

• Customer details: Basic customer information like age group, gender, and if they are a
member of the loyalty program. This can helps to group the customers and study how
differently people shop.
Use this kind of dataset is helpful because it looks like the real data that medium and and large
stores uses it every day. Big retail companies int the world track this kind of data or information.
It helps stores to managers to make the smart choices and smart decisions based on the data, and
not only just by guesses.
One great thing about this dataset is that it has many kind of data. It has numbers like how much
items was sold and how much money was made, and it also has categories like store location,
product type, and customer type. Because of this, we can use many ways to study the data like
looking for patterns, making predictions and grouping similar things.
From a business point of view, this data can help answer important questions like:
• How do customer choices change by age, gender, or location?

• Which stores are doing the best and why?

• Do discounts help increase sales?

• Which products sell well, and which don’t?

This report will look at the data step by step, using modern tools to show how a store can use its
data to improve and make better decisions.

Purpose and Objectives of the Analysis


The main purpose of this report is to show how using data can help the stores to make smart and
better decisions. The goals of this analysis are:
• To find out how the sales change over time and in different store locations.
• To see how the offers and prices affect the product sales.
• To guess the future of sales using past data so stores can plan the stock and staff.
• To understand different types of customers so stores can do better and more personal
marketing.
• To give useful suggestions to help the business grow.
This analysis will help store managers and business leaders make smart choices based on real
data. It can help increase profits, lower costs, and make customers happier.

2. Methodology
Analytical Workflow and Tools
To get useful information from the data, we followed a clear step-by-step process used in the
data industry. We used ideas from a common method called CRISP-DM, which helps to guide
the data projects. We used both basic math (statistics) and computer models (machine learning)
to answer the business questions mentioned earlier.
The goal was to find the patterns, understand how different things in the data are connected, and
help the retail business make smart decisions based on facts.

1. Data Preprocessing and Cleaning


The rst step in our analysis was to get the data ready by xing the mistakes and making sure
that the data was correct. Here’s what we did:

• Removed Duplicates: We deleted the repeated records so that the results wouldn’t be
unfair or wrong.

• Fixed Missing Data: Some parts of the data were empty, like customer details or
discount info. We lled the missing numbers using the average, and for missing
categories (like gender), we used the most common value. If too much data was missing
in a row, we removed that row.

• Made Categories Consistent: We cleaned up the category names like store type (city or
village), product types, and gender so everything can matched. We then changed these
words into numbers using the simple methods so computer models could understand
them.

• Changed Date Format: We changed the date eld into a special format that lets us easily
nd the day, month, or quarter. This helped us study sales over time and see trends.

2. Exploratory Data Analysis (EDA)


In this step, we looked closely at the data to understand it better. This helped us to nd the
unusual values (outliers), spot patterns, and come up with the ideas for deeper study.

• Basic Statistics: We used the Python tools like Pandas and NumPy to nd the average,
middle value, how much values the change, and how often things appear.

• Graphs and Charts: We used tools like Seaborn and Matplotlib to make:

◦ Bar charts to compare the different product categories

◦ Line graphs to see how the sales changed over the time

◦ Box plots to see how the revenue changed by age or gender

◦ Heatmaps to show how different numbers are connected

From these graphs, we learned that the city stores sold more electronics, while village stores
made more money from groceries.

3. Time Series Forecasting


fi
fi
fi
fi
fi
fi
fi
To help the store to plan stock and staff, we studied how sales changed over the time using
special methods:

• ARIMA Model: This method helped us to predict the future sales by looking at the past
trends and patterns.

• Exponential Smoothing: This helped us to see seasonal effects (like higher sales during
holidays) by reducing short-term ups and downs.

• STL (Seasonal Decomposition of Time Series): This method broke the sales data into
three parts:

◦ Overall trend (long-term rise or fall)

◦ Seasonal pattern (like more sales during festivals)

◦ Random changes (ups and downs we can’t explain)

These tools helped us to understand how the sales change over time and plan better for the
future.

4. Regression Analysis
To nd out what things affect the sales, we used the following methods:

• Linear Regression: This helped us to see how the things like discounts, customer details
(like age or gender), and store location affect sales and the number of items sold.

• Multivariate Regression: This is a more advanced version where we looked at many


factors at the same time. It helped us to understand how these factors work together and
make better business decisions.

5. Customer Segmentation (Clustering)


To better understand our customers and improve marketing, we grouped them based on how they
shop. We used a method called:

• We grouped customers based on how much they spend, how often they shop, and what
they buy using a method called K-Means Clustering.

• We found different types of shoppers (like discount hunters, premium buyers, and bulk
buyers) to help improve marketing and customer loyalty strategies.

This helped the business make better marketing and loyalty plans.

6. Tools Used
We used both coding tools and spreadsheet software to do the analysis.
fi
Python Libraries:
o Pandas and NumPy for data handling and numerical operations
o Seaborn and Matplotlib for advanced data visualization
o Scikit-learn for regression and clustering models
o Statsmodels for statistical modeling and time series analysis
• Excel: Used at the beginning to look at the data, make quick summaries, and create pivot
tables before doing deeper analysis with code.

3. Results and Discussion


This section explains the aim we found from the study of the data. We see at the seasonal trends,
how different stores performed, types of customers and how accurately our sales predictions are.
The results shows useful patterns and give ideas that can help the better decision making.

1. Seasonal and Temporal Patterns


Monthly sales figures highlighted variations throughout the year:
% Change from
Month Total Sales ($) Key Observa ons
Previous Month
January 390,000 - Post-holiday dip
August 510,000 +21% Back-to-school peak
Decemb Holiday shopping
655,000 +28%
er surge

• Sales were higher on the weekends than weekdays. On average, people spent about
$18,500 on weekends and $14,000 on the weekdays, showing a 32% increase in weekend
spending.
• The highest sales in one day happened on December 23rd, with $78,000 in sales likely
because of last minute holiday shopping.
• We also saw more sales in stationery and clothes during August and September, which
matches the school season.

2. Store and Product Performance


A comparative evaluation between store types and product categories revealed significant
differences in performance:
ti
Store Performance Summary

Store Avg Monthly Revenue Avg Transac on Value Customer Foo all (avg/
Type ($) ($) month)
Urban 450,000 38 11,842
Rural 320,000 31 10,322

• Urban stores made more money than rural stores about 40.6% more. This is because city
stores have more people, higher spending, and bigger shopping.
• City stores also had more customers and bigger sales per purchase compared to rural
stores.
Product Performance Matrix

Units Sold Revenue Pro t Margin


Category
(Monthly) ($) (%)
Groceries 120,000 150,000 12%
Electronics 11,500 220,000 35%
Household
17,000 135,000 28%
Items
Apparel 23,000 97,000 22%
• Groceries sold the most but at the least pro t (12%).

• Electronics and household items made more pro t (35% and 28%).

• The best-selling product was “Family Pack Milk 2L”, selling about 3,450 units per
month, making $6,200every month.

3. Customer Segmentation Insights (K-Means Clustering)


Clustering analysis (using K-Means) divided customers into three different behavioral segments:
% of Total Avg Basket Visit
Cluster Key Behavior
Customers ($) Frequency
Regular visits, low spend, deal-
Budget 54% 15 Weekly
seeking
Premium 22% 58 Monthly Brand loyal, high-value purchases
Occasion
24% 27 Irregular Promo on-sensi ve, low reten on
al

• Promotions for Cluster 2 (Premium Shoppers) worked well — 18.3% more of them
bought expensive products, showing that targeted ads and deals were effective.
fi
ti
ti
tf
ti
ti
fi
fi
• Cluster 1 customers didn’t spend as much per the visit, but they made up more than half
of all shoppers, so it’s smart to keep them happy with bulk or value deals.

4. Forecast Accuracy and Sales Projections


We used to methods to predict future sales:
RMS MAPE Forecasted Q1 Sales YoY
Model
E (%) ($) Growth
ARIMA (2,1,2) 7.5 4.2% 1.38 million +6.8%
Exponen al
9.1 5.6% 1.33 million +5.1%
Smoothing

• The ARIMA model gave more accurate results.


• February sales were expected to be dropped by 9%, normal after the holiday season.
• We tested both models carefully to make that we can get reliable and good results.

Strengths
• Accuracy: The error was low, so we can trust the predictions.
• We grouped customers in helpful ways for better marketing and planning.
• We used charts and tables to explain the findings and easy to understand.
• The results gave direct ideas for stock, ads and running the business.

Weaknesses and Limitations


• We do not know customer’s earning or jobs, which can make it to understand shopper
types.
• Some sales and discounts were not clearly shown, making I third to judge that how well
promotions worked.
• We didnot include factors like inflation, weather and competitors these all can affect the
sales.
• Everything was based on past sales, so we cannot reach in real tome to new trends.

4. Conclusion and Recommendations


ti
Conclusion
This analysis shows that how using data in a smart and organised way can help business to find
useful information from large sets of data. BY studying over 1,00,000 store transaction, we find:
• Seasonal trends like a 22% sales jump in December and 15% to 20% more sales on the
weekdays.
• Only 20% of the items made up 80% of the total sales.
• Different types of customers who shop in very different ways and spend differently.
We used smart tools like: Time series forecasting (ARIMA) to predict future sales, Clustering
(K-Means) to group similar customers, and Regression models to find what factors affect sales.
These methods helped us better to understand what to stock, when to run promotions and how to
serve different types of customers.

Even the data was not real but based on realistic example, the results shows how data can be
used to make smart business decision. This analysis helps us to increase sales, cut down on waste
in both urban and rural areas.

Recommendations
Based on the findings from the data, we proposed some recommendations to get benefit in the
business:

1. Better Inventory Management


• Plan the stock using the expected 6.8% 5growth in sales to keep more stock in busy
months like December and August.
• Use automatic alerts to restock popular items on time and avoid less in stock.
• Focus on the top 20% of items that make 80% of sales, to make sure that these are always
available.
2. Smart Marketing
• Create offers for different types of customers.
• Give regular small discounts to keep budget shoppers coming back.
• Offers special bundles or early access to new products for premium shoppers.
• Send remainder through emails and sms with limited time offers to bring occasional
shoppers back.
3. Improve Store Performance
• Keep stores open for longer on weekends and holidays in cities, where more peoples
shop.
• Put high profit items like electronics in the best shelf spots to increase sales.
• In rural areas keep plenty of fast selling daily items like groceries and households goods
due to this stock shortage cuts by 12-15%.
4. Collect Better Data
• Encourage customer to signup for loyalty programs to collect more information about
them, like income or job.
• Make sure that all discounts and promotions are entered in the same way which can helps
to track that offers which really works.
• Add outside data like weather information or inflation rates to improve the predictions
and handles sudden changes better.

If the company follows these steps, it can improve how it run the stores, market to the
right people, and plan better decisions which can increase the profits by 10-15% each
year. Using data in daily decisions is not just helpful , it’s a big advantage in todays
business world.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy