100% found this document useful (1 vote)
244 views23 pages

BI Retail Industry v1.1

This document discusses using business intelligence for the retail industry. It begins with an introduction to the complex nature of the retail industry and how business intelligence can help retailers make better data-driven decisions. It then discusses how BI can help retailers achieve their goals by analyzing sales, customers, inventory, suppliers and more. Several key performance indicators for the retail sector are identified. Finally, sample BI dashboards are shown and steps for designing a BI data warehouse for retail are outlined.

Uploaded by

vivek5110
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
244 views23 pages

BI Retail Industry v1.1

This document discusses using business intelligence for the retail industry. It begins with an introduction to the complex nature of the retail industry and how business intelligence can help retailers make better data-driven decisions. It then discusses how BI can help retailers achieve their goals by analyzing sales, customers, inventory, suppliers and more. Several key performance indicators for the retail sector are identified. Finally, sample BI dashboards are shown and steps for designing a BI data warehouse for retail are outlined.

Uploaded by

vivek5110
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 23

“Business Intelligence for

Retail”

Submitted by:

Vivek Sharma

Ashutosh Sinha
Retail Industry
1.Introduction
Retail is the most complex industry handling a wide range of
products from large numbers of suppliers and servicing the
highest number of customers. Retail Industry market trends
change most frequently than any other industry. To remain
competitive in the retail Industry, keeping an eye on all the
operations in retail business is very crucial. Understanding
customer requirements and providing them with what they
want and still maintaining profitability requires valuable and
highly analysed information on management part to deal
with ever changing dynamic market conditions and to stay
ahead of the competition.

As retail markets become more competitive, the ability to


react quickly and decisively to ever changing trends and to
customize products and services to individual customer
needs is more critical than ever. A business intelligence
system can be a very effective means of organizing and
analyzing the vast amount of information generated in a
retail business, and helps you to generate a more effective
business model for keeping your business profitable.

2.Retail and Business Intelligence


Successful retailers strive to accomplish three basic
objectives:

• to align their business with client needs

• to differentiate from competitors; and

• to optimize product mix and space utilization.


To achieve these basic goals, retailers must be able to
successfully manage inventory, product mixes, promotions,
advertisements, supply chain dynamics, and a host of other
factors. Furthermore, As retail markets become more
competitive, the ability to react quickly and decisively to
ever changing trends and to customize products and services
to individual customer needs is more critical than ever. Lack
of information is not the problem—data to assist in making
these kinds of decisions is readily available but the problem
is that the complexity and volume of information available to
organizations is overwhelming.

Technology plays an indispensable role in supporting the


backbone of retail businesses. In a retail environment,
transactional systems, such as Point of Sales (POS) systems
are efficient in what they are intended to do - record and
retrieve large volumes of transactions and operations.
Embedded in the Point of sales systems is a "treasure trove"
of dormant, unclassified and often unused information about
what happened in the business in the preceding period like
last week etc. Legacy reporting systems often present
historical information in the form of standard static layouts.
These reports can neither be viewed from different
perspectives at different times nor can they provide critical
insight for retailers in answering questions such as

• How can inventory levels be optimized

• What are the most profitable products

• Who are the most valuable customers

Increasingly, successful retailers will be those that can


effectively categorize and utilize these data and use it for
decision making.
3.Retail Industry analysis through BI
BI can provide useful insights into the retail sector:

 Sales and profitability analysis.

 Store operations analysis.

 Customer analysis.

 Merchandise management

 Supplier performance measurement

 Marketing and e-commerce analysis.

 Brand and marketing research.

 Market Share analysis.

4.Key performance indicators of retail sector:


Sales per hour – this statistic tells us about the speed at
which each individual salesperson is selling or attending to
customers compared to everyone else on the shift.

Average Sale – the average selling price of each individual


salesperson compared to everyone else on the shift – higher
averages show a greater knowledge of product as the
salesperson is able to sell higher ticket items. Low statistics
reveal the salesperson lacks skill in either product knowledge
or effective probing.

Items Per Sale – tells us about the ability of the salesperson


to add-on to a sale.

Conversion Rate – tracks how many visitors to the store are


turned into customers.

Wage to Sales Ratio – compares a salesperson’s hourly


wages to hourly sales. This KPI differentiates clear
performers from underperformers
Average sales per customer or transaction – Total sales
for a given period divided by the number of customers or
transactions for the same period

Sales per square foot / meter – Actual sales for a given


period divided by the total floor area (in sq.ft. or meters) of
the store.

Inventory Sales per selling hour – Actual sales for the


store divided by the number of selling hours during the same
period

Sales per labour hour – Actual sales for the store divided
by the number of labor hours used during the same period
Inventory Turns – This KPI tells us how often the average
inventory over a given period of time (usually a year) is sold
in that same period of time

Inventory Store conversion rate – The number of


transactions in a given period divided by the total number of
customers who entered the store during the same period

Coupon conversion percentage – Percentage of coupons


that have been used by customers

Profit per Customer Visit – Profit obtained from each


customer visit. This way you can easily set goals for your
sales team in order to increase profits.

Units per customer or transaction – Total number of


units sold in a given period divided by the number of
customers or transactions for the same period

Shelf space profitability – measured per product

Price premium – The relative price of a product compared


to a benchmark price (average retail price)
Promotion share – Share of promotion products in
percentage (%) of total sales

Shortages/overages in cash registers

Sell-through percentage (%) – Is a percentage of units


sold during a period and it is calculated by dividing the
number of units sold by the beginning on-hand inventory (for
that same time period)

Percentage of perishable items with past due date –


Number of perishable items with past due date as a
percentage of all items in store
Product visibility on shelf – Measures the amount of frontal
views of a single product-package on a fully stocked shelf

Sales Gross Margin Return on Inventory Investment –


The GM ROII multiplies Inventory Turns (which tells us how
healthy our stock is) by Gross margin (which tells us the
percentage of profit we make on each sale)

5. Business Intelligence software for retail: Major


vendors
1.

• The IBM Cognos BI suite serves more than


23,000 customers in more than 135 countries.
• Complete corporate performance management
solution. Supports enterprise planning, scorecarding,
and business intelligence.
2.
• Exceptionally strong in advanced analytic
analysis.
• Full analysis power revealed in the hands of
uses with knowledge of SAS programming.
• Subscription-based pricing model.
3.

• Widely established and respected vendor of


business intelligence solutions.
• Market leader for data warehousing tools.
• Gartner recognized the Siebel Business
Analytics platform as one of the most comprehensive
and visionary BI platforms available.
• Oracle solutions are designed to integrate
seamlessly with any existing
database systems.
• Hosted and on-premise B.I. solutions available.

4.

• As one of the largest business software


companies, SAP can integrate BI
throughout SAP applications and your company.
• Strong data scalability and performance thanks
to in-memory analytics and column-based vectoring.
• Acquisition of Business Objects makes SAP the
largest BI vendor.

5.

6.

6.Sample retail Business Intelligence


dashboards:
7. Designing a Business Intelligence data ware house for
Retail Industry

This section would deal with the question of how to design a


business intelligence data ware house for retail industry. We
will explain this step by step through a small retail case
study to make it simple to understand.

Case study:

 We are running a retail business which has 100 retail


stores spread over five states.

 Each of the stores has a full complement of departments,


including grocery, frozen foods, dairy, meat, produce,
bakery, floral, and health/beauty aids.

 Each store has roughly 60,000 individual products on its


shelves.

 About 55,000 of the SKUs come from outside


manufacturers and

have bar codes imprinted on the product package.

These bar codes are called universal product codes


(UPCs). UPCs are at the same grain as individual SKUs.

 The remaining 5,000 SKUs come from departments such


as meat, produce, bakery, or floral. While these products
don’t have nationally recognized UPCs, the grocery chain
assigns SKU numbers to them.

 Data is collected at two points:

 Point-of-sale(POS)-when customer make purchases.

 Data collection point-when vendors deliver materials.

 Management concern: maximizing profit.

 Charging as much as possible.


 Lowering cost for product acquisition and overheads.

 Attracting more and more customers in the highly


competitive pricing environment.

 Other management’s major decisions revolve around:

 Promotions.

 Pricing.

We shall use dimensional design process and it consists of


following four steps:

1. Selecting the Business process.

2. Declaring the Grain.

3. Choosing the dimensions.

4. Identifying the facts.

Step 1: Selecting the business process

In our retail case study, management wants to better


understand customer purchases as captured by the POS
system.

 Thus the business process we’re going to model is POS


retail sales.

 This data will allow us to analyze what products are selling


in which stores on what days under what promotional
conditions.

Step 2: Declaring the grain

Grain here is defined as the modularity level of the data.

 In our case study, the most granular data is an individual


line item on a POS transaction. To ensure maximum
dimensionality and flexibility, we will proceed with this
grain.

 While users probably are not interested in analyzing single


items associated with a specific POS transaction, we can’t
predict all the ways that they’ll want to cull through that
data.

 For example, they business users may want to


understand the difference in sales on Monday versus
Sunday. Or they may want to assess whether it’s
worthwhile to stock so many individual sizes of certain
brands, such as cereal. Or they may want to understand
how many shoppers took advantage of the 50-cents-off
promotion on shampoo.

 While none of these queries calls for data from one


specific transaction, they are broad questions that require
detailed data sliced in very precise ways.

Step 3: Choosing the dimensions

The dimension table contains the textual descriptors of the


business.

 Major primary dimensions for the our grain are the date,
product, and store.

 We assume that the calendar date is the date value


delivered to us by the POS system. Later, we will see what
to do if we also get a time of day along with the date.

 Within the framework of the primary dimensions, we can


ask whether other dimensions can be attributed to the
data, such as the promotion under which the product is
sold.

Step 4 : Identifying the facts


A fact table is the primary table in a dimensional model
where the numerical performance measurements of the
business are stored

 The facts collected by the POS system include the sales


quantity (e.g., the number of cans of chicken noodle
soup), per unit sales price, and the sales dollar amount. In
some cases it may include the dollar cost.

 Three of the facts, sales quantity, sales dollar amount, and


cost dollar amount, are beautifully additive across all the
dimensions. We can slice and dice the fact table with
impunity, and every sum of these three facts is valid and
correct.

 Whereas, dimensions like gross profit and unit price are


non additive and can be calculated through query.

Figure: Measured facts in the retail sales schema

Date Dimension

The Date dimension is the most frequently used dimension of a


datawarehouse. Some data in this table can not be calculated.
For example, whether the date was a holiday or not has to be
calculated through holiday calander input from the company.
Moreover, fiscals can be cutomized (say the financial year
extends from January to December) ending A typical date
dimension is shown below.
Date dimension is most frequently used for slicing operations,
and most reports are prepared on time basis. Drilldown is also
done on the basis of this dimension.

If the rows in a fact table are coming from several timezones, it


might be useful to store date and time in both local time and a
standard time. This can be done by having two dimensions for
each date dimension needed – one for local time, and one for
standard time. Storing date in both local and standard time, will
allow for analysis on when facts are created in a local setting
and in a global setting as well.

Product dimension

Product dimension helps in slicing and dicing through the


various reports that can be prepared using this dimension.
The important function of this dimension is to hold as many
descriptive attributes of each SKU as possible. Typically
each SKU rolls up to brands, brands to categories, and
categories to departments. Generally, 50 attributes is
considered a reasonably good descriptive dimension. This
dimension is one of the 3 primary dimensions in data marts
on retail.

Store dimension

Store dimension is very similar to the product dimension, and


is subject to slow change.
Promotion dimension

The promotion dimension describes the promotion conditions


under which a product was sold. Instead of having a single
attribute of promotion like discount/ free product etc, this
dimension Promotion conditions include temporary price,
media name, free product name etc. Each promotion flags
appropriate flags in the row to discribe itself including
reductions, end-aisle displays, newspaper ads, and coupons.
This dimension is often called a causal dimension (as opposed
to a casual dimension) because it describes factors thought to
cause a change in product sales.

It may be difficult for users to comprehend, but is very useful


in protection against changing dimensions. Here, another tip
is to avoid null key in fact table, even if the product is not
under any promotion as it helps to get right results from
joins.

Rationale for the approach

1) Selecting a single business process to model at a go:


We model processes instead of trying o pursue a single
requirement/report. This is because most the
needs/requirements for the end users is on a case basis.
One can not economically use time & effort to set up
new reporting routines, everytime a new requirement
comes up. Also, the scaling of an ad hoc reporting
system could take toll on the OLTP systems, for getting
the data feed. Hence, it is wise to approach the problem
in a more systematic way, even if the immediate
requirement seems small.

Moreover, the Process approach cuts through the


department barriers and makes it easier to see the
larger pictures. Another need of production DW
designers is to see the clarity in relation to availability
of data, ie. What data is already there in the DW (A
large DW is usually replenished by multiple data feeds,
and it gets increasingly difficult for a designer to keep
track of what is already there) and what needs to be
fetched, when implementing a new requirement.

2) Declaring the grain

The general thumbrule in case of grain declaration is '


the smaller, the better'. For example, one could also
capture the denomination and currency number of the
notes used in a transaction, and that would allow for
very interesting analyses to take place. But there is a
huge chance that such level of details would mostly get
unused. Also, granularity takes a toll on the processing
of cumuative results. Additional cumulative tables
would be needed to be maintained and updated for
every added level of granularity. However, with the
growing powers of hardware, this does not seem to be a
big concern.

Not providing the smallest possible level of granularity


would lead to problems of redesign, in case a
requirement comes up. So, designing for a system to be
working 10 years from now, the designer should take
the freedom of keeping the granularity as high as
possible.

3) Choosing the dimensions:

Choosing dimensions is the art part in the design of DW.


How business people describe the data they use, can be
learned from anticipation and expertise. Dimensions
help reduce the run processing time for queries, as
dealing with large databases is tricky.

Keeping the processing time in check is essential as the


database size for Data warehouses can exceed
terabytes. Also, the cost of query gets reduced if we can
use less of SQL functions.

Keeping as many dimensions helps in slicing and dicing.


However, with shifting IT contracts, vendors tend to add
new dimensions for every new requirement without
studying the system properly. Can lead to degenrate
dimensions resulting in higher query cost and less
friendliness to the user.

4) Identifying the facts:

One should avoid using ratios and percentage data in


fact table. This data needs to be updated every time an
entry is made to the affecting variables. This could
imply either inconsistency due to delay in updation or
increased load due to updation. Instead such data
should be calculated by the application logic.

Best practices explained


1) Use of surrogate keys:

Though operational changes happen once in a decade


or two, it is always wise to be ready for the change.
Thus, it is recommended that surrogate keys be used to
connect fact and dimensions. This helps to remove the
dependency between the data stored in the row and the
key. The key must not hold any peice of information
that can possibly be used.

2) Design for extensibility

Ideally, the design should be able to accomodate new


requirements just by adding new dimension tables and
an rows in fact table. In case of product launches, where
product description is not available till late, using
dummy data can be helpful. However, some
requirements can not be fulfilled this simply. For
example, if it is desired to increase the granularity of a
dimension table along with the fact table, it would be
necessary to drop the fact table and rebuild it. However,
existing applications would be unaffected.

Also, when a new data source is involving unexpected


new dimensions is encountered, it is wise not to force fit
the measurements into the existing fact table.

3) Snowflaking

Snowflaking refers to the practice of dimensional


normalization in the data warehouse design.
Snowflaking makes the query design for users more
difficult. Also, the queries become more costly due to
the joins. Monetary value of disk space saving is not
very high and the higher cost of maintainance of
consistency is bore by the ETL process, which is rather
acceptable than putting the load on OLAP queries.
Moreover, datawarehouses are not meant for much of
data changes afterwards, as historic data keeping is
done here.
4) Centipede dimensions

Keeping too many dimensions in the model implies that


two or more dimensions need to be combined togather.
The thumrule here is that if there are more than 25
dimensions, you may need to revisit the design. If the
resulting dimension is noticably smaller than the cartesian
product of the seperate dimensions, then it could be a
good decision to combine.

5) Protection against slowly changing dimensions

Dimensions such as product and store are subject to


slow changes. For example, a store may change it's
location and thus other related fields in address
columns such as city, state and PIN code etc. Similarliry,
a product may be renamed and it's cost of production/
acquisition may change over time.

It is generally desirable to preserve the historic


information as well as the warehouses being actionable
on the basis of current information. Author of DW
toolkit, Kimball suggests various ways to cope with the
problem of changing dimensions. Of these solutions,
there are 2 types that preserve historical data.

Type 2 preserves changing attruibutes by adding 2


more attributes, start date and end date. Type 6 also
works in a similar way, but also mentions the
immediately previous applicable value to the current
value.

References
1)Kimball Ralph, Ross Margy: 2nd Edition , The Data
warehouse toolkit, the complete guide to dimensional
modelling

2) http://en.wikipedia.org/wiki/Measure_(data_warehouse) on
13/04/2011

3) http://en.wikipedia.org/wiki/Slowly_changing_dimension on
13/04/2011

4) http://en.wikipedia.org/wiki/Dimension_table on 13/04/2011
5) http://www.biretail.com/

6) http://www.elegantjbi.com/Solutions/industry_BI_for_retail.htm

7) http://blog.maia-intelligence.com/2009/09/30/kpi-for-retail/

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy