0% found this document useful (0 votes)
10 views21 pages

Week 03 Part 01

The document discusses Online Analytical Processing (OLAP) and its role in business intelligence and analytics, emphasizing the importance of data cubes for multidimensional data representation. It explains various concepts such as star and snowflake schemas, as well as operations like slicing, dicing, drill-down, roll-up, and pivoting in data analysis. The document highlights how OLAP enhances data analysis capabilities beyond traditional SQL methods.

Uploaded by

Riya singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views21 pages

Week 03 Part 01

The document discusses Online Analytical Processing (OLAP) and its role in business intelligence and analytics, emphasizing the importance of data cubes for multidimensional data representation. It explains various concepts such as star and snowflake schemas, as well as operations like slicing, dicing, drill-down, roll-up, and pivoting in data analysis. The document highlights how OLAP enhances data analysis capabilities beyond traditional SQL methods.

Uploaded by

Riya singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 21

BUSINESS INTELLIGENCE & ANALYTICS

Online Analytical Processing (OLAP)

Saji K Mathew, PhD


Professor, Department of Management Studies
INDIAN INSTITUTE OF TECHNOLOGY MADRAS
What business wants

“Ten years ago I could have told you how Doritos were
selling west of the Mississippi.Today not only can I tell you
how well they are selling in California, in Orange County,
in the town of Irvine, in the local Vons Supermarket, in the
special promotion, at the end of Aisle 4, on Thursdays.”

D. Wayne Calloway
(ex CEO, PepsiCo)
Sale Transaction Schema
Sales Cube
A Data Cube

Total annual sales


Date of TV in U.S.A.
1Qtr 2Qtr 3Qtr 4Qtr sum
t
uc

TV
od

PC U.S.A
Pr

VCR

Country
sum
Canada

Mexico

sum
Cuboids Corresponding to the Cube
A cube is a lattice of cuboids
all
0-D(apex) cuboid
product date country
1-D cuboids

product,date product,country date, country


2-D cuboids

3-D(base) cuboid
product, date, country
A lattice of cuboids
From tables and spreadsheets to
data cubes
} A data warehouse is based on a multidimensional data model which views
data in the form of a data cube
} A data cube, such as sales, allows data to be modeled and viewed in
multiple dimensions
} Dimension tables, such as item (item_name, brand, type), or time(day,
week, month, quarter, year)
} Fact table contains measures (such as dollars_sold) and keys to each of
the related dimension tables
} In data warehousing literature, an n-D base cube is called a base cuboid.
The topmost 0-D cuboid, which holds the highest-level of summarization,
is called the apex cuboid. The lattice of cuboids forms a data cube.
Data Cube Computation
} Data cube can be viewed as a lattice of cuboids
} The bottom-most cuboid is the base cuboid

} The top-most cuboid (apex) contains only one cell


} How many cuboids* in an n-dimensional cube with Li levels
each?
n
T = Õ ( Li +1)
i =1

*Number of cuboids determined by no of dimensions and levels


Conceptual Modeling of Data
Warehouses

} Modeling data warehouses: dimensions & measures


} Star schema: A fact table in the middle connected to a set of
dimension tables
} Snowflake schema: A refinement of star schema where some
dimensional hierarchy is normalized into a set of smaller
dimension tables, forming a shape similar to snowflake
} Fact constellations: Multiple fact tables share dimension
tables, viewed as a collection of stars, therefore called galaxy
schema or fact constellation
Example of Star Schema
time
time_key item
day item_key
day_of_the_week Sales Fact Table item_name
month brand
quarter time_key type
year supplier_type
item_key
branch_key
branch location
location_key
branch_key location_key
branch_name units_sold street
branch_type city
dollars_sold province_or_street
country
avg_sales
Measures
Example of Snowflake Schema
time
time_key item
day item_key supplier
day_of_the_week Sales Fact Table item_name supplier_key
month brand supplier_type
quarter time_key type
year item_key supplier_key

branch_key
location
branch location_key
location_key
branch_key
units_sold street
branch_name
city_key city
branch_type dollars_sold
city_key
avg_sales city
province_or_street
Measures country
OLAP – OnLine Analytical Processing

} A definition:

} Data representation is in the form of a CUBE


} OLAP goes beyond SQL with its analysis capabilities
} Key feature of OLAP: Relevant multi-dimensional views
such as products, time, geography
Data warehousing & OLAP architecture
Slicing
A slice is a subset of
a multi-dimensional
array corresponding
to a single value for
one or more
members of the
dimensions
Dicing
The dice operation is
a slice on more than
two dimensions of a
data cube (or more
than two consecutive
slices)
Drill-down
In drill-down the user
navigates among
levels of data
ranging from the
most summarized
(up) to the most
detailed (down)
Roll up
(Aggregate,
Consolidate) A roll-
up involves
computing all of the
data relationships for
one or more
dimensions.
Pivoting
This operation is
also called rotate
operation. It rotates
the data in order to
provide an
alternative
presentation of
data – the report or
page display takes
a different
dimensional
orientation.
Gartner Magic Quadrant-2019

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy