Week 03 Part 01
Week 03 Part 01
“Ten years ago I could have told you how Doritos were
selling west of the Mississippi.Today not only can I tell you
how well they are selling in California, in Orange County,
in the town of Irvine, in the local Vons Supermarket, in the
special promotion, at the end of Aisle 4, on Thursdays.”
D. Wayne Calloway
(ex CEO, PepsiCo)
Sale Transaction Schema
Sales Cube
A Data Cube
TV
od
PC U.S.A
Pr
VCR
Country
sum
Canada
Mexico
sum
Cuboids Corresponding to the Cube
A cube is a lattice of cuboids
all
0-D(apex) cuboid
product date country
1-D cuboids
3-D(base) cuboid
product, date, country
A lattice of cuboids
From tables and spreadsheets to
data cubes
} A data warehouse is based on a multidimensional data model which views
data in the form of a data cube
} A data cube, such as sales, allows data to be modeled and viewed in
multiple dimensions
} Dimension tables, such as item (item_name, brand, type), or time(day,
week, month, quarter, year)
} Fact table contains measures (such as dollars_sold) and keys to each of
the related dimension tables
} In data warehousing literature, an n-D base cube is called a base cuboid.
The topmost 0-D cuboid, which holds the highest-level of summarization,
is called the apex cuboid. The lattice of cuboids forms a data cube.
Data Cube Computation
} Data cube can be viewed as a lattice of cuboids
} The bottom-most cuboid is the base cuboid
branch_key
location
branch location_key
location_key
branch_key
units_sold street
branch_name
city_key city
branch_type dollars_sold
city_key
avg_sales city
province_or_street
Measures country
OLAP – OnLine Analytical Processing
} A definition: