0% found this document useful (0 votes)
118 views24 pages

LWTN Data Modeling

This document provides an overview of dimensional data modeling concepts and techniques for Mitchell Pearson's LWTN Data Modeling class. It discusses key dimensional modeling terminology like facts, dimensions, and star schemas. It also covers best practices for dimensional modeling like keeping models simple, scalable, and focused on key business questions. The document provides an agenda for the class that will cover dimensional modeling in Power BI, multiple fact tables, aggregate tables, and calculation groups.

Uploaded by

garimabh02
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
118 views24 pages

LWTN Data Modeling

This document provides an overview of dimensional data modeling concepts and techniques for Mitchell Pearson's LWTN Data Modeling class. It discusses key dimensional modeling terminology like facts, dimensions, and star schemas. It also covers best practices for dimensional modeling like keeping models simple, scalable, and focused on key business questions. The document provides an agenda for the class that will cover dimensional modeling in Power BI, multiple fact tables, aggregate tables, and calculation groups.

Uploaded by

garimabh02
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

LWTN Data Modeling

Mitchell Pearson
About Mitchell
Highlights:

• Consultant and Trainer

• Pragmatic Works 10 years

• Author

• Blogs at MitchellPearson.com

• Family: Wife & 3 kids

• Table Top Games

• YouTube.com/c/PragmaticWorks

• mpearson@pragmaticworks.com
Let’s Review
Logistics
Class Files

Course Completion Certification

11:00am – 2:00pm (Eastern)

Break – 12:25 (15 minute break)


Agenda
Dimensional Modeling
Facts and Dimensions
Star Schema

Creating a Data Model in Power BI

Multiple Fact Tables

Aggregate Tables

Calculation Groups (Role Playing Tables)

Storage Modes
Recommended Books
Data Warehouse Toolkit
- Ralph Kimball

Star Schema
- Christopher Adamson
Dimensional Modeling
What is a Data Model?
Things to consider….
1) What are you measuring?
2) What types of business problems are you trying to solve?
3) How much data are you working with?
4) What are your data sources?
Attributes of a Good Data Model
Can be easily understood and consumed

Large data changes are scalable

Provides predictable performance

Is flexible and adaptable, but not at the


expense of the other attributes
What is easier with a good data model?
Managing Storage constraints

Performance Tuning

Managing Row Level Security

Authoring DAX

Everything!
Star Schema
1
Many Fact table in the middle
1
Surrounded by Dims

1 Looks like a ‘Star’

Fact table is the “Many”


side of the (one to many)
relationship
Snowflake Schema
Dim Fact Dim Center is a Star schema
Fact table in middle
Surrounded by Dims
Dims “snowflake” off of other Dims
Flake
Dim If you have many, it looks like a
‘Snowflake’
Snowflake Dim or Fact tables can be the “Many”
side of the relationship
Model Types
Conceptual Logical Physical
Sales Fact Sales
Sales Transaction Number Fact Sales ID [int] IDENTITY(1,1) NOT NULL
Product Transaction Number [varchar](64) NULL
Sale Date Product ID [int] NOT NULL
Sales Amount Sale Date ID [int] NOT NULL
Sales Amount [decimal](19,2) NULL

Products Product
Product
UPC
Name Product ID [int] IDENTITY(1,1) NOT NULL
Description UPC Code [varchar](12) NOT NULL
Product Name [varchar](128) NOT NULL
Product Description [varchar](512) NULL
Conceptual
Model
Dimensional Model Design
Dimensional Model Basics | Model Types
Dimensional Model – Terminology
Dimensional Model
Organizes the data so it is easy to retrieve for
reporting purposes
Fact Table
A fact is an event that may or may not include
measures.
Dimension Table
Category of information, or a noun, descriptive
Attribute (column in dimension table)
Descriptor of the object
Fact Tables
Fact Table
Contains Measures
(or items to be aggregated)
of a business process
Examples
Claim Amount, Screenings, Total
Claims, Cost
Measures
Usually sliceable
Examples:
By Month, By Member
Fact
Table
Fact Tables
Fact
A fact is an event that may or may not
include measures

Granularity
Lowest level of information that will
be stored in the fact table, or the
values that would make the row
distinct compared to all other rows
Dimensions
Dim Table
A Dim (or Dimension)
table contains descriptive
attributes that define how a
fact should roll up

Examples:
By Month
By Customer
Dim Dim By Geo
Table Table
Relationships

Relationships
Connection between 2
tables (usually fact & Dim
tables) using columns
from each
Types of Relationships
1 to Many
1 to 1
Many to Many
(with a bridge table)
Relationships
Dimensional Model - Structure

Highly Denormalized
Tables merged logically for
reporting
Table Types
OR
Only Facts and Dimensions
Necessary Fields Only
No Unnecessary Attributes
Defining Dimension
• “Dimensions provide the “who, what, where, when, why, and how” context
surrounding a business process event.
• - Ralph Kimball
Dimension Architecture
• Wide Table
• Surrogate Key (Unique ID)
• Natural Key
• Best Attributes are Desciptive
• Start Date / End Date
• Flags
DEMO TIME!!

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy