0% found this document useful (0 votes)
65 views19 pages

Dataware House

This document provides an overview of key concepts related to data warehousing including: what a data warehouse is, its components and architecture, how it differs from transactional databases, and common data models used including star schemas, snowflake schemas, and fact constellation schemas. The document outlines common data warehousing processes like ETL and dimensions of data modeling. It also discusses analytical tools like OLAP that are used to analyze data stored in data warehouses.

Uploaded by

sharukh misbah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
65 views19 pages

Dataware House

This document provides an overview of key concepts related to data warehousing including: what a data warehouse is, its components and architecture, how it differs from transactional databases, and common data models used including star schemas, snowflake schemas, and fact constellation schemas. The document outlines common data warehousing processes like ETL and dimensions of data modeling. It also discusses analytical tools like OLAP that are used to analyze data stored in data warehouses.

Uploaded by

sharukh misbah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19

DATA WAREHOUSE

AGENDA
1. What is Data Warehouse? 9. OLAP
2. Real Time Data Warehouse 10. OLTP
3. Data Warehouse architecture 11. Data modelling and data warehouse
4. Data Warehousing processes 12. Multidimensional data model
5. Characteristics 13. Architectural component
6. Components 14. Star schema
7. Applications 15. Snowflake schema
8. DBMS vs Data Warehousing 16. Fact constellation schema
What is Data Warehouse ?

• Data Warehouse is an appliance for storing and analyzing


data, and reporting.
• It is designed for query and analysis.
• It usually contains historical data derived from transaction
data, and can also include data from other sources.
• It is a type of data management system that is designed to
enable and support Business Intelligence(BI) activities.
Real time Data Warehouse
Characteristics

• Subject Oriented- it can be used to analyze a particular


subject area.
• Integrated- it integrates data from multiple data
sources.
• Time Variant- Historical data is kept in a data
warehouse
• Non volatile- once data is in the warehouse, it will not
change..
Components of Data Warehouse

• Central database:A database serves as the foundation of your data warehouse.

• Metadata: it specifies the source,usage,values and other features of the dataset.

• Access tools:it allow users to interact with data in your dataware house.ex:OLAP tools.
Applications of Data Warehouse
DBMS vs DATA WAREHOUSING

DBMS DATA WAREHOUSING

• It is transaction oriented • It is subject oriented


• Contains detailed data • Contain historic data
• It records data • It analyzes data
• Not for reporting • It is for reporting
Data Warehouse architecture

ETL Process
Data Warehousing processes

• Data cleaning- it includes filling in missing values, smoothing noisy data


removing outliers and resolving inconsistencies.
• Data integration- it includes integration of multiple database, data cubes and
files.
• Data transformation- converts data from host format to warehouse format.
• Data loading- sort, summarize, consolidate, compute, views, check integrity.
• Data refreshing- propagates the update from data sources to the warehouse.
Data Modeling in Data Warehousing
Multidimensional data Model
• The multidimensional data model is used to store the data in the form of data cubes.

• A data cube allows data to be viewed in multiple dimensions.

• Here the dimensions are the entities with respect to which an organization keeps the records.

• It provides mechanism to store data and a way for business analysis.

Dimensions and Facts are the two components of Multidimensional data model.

• Dimensions- are the text attributes to analyze data.

• Facts - are the numeric volume to analyze business.

• It helps to provide fast and accurate data-related answers to complex business queries.
Architectural component

Fact Day to Day Business Transactional


Data. Fact table mostly contains
Measure 1
numerical data. In Retail domain,
Measure 2
sales transaction, supply chain
Measure 3 transactions, customers feedbacks or
complaints can be termed as facts.

FK
Dimensiona
l Model
PK
Dimension Complete description of an object.
Attribute 1 Dimensions may contain hierarchical
Attribute 2 data. In Retail domain, Products,
Employees, Customers,Vendors,
Attribute 3
Promotions,Time are considered as
dimensions.
OLAP(online analytical processing)
• It is a computing method that enable
users to easily and selectively extract
data.

• It is used to query data in order to


analyze it from different point of view.

• OLAP business intelligence often


helps in trend analysis, financial
reporting, sales forecasting,
budgeting, and other planning
process
OLTP(online transaction process)

• It facilitate and manage transaction oriented applications.

• OLTP enables large number of database transaction made by


large number of people over the internet.

• A database transaction is a change, insertion, deletion or


query of data in a database.

• OLTP system drives many of the financial transactions made


everyday including online transaction, ATM transaction, e-
commerce and in-store purchase.
Schema for Multidimensional Model

Star Schema:
• It is the simplest data warehouse schema Dimension Dimension
because it resembles a star.

• This schema consists the data in the form


Fact
of facts and dimensions.

• In star schema every dimension table is


directly connected to the fact table.
Dimension Dimension

• Each dimension table is joined to the fact


table using primary key to foreign key.
Schema for Multidimensional Model
Dimension
Snowflake schema: Dimension

• It is more complex than star schema.

• Here in snowflake schema dimensional


tables are connected to other dimensional Dimension
Dimension Fact
tables.

• Dimensional tables of snowflake schema


are normalized.
Dimension Dimension
• The dimension tables are normalized to
reduce redundancies
Schema for Multidimensional Model
Fact constellation schema: Dimension Dimension Dimension
• This schema have multiple fact tables.

• These multiple fact tables share many


dimension tables.
Fact Dimension Fact
• This type of schema can be viewed as a
collection of star and snowflake schema
hence it is called a galaxy schema or a fact
constellation schema.
Dimension Dimension Dimension
• The main disadvantage of this schema is
that it has more complex design.
Thank you
1

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy