Data Warehouse
Data Warehouse
DATA WAREHOUSE
Supervisor
Mr. Ahmed Ala’
By
Millat hakem merza 2023-2024
Contents
Introduction ........................................................................................... 3
Data Warehouse vs. Database, Data Lake, and Data Mart .............. 6
Conclusion............................................................................................. 9
Reference ............................................................................................ 10
Abstract
This comprehensive report dives into the world of data warehousing,
illuminating the foundational concepts, architecture, components, types,
and the multitude of benefits it offers to organizations. A data warehouse,
often referred to as an enterprise data warehouse (EDW), is a sophisticated
system designed to gather and consolidate data from diverse sources,
enabling advanced data analysis, artificial intelligence, and machine
learning. This report also distinguishes data warehouses from other data
management systems and emphasizes their pivotal role in facilitating
smarter decision-making and gaining a competitive edge.
Introduction
In the era of data-driven decision-making, data warehouses play a pivotal
role in supporting organizations' analytical needs. A data warehouse, often
referred to as an enterprise data warehouse (EDW), is a sophisticated
system designed to gather and consolidate data from various sources into
a unified, central, and consistent data store. This centralization of data
empowers organizations to leverage data analysis, data mining, artificial
intelligence (AI), and machine learning to make informed business
decisions. Unlike standard databases, data warehouses are purpose-built
to handle vast volumes of historical data, often reaching petabytes in size.
Bottom tier: The bottom tier consists of a data warehouse server, usually a
relational database system, which collects, cleanses, and transforms data
from multiple data sources through a process known as Extract, Transform,
and Load (ETL) or a process known as Extract, Load, and Transform (ELT).
Middle tier: The middle tier consists of an OLAP (i.e. online analytical
processing) server which enables fast query speeds. Three types of OLAP
models can be used in this tier, which are known as ROLAP, MOLAP and
HOLAP. The type of OLAP model used is dependent on the type of database
system that exists.
Top tier: The top tier is represented by some kind of front-end user interface
or reporting tool, which enables end users to conduct ad-hoc data analysis
on their business data.
Components of Data Warehouse Architecture
Critical components of data warehouse architecture include:
• ETL (Extract, Transform, Load): ETL is the process employed by
database analysts to move data from source systems to the data
warehouse. It converts data into a usable format for analysis and
querying.
• Data Layer: The data layer provides access to the data and may include
data marts, which segment data for different user groups.
• Data Warehouse vs. Data Mart: Data marts are subsets of data
warehouses tailored for specific departments or business units,
allowing them to quickly access focused insights.