Ab Initio - V1.1
Ab Initio - V1.1
15
GDE 1.10.9.3
End User
Workstations
Central
Repository
•Extract •Load
•Design •Replication •Access & Analysis
•Scrub •Index
•Mapping •Data Set Distribution •Resource Scheduling & Distribution
•Transform •Aggregation
Meta Data
System Monitoring
There Are Many Options
Operational User
Source Workstations
Systems E Operational
x Data Store
t
r
a
c Architected
t Data Mart
i
o Data
n
Warehouse
S
y
s
t
e
m Independent
s Data Mart
Types of Warehousing Solutions
Volatility of Non-Volatile
Very Volatile Volatile Non-Volatile
Contents
External
Data
SQL
Server
ETL
DW Storage Mgmt.
Extract
Informix
Transform Enterprise
Data
eCRM Cleanse Warehouse
Loader
SAP
R/3
ETL Process
• Extraction:
– The extraction process is a process of replicating by selecting
from one or more source databases
• Transformation:
– After data is extracted, business rules, such as filtering,
summarizing, merging, transposing, or derivations, should be
applied to the extracted data
• Cleansing:
– The cleansing function determines what values violate the
business rules and either rejects or transforms them to “cleanse”
the data, brining it into compliance warehouse
• Loading:
– After cleansing the data, this process loads the transformed
records into the enterprise data warehouse
Warehouse Models
– Star
– Snowflake
– Constellation
Warehouse Model - Star
Product Table Store Table
Product_id Store_id
Product_desc District_id
… ...
• Ab Initio’s Core
• Highly extensible
• Enterprise Meta>Environment
Sandbox
EME Sandbox
Sandbox
Component Groups
• Compress components
• Continuous components
• Database components
• Dataset components
• Departition components
• Deprecated components
• FTP components
• Miscellaneous components
Contd...
...Contd
• Partition components
• Sort components
• Transform components
• Translate components
• Validate components
Component Organizer