0% found this document useful (0 votes)
117 views26 pages

Ab Initio - V1.1

The document provides an overview of data warehousing and the Ab Initio ETL tool. It discusses data warehouse models including star schemas and snowflake schemas. It also describes the key components of Ab Initio including the CO operating system, component library, graphical development environment, and EME version control software. Sample Ab Initio code is referenced but not shown.

Uploaded by

Praveen Joshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
117 views26 pages

Ab Initio - V1.1

The document provides an overview of data warehousing and the Ab Initio ETL tool. It discusses data warehouse models including star schemas and snowflake schemas. It also describes the key components of Ab Initio including the CO operating system, component library, graphical development environment, and EME version control software. Sample Ab Initio code is referenced but not shown.

Uploaded by

Praveen Joshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 26

CO> Os 2.10.

15
GDE 1.10.9.3

Prepared by Praveen Joshi


Praveen.hans@gmail.com
Objectives

• Data Warehousing overview


• Introduction to ETL
• Overview of Ab Initio
• Ab Initio products
• Sample Ab Initio code
The Decision Making Roadmap
Business Planning
Actions Vision
Knowledge

Data Decision Information Executive


Transaction
Support Information
Systems
Systems Systems

RUN MANAGE GROW

• Operational • Analytical • Dimensional


• Functional • Subject • Subject
• Current • History • History
• Detailed • Detailed • Summary
Users Knowledge Brokers Management
A Data Warehouse Is A Process
Data Characteristics
• Raw Detail • Integrated • History • Targeted
• No/Minimal History • Scrubbed • Summaries • Specialized (OLAP)
Source OLTP Architected
Systems Data Mart
Data
Warehouse

End User
Workstations
Central
Repository

•Extract •Load
•Design •Replication •Access & Analysis
•Scrub •Index
•Mapping •Data Set Distribution •Resource Scheduling & Distribution
•Transform •Aggregation

Meta Data

System Monitoring
There Are Many Options
Operational User
Source Workstations
Systems E Operational
x Data Store
t
r
a
c Architected
t Data Mart
i
o Data
n
Warehouse
S
y
s
t
e
m Independent
s Data Mart
Types of Warehousing Solutions

• Operational Data Store


– integrated, current, detailed data for operational
activities
• Central Data Warehouse
– integrated, historic, summary and detailed data for
company-wide data analysis
• Data Mart
– independent, historic, summary data for a small group
of business users analyzing a specific business process
Operational Operational Data
Properties Data Mart
Source Systems Data Store Warehouse

Detailed Data Summary Information Single Function


Contents Detailed Data + Appropriate Summary + Appropriate Detail Summary

Timeliness Current Nearly Current Point-in-Time Point-in-Time

Updated Continually Frequently Periodically Periodically

Performance Tuned for Production Tuning Not Usually


Tuned for Update Tuned for Query
Needs Environment An Issue

Presentation Static Both Static & Flexible Flexible Management Focus

Amount of Data Controlled for Moderate


Low May Be Very High
Accessed Performance

Volatility of Non-Volatile
Very Volatile Volatile Non-Volatile
Contents
External
Data

SQL
Server
ETL
DW Storage Mgmt.
Extract
Informix
Transform Enterprise
Data
eCRM Cleanse Warehouse

Loader
SAP
R/3
ETL Process
• Extraction:
– The extraction process is a process of replicating by selecting
from one or more source databases
• Transformation:
– After data is extracted, business rules, such as filtering,
summarizing, merging, transposing, or derivations, should be
applied to the extracted data
• Cleansing:
– The cleansing function determines what values violate the
business rules and either rejects or transforms them to “cleanse”
the data, brining it into compliance warehouse
• Loading:
– After cleansing the data, this process loads the transformed
records into the enterprise data warehouse
Warehouse Models
– Star
– Snowflake
– Constellation
Warehouse Model - Star
Product Table Store Table
Product_id Store_id
Product_desc District_id
… ...

Sales Fact Table


Product_id
Store_id
– Central fact table Item_id
Day_id
– Radiating dimensions Sales_dollars
Sales_units
– Denormalized model ...
Time Table Item Table
Day_id Item_id
Month_id Item_desc
Period_id ...
Year_id
Warehouse Model - Star

– Easy for users to understand


– Fast response to queries
– Simple metadata
– Supported by many front end tools

– Less robust to change


– Slower to build
– Does not support history
Warehouse Model - Snowflake
Product Table Store Table
District Table
Product_id Store_id
District_id
Product_desc Store_desc
District_desc
District_id

Sales Fact Table


Item_id
Store_id
Sales_dollars
Sales_units

Time Table Item Table Dept Table Mgr Table


Week_id Item_id Dept_id Dept_id
Period_id Item_desc Dept_desc Mgr_id
Year_id Dept_id Mgr_id Mgr_name
Warehouse Model - Snowflake

– Direct use by some tools


– More flexible and suited to requirements
– Provides for speedier data loading
– May become large and unmanageable
– May degrade performance
– More complex metadata
Country State County City
Warehouse Model - Constellation
Warehouse Table Product Table Store Table
Warehouse_id Product_id Store_id
Warehouse_loc Product_desc District_id

Inventory Fact Table Sales Fact Table


Product_id Item_id
Shelf_id Store_id
Cost_dollars Sales_dollars
Qty_on_hand Sales_units
Time Table
Week_id Item Table
Period_id Item_id
Year_id Dept_id
Ab Initio
• High quality ETL tool

• Used in solving huge data


processing
problems in insurance, telecom, e-
commerce vertical

• Easy to use and deliver solutions


• High performance, scalability and
robust
Ab Initio - Products

This is what I want!!!


Co>Operating System

• Ab Initio’s Core

• Powerful Engine which supports all kind of data processing

• Delivers Parallelism and distributed computing services

• Gives platform independent data transport

• Check points and process monitoring

• Provides unlimited scalability, portability and flexibility


Component Library

• Contains set of very powerful reusable modules

• Best of the breed transformation capability

• Highly extensible

• Meta Data Driven – Adaptability

• Drag and Drop facility


Graphical Development
Environment
• Helps to build applications faster

• DRAG AND DROP facility

• From Ideas to Execution – Philosophy

• Good quantification of data volumes and execution times.

• Easy to understand and self documented

• Easy to Modify applications

Show the GDE window


EME

• Enterprise Meta>Environment

• Version control Software from AB INITIO

• Automatic Version control with the help of the TIME STAMPS

• Sandboxes – The local container for AB INITIO objects

• Check In processes – From Sandbox to EME

• Check out processes – From EME to sandbox


Check In – Check Out

Sandbox

EME Sandbox

Sandbox
Component Groups
• Compress components
• Continuous components
• Database components
• Dataset components
• Departition components
• Deprecated components
• FTP components
• Miscellaneous components
Contd...
...Contd

• Partition components
• Sort components
• Transform components
• Translate components
• Validate components
Component Organizer

• On the GDE menu bar, choose


View Component Organizer
• Press F9
Functions Of the Component
Organizer
• To display Component Folders in
the Component List Box.
• To view information about
components and folders by using
the Description Panel.
• To insert components into a graph.
• To create custom components.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy