0% found this document useful (0 votes)
16 views8 pages

Transformation Excel

The document describes several ETL transformations: - The Aggregator transformation performs aggregate calculations like averages and counts on grouped data. - The Expression transformation performs non-aggregate calculations on a single row. - The Union transformation merges data from multiple pipelines into one. - The Filter transformation filters out rows that do not meet specified conditions. - The Joiner transformation joins data from two sources based on matching columns. It provides important details on how each transformation handles data and considerations for optimizing performance.

Uploaded by

pankaj1986pank
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views8 pages

Transformation Excel

The document describes several ETL transformations: - The Aggregator transformation performs aggregate calculations like averages and counts on grouped data. - The Expression transformation performs non-aggregate calculations on a single row. - The Union transformation merges data from multiple pipelines into one. - The Filter transformation filters out rows that do not meet specified conditions. - The Joiner transformation joins data from two sources based on matching columns. It provides important details on how each transformation handles data and considerations for optimizing performance.

Uploaded by

pankaj1986pank
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 8

Transformation Active/PassConnected/Un Cache/Uncache

Aggregator Active Connected Cache

Expression Passive Connected No Cache

Union Active Connected No Cache

Filter Active Connected No Cache


Joiner Active Connected Cache
Used For
used to performs aggregate calculations, such as averages,
count first, last, max, median, min, percentile, stddev,
sums and variance.

The Expression transformation is use to perform


non-aggregate calculations.
The Expression transformation to calculate values in a single
row before it writes into Target

The Union transformation is a multiple input group


transformation that you use to merge data from multiple
pipelines or pipeline branches into one pipeline branch.
It merges data from multiple sources similar to
the 'UNION ALL' SQL statement to combine the results
from two or more SQL statements

The Filter transformation to filter out rows in a mapping which


are passing through it. The Filter transformation allows rows
that meet the specified filter condition to pass through. It drops
rows that do not meet the condition. You can filter data based
on one or more conditions.
A filter condition may returns TRUE or FALSE for each row that
the Integration Service evaluates, depending on the specified
condition. For each row that returns TRUE can pass through the
transformation. For each row that returns FALSE will be
dropped out and writes a message to the session log.
The Joiner transformation to join source data from two
related heterogeneous sources residing in different
locations or file systems. We can also join data from
the same source. The use Joiner transformation sources
should have at least one matching column.
The Joiner transformation uses a condition that matches
one or more pairs of columns between the two sources.
Important Points
• The expression can include non-aggregate expressions and conditional clauses
You can configure an input, input/output, output, or variable port for the group. When grouping data, the Aggregator tran
each group unless otherwise specified.
• To handle the Null values in Aggregate functions you have to configure the Integration Service. You can choose to treat n
NULL or zero. By default, the Integration Service treats null values as NULL in aggregate functions.
• In-case you have checked sorted input option and the input data is not sorted before passing to it. Then the session will
• Do not use sorted input if either of the following conditions are true:
> The aggregate expression uses nested aggregate functions.
> The session uses incremental aggregation
• Aggregate cache
Data and Index Cache
• Aggregate expression
• Group by port
• Sorted input

• We can create multiple input groups, but only one output group.
• All input groups and the output group must have matching ports. The precision, datatype, and scale must be identical ac
• The Union transformation does not remove duplicate rows. To remove duplicate rows, you must add another transforma
transformation.
• We cannot use a Sequence Generator or Update Strategy transformation upstream from a Union transformation.
• The Union transformation does not generate transactions.

• The filter condition is case sensitive.


• To maximize session performance, keep the Filter transformation as close as possible to the sources in the mapping. Rat
to discard through the mapping, you can filter out unwanted data early in the flow of data from sources to targets.
• The Source Qualifier transformation provides an alternate way to filter rows. Rather than filtering rows from within a ma
transformation filters rows when read from a source.
• You cannot use a Joiner transformation when either input pipeline contains an Update Strategy transformation.
• You cannot use a Joiner transformation if you connect a Sequence Generator transformation directly before the Joiner tr
The Joiner transformation supports the following types of joins:
Normal Join
With a normal join, the Integration Service discards all rows of data from the master and detail source that do not match,

Master Outer Join


A master outer join will keeps all rows of data from the detail source and the matching rows from the master source. It dis
master source.

Detail Outer Join


A detail outer join keeps all rows of data from the master source and the matching rows from the detail source. It discards
detail source.

Full Outer Join


A full outer join keeps all rows of data from both the master and detail sources

: A normal or master outer join performs faster than a full outer or detail outer join.
If a result set includes fields that do not contain data in either of the sources, the Joiner transformation populates the emp
know that a field will return a NULL and you do not want to insert NULLs in the target, you can set a default value on the P

The number of rows the Integration Service stores in the cache depends on the partitioning scheme, the source data, and
transformation for sorted input.

To improve performance for an unsorted Joiner transformation, use the source with fewer rows as the master source. To i
Joiner transformation, use the source with fewer duplicate key values as the master.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy