INFORMATICA
S.No
|
Transformation
|
I
|
O
|
V
|
R
|
M
|
L
|
Important
|
PAGE
|
1
|
Source Qualifier SQ A/C
|
*
|
*
|
|
|
|
|
Distinct,
|
8
|
2
|
Filter FIL A/C
|
*
|
*
|
|
|
|
|
Expression
|
4
|
3
|
Expression
EXP P/C
|
*
|
*
|
*
|
|
|
|
Group by port, Sorted input, Expression
|
|
4
|
Sequence Generator
SEQ P/C
|
*
|
*
|
|
|
|
|
NEXTVAL,CURVAL
|
10
|
5
|
Router RTR A/C
|
*
|
*
|
|
|
|
|
Expression
|
|
6
|
Union UN A/C
|
*
|
*
|
|
|
|
|
|
|
7
|
Transactional control
TC A/C
|
*
|
*
|
|
|
|
|
|
|
8
|
Sorter SRT A/C
|
*
|
*
|
|
|
|
|
Distinct
|
7
|
9
|
Normalizer
NRM A/C
|
*
|
*
|
|
|
|
|
Sequence, restart
|
|
11
|
Stored Procedure
SP P/C-UC
|
*
|
*
|
|
*
|
|
|
Bad file created
|
|
12
|
Joiner JNR A/C
|
*
|
*
|
|
|
*
|
|
Sorted Input
|
|
13
|
Aggregator AGG A/C
|
*
|
*
|
*
|
|
|
|
Group by port, Sorted input
|
|
14
|
Rank RNK A/C
|
*
|
*
|
*
|
*
|
|
|
Group by port, Expression
|
|
15
|
Look-up LKP A/C-UC
|
*
|
*
|
|
*
|
|
*
|
|
|
16
|
XML Source Qualifier
XML A/C
|
|
|
|
|
|
|
|
|
17
|
Custom CT A-P/C
|
*
|
*
|
|
|
|
|
|
|
18
|
External Procedure EP P/C-UC
|
*
|
*
|
|
*
|
|
|
|
|
19
|
Update Strategy
UPD A/C
|
*
|
*
|
|
|
|
|
DD_INSERT,
DD_UPDATE,
DD_DELETE, DD_REJECT - UP
as UP , UP as IN, UP else In , Tranc
EXPRESSION - Reject file created
|
|
20
|
Midstream XML Parser
|
|
|
|
|
|
|
|
|
21
|
Midstream XML Generator
|
|
|
|
|
|
|
|
|
22
|
Application Source Qualifier
|
|
|
|
|
|
|
|
|
23
|
MQ Series Qualifier
|
|
|
|
|
|
|
|
|
24
|
Application Multi group Qualifier
|
|
|
|
|
|
|
|
|
1. AGGREGATOR TRANSFORMATION
a/c - AGG
Definition
The
Aggregator transformation is Active and Connected.
The
Aggregator transformations allow you to perform to aggregate calculation.
You can
use the aggregator transformation to perform calculation on groups.
The Row
which meet the condition are passed to target.
The
doesn’t meet the condition, rejected row store rejected file or Bad file
directory.
Aggregator Functions
AVG MAX STDDEV
COUNT MEDIAN SUM
FIRST MIN VARIANCE
LAST PERCENTILE
PORTS
INPORTS -
(I) each input Receive data
OUTPORTS - (O) Pass the data to other transformation
VRIABLE PORTS - (V) its stores the Intermediate result it can reference input ports
Not
to out ports
GROUP BY PORT -
PROPERTIES
Cache Directory - $PMCaheDir
Tracing Level - Normal ( Terse / Normal / Verbose
initialization /
verbose data )
Sorted
Input -
Aggregator Data Cache -
2000000 Bytes
Aggregator Index Cache - 1000000 Bytes
Transformation Scope - All input (Transaction / All Input)
COMPONENT
Aggregate
Expression - Non aggregate Expression / Conditional
Class
Aggregate
Cache -
Group by
Port - Which column you want group by
Eg. Dept
Sorted
Input - Reduce the amount of data
cached
AGGREGATOR CACHE:
The PCS stores data
in the aggregate cache until it complete the aggregator calculation
Index
Cache : It stores the group value, As Configured in the group by
port
Data
Cache : Stores calculation ( Row data Stores, output value) Based
on group-by-ports
OPTIMIZATION
-
Group
by simple columns like numbers instead of string or date
-
Use
sorted input
-
Use incremental aggregation
-
Minimize the aggregate function
-
Before filter transformation best ( Reduce the Data)
-
Lookup unconnected & stored procedure – we can call
2. EXPRESSION Transformation
P/C - EXP
Definition
Expression
Transformation Passive and connected transformation
This can
be calculate values in a single row
before writing to the target.
Row by
row calculation,
Perform the any non aggregate
function
EXPRESSION Functions
The Expression Transformation is used for data cleansing
and scrubbing
There are over 80 functions within PowerCenter, such as
salary, concatenate, instring, rpad,
ltrim and we use many of them in the Expression
Transformation.
We can also create derived columns
and variables in the Expression Transformation.
COMPONENT
Expression - we can call – Unconnected Stored Procedure and
Unconnected Lookup
PORTS
INPORTS - (I) Each input port Receive data
OUTPORTS - (O) which provide the value to either target or
next transformation
in the mapping is called output ports
VARIABLE PORTS - (V) Its
stores the Intermediate result it can reference input ports
(
-Which stores the variable information )
PROPERTIES
Tracing Level -
Normal ( Terse / Normal / Verbose initialization /
verbose data )
OPTIMIZATION
-
Factoring
out common logic
-
Minimizing
aggregator function calls. For Eg.use SUM(A+B)
instead of using SUM(A) + SUM(B)
-
Replacing
common sub expression with local variables
-
Choosing
Numeric Vs String operation
-
Choose
DECODE function Vs LOOK UP operation
-
Choose CONCAT operation for Eg use ||’|| instead of CONCAT (Fname,
Lastname)
-
you can enter multiple expression in a Single Expression Transformation.
3. FILTER Transformation
a/c - FIL
DEFINITION
-
This is a type of
active and connected Transformation which is used to filter of the source rows
based on a
condition.
-
Only the row which
meet the condition are pass through to target.
-
Any kind of source we can use filter
Transformation
-
Filter condition drops
data that does not match the condition
-
We can put one or more condition (more condition
means we can use AND , OR operator)
-
Discards rows don’t appear in the session log or
reject files
PORTS
INPORTS
- (I) Receive data from source
OUTPORTS - (O) Pass the data to other Transformation
PROPERTIES
Filter
Condition : <put
Condition>
Tracing
Level : Normal (Normal / Terse / Verbose
init / Verbose data )
OPTIMIZATION (TIPS)
¨ Use the filter transformation early in the mapping (or) nearly in SQ
¨ The filter condition is case sensitive, and queries in some database do
not take this into
account.
TROUBLESHOOTING
Case
sensitivity : the filter condition is case sensitive
Appended spaces :
use the RTRIM function to remove
additional space
FUNCTIONS
-
You can use one or more condition in filter
transformation
-
AND , OR logical
operator through
4.
JOINER Transformation
a/c - jnr
Definition
-
This is active and
connected Transformation.
-
Can be used to join
two sources coming form two different locations or same location.
-
We can use homo genius and hetero genius sources
-
Join a flat file and a
relational sources or to join two flat files or to join a relational source
and
a XML source.
CONDITION
1).
Two sources there must be at least one matching ports or columns
2).
Two sources there should have Primary key and Foreign key relationship
PORTS
INPORTS - (I) Receive
data from source
OUTPORTS - (O) Pass the data to other Transformation
MASTERPORTS - (M) If checked master(small) otherwise details
(large)
(to switch the Master Details relationship for the source )
PROPERTIES
1 Cache
sensitive String Comparison - (Character
data only enable)
2 Cache Directory - $PMCacheDir
3 Join Condition -
4 Join
Type - NORMAL (Normal
- M.outer -D.outer- Full Outer)
5 Null ordering in Master - Null is highest value
(Null is lowest value)
6 Null
ordering in Detail - Null is highest value
(Null is lowest value)
7 Tracing Level - Normal (Normal / Terse / Verbose
init / Verbose data )
8 Joiner
Data cache size - 2000000
9 Joiner
Index cache size - 1000000
10 Sorted
Input -
11 Transformation Scope - All input (Transaction / All Input )
COMPONENT
Case sensitive string comparison - (Character data only enable)
Cache directory -
Join
condition -
Joiner
type - ( Normal ,
Master Outer, Detail Outer, Full outer)
CACHE
Joiner Data cache size : Out
put value only
Joiner Index cache size : The
index cache holds rows from the master source that are in
the join condition.
Index
cache
Stores
index values for the master source table as configured in the join condition.
|
Data
Cache
Stores
master source rows.
|
Functions
Following types of source can be used in a
joiner
-
Two relational tables
existing in separate databases
-
Two flat files in
potentially different file systems
-
Two different ODBC
sources
-
Two instances of the
same XML sources
-
A relational table and
a Flat file source
-
A relational table and
a XML source
A joiner cannot contain the following types
of source
-
Both pipelines begin
with the same original data sources.
-
Both input pipeline
originate from the same source qualifier transformation
-
Both input pipeline
originate from the same normalizer transformation.
-
Both input pipeline
originate from the same joiner transformation.
-
Either input pipeline
contains an update strategy transformations
-
Either input pipeline contains a connected or unconnected
sequence Generator transformation.
PERFORMANCE
-
Use sorted input (flat file ,relational data,)
-
Minimizing the disk
input and output
-
Use in front of sorted
transformation
-
For an unsorted joiner
transformation, designate as the master source the source with fewer rows
-
For an sorted joiner
transformation, designate as the master source the source with fewer duplicate
key values
-
Following Transformation
we can’t use before the joiner Transformation.
-
Sequence Generator
Transformation directly
-
Update strategy
Transformation
TIPS
-
sorted input – improve the session performance.
-
Don’t use following transformation sort origin and
joiner transformation
-
Custom , Unsorted aggregator, Normalizer, Rank.
-
Sort order from both table( master & Detail)
-
Normal
or Master outer join perform than a full outer or detail outer join.
Normal - Matched Rows form master and detail
source
Master -
all rows data from the detail source and the matching rows from the master
source
Detail -
all rows data from the master source and the matching rows from the detail
source
Full outer - all rows rows of data from both the
master and detail sources
5.
RANK TRANSFORMATION
A/C - RNK
Definition
This
an Active and Connected Transformation
Which is
used to identify the Top or Bottom rank of data based on condition.
Rank
transformation to return the largest or smallest numeric value in a port or
group
We can use a rank transformation to return the
strings at the top or the bottom of a session
sort order.
Functions
ascii - Binary
sort order
UNICODE -
Session sort order in session properties (code Page)
Binary
sort order
Binary value string and returns rows with the highest
binary values for string
PORTS
INPUT ( I) - minimum of one
OUTPUT (O) -
minimum of one
VARIABLE (V) -
Stores values or calculations to
use in an expressions
RANK (R) - Only one (default port-only out put return only)
EXPRESSION -
GROUP BY PORT -
PROPERTIES
CACHE DIRECTORY -
$PMCacheDir
TOP / BOTTOM -
TOP
NUMBER OF
RANKS -
CASE
SENSITIVE STING COMPARISON -
TRACING LEVEL -
normal
RANK DATA
CACHE SIZE -
2000000
RANK INDEX
CACHE SIZE -
1000000
TRANSFORMATION
SCOPE -
All input (All input/ Transformation)
CACHE
Index Cache
|
Data Cache
|
Stores
group values as configured in the group by ports.
|
Stores
ranking information based on the group by ports.
Can
must run the session on a 64bit PoweCen
|
PERFORMANCE
- Configure ASCII mode
6. Sorter Transformation
A/C - SRT
DEFINITION
¨
It allows to sort data
either in ascending or descending according to a specify sort key (field)
¨
Also used to configure
for case- sensitive sorting and specify whether the output rows should
be distinct.
FUNCTIONS
- Sort data from relational or flat file
source.
- The
sorter transformation treats the data passing through each successive
sort key port as a
Secondary sort of the previous port.
COMPONENT
DIRECTION (V) - Ascending
or Descending
PORTS
INPORTS (I) - Receive
data from source
OUTPORTS (O) - Pass
the data to other Transformation
KEY (V) - Which
one u want to sort the A/D)
PROPERTIES
Sorter cache size :10000000 #input rows +
[ (∑column
size) + 16]
Case sensitive : (enable)Uppercase
higher than lower case
Work directory : #PMTempDir
(Temp file store-sorting time
Distinct : enable– eliminate duplicate value in out put
Tracing
level : normal (
Terse / Normal
/ Verb init / Verb data )
Null
treated low : (enable–treat
null values higher than any other value)
Transformation scope :All input 1)
Transaction 2) All Input
SORT DATA
Each
successive sort key port as a secondary sort of the previous port
FORMULA
# input
rows [( S column size ) + 16]
PERFORMANCE
-
Sorter transformation to sort data passing through an
Aggregator transformation configured to use
sorted
transformation
-
You should configure sort criteria to PCS applies to
all sort key ports
7. SOURCE QUALIFIER Transformation
A/C - SQ
DEFINITION
When adding a
relational or flat file source definition to a mapping it is must to connect it
to a
Source Qualifier Transformation.
The
Source Qualifier transformation represents the rows that the powerCenter server
reads when
it runs a session.
FUNCTIONS
& Perform
Overriding
the default SQL query - Only relational
Filtering
the record -
Only relational
Join the
data from two or more tables etc, - Same
source database
IMPORTANT TOPIC
Target load order - Constraint
based load
Parameter & variable - $$$
session start time
Default query -
SQL Overwrite -
Override the default SQL query (user
defined join, source filter, no of sorted ports, select distinct
setting
PORTS
INPORTS (I) - Receive data from source
OUTPORTS (O) - Pass the data to other Transformation
PROPERTIES
SQL Query - (custom query replace the default query)
User
Defined join - (user defined join)
Source filter - (filter condition)
No of Sorted ports -
0 (order
by includes no of ports-sort order)
Tracing
level - normal (
Terse / Normal
/ Verb init / Verb data )
Select
Distinct - (enable-unique values from source) only enable flat file
Pre SQL - (before reads to the source)
Post SQL - (after it writes to the target)
OPTIMIZATION
- Use the
source qualifier to filer. The source qualifier limits the row set extracted
from the source where
as filter limits the row set sent to a
target.
PERFORMANCE
-
Join data originating from the same source database
-
Filter rows when the PCS reads source data
-
Specify an outer rather than the default inner join
-
Specify sorted ports
-
Select only distinct values from the source
-
Create custom query to issue a special select statement
for the PCS to read source data
-
Data type we can’t change, if you can change mapping is invalid.
Target Load Order:
-
Multiple SQ connected multiple target.
-
One SQL provide multiple target you can enable
constraint based loading in a session to have the
PCS load data based on target table PK
& FK relationship.
Default Join:
-
PK – FK Relationship
-
Matching Data Type
Custom Join:
-
Custom don’t
have PK & FK relationship
-
Data type of columns used for the join don’t match
Outer Join support :
- Default query outer join statement nested
query created -( left outer, right outer, full outer)
8. rOUTER Transformation
A/C - RTR
DEFINITION
-
This is an active and
connected Transformation
-
Similar to Filter
Transformation
-
Single input multiple
Target opp to union Transformation.
-
Processing the
incoming data only once and passes the output to multiple groups and routes
data to the default O/P group that do not meet the condition
FUNCTIONS
-
Router Transformation
in a mapping the PCS process the incoming data only once.
-
Router Transformation
efficient of Filter Transformation
-
Router Transformation
one input group multiple output group
(user
define output group (many) & default output group (one only)
PORTS
Group
Input group - user
define group to test a condition based on incoming data
Output group - 1.
user defined group
2. Default group
-
we can’t modify on
delete output ports
-
only connected target
group
-
out put group of
sequential only default created.
-
If you want the PCS to
drop all rows in the default group, don’t connect it to a transformation or a
Target in a mapping.
-
If rows meet more then
one group filter condition, the PCS Passes this rows multiple time
PROPERTIES
Tracing Level -
COMPONENT
Input and Output groups
Input and Output ports
Group filter conditions
PERFORMANCE &TIPS
-
One group can be
connected to more then one transformation or target
-
One output ports in a
group can be connected to multiple transformation or targets.
-
Multiple output ports
in one group can be connected to multiple transformations or targets
-
More than one group
cannot be connected to one transformation or target
-
We can’t connect more
then one group to multiple input group Transformation, except for joiner
transformations, when you connect each output group to different input group.
9. sEQUENCE GENERATOR Transformation
P/C - SEG
DEFINITION
Sequence Generator
Transformation generates the numeric
values.
SGT
to create unique primary key values, cycle through a sequential range of
numbers
Common
Use: - SG when you perform multiple loads to a
single target
- Replacing the missing values.
We
can’t connect to more then one transformation
FUNCTIONS
CURRVAL - NEXTVAL + INCREMENTAL BY VALUE
One row in each block
Currval port without connecting the nextval
port
PowerCenter server passes a constant value
for each row
One row in each block
NEXTVAL - primary key – down stream transformation
PORTS ( Both are default port )
INPORTS
- Receive data through unconnected Transformation
OUTPORTS
- Pass the data to other transformation
- 2 Default output ports 1. NEXTVAL ,
2. CURVAL
PROPERTIES
Start value - 0 cycle
option (enter the value complete the cycle value after restart the
value
Increment by - 1 D
b/w 2 consecutive values from the nextval port the default values is 1
End value
- (1-2147483647) the
maximum values powerCenter generates.
- sequence is not configured to cycle
it fails the session.
Current value- enter
you want first value PC server to use in sequence
-must
be generate than or equal to the start value and less than the end
value.
Cycle - If
selected –sequence range (up to limit)
- If not
selected – session failure with overflow error.
Number
of cached values –1 no of cached values determine the number
of values the PC
server caches at one time
Reset - If
selected, PC generates values based on the original current value for each
session (other wise)
Reflect the last – generated value
(Reusable is disabled for reusable sequence generator
Transformati
Tracing level - level
of information
PERFORMANCE
NON REUSABLE SEQUENCE GENERATOR
Cache
enable limit grater than 0.
-
row skipped the value.
-
discards the unused values.
REUSABLE
SEQUENCE GENERATOR
-Cache
enable some upto limit Eg.1000
OpTIMIZATION
-
Use reusable sequence
generator if the same sequence generator is to be used in more than one sessions.
-
Optimize performance
by connecting only the nextval port in a mapping
-
Sequential reusable
and use it in multiple mapping.
functions
-
Perform the following task with a
sequence generator Transformation
-
Create keys
-
Replace missing the value
-
Cycle through a
sequential range of numbers
10.
update strategy Transformation
A/C
- UPD
DEFINITION
This is an active
and connected transformation.
It
is used to update in data in target table, either to maintain history of data
or recent changes
It
is used to flag the records for Insert, update, Delete and Reject rows in the
Target database.
It is
used in slowly changing dimension to update the target table.
This
transformation used to SCD-1,SCD-2 and SCD-3 type.
FUNCTIONS
Set at a 2 levels
1. Within a session – treat
all records in the same way
for example,
treat all records as
DD_INSERT-0,DD_UPDATE-1,DD_DELETE-2,DD_REJECT-3,
2. Within a Mapping Levels – Flag records for insert, update, delete or reject
·
Insert.
Select this option to insert a row into a target table.
· Delete.
Select this option to delete a row from a table.
·
Update.
You have the following options in this situation:
Update
as update. Update each row flagged for update if it
exists in the target table.
Update
as insert. Inset each row flagged for update.
Update
else Insert. Update the row if it exists. Otherwise,
insert it.
·
Truncate table.
Select this option to truncate the target table before loading data.
PORTS
INPORTS (I) - Receive data from source
OUTPORTS
(O) - Pass the data to other Transformation
PROPERTIES
Update strategy expression -0 (DD_INSERT-0,DD_UPDATE-1,DD_DELETE-2,DD_REJECT-3,)
Forward rejected rows - enable - flags the rows for reject and writes them to the session
reject file.
Tracing level - normal (
Terse / Normal
/ Verb init / Verb data )
FUNCTION
SCD-1 : It keeps the
most recent updated values in the target
SCD-2 : It keeps the
full historical business information in the target
The full history is maintain by inserting
the new record in the target
SCD-3 : It keeps
previous value and current
PERFORMANCE
1.
Whenever use Dynamic
cache - at the time you must use UPD
transformation
2.
Dynamic lookup use you
can must select
1.
Select insert 2. select update as update 3. Don’t select
delete
3.
upd - > AGG
Use only – Update , Insert , Delete
4.
AGG - > UPD
Use only – Update , Insert , Delete, Reject
Update Strategy
Dynamic lookup – u
must use UPD transformation
Business Logic :
IFF((current
date>Previousdate) , DD_reject, DD_update)
|
|
|
|
- Dynamic lookup use you can must select
- Select Insert
- Select Update as update
- Don’t select delete
11. lookup Transformation
P/C & UC - LKP
DEFINITION
Look
up Transformation in lookup data in flat file or a relational table, view or
synonym
Get
a related value
Look up
transformation is used to perform the following task.
Get a related value
Perform a calculation
Update slowly changing dimension
FUNCTIONS
Relational
look up - dynamic cache use
Flat
file lookup - must use in
static cache
-
we can configure sorted input
PORTS
INPUT ( I) -
OUTPUT (O) -
LOOKUP (L) -
RETURN
(R) -
PROPERTIES
Look up SQL Override
(R) -
Lookup table name (R) -
Lookup caching enabled
(R/F) -
Lookup policy on multiple match ” -
Lookup condition -
Location information -
Source type -
Re
cache if stale
Tracing
level
Lookup
cache directory name
Lookup
cache initialize
COMPONENT
Look up table
Ports
Properties
Condition
Metadata
Extensions
LOOK UP CACHE
Persistent cache
Re cache from database
Static cache
Dynamic cache
Shared cache
PERFORMANCE
Cached lookup:By
indexing the columns in the lookup Order by
LOOK UP TRANSFORMATION
Look up
F
To look up data in
Flat File, Table, Synonym, View.
F
Use multiple lookup
transformation in a mapping
F
Performs the following
tasks
§
Get related value
§
Perform calculations
§
Updated slowly
changing dimension tables.
Connected Lookup
Static cache :
return value from the lookup query.
Dynamic cache :
Case 1 : No rows found in cache –
inserts the record
Case 2 :
Row found in cache - updates the
records
Unconnected Lookup
- Common use into
update slowly changing dimension – tables
- Returns one value
into the return port of look up transformation
Connected or Unconnected
- Receive input and
send output in different ways
Relational or flat file lookup
Cached or uncached
Dynamic -
Relational
Static -
Flat file
Cached -
Performance ( store the value whenever you want lookup table refer only
Uncached - each time lookup the value.
Connected Lookup Transformation
Unconnected Lookup Transformation
Relational & flat files lookups
v Relational Lookups -
Dynamic cache
v Flat file lookup -
Can use sorted input
-
Can use indirect file
-
Can sort null date
high
-
Can use case sensitive
string comparison
Look up Components
v Look source - cached lookup – order by
v Ports - Uncached lookup – select
v Properties -
v Conddtion -
Ports
Out port (O) -
Lookup (L) -
Return (R) - only
in connected lookup transformation
Properties
v Lookup SQL override
(R) -
v Lookup table name (R) -
Table, Synonym, View
v Lookup caching
enabled (R/F) -
v Lookup policy on multiple match (F/R)- enable mean (first,
last, return an error)
v Look Condition (F/R) -
v Connection information (R) -
v Source Type
(R/F) -
v Tracing Level (R/F) -
v Lookup cache Directory name (F/R) -
v Lookup cache Persistent
(F/R) -
v Look up Data cache size
(F/R) -
v Look up index cache size
(F/R) -
v Dynamic lookup cache
(R) - insert (or) updates – (only lookup cache enabled)
v Output old value on update (F/R) - use only with dynamic cache enabled
v Cache file name prefix (F/R) - use only with persistent lookup cache
- name prefix to use persistent lookup cache file
v Re cache from lookup source(F/R) - Rebuild the persistent cache file
v Insert else update (R) - use only with dynamic cache enabled
v Update else Insert (R) -
”
v Date Time format
(F) -
v Thousand separator (F) -
default no separator (‘,’
‘.’)
v Decimal separator (F) -
default period (,
.)
v Case sensitive string comparison(F) -
v Null ordering (F) -
v Sorted Input (F) -
Configuring Lookup Properties in
a Session
v Flat file lookups - (file name and file type)
v Relational Lookup - (u can define $source & $Target variable
in session)
Configuring Properties Flat file Lookup in a Session
v Lookup source file directory -
$LookupFileDir (default)
v Lookup source file name -
v Lookup source file name -
Direct
Configuration Relational Lookups in a Session
v Choose any relational
connection
v Connection variable , $DBconnection
v Specify database connection for $Source and $Target
Lookup Query
Default lookup query
·
SELECT - SQL override
·
ORDER BY - we can use enabled the cache ( u can’t view this )
Overriding the lookup Query
·
override the ORDER BY
statement
·
A lookup table
name(or) columns contain a reserved word - 'reserved word’
·
Use mapping parameter
& variables
·
A Lookup column name
contains a slash (/) character
·
Add where statement
·
Other
Overriding the ORDER BY Statement
Order
by - -
Reserved words
-
lookup or column names
contain a database reserved word
such as Month,Year – session fails
-
resword.txt (PC
initialization Directory)
Guideline to Overriding the Lookup Query
-
SQL override only
lookup SQL query relational only
-
Cache not enable PCS doesn’t recognize the override
-
Default query or
configure override – lookup / output port – add or subtract port from the SELECT statement, the session fails,
-
Filter before lookup
using dynamic cache when you add where clause to lookup SQL override
-
Override the ORDER BY
statement – session fail (because doesn’t contain condition port)
Don’t suppress order only
-
Reserved word session
fail if you want use quotes ( “ ”)
Step overriding the Lookup Query
Properties
Tab enter SQL override
Validate
to test
Lookup condition
1 .
Data type in condition must match
2.
Multiple condition –
use AND, OR
3.
Flat file for sorted
input – session fail (condition are not grouped so you select group column)
Uncached Static cache
1.
=, >, <, >=,
<= , !=
2.
multiple condition –
use AND , OR
3.
more then one lookup
condition (first meet all condition after another condition so you select GROUP
columns
Dynamic cache
1.
Only = operator
2.
can’t handling for
multiple matches – otherwise PCS fail
Lookup Cache
Index cache - Condition
value
Data cache - Output value
1). Persistent cache -
2). Recache from
lookup source -
3). Static cache -
4). Dynamic cache -
5). Shard cache -
Configuring Unconnected Lookup Transformation
-
:LKP – reference
qualifier to call the lookup within another transformation
-
Calling the same
lookup multiple time in one mapping
-
Syntax :
:LKP.Lookup_transformation_name(argument,argument,…..)
Unconnected use following kinds.
-
Add input port -
more then one condition
-
Add the lookup
condition
-
Designate a return
value
-
Call the lookup from
another transformation
Add input port
Design for source and target
Item_id out
IN_Item_id in
Add Lookup Condition
Item_id =
IN_Item_id
-
return condition is
false lookup return null
Designate a return value
-
Multiple input &
single output only
-
Update strategy or
filter expression
Call the lookup through an Expression
Eg.
IFF(Isnull(:LKP.lkpitems_dim(item_id,price)),DD_Update,DD_Reject
Creating
a Look Transformation
1.
Choose an existing
table or file definition
2.
Import a definition
from a relational or file
3.
skip a create a manual
definition
TIPs
1.
Add an index to the
column used in a lookup condition
2.
place condition with
an equality operator(=)first
LOOKUP CACHE
Index - PCS Condition value
Data Cache - Output Value
Default - $PMCacheDir
-
Data doesn’t fit in the memory cache
-
PCS stores the
overflow value in the cache file when
the session complete
-
Flat file lookup for
sorted input
1.
Persistent
Cache :
- Save and reuse
them the next time
2.
Recache from
source
-
persistent cache is
not synchronized with the lookup table
-
Rebuild
3.
Static Cache
-
Read only
-
Default cache
-
PCS doesn’t update the
cache
4.
Dynamic Cache
-
insert new rows or
update existing row
-
Dynamic insert &
update – pass data to target table
-
Can’t use flat file
5.
Shared Cache
-
use can use multiple
transformation
1. PERSISTENT CACHE :
-
PCS save or delete lookup cache files after
successful session based on the lookup cache persistent property.
-
Lookup table doesn’t
change between session you can configure the lookup transformation to use a
persistent lookup cache.
-
PCS saves & reuses
cache files from session to session so
eliminating time required to read the lookup table.
Non Persistent Cache
-
Enable caching in lookup transformation the PCS delete the
cache files at the end of a session
-
Next time you run the
session the PCS build the memory cache from the database.
Persistent cache
-
If you want save and
reuse the cache files you can configure the transformation
-
Use persistent cache
the lookup table doesn’t change between session runs
-
Lookup table changes
occasionally, can override session properties to re caches the lookup from the
database
-
Use Persistent cache
means you can specify a name for the cache file
PCS server handling of persistent caches
2. Rebuilding the Lookup
caches
-
Rebuild the Lookup
caches, lookup sources changed size the last time the PCS build the cache
-
When you rebuild the
cache the PCS create new cache file overriding existing persistent cache file
-
The PCS server write a
message to the session log file when if rebuild the caches
-
Don’t choose the to
recache the lookup source PCS automatically rebuild the persistent cache
3. STATIC CACHE (or) UCACHED LOOKUP
- The PCS builds the
cache when it process the first lookup request
-
PCS doesn’t update the cache.
-
Condition true –
connected lookup transformation return values represent by lookup/output ports.
-
Condition true -
unconnected lookup transformation return values represent by return ports.
-
Condition is not true
– connected lookup transformation return values to output port.
-
Condition is not true
– unconnected lookup transformation return null values to returns port.
-
Multiple partition
means PCS create one memory cache for each partition .
4. DYNAMIC LOOKUP CACHE
-
Insert the row into
the cache
-
Update the row in the cache
-
Makes no changes to
the cache
- Some situation
when you can use dynamic lookup cache
1.
Update a master customer table with new & updated
customer information
- Static lookup
cache -
fact file
- Dynamic lookup
cache - Relational table
2.
Loading data into a
slowly changing dimension table & a fact table
3.
Router or filter - use
4.
Multiple partition in
a pipeline that use a dynamic lookup cache the PCS create one me memory cache
and one disk cache for each transformation .
New lookup row
port
-
Target table
synchronized
-
Ignore Null input for
updates
-
Ignore in comparison
·
Ignore Null values
Using the
Associated Input port :
-
You must associated each lookup/out port with an
input/output port or a sequence ID
-
The PCS uses the data
in the associated port to insert or update rows in the lookup cache.
Sequence ID
Generate following Process
-
PCS create dynamic
lookup cache – tracks the range of value in the cache associated with
any port using a sequence ID
-
Maximum value for a
sequence ID is 2147483647.
Working with
lookup Transformation values
-
Associated an input/output ports or a sequence ID with a
Lookup/output port – following
Value match default.
·
Input Value - PCS passes into the
Transformation
·
Lookup Values -
PCS Passes insert into the cache.
Input /output port output value – PCS Passes out of the
Input/output port
-
Out put old value on
update – PCS output the value that existed in the cache before it updated the
row.
-
Out put new value on
update – PCS output the updated value that it write in the cache
-
When the update a
dynamic lookup cache & Target table.
-
PCS can handle the
null values in the following ways.
·
Insert Null values -
·
Ignore Null values -
(Not null values)
-
When you run a session
that use a dynamic lookup cache PCS compares the value in all lookup ports with
the value.
-
If compare the value
to determine whether or not to update the row in the lookup cache.
Update strategy
Transformation with a Dynamic chache.
1.
Row entering the
lookup Transformation : (By default) all
row type all rows entering a
lookup
transformation is insert.
2.
Row Leaving the Lookup
Transformation : PCS changed the lookup cache but it does not change the row type
-
Update Strategy transformation & a dynamic lookup cache
you must define certain
session properties
- PCS result the
lookup cache & Target table might become unsynchronized.
5. Sharing the Lookup Cache
-
You can configure multiple lookup Transformation in a mapping to
store a single lookup cache.
- You can share cache that are unnamed &
named.
1.
Unnamed Cache: - Compatible
caching structure
- The PCS share the cache by default you can
share static unnamed caches.
2. Named Cache : -
Use a persistent named cache
- when you want to share cache files across
mapping or share and a static cache
Sharing an Unnamed Lookup cache :
-
When 2 Lookup transformation share an unnamed
cahce.
-
You can share static
unnamed cache.
Sharing a Named Lookup Cache.
-
We can share the cache
between multiple lookup transformation by using a Persistent
-
We can share one cache
between lookup Transformation is the same mapping (or) across mapping
-
Named cache – cache
directory for cache files with the same files name prefix.
-
Specify the cache file
directory.
-
PCS rebuild the memory
cache from the persisted file
-
PCS structure don’t
match the PCS fails the session.
-
PCS process multiple
session simultaneously when the lookup transformation only need to read the
cache files.
-
A named cache created
by a dynamic lookup transformation with a lookup policy
TIPs
- PCS then saves & reuses cache
files from session to session, eliminating the time required to read the lookup
table
14. UNION Transformation
A/C
- UN
DEFINITION
-
Union Transformation is a multiple
input group transformation which is responsible for merging the data
coming from more then one source.
-
Union Transformation also merge the data Hetero geneous sources also.
-
Union Transformation is newly introduced in Informatica 7.1 version
onwards.
-
Union Transformation to the UNION ALL statement
-
Union Transformation is developed using the custom Transformation.
FUNCTIONS
-
Create Multiple input groups but only one output groups
-
All Input groups and the out put groups must have matching port. The
precision, data type, and scale must be identical across all groups
-
Union Transformation doesn’t remove duplicate rows.
-
To remove duplicate rows you must add another transformation upstream
from a union Transformation.
-
Can’t use sequence generator or update strategy transformation upstream
from a Union Transformation.
-
Union Transformation doesn’t generate transaction.
COMPONENT
Transformation
Tab :
you can
rename the transformation and add a description
Properties
Tab : you can specify the Tracing level
Groups :
you can create & delete input groups (Design displays groups you
create
on the ports tab
Groups
ports tab :
you can create & delete ports for the input groups
We can’t
modify ports, Initialization properties, meta data Extension or port attributes
definition Tab
PORTS
Groups & ports :
Multiple
input groups & one output groups, Design create output groups by default we
can’t edit or delete the outputs groups
MAPPING
-
Union Transformation is a non
blocking multiple input group Transformation
-
When you add a Union Transformation to a mapping you must verify that you
connect the same ports in all inputs groups. If you connect all ports in one
input group but don’t connect a port in another input groups. If you connect
all ports in one input groups, but don’t connect a port in another input group
the PCS passes Nulls to the unconnected ports
PROPERTIES
Mapping level
- Session Level
Module identifies - Pmuniontrans
Function identifiers - pmunionfunc
Runtime Location - enable
Tracing Level - Normal enable
Is Partition able -
Inputs Must Block -
Is Active -
Update
Strategy Transfomat -
Transformation Scope - Row
Generate Transformation -
Output Repeatable - Never
PERFORMANCE
No comments:
Post a Comment