Friday, November 14, 2014

ETL & Informatica


ETL stands for Extraction, Transformation and Loading. Informatica provides a single enterprise data integration platform to help access, transform, and integrate data from a large variety of systems and deliver that information to other transactional systems, real-time business processes and users.

Extraction
The first part of an ETL process involves extracting the data from the source systems. PowerCenter has the ability to extract structured, unstructured, semi-structured, relational, mainframe, or file data.

Transformation
In this stage, a series of rules or functions are applied to the extracted data from the source. Some data sources will require very little or even no manipulation of data. Informatica Developer provides a set of transformations that perform specific functions like selecting only certain columns to load , deriving a new calculated value, filtering, sorting, joining data from multiple sources, aggregation, generating surrogate-key values, etc.

Loading
In this stage, some data are loaded to the target directly without applying any transformation logic and some data are loaded to the target after applying the logic or business rules. The load phase loads the data into the end target.

1 comment: