Incremental Aggregation

Incremental aggregation allows a PowerCenter session to update aggregate targets incrementally based on changes in the source data, rather than completely recalculating aggregates from the entire source each time. It works by capturing and processing only new or changed source records, and storing historical aggregate data in index and data files to update targets incrementally. Consider using incremental aggregation when source changes are incremental and do not significantly alter existing target data.

Uploaded by

gandhidasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views2 pages

Incremental Aggregation

Uploaded by

gandhidasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 2

Incremental Aggregation Overview

When using incremental aggregation, you apply captured changes in the source to aggregate calculations in a session. If the source changes only incrementally and you can capture changes, you can configure the session to process only those changes. This allows the PowerCenter Server to update your target incrementally, rather than forcing it to process the entire source and recalculate the same data each time you run the session. For example, you might have a session using a source that receives new data every day. You can capture those incremental changes because you have added a filter condition to the mapping that removes pre-existing data from the flow of data. You then enable incremental aggregation. When the session runs with incremental aggregation enabled for the first time on March 1, you use the entire source. This allows the PowerCenter Server to read and store the necessary aggregate data. On March 2, when you run the session again, you filter out all the records except those time-stamped March 2. The PowerCenter Server then processes only the new data and updates the target accordingly. Consider using incremental aggregation in the following circumstances:

You can capture new source data. Use incremental aggregation when you can capture new source data each time you run the session. Use a Stored Procedure or Filter transformation to process only new data. Incremental changes do not significantly change the target. Use incremental aggregation when the changes do not significantly change the target. If processing the incrementally changed source alters more than half the existing target, the session may not benefit from using incremental aggregation. In this case, drop the table and re-create the target with complete source data.

Note: Do not use incremental aggregation if your mapping contains percentile or median functions. The PowerCenter Server uses system memory to process Percentile and Median functions in addition to the cache memory you configure in the session property sheet. As a result, the PowerCenter Server does not store incremental aggregation values for Percentile and Median functions in disk caches. The first time you run an incremental aggregation session, the PowerCenter Server processes the entire source. At the end of the session, the PowerCenter Server stores aggregate data from that session run in two files, the index file and the data file. The PowerCenter Server creates the files in a local directory. Each subsequent time you run the session with incremental aggregation, you use only the incremental source changes in the session. For each input record, the PowerCenter Server checks historical information in the index file for a corresponding group. If it finds a corresponding group, the PowerCenter Server performs the aggregate operation incrementally, using the aggregate data for that group, and saves the incremental change. If it does not find a corresponding group, the PowerCenter Server creates a new group and saves the record data. When writing to the target, the PowerCenter Server applies the changes to the existing target. It saves modified aggregate data in the index and data files to be used as historical data the next time you run the session. If the source changes significantly, and you want the PowerCenter Server to continue saving aggregate data for future incremental changes, configure the PowerCenter Server to overwrite existing aggregate data with new aggregate data. For details, see Reinitializing the Aggregate Files. When you partition a session that uses incremental aggregation, the PowerCenter Server creates one set of cache files for each partition. The PowerCenter Server creates new aggregate data, instead of using historical data, when you perform one of the following tasks:

Save a new version of the mapping. Configure the session to reinitialize the aggregate cache. Move the aggregate files without correcting the configured path or directory for the files in the session property sheet. Change the configured path or directory for the aggregate files without moving the files to the new location. Delete cache files. Decrease the number of partitions.

Note: When the PowerCenter Server rebuilds incremental aggregation files, the data in the previous files is lost. Reinitializing the Aggregate Files If the source tables change significantly, you might want to run the session with the entire source data. To do this, you can configure the session to reinitialize the aggregate cache.

Incremental Aggregation Overview

For example, you can reinitialize the aggregate cache if the source for a session changes incrementally every day and completely changes once a month. When you receive the new monthly source, you might configure the session to reinitialize the aggregate cache, truncate the existing target, and use the new source table during the session. After you run a session that reinitializes the aggregate cache, edit the session properties to disable the Reinitialize Aggregate Cache option. If you do not clear Reinitialize Aggregate Cache, the PowerCenter Server overwrites the aggregate cache each time you run the session. Note: When you move from Windows to UNIX, you must reinitialize the cache. Therefore, you cannot change from a Latin1 code page to an MSLatin1 code page, even though these code pages are compatible. Moving or Deleting the Aggregate Files Once you run an incremental aggregation session, avoid moving or modifying the index and data files that store historical aggregate information. If you do move the files into a different directory, and you want the PowerCenter Server to use the aggregate files, you must also change the path to those files in the session properties. As well, if you change the path to the files, but you do not move the files, the PowerCenter Server rebuilds the files the next time you run the session. If you change certain session or server properties, the PowerCenter Server cannot use the incremental aggregation files, and it fails the session. To avoid session failure, delete existing incremental aggregation files when you perform any of the following tasks:

Change the PowerCenter Server data movement mode from ASCII to Unicode or from Unicode to ASCII. Change the PowerCenter Server code page to an incompatible code page. Change the session sort order when the PowerCenter Server runs in Unicode mode. Change the Enable High Precision session option.

Finding Index and Data Files By default, the PowerCenter Server stores the index and data files in the directory entered in the server variable, $PMCacheDir, in the Workflow Manager. The PowerCenter Server names the index file PMAGG*.idx. The PowerCenter Server names the data file PMAGG*.dat. If you run the session using Verbose Init mode, the PowerCenter Server writes the file names in the session log. To locate the files, look in the previous session log for the TE_7034 and TE_7035 messages that indicate the cache file name and location. The following messages show sample entries in the session log: MAPPING> TE_7034 Aggregate Information: Index file is [D:\Informatica\InformaticaServer\Cache\PMAGG8_4_2.idx] MAPPING> TE_7035 Aggregate Information: Data file is [D:\Informatica\InformaticaServer\Cache\PMAGG8_4_2.dat] If you do not run the session using Verbose Init mode or use an identifiable transformation naming convention, you may have difficulty determining which files belong to each session. For more information about cache file storage and naming conventions, see Cache Files.

Data Integration Workflow Essentials
No ratings yet
Data Integration Workflow Essentials
8 pages
Filters
No ratings yet
Filters
7 pages
What Are The Best Mapping Development Practices and What Are The Different Mapping Design Tips For Informatica?
No ratings yet
What Are The Best Mapping Development Practices and What Are The Different Mapping Design Tips For Informatica?
29 pages
Incremental Aggregation in Informatica
No ratings yet
Incremental Aggregation in Informatica
3 pages
Name of Solution:: Please Rate This Solution and Share Your Feedback On Website
No ratings yet
Name of Solution:: Please Rate This Solution and Share Your Feedback On Website
2 pages
PowerCenter Session Partitioning Guide
No ratings yet
PowerCenter Session Partitioning Guide
4 pages
Powercenter Version 8.6 New Features and Enhancements: Command Line Programs
No ratings yet
Powercenter Version 8.6 New Features and Enhancements: Command Line Programs
4 pages
Working With Powercenter 8 Desinger
No ratings yet
Working With Powercenter 8 Desinger
67 pages
Informatica Advanced Training
100% (3)
Informatica Advanced Training
94 pages
Informatica Repository Manager
No ratings yet
Informatica Repository Manager
5 pages
Informatica PowerCenter Tips
No ratings yet
Informatica PowerCenter Tips
8 pages
Informatica PDF
No ratings yet
Informatica PDF
55 pages
Understanding Cache in PowerCenter
No ratings yet
Understanding Cache in PowerCenter
37 pages
A FAQs
No ratings yet
A FAQs
9 pages
Dynamic Partitioning in Informatca 8.X
No ratings yet
Dynamic Partitioning in Informatca 8.X
32 pages
Transformations
No ratings yet
Transformations
7 pages
PowerCenter 8.6 Enhancements Overview
No ratings yet
PowerCenter 8.6 Enhancements Overview
14 pages
Informaticalakshmi
No ratings yet
Informaticalakshmi
15 pages
Developer Lab Guide
No ratings yet
Developer Lab Guide
154 pages
Informatica PowerCenter Guide
No ratings yet
Informatica PowerCenter Guide
9 pages
A Interview Questions and Answers - Cool Interview
100% (16)
A Interview Questions and Answers - Cool Interview
30 pages
Answers 1
No ratings yet
Answers 1
68 pages
Checklist For Best Practices in Powercenter
No ratings yet
Checklist For Best Practices in Powercenter
7 pages
Informatica Corporation Powercenter Version 8.6.0 Hotfix 4 Release Notes
No ratings yet
Informatica Corporation Powercenter Version 8.6.0 Hotfix 4 Release Notes
7 pages
Essbase11 - Aggregate Storage Overview
No ratings yet
Essbase11 - Aggregate Storage Overview
15 pages
A Interview Questions and Answers
No ratings yet
A Interview Questions and Answers
34 pages
Optimize Informatica Aggregation
No ratings yet
Optimize Informatica Aggregation
64 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Informatica Interview
No ratings yet
Informatica Interview
32 pages
Informatica PowerCenter 7 Training
No ratings yet
Informatica PowerCenter 7 Training
10 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
Dimension Data Interview Insights
No ratings yet
Dimension Data Interview Insights
4 pages
Performance Tuning in Informatica
No ratings yet
Performance Tuning in Informatica
26 pages
Informatica Guide
No ratings yet
Informatica Guide
159 pages
PowerCenter Lookup Optimization Guide
No ratings yet
PowerCenter Lookup Optimization Guide
2 pages
PowerCenter Transformation Guide
No ratings yet
PowerCenter Transformation Guide
9 pages
Logs & Error Handling Settings
No ratings yet
Logs & Error Handling Settings
4 pages
Is Sorter Transformation Passive or Active ?: 1. When We Want To Get Single Return Value
No ratings yet
Is Sorter Transformation Passive or Active ?: 1. When We Want To Get Single Return Value
7 pages
Faq Infa Forum
No ratings yet
Faq Infa Forum
15 pages
Sorter Transformation Properties 1. Sorter Cache Size
No ratings yet
Sorter Transformation Properties 1. Sorter Cache Size
3 pages
Lookup and Lookup Caches
No ratings yet
Lookup and Lookup Caches
17 pages
Update Strategy Transformation Overview
No ratings yet
Update Strategy Transformation Overview
6 pages
Informatica Interview Q&A Guide
100% (4)
Informatica Interview Q&A Guide
12 pages
Data Warehouse and Data Cube
No ratings yet
Data Warehouse and Data Cube
30 pages
Bakery Workers' Job Satisfaction Study
No ratings yet
Bakery Workers' Job Satisfaction Study
3 pages
BSC CS Ty Practice Exam
No ratings yet
BSC CS Ty Practice Exam
4 pages
AI Sectoral Report Feb2024
No ratings yet
AI Sectoral Report Feb2024
44 pages
Descriptive Qualitative
No ratings yet
Descriptive Qualitative
15 pages
Hair 4e IM Ch03
No ratings yet
Hair 4e IM Ch03
20 pages
Siemens Written Test Questions
No ratings yet
Siemens Written Test Questions
4 pages
Camote Tops Juice Research Methodology
No ratings yet
Camote Tops Juice Research Methodology
11 pages
BlackBook Template Demat DebtMarket
No ratings yet
BlackBook Template Demat DebtMarket
11 pages
Understanding Digital Service Innovation
No ratings yet
Understanding Digital Service Innovation
12 pages
Minor Project Report
No ratings yet
Minor Project Report
5 pages
MBA Project Report Guide
No ratings yet
MBA Project Report Guide
23 pages
Comp Arch Chapter 6
No ratings yet
Comp Arch Chapter 6
93 pages
Marketing Talent Skill Gap
No ratings yet
Marketing Talent Skill Gap
19 pages
Airtel Project
No ratings yet
Airtel Project
60 pages
Business Statistics: A First Course: Fifth Edition
No ratings yet
Business Statistics: A First Course: Fifth Edition
20 pages
SQL Server Machine Learning & Services
No ratings yet
SQL Server Machine Learning & Services
7 pages
Web Service Manual: March 2016 Author Tecnoteca SRL
No ratings yet
Web Service Manual: March 2016 Author Tecnoteca SRL
114 pages
Client-Server Databases PDF
No ratings yet
Client-Server Databases PDF
15 pages
NAND Flash Memory: Serial Peripheral Interface (SPI) MT29F1G01AAADD Features
No ratings yet
NAND Flash Memory: Serial Peripheral Interface (SPI) MT29F1G01AAADD Features
43 pages
Data Analysis Vocabulary Guide
No ratings yet
Data Analysis Vocabulary Guide
10 pages
Amadeus Airline Ticketing Course Guide
No ratings yet
Amadeus Airline Ticketing Course Guide
6 pages
Architectural Research Report Format
No ratings yet
Architectural Research Report Format
4 pages
9th 2ut Partb PDF
No ratings yet
9th 2ut Partb PDF
6 pages
Connect Visual FoxPro to SQL Server
No ratings yet
Connect Visual FoxPro to SQL Server
63 pages
Oracle Database 12c: OR1, 5 Tage
No ratings yet
Oracle Database 12c: OR1, 5 Tage
1 page
ODI 10G and Salesforce Integration Guide
100% (1)
ODI 10G and Salesforce Integration Guide
42 pages
PL/SQL Practice Quizzes Final Exam
No ratings yet
PL/SQL Practice Quizzes Final Exam
3 pages
M2 - Entity Relationship (ER) Model
No ratings yet
M2 - Entity Relationship (ER) Model
22 pages
The Impact of Small Scale Business On The Economy Development in Ilorin
No ratings yet
The Impact of Small Scale Business On The Economy Development in Ilorin
15 pages

Incremental Aggregation

Uploaded by

Incremental Aggregation

Uploaded by

Incremental Aggregation Overview

Incremental Aggregation Overview

You might also like