0% found this document useful (0 votes)

143 views15 pages

UBS OCF - IDQ Capabilities Review

1. The document discusses use cases and tool requirements for an Operational Control Framework (OCF) using Informatica Data Quality (IDQ). 2. Several use cases are described that involve sampling, data transformation, analytics and reporting on large datasets. IDQ is identified as providing out of the box capabilities for many of the sampling and data transformation requirements. 3. Key tool requirements include an enterprise server-based solution, ability to visualize processes and rules, support for reusability and automation, and agility in testing changes. The document indicates IDQ can meet many of these requirements. 4. Features of IDQ are listed relating to data profiling, reference data management, data quality enhancement, deduplic

Uploaded by

AmarnathMaiti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

143 views15 pages

UBS OCF - IDQ Capabilities Review

Uploaded by

AmarnathMaiti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

UBS OCF - IDQ Capabilities review

IDQ – 1WMP Phoenix Data

Migration use cases

2
1WMP – Data migration implementation
Use Cases

Data Profiling / Data Business

Data Integration Reusability UBS NFR
Quality Transformation Verification
• Disparate source • Out of the box • Expression • Direct business • Reusable • Single sign on
data extracts profiling – null • Aggregator user access mappings through UBS
• Supporting multi- check, min / max • Rank • Step by step data through logical smart card
byte characters check, cardinality • Joiner verification and customized • Controlled user
• Dynamic etc • Filter • Easy data objects access via BBS
mapping • Business Rule • Java interpretation of • Reference data
based profiling • Router business logics / management
• Data drill down • Sorter lineage
• Score card • Read / Write
generation and • Lookup
monitoring • Union
• Merge

3
OCF – Use cases & Tool
requirements

4
OCF – Use cases
IDQ
S.No Use Case High level description Details
Capability
1 Random Sampling engine logic, where from a transformed result set of around 100,000 to Out of the box, through
Sampling 500,000 records a random sample of 10 to 20 records need to selected for reporting  profiling
2 Risk based Scenario based sampling, where from a large data set (500K - 1MM records), is ranked Out of the box, through
sampling based on requirement criteria (e.g. Trades of Client Advisors with largest risk exposure)
and from there random samples of 10 to 20 records need to be chosen
 profiling

3 CA Sweep Based on historical samples of Use Case 1 and 2, the trades associated with Client Through transformations
Advisors already selected in the samples 1 & 2 are excluded from the universe (full 
data set), and from the excluded set 1 or 2 sample records are randomly selected for
reporting
4 Fuzzy logic – Address comparison to find same addresses, using fuzzy logic comparison techniques Out of the box. Through
String MATCH transformation.
comparison
(Jaro Winkler
 Support Hamming,
Bigram, Edit & Jaro
distance) distance
5 Large volumes Process large volumes of datasets (~10MM) across multiple scenarios, combine them
of data ~10 based on logic (joins, filters, sorts) to generate a result of over ~12 to 15 MM records in 
million – time reasonable timeframe
series data
6 Email Emailing interface to Outlook to send scheduled emails to configured recipients with No out of the box
scheduling of embedded or attached results (excel / csv file or table format) from ETL (e.g. Email to solution. However can
output Desk Heads on the CA's under them whose clients had high value trades for the day) be done through Java
transformation

5
OCF – Tool requirements
IDQ
S.No Tool req. High level description Details
Capability
1 Enterprise and Need for an Enterprise & Server based Tool featuring the capability of the E2E Server based,
Server based Reporting requirements – Sampling, Analytics, ETL & Reporting generation  Intermediate analytics &
reporting support
2 Visualize Ability to visualize process flow and business rules to be able to easily determine the Easy to understand
process flow outcome of each business rule  through existing
interface
3 Reusability & Ability to repeat the same business rules with different data sets and over different time Demonstrated in
rerun ability periods with configurable sampling parameters (that would allow the same business phoenix migration
rules to be applied across regions but different sampling volume for each location 
requirement)
4 Traceability Traceability of data sources, aggregation of data and application of business rules to be Easy to understand
documented visually  through existing
interface
5 Automation Ability to automate execution of business rules to ensure samples/cases available for Informatica scheduler
investigation by case managers before business hours in each region  can be used
6 Agility Ability to test and evaluate changes quickly and easy, document inline and deploy Can be done through
same asset  adequate privileges

6
IDQ Features & Benefits
Informatica Data Quality Features

Reference Data Data Quality

Data Profiling De-Duplication Exception Handling Monitoring
Management Enhancement
• Access Data for • Easy • Data Cleansing & • Configure • Data Stewardship • Configuration of
anomalies and Maintenance of Enrichment Probabilistic and • Manual Data Dashboards and
inconsistencies Enterprise wide through Deterministic correction Reports for
• Build Metrics and Reference data Mapplets and Match Rules • Manual continuous DQ
Scorecards • Audit trail for Rules • Creation of Consolidation of monitoring
• Build Rules for capturing • Address Clusters Duplicate Data • Reactive and
Profiling changes to the Standardization • Consolidation of • Audit Mechanism Proactive
• Trend analysis on LOV list • Publish DQ rules Match candidates for Exception Monitoring
DQ metrics as Web Services handling capability

Benefits

• Proactively cleanse and monitor data for all applications and keep it clean
• Huge savings on ongoing data quality maintenance by Business users
• Enable the business to share in the responsibility for data Quality and Data Governance
• Enhance IT productivity with powerful business-IT collaboration and a common data quality environment

7
Data Profiling Features & Benefits
Provide immediate insight into the • End to end data profiling to discover the
basic quality of data and quickly content, quality, and structure of a data source:
expose potential areas of risk • Column Profiling
• Primary Key profiling
• Functional dependency Profiling
• Data Domain Discovery
Stats to identify outliers
• Enterprise Data Discovery
and anomalies in data Value and Pattern
Frequency to isolated
• Customized business rules can be created and
inconsistent/dirty data or used during profiling
unexpected patterns
• Rule builder allows Business Users to efficiently
collaborate with Developers for building
complex business rules.
• Rich GUI interface enables easy readability of
Rule Specifications.
• Scorecards can be created to display the value
frequency for columns in a profile
• Trend charts can be configured to view the
history of scores over time
Drill down into actual data
values to inspect results across
entire data set, including
potential duplicates

8
Scorecards and Trend Charts
Features & Benefits
• Enables Business to “measure the data-fitness” based on defined metrics before using it for various
data-driven projects.
• Critical for making good decisions about data quality improvement initiatives.
• Trend Charts allows Business to evaluate the progression and ROI of Data quality programs.
Quantify the Quality of Data with
• Weighted Scores on multiple metrics can help to find root causes and significant contributors for
Scorecards and Trend Charts
poor Data Quality Scores

9
Reference Data Management
Features & Benefits
Enriching or Standardizing Data using • Enables Business to create and manage
Reference Data Reference data
• Maintain Audit trails to monitor changes
to the Reference data objects
• Use Reference data objects to
standardize and enrich source data
during data quality operations
• Same Reference Data Objects can be
used across multiple data quality
projects
• Reference data objects can be created
from Column Profile values, Patterns, flat
files and database tables

10
Rule Builder Features & Benefits
• Enables Business Users to define Data Requirements of a Business Rule as a reusable
software object that can be run against the data to check its validity.
• Allows Business Users to efficiently collaborate with Developers for building complex
business rules.
• Rule Specifications defines condition-action pairs for defining Business Rules that can
Define and Design Business Rules
be evaluated in a particular order for validity.
• Rich GUI interface enables easy readability of Rule Specifications.

11
Data Quality Enhancement
Cleanse and Standardize Data, Resolving
and address data quality issues
Features & Benefits
• Build rules and mapplets to address data
quality issues
• Address validation corrects errors in
addresses and completes partial
addresses
• Reference data usage for enhancing DQ
process
• Exception handling for Manual review
and correction
• Export Maps to PowerCenter for
metadata reuse for physical data
integration
• Web Service consumers /provider for
integration with any SOAP based
application

12
Data Quality Mapping
Address Validation and
Geocoding enrichment across
260 countries

Standardization and Reference

Data Management

Address
Validation

Standardize

Parsing of Unstructured
Data/Text Fields of all data Parsing
types of data (customer/
product/ social/ logs)

DQ logic pushed down/run Native or Hadoop

13
De-Duplication

Features & Benefits

Identify Duplicates and Consolidate
• Customizable match rule
• Support both fuzzy as well as exact
match rule.
• Duplicate analysis and consolidation
of source data.
• Identity Matching capability using
population files
• Auto merging of the data based on
customizable de-duplication rule.
• Manual merging /unmerging for the
data which have low match score.

IDQ Functionality Imp
No ratings yet
IDQ Functionality Imp
7 pages
PowerCenter 6.x Upgrade Features Overview
No ratings yet
PowerCenter 6.x Upgrade Features Overview
53 pages
Informatica MDM Course Contents
No ratings yet
Informatica MDM Course Contents
7 pages
xBRL Adoption and Data Integration Insights
No ratings yet
xBRL Adoption and Data Integration Insights
18 pages
Informatica Data Management Guide
No ratings yet
Informatica Data Management Guide
33 pages
Informatica Pushdown Optimization Tips
No ratings yet
Informatica Pushdown Optimization Tips
5 pages
Velocity v8 Data Warehousing Methodology
No ratings yet
Velocity v8 Data Warehousing Methodology
1,106 pages
3 Teradata Interview Questions and Answers
No ratings yet
3 Teradata Interview Questions and Answers
7 pages
Informatica Cloud Data Quality December 2022
No ratings yet
Informatica Cloud Data Quality December 2022
20 pages
Informatica MDM Sample Resume 3
No ratings yet
Informatica MDM Sample Resume 3
6 pages
Comprehensive Data Warehousing Guide
No ratings yet
Comprehensive Data Warehousing Guide
11 pages
A Certification Questions
100% (2)
A Certification Questions
67 pages
Differences Between Active Transformation and Passive Transformation
No ratings yet
Differences Between Active Transformation and Passive Transformation
18 pages
Introduction to Databases & SQL
No ratings yet
Introduction to Databases & SQL
17 pages
Informatica Powermart / Powercenter 6 Basics Hands-On Lab Guide
No ratings yet
Informatica Powermart / Powercenter 6 Basics Hands-On Lab Guide
309 pages
Informatica Data Quality Guide
No ratings yet
Informatica Data Quality Guide
31 pages
A Interview Faq's - 2
No ratings yet
A Interview Faq's - 2
22 pages
DQ Architecture
0% (1)
DQ Architecture
3 pages
IDQ Learning
No ratings yet
IDQ Learning
36 pages
Cloud Mapping for Data Professionals
No ratings yet
Cloud Mapping for Data Professionals
22 pages
SCIM Configuration in IICS Admin
No ratings yet
SCIM Configuration in IICS Admin
16 pages
Resume Mohit
No ratings yet
Resume Mohit
6 pages
Informatica 9.x Course Curriculum
No ratings yet
Informatica 9.x Course Curriculum
8 pages
Informatica Interview Q&A Guide
No ratings yet
Informatica Interview Q&A Guide
42 pages
Benefits of Data Archiving in Data Warehouses
100% (1)
Benefits of Data Archiving in Data Warehouses
12 pages
Informatica MDM High Availability Options
No ratings yet
Informatica MDM High Availability Options
6 pages
Informatica B2B Data Transformation Course
No ratings yet
Informatica B2B Data Transformation Course
3 pages
Software Developer Resume
No ratings yet
Software Developer Resume
4 pages
Documenting ETL Rules in CA ERwin
No ratings yet
Documenting ETL Rules in CA ERwin
25 pages
Informatica Lab Guide
No ratings yet
Informatica Lab Guide
8 pages
Informatica Power Center ETL Development
No ratings yet
Informatica Power Center ETL Development
3 pages
PAM For Informatica Platform v10 5 4
No ratings yet
PAM For Informatica Platform v10 5 4
237 pages
MDM 2
No ratings yet
MDM 2
18 pages
ETL Challenges for Data Warehousing
No ratings yet
ETL Challenges for Data Warehousing
16 pages
NagendraPrasad (8y 0m)
No ratings yet
NagendraPrasad (8y 0m)
4 pages
IDQ Mappings From Cleanse Functions
No ratings yet
IDQ Mappings From Cleanse Functions
4 pages
Introduction to Data Warehousing Concepts
No ratings yet
Introduction to Data Warehousing Concepts
38 pages
ETL Design for Data Warehousing
No ratings yet
ETL Design for Data Warehousing
39 pages
ERwin API
No ratings yet
ERwin API
72 pages
DWH & Datastage
No ratings yet
DWH & Datastage
5 pages
Informatica Questions 1
No ratings yet
Informatica Questions 1
15 pages
Informatica PowerCenter Course Overview
No ratings yet
Informatica PowerCenter Course Overview
6 pages
Informatica Data Quality Online Training
No ratings yet
Informatica Data Quality Online Training
9 pages
Informatica PowerCenter Overview and Architecture
No ratings yet
Informatica PowerCenter Overview and Architecture
66 pages
BI Mini Project-Wardrobe Analysis
No ratings yet
BI Mini Project-Wardrobe Analysis
11 pages
Informatica ETL Interview Questions Solutions
No ratings yet
Informatica ETL Interview Questions Solutions
2 pages
Using PMREP and PMCMD in Informatica
100% (1)
Using PMREP and PMCMD in Informatica
17 pages
Informatica Lookup Transformation Guide
No ratings yet
Informatica Lookup Transformation Guide
2 pages
IDQ Audit for 1WMP Data Migration
100% (1)
IDQ Audit for 1WMP Data Migration
11 pages
Informatica Data Qaulity Technical Design Document
0% (1)
Informatica Data Qaulity Technical Design Document
17 pages
Idq 1
No ratings yet
Idq 1
13 pages
Data Quality Concepts Overview
100% (3)
Data Quality Concepts Overview
83 pages
SAP Data Governance Solutions Overview
No ratings yet
SAP Data Governance Solutions Overview
20 pages
Informatica Data Quality Overview
100% (1)
Informatica Data Quality Overview
43 pages
Informatica Data Quality Guide
100% (3)
Informatica Data Quality Guide
199 pages
Informatica Data Quality Overview
No ratings yet
Informatica Data Quality Overview
48 pages
AIA DQG IDQ Approach& Features v1.1
No ratings yet
AIA DQG IDQ Approach& Features v1.1
29 pages
UBS Data Migration with IDQ Solution
No ratings yet
UBS Data Migration with IDQ Solution
1 page
1020 Data Profiling
No ratings yet
1020 Data Profiling
3 pages
Library Data Warehouse Benefits
No ratings yet
Library Data Warehouse Benefits
67 pages
Hypothesis Testing For The Difference of Means
No ratings yet
Hypothesis Testing For The Difference of Means
11 pages
Understanding Confidence Intervals for Means
No ratings yet
Understanding Confidence Intervals for Means
11 pages
Sampling Distribution of Sample Proportion
No ratings yet
Sampling Distribution of Sample Proportion
9 pages
Confidence Interval For The Difference of Means
No ratings yet
Confidence Interval For The Difference of Means
9 pages
Inference Conditions for Sample Proportions
No ratings yet
Inference Conditions for Sample Proportions
5 pages
Hypothesis Testing for Population Proportions
No ratings yet
Hypothesis Testing for Population Proportions
5 pages
Matched Pair+Hypothesis+Testing
No ratings yet
Matched Pair+Hypothesis+Testing
8 pages
The+Student's+t Distribution
No ratings yet
The+Student's+t Distribution
7 pages
Hypothesis Testing for Proportion Differences
No ratings yet
Hypothesis Testing for Proportion Differences
6 pages
Confidence Interval for Proportions
No ratings yet
Confidence Interval for Proportions
4 pages
Calculating Confidence Intervals for Proportions
No ratings yet
Calculating Confidence Intervals for Proportions
7 pages
UBS IDQComponents Benefits
No ratings yet
UBS IDQComponents Benefits
4 pages
Informatica Performance Optimization Guide
0% (1)
Informatica Performance Optimization Guide
4 pages
Data Integration Optimization Guide
No ratings yet
Data Integration Optimization Guide
1 page
R0 - NSDGA - ER - VLP DIGITAL CITIZENSHIP AND NETIQUETTE - A TEACHERS' GUIDE TO ONLINE LEARNING - 05june2020
No ratings yet
R0 - NSDGA - ER - VLP DIGITAL CITIZENSHIP AND NETIQUETTE - A TEACHERS' GUIDE TO ONLINE LEARNING - 05june2020
13 pages
MPD Article How To Measure Partial Discharge 2020 ENU
No ratings yet
MPD Article How To Measure Partial Discharge 2020 ENU
2 pages
TDS353
No ratings yet
TDS353
8 pages
IDS Reference Architecture Model 3.0 2019
No ratings yet
IDS Reference Architecture Model 3.0 2019
118 pages
Sample Profiel Summary Points
No ratings yet
Sample Profiel Summary Points
5 pages
HPE - A00021989enw - HPE SimpliVity 380 QuickSpecs
No ratings yet
HPE - A00021989enw - HPE SimpliVity 380 QuickSpecs
23 pages
Best Practices For Team-Based Development
No ratings yet
Best Practices For Team-Based Development
4 pages
WAGO I/O System Field: 16-Channel Digital Input/Output PROFINET 24 V DC 2.0 A 8 × M12 Connection 765-1102/100-000
No ratings yet
WAGO I/O System Field: 16-Channel Digital Input/Output PROFINET 24 V DC 2.0 A 8 × M12 Connection 765-1102/100-000
114 pages
Resume Template Word Australia
100% (1)
Resume Template Word Australia
6 pages
Federated Learning Maternal Health Project Proposal
No ratings yet
Federated Learning Maternal Health Project Proposal
3 pages
Upgrade to XPG SX6000 Lite SSD
No ratings yet
Upgrade to XPG SX6000 Lite SSD
2 pages
EY-GRC Strategy Services
No ratings yet
EY-GRC Strategy Services
2 pages
Mohamed Abuthahir: QA Automation Expert
No ratings yet
Mohamed Abuthahir: QA Automation Expert
1 page
Cyber Security and Privacy: Prof. Saji K Mathew
No ratings yet
Cyber Security and Privacy: Prof. Saji K Mathew
396 pages
Cloud Agent
No ratings yet
Cloud Agent
28 pages
Assessment Guide - DBA
No ratings yet
Assessment Guide - DBA
14 pages
Firewall App Blocker v1.7
No ratings yet
Firewall App Blocker v1.7
6 pages
CS601 GDB Solution by ZB
No ratings yet
CS601 GDB Solution by ZB
9 pages
COPC CX Standard for Customer Operations
No ratings yet
COPC CX Standard for Customer Operations
80 pages
Devops Manual
No ratings yet
Devops Manual
101 pages
Automation's Impact on Banking Efficiency
No ratings yet
Automation's Impact on Banking Efficiency
11 pages
CSE2005 - OBJECT-ORIENTED-PROGRAMMING-USING-JAVA - ETH - 4.1 - 15 - CSE2005 - Object Oriented Programming Using JAVA - Revised-4.1 (B.Tech-UC)
No ratings yet
CSE2005 - OBJECT-ORIENTED-PROGRAMMING-USING-JAVA - ETH - 4.1 - 15 - CSE2005 - Object Oriented Programming Using JAVA - Revised-4.1 (B.Tech-UC)
3 pages
Clase 6
No ratings yet
Clase 6
12 pages
ARM Processor Notes 50pages
No ratings yet
ARM Processor Notes 50pages
20 pages
JD From ADVI Group of Companies
No ratings yet
JD From ADVI Group of Companies
7 pages
Garden Roses - Nana's Kitchen Vintage Red Summertime - Iphone Case For Sale by Melissa MacMichael - Redbubble
No ratings yet
Garden Roses - Nana's Kitchen Vintage Red Summertime - Iphone Case For Sale by Melissa MacMichael - Redbubble
2 pages
Design Limitations PCB
No ratings yet
Design Limitations PCB
18 pages
Planning Manual
No ratings yet
Planning Manual
98 pages
3 Hours / 100 Marks: Seat No
No ratings yet
3 Hours / 100 Marks: Seat No
5 pages
Manual Do Software BrasilTech
No ratings yet
Manual Do Software BrasilTech
595 pages

UBS OCF - IDQ Capabilities Review

Uploaded by

UBS OCF - IDQ Capabilities Review

Uploaded by

UBS OCF - IDQ Capabilities review

IDQ – 1WMP Phoenix Data

Data Profiling / Data Business

Reference Data Data Quality

Standardization and Reference

DQ logic pushed down/run Native or Hadoop

Features & Benefits

You might also like