Informatica Data Quality Content

This course can be delivered through online instructor led format, which is comprised of 17 modules.

Topics covered include

  • Profiling, Standardization, Data Cleaning using Labeler & Parser, de-duplication and Address Validation
  • Matching and Consolidation Techniques
  • Reference Table Management and its usage

Course Objectives

On completion of this course Attendees will be able to:

Navigate the Developer Tool and collaborate on projects with Analysts using the Analyst Tool

  • Perform Column Profile, Join Profile, Multi object and Mid-Stream Profiling, Mid-Stream data preview, LDO, Scorecard and working with DQ transformations
  • Manage Reference Tables in the Developer & Analyst Tool
  • Design Rule & Mapplet, Mapping, Wokflows and develop of Applications
  • Create standardization, cleansing and parsing routines
  • Identify duplicate records
  • Build mappings used to associate and consolidate matched records
  • Exception Process – Bad Records & Duplicate Records
  • Validate addresses

Course Agenda

Introduction to Data Quality Management

Unit 1: Working with Informatica Developer 10X

  • GUI
  • Mappings
  • Mapplets
  • Transformations
  • Content Sets
  • Data Objects
  • Reference Tables
  • LDO

Unit 2: Analyst Collaboration

  • Creating Profile and Scorecards
  • Adding Comments/Tags
  • Reviewing information from the Analyst
  • Creating/adding to Reference tables
  • Creating Profile, Reference Tables

Unit 3: Developer Profiling and Logical Data Objects

  • Perform:
    1. Column Profiling
    2. Join Profiling
    3. Mid-stream profiling
  • Create a Logical Data Object
  • Create Mappings and work with DQ and Core transformations

Unit 4: Labeler and Data Standardization

  • Cleanse and transform data using Labeler and Standardization Transformations
  • Develop data standardization mapplets and mappings
  • Working with Reference tables

Unit 5: Parsing

  • Perform parsing using a variety of methods such as:

    1. o Token Parser
    2. o Pattern Parser
    3. o Working with Reference Tables

Unit 6: Field Matching

  • Grouping data
  • DQ Matching
  • Match Cluster Analysis
  • Matching Performance Analysis

Unit 7: Identity Matching

  • Build Matching mappings using Identity matching
  • Identity Populations and Strategies

Unit 8: Automatic Consolidation & Key Generator

  • Associate and Consolidate data

Unit 9: Manual Exception and Consolidation Management

  • Build and execute Mappings, using the Exception Transformation, to identify bad records and duplicate records.

Unit 10: Task and Workflow Management

  • Build and execute workflows to populate Informatica Data Director user inboxes with exception and duplicate records

Unit 11: Informatica Data Director (Informatica Analyst)

  • Update exception and duplicate records in IDD

Unit 12: PowerCenter Integration

  • Export DQ Mapping to PowerCenter
  • Run DQ Mappings/Mapplets in PowerCenter
  • Build and execute a workflow in PowerCenter Developer using DQ mapplets.

Unit 13: Running DQ in a Standalone environment

  • Schedule DQ mappings to run in DQ Standalone using Windows Task Manager

Unit 14: Object Import/Export to Informatica PowerCenter

  • Import Projects using both Basic and Advanced methods
  • Export Projects
  • Deploying DQ Jobs to Application

Unit 15: Content

  • What content is available with IDQ 10.X?
  • Content Management Service
  • Accelerators
  • Core Accelerator

Unit 16: Parameters and Schedule

  • How to use Parameters in Data Quality mappings, transformations and reference tables
  • Scheduling Profile, Scorecards and Applications.

Unit 17: Address Validation

  • Create a Reusable AV Transformation
  • AV Transformation Properties, Inputs and Outputs
  • Build and execute an Address Validation Mapping
  • Reusable AV Mapplet

Comments