CA ERwin Data Profiler
CA ERwin Data Profiler helps increase the quality of your data by performing cross-system analysis and profiling of both database and legacy systems. Find hidden inconsistencies in data, and leverage robust statistics to correct data errors in your database or modeling environment. Cleansed metadata can be exported to CA ERwin® Data Modeler to create quality data models that match real-world data.
Key Features
- Column Profiling and Analysis: Helps reduce analysis cycles and increases their effectiveness through consistent and standardized results. In addition, this functionality enables the automatic discovery of standard column statistics, including:
- Primary Key-Foreign Key Discovery: Offers fully automated discovery, definition and visualization of relationships within a single data source. Inferred relationships provide critical structural insights into legacy data structures, as well as points of comparison and confirmation for documented metadata - allowing your analysts and designers to account for inherent relationships when building target systems.
- Cross-System Attribute Overlap Analysis: Performs an automated cross-compare of all columns across many data sources (up to 20) in order to establish a baseline of overlapping data. By leveraging this functionality, you can discover attribute supersets and subsets, as well as overlapping and unique attributes, to speed reconciliation of disparate data sources and effectively identify and document transformation requirements.
- Data Synchronization Analysis: Validates the uniqueness of a potential global identifier in each data source and then confirms that data across sources can be aligned and synchronized using this identifier. This functionality enables you to prototype and test survivorship rules between sources before you move the data into a master structure, helping you ensure quality and consistency across existing sources and well-aligned target structures.
- CA ERwin Data Modeler Integration: Enables the creation of data models that can be persisted and reused within CA ERwin Data Modeler for documentation, impact analysis and stakeholder visualization. Based on metadata inferred from the instance data and profiling results in CA ERwin Data Profiler, these models help you achieve proper business alignment, improved end-user understanding and reduced analysis cycles in subsequent projects.