Data profiling is the process of examining the data available in existing data sources (e.g. databases, applications, files, etc.) and collecting statistics and information about this data. Data profiling enables the assessment of the quality level of the data contained in the information system, according to a defined set of metrics and goals.
Talend Open Profiler is a sophisticated, yet simple-to-use open source data profiling tool that defines the content, structure, and quality of highly complex data structures. The open source data profiler allows business users and data management staff to perform a large variety of analyses using a set of indicators, patterns and rules for each data element being analyzed or monitored. It analyzes data on an ongoing basis, and analyzes changes to source data over time to help improve data quality.
These data quality indicators can range from simple or advanced statistics to text string analysis, including summary data and statistical distributions of records. The patterns are preset or customized expressions that define the expected form of data analyzed and the data quality rules help define custom business thresholds and value ranges.
Talend Open Profiler produces sophisticated reports and graphs that let users gauge at a glance the data quality, and the status of the predefined indicators. In addition an embedded data explorer allows users to directly drill down into the tables of the analyzed databases.
Download Talend Open Profiler now!
Want to learn more about open source data quality tool Talend Open Profiler? Then watch an online demo or check out our users' testimonials.
Not sure if you need Talend Open Profiler or Talend Data Quality? Check out the features comparison matrix.