Quality
Last updated
Last updated
The Quality tab provides a detailed view of data quality across various dimensions, supported by different test types. Here’s what you can expect:
Quality Dimensions and Test Types
Our platform currently supports four quality dimensions, each associated with specific test types:
Accuracy: Measures how close the data values are to the true values. Tests include “Regex” and “Value in.”
Completeness: Measures the extent to which all required data elements are present. Tests include “Not Null.”
Uniqueness: Checks each data record to ensure it is unique within the dataset. Tests include “Is Unique.”
Validity: Ensures data conforms to acceptable standards, such as ranges and formats. Tests include “Is Email” and “Is UUID.”
The Data Quality (DQ) score is calculated daily using the formula:
where is the count of failed rows, and is the total count of rows in a monitor scan.
The health score for each dimension is the average of all monitors over the selected time period.
Only a select few types of monitors can produce the health score. To know which monitors generate health scores, read this article.
Data Health Score
The Data Health Score represents the average score for all dimensions over the selected time period. The scores are color-coded for easy interpretation:
• Green (> 98%): Excellent health
• Yellow (95% - 98%): At risk
• Red (< 95%): Poor health
Custom Date Range and Filters
The custom date range supports up to six months, allowing for in-depth analysis over a quarter. The Quality dashboard also includes various filters to help you narrow down your data view, such as:
• Domains
• Data sources
• Data Owners
• Monitor mode (Scheduled, On-demand)
• Row creation preferences (filter for 'All Records' scan only)
• Tags
• Classifications
Source/Domain Summary
The Source/Domain Summary in the Quality tab provides results based on selected domains and shows scores for key quality metrics. This helps you gain a deeper understanding of your data’s health across different data sources and domains, making it easier to pinpoint areas for improvement.