Decube
Try for free
  • πŸš€Overview
    • Welcome to decube
    • Getting started
      • How to connect data sources
    • Security and Compliance
    • Data Policy
    • Changelog
    • Public Roadmap
  • Support
  • πŸ”ŒData Warehouses
    • Snowflake
    • Redshift
    • Google Bigquery
    • Databricks
    • Azure Synapse
  • πŸ”ŒRelational Databases
    • PostgreSQL
    • MySQL
    • SingleStore
    • Microsoft SQL Server
    • Oracle
  • πŸ”ŒTransformation Tools
    • dbt (Cloud Version)
    • dbt Core
    • Fivetran
    • Airflow
    • AWS Glue
    • Azure Data Factory
    • Apache Spark
      • Apache Spark in Azure Synapse
    • OpenLineage (BETA)
    • Additional configurations
  • πŸ”ŒBusiness Intelligence
    • Tableau
    • Looker
    • PowerBI
  • πŸ”ŒData Lake
    • AWS S3
    • Azure Data Lake Storage (ADLS)
      • Azure Function for Metadata
    • Google Cloud Storage (GCS)
  • πŸ”ŒTicketing and Collaboration
    • ServiceNow
    • Jira
  • πŸ”’Security and Connectivity
    • Enabling VPC Access
    • IP Whitelisting
    • SSH Tunneling
    • AWS Identities
  • βœ…Data Quality
    • Incidents Overview
    • Incident model feedback
    • Enable asset monitoring
    • Available Monitor Types
    • Available Monitor Modes
    • Catalog: Add/Modify Monitor
    • Set Up Freshness & Volume Monitors
    • Set Up Field Health Monitors
    • Set Up Custom SQL Monitors
    • Grouped-by Monitors
    • Modify Schema Drift Monitors
    • Modify Job Failure Monitors (Data Job)
    • Custom Scheduling For Monitors
    • Config Settings
  • πŸ“–Catalog
    • Overview of Asset Types
    • Assets Catalog
    • Asset Overview
    • Automated Lineage
      • Lineage Relationship
      • Supported Data Sources and Lineage Types
    • Add lineage relationships manually
    • Add tags and classifications to fields
    • Field Statistcs
    • Preview sample data
  • πŸ“šGlossary
    • Glossary, Category and Terms
    • Adding a new glossary
    • Adding Terms and Linked Assets
  • Moving Terms to Glossary/Category
  • AI Copilot
    • Copilot's Autocomplete
  • 🀝Collaboration
    • Ask Questions
    • Rate an asset
  • 🌐Data Mesh [BETA]
    • Overview on Data Mesh [BETA]
    • Creating and Managing Domains/Sub-domains
    • Adding members to Domain/Sub-domain
    • Linking Entities to Domains/Sub-domains
    • Adding Data Products to Domains/Subdomains
    • Creating a draft Data Asset
    • Adding a Data Contract - Default Settings
    • Adding a Data Contract - Freshness Test
    • Adding a Data Contract - Column Tests
    • Publishing the Data Asset
  • πŸ›οΈGovernance
    • Governance module
    • Classification Policies
    • Auto-classify data assets
  • β˜‘οΈApproval Workflow
    • What are Change Requests?
    • Initiate a change request
    • What are Access Requests?
    • Initiate an Access Request
  • πŸ“‘Data reconciliation
    • Adding a new recon
    • Understand your recon results
    • Supported sources for Recon
  • πŸ“‹Reports
    • Overview of Reports
    • Supported sources for Reports
    • Asset Report: Data Quality Scorecard
  • πŸ“ŠDashboard
    • Dashboard Overview
    • Incidents
    • Quality
  • ⏰Alert Notifications
    • Get alerts on email
    • Connect your Slack channels
    • Connect to Microsoft Teams
    • Webhooks integration
  • πŸ›οΈManage Access
    • User Management - Overview
    • Invite users
    • Deactivate or re-activate users
    • Revoke a user invite
  • πŸ”Group-based Access Controls
    • Groups Management - Overview
    • Create Groups & Assign Policies
    • Source-based Policies
    • Administrative-based Policies
    • Module-based Policies
    • What is the "Owners" group?
  • πŸ—„οΈOrg Settings
    • Multi-factor authentication
    • Single Sign-On (SSO) with Microsoft
    • Single Sign-On (SSO) with JumpCloud
  • ❓Support
    • Supported Features by Integration
    • Frequently Asked Questions
    • Supported Browsers and System Requirements
  • Public API (BETA)
    • Overview
      • Data API
        • Glossary
        • Lineage
        • ACL
          • Group
      • Control API
        • Users
    • API Keys
Powered by GitBook
On this page
  1. Dashboard

Quality

PreviousIncidentsNextGet alerts on email

Last updated 2 months ago

The Quality tab provides a detailed view of data quality across various dimensions, supported by different test types. Here’s what you can expect:

Quality Dimensions and Test Types

Our platform currently supports four quality dimensions, each associated with specific test types:

  • Accuracy: Measures how close the data values are to the true values. Tests include β€œRegex” and β€œValue in.”

  • Completeness: Measures the extent to which all required data elements are present. Tests include β€œNot Null.”

  • Uniqueness: Checks each data record to ensure it is unique within the dataset. Tests include β€œIs Unique.”

  • Validity: Ensures data conforms to acceptable standards, such as ranges and formats. Tests include β€œIs Email” and β€œIs UUID.”

  • Timeliness: Measures how up-to-date the data is.

  • Consistency: Measures reliability and uniformity of data within datasets.

  • Granularity: Measures level of detail or the degree of aggregation present.

  • Others: Any other tests not within the other categories.

The Data Quality (DQ) score is calculated daily using the formula:

The health score for each dimension is the average of all monitors over the selected time period.

Data Health Score

The Data Health Score represents the average score for all dimensions over the selected time period. The scores are color-coded for easy interpretation:

β€’ Green (> 98%): Excellent health

β€’ Yellow (95% - 98%): At risk

β€’ Red (< 95%): Poor health

Custom Date Range and Filters

The custom date range supports up to six months, allowing for in-depth analysis over a quarter. The Quality dashboard also includes various filters to help you narrow down your data view, such as:

β€’ Domains

β€’ Data sources

β€’ Data Owners

β€’ Monitor mode (Scheduled, On-demand)

β€’ Row creation preferences (filter for 'All Records' scan only)

β€’ Tags

β€’ Classifications

Source/Domain Summary

The Source/Domain Summary in the Quality tab provides results based on selected domains and shows scores for key quality metrics. This helps you gain a deeper understanding of your data’s health across different data sources and domains, making it easier to pinpoint areas for improvement.

DQScore=1βˆ’(RfmRtm)DQ Score = 1 - \left( \frac{R_{fm}}{R_{tm}} \right)DQScore=1βˆ’(Rtm​Rfm​​)

where RfmR_{fm}Rfm​ is the count of failed rows, and RtmR_{tm}Rtm​ is the total count of rows in a monitor scan.

Only a select few types of monitors can produce the health score. To know which monitors generate health scores, .

πŸ“Š
read this article
Overview of Quality Dashboard
Overview for Filter pop-up/module
Overview for Source/Domain Sumamry with poor, at risk and excellent health score