Data quality
Last updated
Last updated
The Data quality dashboard helps you monitor the health status of data assets in your project. Health status is essentially the interpretation of test results of assets with test categories.
Data quality dashboard currently supports monitoring data model assets.
You can easily view data quality in two ways:
Get a bird's-eye view through test coverage summaries and quality matrix, or
Zoom in on specific assets to investigate potential issues in detail.
This guide walks you through the interface of the dashboard.
To access the dashboard, from the top navigation bar, go to Health tracking > Data quality.
First, the overview page displays two sections: Test coverage and Health matrix, showing data quality of a specific environment. You can switch between the Development and Production environments using the toggle in the top-right corner. Recurve fetches all the assets and health statuses of the environment and reflects in the dashboard.
There are two types of coverages:
Monitoring coverage (pie chart) shows the percentage of models in your project that are being monitored through testing. Higher percentage means more assets are being covered.
Monitoring coverage by test category (horizontal bar chart) breaks down the Monitoring coverage by showing the percentage of assets with tests in specific categories. This helps identify areas that may lack sufficient testing. For example, high coverage in "Completeness" but lower coverage in "Validity" could indicate that more attention is needed to ensure data accuracy.
The Health Matrix provides a comprehensive view of all monitored assets. Assets are listed vertically, while columns across the top represent the asset owner, materialization status, and each test category.
Owner: displays the owner of each asset, configurable through metadata.
Materialization Status: indicates the current status of materialization (Success, Failed, or No Status if not yet materialized).
Test Categories: each column represents a test category, with each cell showing the health status of the asset in that category (Passed, Warning, or Error).
The matrix allows you to quickly assess health across multiple categories.
By default the the dashboard shows all types of health statuses with data of the last 14 days. You narrow that down using the Issues only toggle and a customized time range.
You can use the search bar to search for certain assets by names, and use the filter functions to filter out the asset by different properties.
To view detailed health status of an asset, click on the name of the asset from the matrix.
By default, this opens the Data quality tab.
This tab shows all test cases configured to the asset, with test category and target column. Expanding a test case shows all the executions of the test, with the configured severity level and test results.
This section provides a convenient way to view, edit, and add new test cases to an asset.
Switching to the Build history tab shows you all the tasks that involve the asset. Tasks shown here include both those initiated manually and those triggered by scheduled pipeline jobs. This provides users with a comprehensive timeline of the asset’s activity and performance.
Each record provides details about the task run, such as build time, execution duration, and overall health status of the asset.
You can further inspect a task by clicking on it.