About quality score aggregations

A quality score aggregation is a chain of relations from the target asset to the source of the data quality score that instructs Collibra how to calculate scores. The source of the data quality score can be via Data Quality & Observability or an external data quality tool.

Quality score aggregations allow you to:

  • View data quality scores on physical data assets in Data Catalog.
  • View data quality scores on Business, Data, and Governance assets.
  • Control how data quality is compiled across assets.
  • View the health of your assets regardless of the source of your data quality insights.

Quality scores on assets in Data Catalog require a mapping between data quality monitors from a data quality tool, such as Data Quality & Observability, and a corresponding Column or Column subtype asset. Scores from Column and Table assets are aggregated out-of-the-box. However, relation types between assets and non-asset objects require custom aggregation paths for quality scores to be shown.

Example 
This example shows the score aggregation CDQO Database to Column and its fields are explained in the following sections.

data quality rule aggregation path diagram

Type

The source data quality tool from which quality scores and results come.

In the example above, both Collibra data quality and External data quality are shown. However, in your environment, you might see only one. Support for the Data Quality & Observability capability and enabling external data quality tools are controlled via settings in Collibra Console. To edit these settings, please create a support ticket.

Name and Description

The quality score aggregation consists of a unique name and a description that is shown in the Settings page and in the assignments of an asset type.

Target asset type

The asset type that will show the data quality score when the aggregation path is created.

In the example above, Database and Schema Assets will show the data quality score.

Aggregation path

The quality score aggregation aggregates values that are collected through a defined aggregation path. An aggregation path consists of a chain of relations that is to be followed from the asset to which the quality score aggregation is assigned, to the asset containing the actual values.

In the example above, values from Column Assets are aggregated for Database Assets by looking up:

  • The Schema Assets that these Database Assets have.
  • The Table Assets that these Schema Assets contain.
  • The Column Assets that these Table Assets contain.

Examples of aggregation paths

Table to Column

In the following example, the chain of relations in the aggregation path is Table contains Column.

table to column aggregation path diagram

Database to Column

In the following example, the chain of relations in the aggregation path is Technology Asset has Schema, Schema contains Table, and Table contains Column.

database to column aggregation path diagram

Table to Rule

In the following example, the chain of relations in the aggregation path is Table contains Column and Column is governed by Data Quality Rule.

table to data quality rule aggregation path diagram

Database to Metric

In the following example, the chain of relations in the aggregation path is Technology Asset has Schema, Schema contains Table, Table contains Column, Column is governed by Data Quality Rule, and Data Quality Rule executed by Data Quality Metric.

database to metric aggregation path diagram

What's next?