Outliers

The Outliers monitor detects values that differ significantly from the rest of the data and may indicate bad or incorrect data. Numerical outliers are detected with the IQR and box plot methods.

The following table describes the information available on the Outliers tab of the Findings page.

Column Description
Key The column assigned as a Key column. Any outliers that Collibra DQ are grouped by this column if and when a Key is assigned.
Column The column where a potential outlier is detected.
Value The value of the column of the detected outlier.
Count The number of potential outliers in the column.
Predicted The type of value that Collibra DQ predicts for a given run, for example, categorical. This prediction is based on the observed values of previous runs.
Conf

The confidence score, ranging from 0 to 100, indicates how far the current value is from the lower or upper bound.

Lower scores such as 0 or 1, indicate a higher likelihood of the value being an outlier. Conversely, higher scores, such as 97, suggest a lower likelihood of the value being an outlier.

Status

Lets you label and train a finding. a finding. The available dropdown menu options are Validate, Invalidate, and Resolve.

Validate instructs Collibra DQ to either assign a finding to a specific user for review, which then appears in the View the Assignment Queue or acknowledge without an assignee that the finding is a valid observation.

Invalidate instructs Collibra DQ to ignore a finding and allow the value to pass. There are two invalidation options:

  • Save lets you mark a finding as invalidated.
  • Save & Retrain lets you invalidate a finding and any previously saved invalidated findings, if any.
  • Tip When you have many findings to invalidate, it may be best to use the Save option to invalidate them at the same time, once all findings are reviewed.

Resolve Instructs Collibra DQ to mark the finding as an observation and prevents it from appearing in future runs. Resolving a finding does not immediately affect data quality scores.

Profile

The user account that is assigned to this outlier finding. When the Status is Assigned, a user profile displays in this column.

Note When an outlier finding is unassigned, the profile column is empty.

Link ID

Links back to the detected record for remediation.

Note Link ID is not available for categorical outliers.

Action

In Pushdown mode, you can download either a CSV or JSON file containing details of the break records.

Note When you assign a Date and Key column in an Outlier configuration, Collibra DQ may also discover Record finding.

Invalidate All

Invalidate All instructs Collibra DQ to ignore all outlier findings and allow the values to pass.

Exporting outlier records

There are two options above the drill-in table to export the details of your outlier records as .xlsx files:

  • Export generates an Excel file with the details from the drill-in.
  • Export with Details generates an Excel file with the details from the drill-in and the data preview, when available.