Outliers
The Outliers monitor detects values that differ significantly from the rest of the data and may indicate bad or incorrect data. Numerical outliers are detected with the IQR and box plot methods.
The following table describes the information available on the Outliers tab of the Findings page.
Column | Description |
---|---|
Key | The column assigned as a Key column. Any outliers that Collibra DQ are grouped by this column if and when a Key is assigned. |
Column | The column where a potential outlier is detected. |
Value | The value of the column of the detected outlier. |
Count | The number of potential outliers in the column. |
Predicted | The type of value that Collibra DQ predicts for a given run, for example, categorical. This prediction is based on the observed values of previous runs. |
Conf |
The confidence score, ranging from 0 to 100, indicates how far the current value is from the lower or upper bound. Lower scores such as 0 or 1, indicate a higher likelihood of the value being an outlier. Conversely, higher scores, such as 97, suggest a lower likelihood of the value being an outlier. |
Status |
Lets you label and train a finding. a finding. The available dropdown menu options are Validate, Invalidate, and Resolve. Validate instructs Collibra DQ to either assign a finding to a specific user for review, which then appears in the View the Assignment Queue or acknowledge without an assignee that the finding is a valid observation. Invalidate instructs Collibra DQ to ignore a finding and allow the value to pass. There are two invalidation options:
Tip When you have many findings to invalidate, it may be best to use the Save option to invalidate them at the same time, once all findings are reviewed. Resolve Instructs Collibra DQ to mark the finding as an observation and prevents it from appearing in future runs. Resolving a finding does not immediately affect data quality scores. |
Profile |
The user account that is assigned to this outlier finding. When the Status is Assigned, a user profile displays in this column. Note When an outlier finding is unassigned, the profile column is empty. |
Link ID |
Links back to the detected record for remediation. Note Link ID is not available for categorical outliers. |
Action |
In Pushdown mode, you can download either a CSV or JSON file containing details of the break records. |
Note When you assign a Date and Key column in an Outlier configuration, Collibra DQ may also discover Record finding.
Invalidate All
Invalidate All instructs Collibra DQ to ignore all outlier findings and allow the values to pass.
Exporting outlier records
There are two options above the drill-in table to export the details of your outlier records as .xlsx files:
- Export generates an Excel file with the details from the drill-in.
- Export with Details generates an Excel file with the details from the drill-in and the data preview, when available.