Working with Dataset Rules
The Dataset Rules page lists all previously saved rules for a given dataset and provides an overview of their details, such as the definitions of SQL conditions, rule types, and whether or not rules pass validation checks. You can also take several quick actions from this page, such as accessing the Rule Workbench or deleting a rule from a dataset. The ability to manage rules from a single page can shorten the amount of time you spend assessing what is needed to meet your business requirements.
Opening Dataset Rules
The following steps show you how to open the list of applicable rules on your dataset.
- Click the in the sidebar menu, then click Rule Builder. The Dataset Rules page opens.
- On the Dataset Rules page, enter the name of your dataset in the Search for a Dataset searchbar, then select it. The Dataset Rules page displays any existing rules on your dataset.
Viewing rule details
When one or more rules are available on the selected dataset, the table on the Dataset Rules page contains the following detailed information.
Column | Description |
---|---|
Rule Name |
The name of your rule. Hover your cursor over the rule name and click to edit your rule. |
SQL | The SQL condition of your rule. |
Type | The type of rule. |
Column | The primary column that your rule queries. |
Repo | The data class or template from which a rule is created. This only applies to custom rules, such as Data Type, Data Class, and Template. |
Valid |
Shows whether your rule passes rule validation. shows that your rule passes validation. shows that your rule does not pass validation. Note If you see , you can click it to view more details about why your rule does not pass validation. |
Active |
Shows whether your rule is active for future runs of the dataset. Click the icon to change the active status of your rule. shows that your rule is active. shows that your rule is inactive. |
Points |
The number of points that Collibra DQ deducts from your data quality score when data breaches the conditions of your rule. You can set this value on the Rule Details modal of the Rule Workbench. If you do not customize this value, then Collibra DQ uses the default value of 1. |
% | If you do not customize this value on the Rule Details modal, then Collibra DQ uses the default value of 1. |
Category |
The data category that you optionally define on the Workbench. |
Dimension | The DQ Dimension that you optionally assign to your rule on the Workbench. |
Timeout Limit |
The number of minutes that any active rule can take to process before it times out and the tool automatically cancels the job. This limit is useful for keeping problematic rules from consuming too many resources. The maximum timeout limit is determined by the greatest Timeout Limit value of any active rule displayed on the table under the Rules tab. Example If there are 3 active rules associated with a dataset, where Rule 1 has a timeout limit of 20 minutes, Rule 2 has a limit of 30 minutes, and Rule 3 has a limit of 60 minutes, then the dataset has 60 minutes per rule to process. If Rule 3 is inactive, the maximum timeout limit for the dataset is 30 minutes, because Rule 2 has the next highest maximum limit of any active rule. Tip You can increase this value for an individual rule from Rule Details on the Rule Workbench. |
Actions | Click Actions to edit or delete your rule. |
Adding new rules
Click Add Rule to create new rules to run against your dataset.