Configure the automatic acceptance and rejection of classification suggestions

Important 

In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.

Use the following options to see the documentation in the latest UI or in the previous, classic UI:

You can configure the classification acceptance and rejection thresholds in two places: Collibra Settings and the Data Classification Configuration. If you configure the classification thresholds in one place, they are also updated in the other place.

Via Collibra settings

Depending on your environment, follow this procedure either on the Services Configuration tab of the Collibra settings or in Collibra Console:

Important You can't edit the Services Configuration from the Settings page in the latest UI. If you use the latest UI, you can configure settings only in Collibra Console. For more information, go to DGC service configuration settings.

Before you start

Steps

  1. Open the Services Configuration page.
    1. On the main toolbar, click Products icon, and then click Cogwheel icon Settings.
      The Collibra settings page opens.
    2. Click Services Configuration.
    3. Click Edit configuration.
    Open the DGC service settings for editing:
    1. Open Collibra Console.
      Collibra Console opens with the Infrastructure page.
    2. In the tab pane, expand an environment to show its services.
    3. In the tab pane, click the Data Governance Center service of that environment.
    4. Click Configuration.
    5. Click Edit configuration.
  2. Go to the Data Classification- Classification Thresholds section.
  3. Enter the required information:

    Setting

    Description

    Enable automatic classification acceptance and rejection

    True: The automatic acceptance and rejection of data classification suggestions is active.

    False (default): Data classification suggestions are not automatically accepted or rejected.

    Tip Start by manually accepting and rejecting a suggested data class. Only activate the automatic acceptance and rejection feature if you are comfortable with the data classification results.

    Automatic acceptance threshold

    The percentage from which data classification suggestions must be accepted automatically.
    If you set this value to 75, then the classification suggestions with a confidence level of 75% or higher are automatically accepted.

    If multiple classification suggestions meet the threshold condition for a column, the classification suggestion with the highest confidence level percentage is accepted automatically if this classification suggestion is the only one to have that confidence level percentage.

    Example 

    You set the automatic acceptance threshold to 85%. You classify a table with 2 columns.

    • For column A, three classification suggestions are possible, one with confidence level 93%, one with 92%, and one with 90%.
    • For column B, two classification suggestions are possible. Their confidence level is the same, 86%.

    The results of the automatic acceptance will be:

    • For column A, the classification suggestion with 93% will be accepted automatically.
    • For column B, nothing is done, both suggestions will be visible.

    The default acceptance threshold is 90.

    Automatic rejection threshold

    The percentage from which data classification suggestions must be rejected automatically. If you set this value to 49, then all data classification suggestions with a confidence level of 49% or lower are automatically rejected.

    The default rejection threshold is 10.

    Note If the acceptance threshold and rejection threshold are set to the same value, and a data classification suggestion has this confidence level percentage, the classification suggestion will be rejected.

  4. Click Save all.

Via Data Classification Configuration

Requirements and permissions

Make sure you have the required permissions.

Steps

  1. On the main toolbar, click Products icon, and then click Stewardship.
  2. Click Data Classification.
    A list of all data classes opens.
  3. Click the Data Classification Configuration icon .
    The Data Classification Configuration dialog opens.
  4. Select Enable automatic classification acceptance and rejection.

    Tip Start by manually accepting and rejecting a suggested data class. Only activate the automatic acceptance and rejection feature if you are comfortable with the data classification results.

  5. Enter the rejection threshold value.
    This is the percentage from which data classification suggestions must be rejected automatically. The default rejection threshold is 10%.
    The bar automatically updates.
  6. Enter the acceptance threshold value.
    This is the percentage from which data classification suggestions must be accepted automatically. The default acceptance threshold is 90%.
    The bar automatically updates.
    Example 

    In this example, we set the rejection value to 25 and the acceptance value to 95. This means that all data classification suggestions with a confidence level of 25% or lower will be rejected automatically. Data classification suggestions with a confidence level of 95% or higher will be accepted automatically.

  7. Move the rejection threshold indicator to the required value.
    The default rejection threshold is 10%.
    Example 

    In this example, we move the rejection indicator to value 25. This means all data classification suggestions with a confidence level of 25% or lower will be rejected automatically. They won't appear as data classification suggestions.

  8. Move the acceptance threshold indicator to the required value.
    The default acceptance threshold is 90%.
  9. Click Save.