Example: Configuring a data class based on a regular expression, importing a data class, and starting the automatic classification for a table

Important 

In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.

Use the following options to see the documentation in the latest UI or in the previous, classic UI:

You want to add two new data classes in the Unified Data Classification method:

  • Add an extra data class, Date in dd/mm/yyyy format
  • Import the out-of-the-box data class, Phone number.

Before you begin

Make sure you know which regular expressions you want to use for the data classes. For more information and references to useful resources, go to Add a data class.

Steps

  1. Create and configure the Date data class.

    1. On the main toolbar, click Products icon, and then click Stewardship.
    2. Click the Data Classification tab.
    3. Add the data class.
      1. Click Add.
      2. Add the Name of the data class. In our case, Date.
      3. Press Enter to add the data class.
      4. Click Create.
        The data class is created and is available in the list.
    4. Define the data class parameters.
      1. In the Data Classification tab, select the row of the new data class.
        The data class parameters appear in a pane on the right-hand side.
      2. Optionally, add a description by clicking the Description field, typing the description, and clicking outside the field.
      3. Optionally, add a description by clicking the Edit icon next to the Description field.
      4. Open the Details section.
      5. Complete the fields as required.
        For information on the fields, go to Configuring data classes.
        Data class parameterDescription
        Minimum confidence threshold

        We set this value to 80.

        Include empty values

        We leave this field as the default value (False).

        ExamplesFor Date, we add the following examples:
        23/11/2026, 09/02/2023
      6. Open the Classification rules section.
      7. Click Add new rule.
      8. In the Type list, select Regular expression.
        Extra fields appear.
      9. Complete the fields as required.
        For information on the fields, go to Configuring data classes.
        Data class parameterDescription
        Regular expression

        For Date, we add the following expression:
        (0[1-9]|[12][0-9]|3[01])\/(0[1-9]|1[1,2])\/(19|20)\d{2}

        DescriptionWe leave this field empty.
      10. Click Save.
        The classification rule for the data class is configured.
        If you expand the Classification rules section, you see the details.
  2. Import the Phone number data class.

    1. On the main toolbar, click Products icon, and then click Stewardship.
    2. Click the Data Classification tab.
    3. Click Import.
      A dialog box opens, listing all the out-of-the-box data classes and their status. For information on the possible statuses, go to About out-of-the-box data classes.
    4. Clear all data classes, except Phone number.

    5. Click Import.
      The data class is added.
      If you click the data class, you see the details.
  3. Start the automatic classification.

    1. Navigate to a Table asset.
    2. Select ActionsClassify.
      The data classification process starts. For more information, go to Automatically classify assets
      If a data class matches a column in the Table asset, a data classification suggestion will be assigned to the Column asset with a confidence percentage. For more information, go to accepting and rejecting data classification suggestions.

What's Next?

You can now configure an additional data class that is based on a list of values instead of a regular expression.