Create a data class

You can create data classes in multiple locations:

  • The Data Classification Data Classes page in the Stewardship application.
    • To add classification rules to a data class, or to update and delete data classes, you must always go to the Data ClassificationData Classes page in the Stewardship application.
    • If you want to use automatic data classification, create data classes on the Data ClassificationData Classes page. This allows immediate configuration of the classification rules.
  • The asset pages where you can update the classification, such as Column asset pages.
  • Asset views where the Data Classification column has been added.

Prerequisites

  • You have a global role that has the Product Rights > Catalog global permission.
  • You have a global role that has the Data Stewardship Manager global permission.
  • You have a global role that has the Classification > Data Classes > Read global permission.
  • You have a global role that has the Classification > Data Classes > Add global permission.

For more information, go to Required permissions.

Steps

  1. On the main toolbar, click Products iconStewardship.
  2. Go to Data Classification Data Classes.
    The Data Classes page opens.
  3. If the data class doesn't exist yet:
    1. Click Add.
    2. Type the name of the data class and press Enter.
    3. Click Add.
  4. Hover over the data class name and click Preview.
    The data class parameters appear in a pane on the right-hand side.
  5. Optionally, change the name by clicking the Name field, typing the name, and clicking the Save icon.
  6. Make sure the data class is enabled, unless you don't want the data classification process to use it yet.
  7. Optionally, add a description by clicking the Description field, typing the description, and clicking the Save icon.
  8. Open the Details section.
  9. Complete the fields as required.

    To save a value, click the Save icon.

  10. Open the Classification rules section.
  11. Click Add new rule.

    A data class without a classification rule can be used only for manual classification.
    To allow the automatic data classification process to pick up the data class, you need to add at least one classification rule.
    A data class can include multiple rules, and the rules can be of different types.

  12. From the Type list, select the type of classification rule that you want to add to the data class.
    • Add a Regular expression for column names rule to check the name of a column in the data source.
      Unlike the Column name filter, which makes the name a prerequisite to consider the data class, a rule based on name serves as a criteria to apply the data class.
    • Add a Data type rule to check the data type of a column in the data source.
      Unlike the Column type filter, which makes the data type mandatory to consider the data class, a rule based on data type serves as a criteria to apply the data class. For an example, go to Example | Importing data classes and starting automatic classification for a table.
    • Add a Regular expression for data rule to validate a pattern, such as the format of email addresses.
    • Add a List of values for data rule to check for specific, predefined options, such as T-shirt sizes.

    Depending on your selection, extra fields appear.

  13. Complete the fields as required.

  14. Click Save.
    The classification rule for the data class is configured.
    A new section appears. If you expand the section, the details are shown.
  15. If needed, click Add new rule to add another classification rule to the data class.
    • You can combine regular expression for column names, regular expression for data, list of values for data, and data type rules in one data class.
    • The maximum number of rules in a data class is 25.
    • During the automatic data classification process, each rule is verified and the data class is assigned as soon as one of the rules applies.
    • Important 

      By default, rules based on column name and data type are evaluated before rules based on samples, such as regular expressions for data and lists of values for data. Rules based on samples are evaluated in the order in which they appear in the data class.

What's next

Related topics

Helpful resources

Contextualize your data with Unified Data Classification course in Collibra University