Classify columns in a table

By classifying columns in a table, Collibra's Automatic Data Classification platform predicts their data structures, after which, you can accept or reject the prediction.

There are three methods to classify columns:

Tip You can also use the physical data connector to manually select a data class for individual columns.

Prerequisites

  • You have a global role with the Catalog global permission, for example Catalog Author.
  • You have created a support ticket via Zendesk to access to the Automatic Data Classification platform.
  • You have configured Automatic Data Classification for the DGC service.
  • You have the correct permissions to classify tables and columns.
  • You have registered a data source, including these options:
    • Store Data Profile
    • Store Sample Data
  • Data Catalog experience is enabled in the DGC service configuration.
    This will give you access to the improved Schema asset page.
  • Catalog experience is enabled in the DGC service configuration.

Via the Database asset page

  1. Open the Database asset that contains the tables and columns in the schema you want to classify.
    1. In the main menu, click , then Catalog.
      The Catalog Home opens.
    2. In the subpages, click Technology Assets.
    3. Filter on the Database asset type.
  2. Open the relevant database, and then click ActionsClassify.
    You can follow the status of the classification in Activities.
  3. Open the database asset with the classified columns.
  4. Add Add the Data Classification column to the table.

    In the Data Classification column, you find the suggested data classes.
    Example of data classification result

  5. Hover over the classification percentages and accept () or reject () the suggested data class.
    Accept or reject data classification
  • Accepting the classification leaves the classification in the list.
  • Rejecting the classification removes the result from the data classification list.

Via the Schema asset page

  1. Open the Schema asset that contains the tables and columns that you want to classify.
    1. In the main menu, click , then Catalog.
      The Catalog Home opens.
    2. In the subpages, click Data Sources.
    3. Click the relevant schema.
  2. Click the Tables tab.
  3. Select one or more tables from the schema.
  4. To classify all columns in the table, click ActionsClassify.

    Tip To classify one or more specific columns, select the columns, then click ActionsClassify.

    You can follow the status of the classification job in Activities.
  5. Open the Table asset with the classified columns.
  6. Add Add the Data Classification column to the table.

    In the Data Classification column, you find the suggested data classes.
    Example of data classification result

  7. Hover over the classification percentages and accept () or reject () the suggested data class.

Via the Table asset page

  1. Open a Table asset that has columns you want to classify.
  2. On the Table asset page, do one of the following:
    1. To classify all columns in the table, click ActionsClassify in the upper right corner.
    2. To classify specific columns in the table, select the columns and click ActionsClassify in the upper right corner.
      You can follow the status of the classification job in Activities.
  3. Open the relevant table, and then addadd the Data Classification column to the table.

    In the Data Classification column, you find the suggested data classes.

    Example of data classification result

  4. Hover over the classification percentages and accept () or reject () the suggested data class.