Enable profiling and classification via Edge

To enable Edge profiling and classification of synchronized metadata in Data Catalog, you need to run a command and enable multiple settings.

Depending on your environment, follow this procedure either in the Services Configuration section of the Collibra settings or in Collibra Console:

 

Before you begin

You have enabled Database registration via Edge.

Required permissions

Steps

  1. Run the command to enable classification on your Edge site.
  2. Open the Services Configuration page.
    1. On the main menu, click , and then click Settings.
      The Settings page opens.
    2. In the tab pane, click Services Configuration.
    3. Click Edit configuration.
    Open the DGC service settings for editing:
    1. Open Collibra Console.
      Collibra Console opens with the Infrastructure page.
    2. In the tab pane, expand an environment to show its services.
    3. In the tab pane, click the Data Governance Center service of that environment.
    4. Click Configuration.
    5. Click Edit configuration.
  3. In the Data profiling section, enter the required information:

    Setting

    Description

    Database profiling via Edge

    An option to enable profiling and classifying of synchronized metadata via Edge instead of Jobserver.

    • True: Profiling and classification via Edge.
    • False: Profile via Jobserver and classify via the Data Classification Platform.

    Note You can enable Database profiling via Edge only if you also enabled Database registration via Edge.

    Parallel database profiling via Edge

    The maximum number of databases that Edge can profile and classify at the same time.

    Note Schemas in a database are always processed sequentially.

    By default, the value of the setting is one. This means Edge processes one profiling job at a time. The maximum value is four.
    If you change this setting, you must restart Collibra.

    Note 
    • You don't need to enable setting Anonymize data because this setting is not relevant for Edge. Edge only sends the profiling results and classification suggestions to Collibra Data Intelligence Cloud. The profiling results are automatically anonymized for columns of data type Text and Geo before they are sent to Data Catalog.
    • You don't need to enable setting Enable Data Classification in the Data Classification configuration section. This setting relates only to the Data Classification Platform.
      If this setting is set to true, the Classify button is available on Column and Table asset pages. This button allows you to classify data via the Data Classification Platform. However, when using profiling and classification via Edge, you don't need the Data Classification Platform.
  4. Click Save all.