Profile and classify data via Edge

After synchronizing schemas, you can start the profiling and classification process.

Note Collibra Data Intelligence Cloud only has access to synchronized metadata, anonymized profiling results and classification suggestions, but not to actual data from your data source.

Prerequisites

Steps

  1. Open the Database asset page of a registered database.
  2. In the tab pane, click Configuration.
  3. Click the Profiling and Classification tab.
  4. On the Profiling and Classification tab page, click Run profiling and classification.
    Data Catalog triggers the Edge site to start a profiling and classification job.
    Depending on your profiling options, the Edge site profiles and classifies based on all synchronized metadata or on a sample.
  1. Open the Database asset page of a registered database.
  2. In the tab pane, click Configuration.
  3. Click the Profiling and Classification tab.
  4. In the Profiling options section, click Edit.
  5. Select Automatically run when a metadata extraction is synchronized.
  6. Synchronize one or more schemas.
    When the schemas are synchronized, Data Catalog automatically triggers the Edge site to start a profiling and classification job.
  1. Open the Database asset page of a registered database.
  2. In the tab pane, click Configuration.
  3. Click the Profiling and Classification tab.
  4. In Synchronization schedule, click Add schedule to add a new schedule, or to edit an existing schedule.
    The Edit scheduling dialog box appears.
  5. Enter the required information.
    FieldDescription
    RepeatThe interval when you want to synchronize the schemas automatically, for example daily, weekly or based on a Cron expression.
    Cron

    The Quartz Cron expression that determines when the synchronization takes place.

    This field is only visible if you select Cron expression in the Repeat field.

    Every

    The day on which you want to synchronize the schemas, for example Sunday.

    This field is only visible if you select Weekly in the Repeat field.

    Every first

    The day of the month on which you want to synchronize the schemas , for example Tuesday.

    This field is only visible if you select Monthly in the Repeat field.

    At

    The time at which you want to synchronize the schemas automatically, for example 14:00.

    This field is only visible if you select Daily, Weekly or Monthly in the Repeat field.

    TimezoneThe time zone for the schedule.
  6. Click Save.
    All synchronized schemas rules are profiled and classified according to the schedule.
    Depending on your profiling options, the Edge site profiles and classifies based on all synchronized metadata or on a sample.

What's next?

The Edge site starts the profiling and classification process and sends the results to Collibra Data Intelligence Cloud. You can see the profiling and classification job in the list of activities. Click the Result button to open the data profiling results.