Synchronize Azure ML

Synchronizing Azure ML is the process of integrating metadata from Azure ML and making the data available in Collibra Data Intelligence Platform.

You can synchronize manually, or you can automate it by adding a synchronization schedule.

Before you begin

Required permissions

  • You have a Microsoft Azure AI service account with the following permissions:
    • Microsoft.MachineLearningServices/workspaces/jobs/read
    • Microsoft.MachineLearningServices/workspaces/models/versions/read
  • You have a resource role with the Configure external system resource permission, for example, Owner.
  • You have a global role with the Catalog global permission, for example, Catalog Author.
  • You have a global role with the View Edge connections and capabilities global permission, for example, Edge integration engineer. For example, Edge integration engineer.

Steps

  1. On the main toolbar, click Products icon, and then click Catalog.
    The Catalog Home opens.
  2. On the main toolbar, click .
    The Create dialog box appears.
  3. In the Register with Edge section of the Create dialog box, click Integration Configuration.
    The Integration Configuration page opens.
  4. In the Connection name column, locate the Azure connection that you used when you added the Azure ML capability and click the link in the Capabilities column.
    The Synchronization page opens.
  5. In the Synchronization Configuration section, click Add Configuration.
  6. In Domain, select the Domain asset in which you want to add the Azure ML assets.
  7. In Resource Group Name, enter the name of the resource group which holds resources related to your Azure ML model. For more information, go to Azure's resource documentation.
  8. In Workspace Name, enter a name for your workspace. This must be a unique name within your resource group. For more information, go to Azure's resources documentation.
  9. Click Save.
  10. Click Synchronize.
    A notification indicates the synchronization has started.
  1. On the main toolbar, click Products icon, and then click Catalog.
    The Catalog Home opens.
  2. On the main toolbar, click .
    The Create dialog box appears.
  3. In the Register with Edge section of the Create dialog box, click Integration Configuration.
    The Integration Configuration page opens.
  4. In the Connection name column, locate the Azure connection that you used when you added the Azure ML capability and click the link in the Capabilities column.
    The Synchronization page opens.
  5. In the Synchronization Configuration section, click Add Configuration.
  6. In Domain, select the Domain asset in which you want to add the Azure AI assets.
  7. In Resource Group Name, enter the name of the resource group which holds resources related to your Azure ML model. For more information, go to Azure's resource documentation.
  8. In Workspace Name, enter a name for your workspace. This must be a unique name within your resource group. For more information, go to Azure's resources documentation.
  9. Click Save.
  10. In the Synchronization Schedule section, click Add schedule.
  11. Enter the required information and click Save:
    FieldDescription
    RepeatThe interval when you want to synchronize automatically. The possible values are: Daily, Weekly, Monthly, and Cron expression.
    Cron

    The Quartz Cron expression that determines when the synchronization takes place.

    This field is only visible if you select Cron expression in the Repeat field.

    Every

    The day on which you want to synchronize, for example, Sunday.

    This field is only visible if you select Weekly in the Repeat field.

    Every first

    The day of the month on which you want to synchronize, for example, Tuesday.

    This field is only visible if you select Monthly in the Repeat field.

    At

    The time at which you want to synchronize automatically, for example, 14:00.

    • You can only schedule on the hour. For example, you can add a synchronization schedule at 8:00, but not at 8:45.
    • This field is only visible if you select Daily, Weekly, or Monthly in the Repeat field.
    Time zoneThe time zone for the schedule.

What's next?

The synchronization job synchronizes the Azure ML data.
After the synchronization: