Synchronize Amazon Bedrock

Synchronizing AWS Bedrock AI is the process of integrating metadata from Amazon Bedrock and making the data available in Collibra Platform.

You can synchronize manually, or you can automate it by adding a synchronization schedule.

In your Collibra environment

In your Amazon Bedrock environment

You must have an AWS IAM role with the following permissions:

  • bedrock:ListFoundationModels
  • bedrock:ListImportedModels
  • bedrock:ListCustomModels
  • bedrock:GetCustomModel
  • bedrock:ListCustomModelDeployments
  • bedrock:GetCustomModelDeployment
  • bedrock:ListAgents
  • bedrock:GetAgent
  • bedrock:ListAgentActionGroups
  • bedrock:GetModelInvocationLoggingConfiguration

For more information about Amazon Bedrock permissions, go to Amazon Bedrock documentation.

Steps

  1. On the main toolbar, click Products icon Catalog.
    The Catalog homepage opens.
  2. Click Integrations.
    The Integrations page opens.
  3. Click the Integration configuration tab.
  4. In the Connection name column, locate the AWS connection that you used when you added the AWS Bedrock AI capability and click the link in the Capabilities column.
    The Synchronization page opens.
  5. In the Synchronization Configuration section, click Add Configuration.
  6. In Domain, select the Domain asset in which you want to add the AWS Bedrock AI assets.

    Important Ensure that you select a domain of the type Technology Asset Domain.

  7. Optionally, in AWS Regions, select the region of the AWS Bedrock AI assets. If no regions are selected, the integration searches all regions were Bedrock is available.
    Warning If you ingest metadata from a region containing a deprecated foundational model, Collibra creates a placeholder model for the deprecated model. This is to ensure the synchronization doesn't fail due to references to a model that no longer exists in your AWS Bedrock AI catalog.
  8. Optionally, select the Do you want to ingest input and output assets? checkbox if you want to ingest the following S3 data:
    • Input: Training data ingested as File and Storage Container assets.
    • Output: Region-level invocation logging data ingested as Storage Container asses.

    For more information, go to AI model traceability: automatic linking of AI Governance assets.

  9. Optionally, in Custom AI Metrics Mappings, define which custom AWS Bedrock AI Model metrics you want to integrate. You do this by adding the mapping between the custom metric and the Collibra attribute.
  10. Click Save.
  11. Click Synchronize.
    A notification indicates the synchronization has started.
  1. On the main toolbar, click Products icon Catalog.
    The Catalog homepage opens.
  2. Click Integrations.
    The Integrations page opens.
  3. Click the Integration configuration tab.
  4. In the Connection name column, locate the AWS connection that you used when you added the AWS Bedrock AI capability and click the link in the Capabilities column.
    The Synchronization page opens.
  5. In the Synchronization Configuration section, click Add Configuration.
  6. In Domain, select the Domain asset in which you want to add the AWS Bedrock AI assets.

    Important Ensure that you select a domain of the type Technology Asset Domain.

  7. Optionally, select the Do you want to ingest input and output assets? checkbox if you want to ingest the following S3 data:
    • Input: Training data ingested as File and Storage Container assets.
    • Output: Region-level invocation logging data ingested as Storage Container asses.

    For more information, go to AI model traceability: automatic linking of AI Governance assets.

  8. Optionally, in Custom AI Metrics Mappings, define which custom AWS Bedrock AI Model metrics you want to integrate. You do this by adding the mapping between the custom metric and the Collibra attribute.
  9. Click Save.
  10. In the Synchronization Schedule section, click Add schedule.
  11. Enter the required information and click Save:
    FieldDescription
    RepeatThe interval when you want to synchronize automatically. The possible values are: Daily, Weekly, Monthly, and Cron expression.
    Cron

    The Quartz Cron expression that determines when the synchronization takes place.

    This field is only visible if you select Cron expression in the Repeat field.

    Every

    The day on which you want to synchronize, for example, Sunday.

    This field is only visible if you select Weekly in the Repeat field.

    Every first

    The day of the month on which you want to synchronize, for example, Tuesday.

    This field is only visible if you select Monthly in the Repeat field.

    At

    The time at which you want to synchronize automatically, for example, 14:00.

    • You can only schedule on the hour. For example, you can add a synchronization schedule at 8:00, but not at 8:45.
    • This field is only visible if you select Daily, Weekly, or Monthly in the Repeat field.
    Time zoneThe time zone for the schedule.

What's next

The synchronization job synchronizes the AWS Bedrock AI data.
After the synchronization: