About Edge and Collibra Cloud capabilities

A capability, like Sampling or S3 synchronization, is an application that can run on an Edge or Collibra Cloud site. It can access a data source to extract and process data as needed. This data can be stored in an encrypted cache to improve the security of your data and platform. A capability for a specific data source runs as a job and delivers the output to Collibra Platform in a secure and reliable way.

A capability has a capability template that defines a specific use case, for example, data source ingestion.

Capability templates

A capability template is developed for a specific task on a specific data source type. The capability template also determines which properties are available to configure the capability.

Note If there is an integration you want that is not listed below, contact your Account Executive for more options.
 
Capability template Description Supported for Edge sites? Supported for Collibra Cloud sites?
ADLS synchronization Used to connect to Azure Data Lake Storage (ADLS) Yes Yes
AWS Bedrock AI

Used to integrate with Amazon Bedrock.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
AWS SageMaker AI

Used to integrate with Amazon SageMaker.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
Azure AI Foundry

Used to integrate with Azure AI Foundry.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
Azure ML

Used to integrate with Microsoft Azure AI.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
Catalog Data Classification

Used to classify data from a registered JDBC data source in the site.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes

Yes, but limited.

Catalog JDBC ingestion

Used to register a data source and synchronize schemas from a data source via a JDBC connection.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes

Yes, but limited.

Catalog JDBC Sampling

Used to collect and cache sample data from a data source in the site via a JDBC connection.

Ensure that you meet the additional Catalog JDBC Sampling hardware requirements, in addition to the Edge site requirements.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes

Yes, but limited.

Collibra Protect for AWS Lake Formation

Used to set up Protect for AWS Lake Formation.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
Collibra Protect for Databricks

Used to set up Protect for Databricks. This capability appears only if the following parameter is added to the JVM configuration in Collibra Console: -Dfeature.protect.databricks=true

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
Collibra Protect for Google BigQuery

Used to set up Protect for BigQuery.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
Collibra Protect for Snowflake

Used to set up Protect for Snowflake.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
GCS synchronization Used to connect to Google Cloud Storage. Yes Yes
Google Dataplex Catalog synchronization

Used to connect to Google Dataplex Catalog. The way to complete this capability depends on the Dataplex integration type you want to use: Dataplex ingestion or Dataplex Catalog ingestion.

Yes Yes
Google Vertex AI

Used to integrate Google Vertex AI.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
JDBC Profiling

Used to profile and classify data from a registered data source.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes

Yes, but limited.

MLflow AI

Used to integrate MLflow.

This capability can't be added to an Edge site that uses a MITM proxy.

Yes Yes
SAP AI Core Used to integrate with SAP AI Core. Yes Yes
S3 synchronization Used to connect to Amazon S3. Yes Yes
Databricks Unity Catalog synchronization Used to connect to Databricks Unity Catalog. Yes Yes
Technical Lineage Admin

Used to run any of the following technical lineage admin options:

  • List sources
  • Ignore sources
  • Analyze files
  • Sync

Yes Yes
Technical lineage capabilities

Used to create technical lineage for different data sources. For details, go to: Add a technical lineage capability to an Edge site.

Ensure that you meet the additional Technical Lineage minimum network requirements, in addition to the Edge site requirements.

You can use a man-in-the-middle (MITM) proxy between your Edge site and the Collibra Data Lineage service instances. For details on which data sources support the use of proxies, go to Create a technical lineage via Edge, select your data source, and see our test results in the Connect to a proxy server section.

Yes Yes, but limited.

Important While these capability templates are available for all customers, the features for which you use them might still be in beta.

Capability template structure

Each capability template contains the following:

File

Description

A manifest file (YAML)

This file contains the capability metadata and input parameter requirements.

A workflow file (YAML)

This file defines the workflow and binds the parameters to capability containers.

Docker images

One or more Docker images that implement the business logic.

Note Each type of capability has its own required custom properties. These properties appear after you select a capability template from the dropdown menu.