Databricks Unity Catalog lineage integration preflight checks

To ensure successful metadata ingestion and lineage generation, complete the following preflight checks.

In your Databricks Unity Catalog environment

Important Collibra Data Lineage support for Databricks Unity Catalog leverages the system tables feature in Databricks Unity Catalog. The system tables feature is in Public Preview. For details, go to Databricks Previews support & details in Databricks documentation.

As a Databricks Unity Catalog user, ensure that you have the following privileges and permissions:

  • The CAN ATTACH TO and CAN RESTART permissions for your compute resource.
    You can create a dedicated compute resource for generating technical lineage, or use an existing compute resource with Unity Catalog support. For details, go to Compute permissions in Databricks documentation.
    To prevent clusters from running for the entire synchronization duration, you can also configure the Terminate after ... minutes of inactivity setting in Databricks. The setting ensures that clusters automatically stop after a period of inactivity. For more information, go to the Databricks documentation.
  • The following permissions on the system tables or a custom catalog as a workaround if you prefer not to grant permissions to the system tables:

In your Collibra environment

Lineage enablement

  • Technical lineage via Edge is enabled in your Collibra environment.
  • You are using Collibra Platform 2024.02 or newer.
  • Be sure to review the Supported transformation details topic to understand the lineage information Collibra Data Lineage ingests from Databricks Unity Catalog.

Edge

Network and proxy configuration

  • Edge can connect to all Collibra Data Lineage service instances in your geographic location.
  • Optionally, you've connected to a proxy server.
  • Optionally, use a custom certificate to allow the Edge capability to connect to your data source. In this case, you've saved the certificate as "ca.pem" in the same directory as the Edge site installer. If you've saved the certificate in another directory, use the --ca argument in the Edge site installation command.

Collibra permissions

As a technical lineage user, you can connect to Collibra Data Lineage by using the basic or OAuth authentication method. If you use the basic authentication method, ensure you have the Catalog Authorglobal role with the following global permissions. The username you use as the technical lineage user must match the value you entered in the DGC user name field when you enabled technical lineage via Edge.

Note If you use the OAuth authentication method, the following requirements and permissions are not required.
  • Catalog > Advanced Data Type > Add
  • Catalog > Advanced Data Type > Remove
  • Catalog > Advanced Data Type > Update
  • Catalog > Technical lineage

As a Data Catalog user, ensure that your Edge integration engineer global role has the following global permissions. With these permissions, you can create connections and capabilities on Edge, configure the integration, and synchronize the integration.

  • Manage connections and capabilities
  • View Edge connections and capabilities

To add an Edge capability:

To synchronize technical lineage: