Synchronize Databricks Unity Catalog

Updated: April 16, 2026

After Edge is ready to integrate Databricks Unity Catalog, you can start the metadata synchronization process. Synchronizing Databricks Unity Catalog is the process of integrating metadata from the databases connected to Databricks Unity Catalog and making this metadata available in Collibra.

You can either synchronize manually or automate the process by adding a synchronization schedule.

Tip

You can also integrate AI models using the Databricks Unity Catalog for AI integration. If you want to integrate Databricks AI models, ensure that AI Governance is enabled. Without AI Governance, the AI integration functionality is limited. For more information, go to the Databricks Unity Catalog for AI documentation.
If you're looking for information on how to create technical lineage for Databricks Unity Catalog via Edge, go to the technical lineage for Databricks Unity Catalog documentation.

Prerequisites

In your Collibra environment:

You have added the Databricks Unity Catalog capability for the connection.
For metadata synchronization, you know the System asset to use to add the Databricks Unity Catalog assets.
- If you have registered Databricks databases via the Databricks JDBC driver before, use the same System asset.
- If you never registered Databricks databases before, create a new System asset manually and use that one.
- If you will be creating multiple Databricks Unity Catalog integration capabilities, use a unique System asset for each one.
You have a resource role with the Configure external system resource permission, for example, Owner.
You have a global role with the Catalog global permission, for example, Catalog Author.
You have a global role with the View Edge connections and capabilities global permission, for example, Edge integration engineer.

Steps

On the main toolbar, click → Catalog.
The Catalog homepage opens.
In the tab bar, click Integrations.
The Integrations page opens.
Click the Integration configuration tab.
Locate the Databricks connection that you used when you added the Databricks Unity Catalog capability and click the link in the Capabilities column.
The synchronization configuration page opens.
In the Metadata inbound tab, click the Edit icon.
In Ingestion type, select Metadata ingestion to integrate only metadata from Databricks Unity Catalog.
AI model ingestion only and combined Metadata and AI model ingestion are deprecated from this tab. To integrate Databricks AI models, go to the Databricks Unity Catalog for AI documentation instead.

Complete the fields as needed.

Field	Action	Required
System	In System, select the System asset in which you want to link the Databricks assets. You can update the System asset in the Databricks Unity Catalog integration synchronization, if needed. However, if you change the System asset, don't reuse a previously used System asset for the integration. For example: `sys1` > `sys2` > ... > `sysN` > `sys1`.	Yes
Default asset status (Deprecated)	In Default asset status, select how you want to set the status of the synchronized assets. The possible values are: Implemented: Implemented means that all assets receive the "Implemented" status. No Status: No status means that newly created assets receive the first status listed in your Operating Model statuses, and that existing assets keep their assigned status. This field for applying the default asset status in the synchronization configuration is deprecated and will be removed in the future. You can define the default status using the Default Asset Status field in the capability configuration instead. Ensure that the value in this field matches the one in the capability configuration, as this field still takes precedence over the capability value.	No
Ingestion mapping version	Select one of the following values to determine how schemas are integrated into domains: V0 All schemas are included in domain mappings. Schemas that are not explicitly mapped must be excluded manually using domain exclude mapping. V1 Only schemas explicitly mapped are included. All other schemas are automatically excluded. With this option, you do not need to exclude schemas you do not want to integrate. Example If you specify the following mappings in the Domain include mappings field: Path `Orders` and domain `Domain A` Path `Orders > fk` and domain `Domain B` `Order` is the database, and `fk` is any schema with a name that starts with `fk`. If you select `V0`, the Order database and all other schemas that do not start with `fk` are integrated into `Domain A`. All schemas starting with `fk` are integrated into `Domain B`. You must manually exclude any additional schemas you do not want. If you select `V1`, the Order database is integrated into `Domain A`, and all schemas starting with `fk` are integrated into `Domain B`. Schemas not explicitly mapped are automatically excluded, so you do not need to specify Domain exclude mappings.	No
Domain include mappings	Optionally, in Domain include mappings, specify the databases and schemas that you want to integrate. Optionally again, also specify the Collibra domains where they need to be added. This means you can use this field to limit the databases and schemas you want to integrate and define where they need to be added. Important If you don't define include any mappings, the integration automatically creates new domains for each Database and Schema asset in the same community as the System asset. For more information, go to Integrated Databricks Unity Catalog data. If you include a path but don't define a domain, the integration automatically creates new domains in the same community as the System asset. If you selected `V0` for the Version field and add a domain include mapping for the database but not for a related schema, the automatically created domain for the schema is added in the same community as the domain of the database. A match with a schema has priority over a match with a database. To limit the scope of metadata ingestion to specific domains in Collibra, add a domain include mapping: Click Add domain include mappings. In Path, add the path to the databases and schemas in Databricks Unity Catalog for which you want to integrate the metadata. You can use the question mark (`?`) and asterisk (``) wildcards in the catalog and schema names. If a catalog or schema matches multiple lines, the most detailed match is taken into account. Optionally, in Domain, select the Collibra domain in which you want to integrate the metadata. If you don't define a domain, the integration automatically creates new domains in the same community as the System asset. Example Show examples Path `Orders` and domain `Domain B` In this case, the Orders Database asset and all its related assets will be integrated in Domain B. Path `Orders > fk` and domain `Domain B` In this case, the Orders Database asset will be integrated into its own domain in the same community as the System asset. All schemas that start with fk and their related assets will be integrated in Domain B. Path `Orders > ` and domain `Domain B` In this case, the Orders Database asset will be integrated in the same domain as the System asset. All schemas in the Orders catalog and their related assets will be integrated in Domain B. Show full scenario You have a database Orders that includes multiple schemas. If you want to make sure that the Orders database and related schemas are added to Domain B, add the following include mappings: Path `Orders` and domain `Domain B`, to make sure the Database asset is added to Domain B. Path `Orders > ` and domain `Domain B`, to make sure all Schema assets in Orders are added to Domain B. If you want to make sure that the Orders database and related schemas are added to domain B, except for the schemas that start with `test_`, add the following include mappings: Path `Orders` and domain `Domain B`, to make sure the Database asset is added to Domain B. Path `Orders > test_` and domain `Domain C`, to make sure that all schemas in Order that start with test_ are added to Domain C. Path `Orders > ` and domain `Domain B`, to make sure all other Schema assets in Orders are added to Domain B.	No
Domain exclude mappings	Optionally, in Domain exclude mappings, specify the path to databases and schemas in Databricks Unity Catalog that you don't want to integrate. Note The exclude mapping has priority over the include mapping. To exclude specific metadata from being ingested into Collibra, add a domain exclude mapping: Click Add domain exclude mappings. In the field, add the path to the databases and schemas in Databricks Unity Catalog that you want to exclude. You can use the question mark (`?`) and asterisk (``) wildcards in the catalog and schema names. For example: ` > test`.	No
Extensible properties mappings	Via the Extensible properties mapping field, Databricks Unity Catalog allows you to add additional properties to Catalog, Schema, and Table objects. Optionally, in Extensible properties mapping, specify the additional default system properties or custom properties that you want to integrate from Databricks Unity Catalog into Collibra. You can integrate most values from the Details page from Catalog, Schema, Table, and View objects into specific attributes in Collibra assets. You can do this by adding the mapping between the fields for the objects in Databricks Unity Catalog and the Collibra attribute. Important If you use this feature, make sure to add any custom attributes/characteristics, as needed, to the asset type assignment. The name of the property starts with the object type, for example `catalogs.systemAttributes.metastore_id`. `catalogs` refers to Database assets, `schemas` to Schema assets, `table` to Table assets, and `views` to Database View assets. The following system properties are supported: Catalogs: "browse_only", "catalog_type", "connection_name", "created_at", "created_by", "isolation_mode", "metastore_id", "provider_name", "provisioning_info", "securable_kind", "securable_type", "share_name", "storage_location", "storage_root", "updated_at" , and "updated_by". Schemas: "catalog_type", "created_at", "created_by", "metastore_id", "securable_type", "securable_kind", "storage_location", "storage_root", "updated_at", and "updated_by". Table: "access_point", "catalog_name", "created_at", "created_by", "data_access_configuration_id", "data_source_format", "deleted_at", "metastore_id", "schema_name", "securable_type", "securable_kind", "sql_path", "storage_credential_name", "storage_location", "table_type", "updated_at", "updated_by", and "view_definition". Views: "access_point", "catalog_name", "created_at", "created_by", "data_access_configuration_id", "data_source_format", "deleted_at", "metastore_id", "schema_name", "securable_type", "securable_kind", "sql_path", "storage_credential_name", "storage_location", "table_type", "updated_at", "updated_by", and "view_definition". To add an additional property mapping: Click Add another mapping. In Property name, do one of the following: To add a system attribute, select the Databricks Unity Catalog property name from the dropdown list. To add a custom attribute, type the name of the custom property manually. Use the following naming convention: `[object type].customParameters.[name of parameter]`. For example: catalogs.customParameters.Parameter1 schemas.customParameters.catalogAndNamespace.part.1 table.customParameters.view.catalogAndNamespace.part.2 views.customParameters.Paramerer2 In Attribute, select the attribute in which you want to see the value.	No
Stop compute resource	This field is important if the Compute Resource HTTP Path field was completed in the Databricks Unity Catalog capability to allow for source tag integration. If this field is set to Yes, the compute resource in Databricks Unity Catalog will be stopped right after the source tags are extracted. If this field is set to No, the compute resource remains active. Tip To prevent clusters from running for the entire synchronization duration, you can also configure the Terminate after ... minutes of inactivity setting in Databricks. The setting ensures that clusters automatically stop after a period of inactivity. For more information, go to the Databricks documentation.	No
Ingest metric views (in preview)	Optionally, ingest metric views from Databricks Unity Catalog as dedicated assets. This option is turned off by default. Note When this feature is generally available, Databricks metric views will be ingested as Metric View assets in Collibra by default. The option to ingest metric views will be removed from the integration settings. If you're also using the Databricks JDBC driver, Metric View assets are initially ingested as Database View assets via JDBC ingestion: When the same assets are ingested via the Databricks Unity Catalog integration with the Ingest metric views option selected, these are automatically converted into Metric View assets. If you do not select the Ingest metric view option and run the synchronization, the same assets are automatically converted to Table assets.	No

Important When you integrate a data source without applying include or exclude mappings rules, and later on, exclude a registered asset using an include or exclude mapping during resynchronization, the related assets receive the Missing from Source status.

Click Save.
Click Synchronize.
A notification indicates the synchronization has started.

On the main toolbar, click → Catalog.
The Catalog homepage opens.
In the tab bar, click Integrations.
The Integrations page opens.
Click the Integration Configuration tab.
Locate the Databricks connection that you used when you added the Databricks Unity Catalog capability and click the link in the Capabilities column.
The synchronization configuration page opens.
In the Metadata Inbound tab, click the Edit icon.
In Ingestion Type, select Metadata ingestion to integrate only metadata from Databricks Unity Catalog.
AI model ingestion only and combined Metadata and AI model ingestion are deprecated from this tab. To integrate Databricks AI models, go to the Databricks Unity Catalog for AI documentation instead.

Complete the fields as needed.

Field	Action	Required
System	In System, select the System asset in which you want to link the Databricks assets. You can update the System asset in the Databricks Unity Catalog integration synchronization, if needed. However, if you change the System asset, don't reuse a previously used System asset for the integration. For example: `sys1` > `sys2` > ... > `sysN` > `sys1`.	Yes
Default asset status (Deprecated)	In Default asset status, select how you want to set the status of the synchronized assets. The possible values are: Implemented: Implemented means that all assets receive the "Implemented" status. No Status: No status means that newly created assets receive the first status listed in your Operating Model statuses, and that existing assets keep their assigned status. This field for applying the default asset status in the synchronization configuration is deprecated and will be removed in the future. You can define the default status using the Default Asset Status field in the capability configuration instead. Ensure that the value in this field matches the one in the capability configuration, as this field still takes precedence over the capability value.	No
Ingestion mapping version	Select one of the following values to determine how schemas are integrated into domains: V0 All schemas are included in domain mappings. Schemas that are not explicitly mapped must be excluded manually using domain exclude mapping. V1 Only schemas explicitly mapped are included. All other schemas are automatically excluded. With this option, you do not need to exclude schemas you do not want to integrate. Example If you specify the following mappings in the Domain include mappings field: Path `Orders` and domain `Domain A` Path `Orders > fk` and domain `Domain B` `Order` is the database, and `fk` is any schema with a name that starts with `fk`. If you select `V0`, the Order database and all other schemas that do not start with `fk` are integrated into `Domain A`. All schemas starting with `fk` are integrated into `Domain B`. You must manually exclude any additional schemas you do not want. If you select `V1`, the Order database is integrated into `Domain A`, and all schemas starting with `fk` are integrated into `Domain B`. Schemas not explicitly mapped are automatically excluded, so you do not need to specify Domain exclude mappings.	No
Domain include mappings	Optionally, in Domain include mappings, specify the databases and schemas that you want to integrate. Optionally again, also specify the Collibra domains where they need to be added. This means you can use this field to limit the databases and schemas you want to integrate and define where they need to be added. Important If you don't define include any mappings, the integration automatically creates new domains for each Database and Schema asset in the same community as the System asset. For more information, go to Integrated Databricks Unity Catalog data. If you include a path but don't define a domain, the integration automatically creates new domains in the same community as the System asset. If you selected `V0` for the Version field and add a domain include mapping for the database but not for a related schema, the automatically created domain for the schema is added in the same community as the domain of the database. A match with a schema has priority over a match with a database. To limit the scope of metadata ingestion to specific domains in Collibra, add a domain include mapping: Click Add domain include mappings. In Path, add the path to the databases and schemas in Databricks Unity Catalog for which you want to integrate the metadata. You can use the question mark (`?`) and asterisk (``) wildcards in the catalog and schema names. If a catalog or schema matches multiple lines, the most detailed match is taken into account. Optionally, in Domain, select the Collibra domain in which you want to integrate the metadata. If you don't define a domain, the integration automatically creates new domains in the same community as the System asset. Example Show examples Path `Orders` and domain `Domain B` In this case, the Orders Database asset and all its related assets will be integrated in Domain B. Path `Orders > fk` and domain `Domain B` In this case, the Orders Database asset will be integrated into its own domain in the same community as the System asset. All schemas that start with fk and their related assets will be integrated in Domain B. Path `Orders > ` and domain `Domain B` In this case, the Orders Database asset will be integrated in the same domain as the System asset. All schemas in the Orders catalog and their related assets will be integrated in Domain B. Show full scenario You have a database Orders that includes multiple schemas. If you want to make sure that the Orders database and related schemas are added to Domain B, add the following include mappings: Path `Orders` and domain `Domain B`, to make sure the Database asset is added to Domain B. Path `Orders > ` and domain `Domain B`, to make sure all Schema assets in Orders are added to Domain B. If you want to make sure that the Orders database and related schemas are added to domain B, except for the schemas that start with `test_`, add the following include mappings: Path `Orders` and domain `Domain B`, to make sure the Database asset is added to Domain B. Path `Orders > test_` and domain `Domain C`, to make sure that all schemas in Order that start with test_ are added to Domain C. Path `Orders > ` and domain `Domain B`, to make sure all other Schema assets in Orders are added to Domain B.	No
Domain exclude mappings	Optionally, in Domain exclude mappings, specify the path to databases and schemas in Databricks Unity Catalog that you don't want to integrate. Note The exclude mapping has priority over the include mapping. To exclude specific metadata from being ingested into Collibra, add a domain exclude mapping: Click Add domain exclude mappings. In the field, add the path to the databases and schemas in Databricks Unity Catalog that you want to exclude. You can use the question mark (`?`) and asterisk (``) wildcards in the catalog and schema names. For example: ` > test`.	No
Extensible properties mappings	Via the Extensible properties mapping field, Databricks Unity Catalog allows you to add additional properties to Catalog, Schema, and Table objects. Optionally, in Extensible properties mapping, specify the additional default system properties or custom properties that you want to integrate from Databricks Unity Catalog into Collibra. You can integrate most values from the Details page from Catalog, Schema, Table, and View objects into specific attributes in Collibra assets. You can do this by adding the mapping between the fields for the objects in Databricks Unity Catalog and the Collibra attribute. Important If you use this feature, make sure to add any custom attributes/characteristics, as needed, to the asset type assignment. The name of the property starts with the object type, for example `catalogs.systemAttributes.metastore_id`. `catalogs` refers to Database assets, `schemas` to Schema assets, `table` to Table assets, and `views` to Database View assets. The following system properties are supported: Catalogs: "browse_only", "catalog_type", "connection_name", "created_at", "created_by", "isolation_mode", "metastore_id", "provider_name", "provisioning_info", "securable_kind", "securable_type", "share_name", "storage_location", "storage_root", "updated_at" , and "updated_by". Schemas: "catalog_type", "created_at", "created_by", "metastore_id", "securable_type", "securable_kind", "storage_location", "storage_root", "updated_at", and "updated_by". Table: "access_point", "catalog_name", "created_at", "created_by", "data_access_configuration_id", "data_source_format", "deleted_at", "metastore_id", "schema_name", "securable_type", "securable_kind", "sql_path", "storage_credential_name", "storage_location", "table_type", "updated_at", "updated_by", and "view_definition". Views: "access_point", "catalog_name", "created_at", "created_by", "data_access_configuration_id", "data_source_format", "deleted_at", "metastore_id", "schema_name", "securable_type", "securable_kind", "sql_path", "storage_credential_name", "storage_location", "table_type", "updated_at", "updated_by", and "view_definition". To add an additional property mapping: Click Add another mapping. In Property name, do one of the following: To add a system attribute, select the Databricks Unity Catalog property name from the dropdown list. To add a custom attribute, type the name of the custom property manually. Use the following naming convention: `[object type].customParameters.[name of parameter]`. For example: catalogs.customParameters.Parameter1 schemas.customParameters.catalogAndNamespace.part.1 table.customParameters.view.catalogAndNamespace.part.2 views.customParameters.Paramerer2 In Attribute, select the attribute in which you want to see the value.	No
Stop compute resource	This field is important if the Compute Resource HTTP Path field was completed in the Databricks Unity Catalog capability to allow for source tag integration. If this field is set to Yes, the compute resource in Databricks Unity Catalog will be stopped right after the source tags are extracted. If this field is set to No, the compute resource remains active. Tip To prevent clusters from running for the entire synchronization duration, you can also configure the Terminate after ... minutes of inactivity setting in Databricks. The setting ensures that clusters automatically stop after a period of inactivity. For more information, go to the Databricks documentation.	No
Ingest metric views (in preview)	Optionally, ingest metric views from Databricks Unity Catalog as dedicated assets. This option is turned off by default. Note When this feature is generally available, Databricks metric views will be ingested as Metric View assets in Collibra by default. The option to ingest metric views will be removed from the integration settings. If you're also using the Databricks JDBC driver, Metric View assets are initially ingested as Database View assets via JDBC ingestion: When the same assets are ingested via the Databricks Unity Catalog integration with the Ingest metric views option selected, these are automatically converted into Metric View assets. If you do not select the Ingest metric view option and run the synchronization, the same assets are automatically converted to Table assets.	No

Important When you integrate a data source without applying Include or Exclude Mappings rules, and then later exclude a registered asset using an Include or Exclude Mapping during resynchronization, the related assets receive the Missing from Source status.

Click Save.
In the Synchronization Schedule section, click the Add synchronization schedule icon.

Enter the information.

Field	Description
Repeat	The interval when you want to synchronize automatically. The possible values are: Daily, Weekly, Monthly, and Cron expression.
Cron	The Quartz Cron expression that determines when the synchronization takes place. This field is only visible if you select `Cron expression` in the Repeat field.
Every	The day on which you want to synchronize, for example, Sunday. This field is only visible if you select `Weekly` in the Repeat field.
Every first	The day of the month on which you want to synchronize, for example, Tuesday. This field is only visible if you select `Monthly` in the Repeat field.
At	The time at which you want to synchronize automatically, for example, 14:00. You can only schedule on the hour. For example, you can add a synchronization schedule at 8:00, but not at 8:45. This field is only visible if you select `Daily`, `Weekly`, or `Monthly` in the Repeat field.
Time zone	The time zone for the schedule.

Click Save

What's next

The synchronization job integrates the metadata of databases, schemas, tables, database views, and columns from Databricks Unity Catalog.

After the synchronization:

You can view a summary of the results from the Activities list.
For metadata synchronization, the resulting assets get a relation to the System asset that you selected.
For information on the integrated data, go to Integrated Databricks Unity Catalog data.
You can set up and configure data profiling and configure the profiling options and profile the data,
You can enable and set up Unified Data Classification and automatically classify the data.
You can set up and configure the use of sample data and request sample data.