Steps overview: Integrate Microsoft Fabric
The Microsoft Fabric integration follows a dual-path ingestion model.
- Path 1: The Fabric synchronization capability uses Microsoft APIs to ingest the overarching Fabric workspaces as well as lakehouse and warehouse assets.
- Path 2: The JDBC connection handles the SQL Server metadata and profiling.
To integrate metadata from Fabric at the column level, synchronize the Fabric integration and then schemas from the registered SQL Server databases. You can choose to integrate metadata only or integrate metadata along with profiling, classification, and requesting sample data.
Complete the listed steps below to integrate Microsoft Fabric.
Path 1: Integrate Fabric metadata
| No. | Step | Description |
|---|---|---|
| 1 | Set up service principal access for Microsoft Fabric. | Sets up a service principal and grants the service principal the required permissions for integration with Fabric. |
|
2 |
Create an Azure connection to Edge. | Creates a connection to Azure in your Edge or Collibra Cloud site. |
| 3 | Add the Microsoft Fabric synchronization capability to Edge. | Adds the Microsoft Fabric synchronization capability to the Azure Edge connection. The capability allows you to retrieve metadata from Fabric. |
| 4 | Synchronize Microsoft Fabric. |
You can manually synchronize Fabric or add a synchronization schedule to automatically synchronize it. As a result, the Fabric metadata is integrated. |
Path 2: Integrate SQL Server metadata
| No. | Step | Description |
|---|---|---|
|
1 |
Create a JDBC connection for SQL Server. |
Creates a JDBC connection to SQL Server in an Edge or Collibra Cloud site site. If you created a SQL Server JDBC connection previously, you can use that JDBC connection. The only identity provider supported for authenticating to SQL databases in Fabric is Microsoft Entra ID.
When setting up the JDBC connection in Collibra, for AuthScheme, you must use Note Create a JDBC connection for each database you want to integrate. |
| 2 | Synchronize schemas for the registered SQL Server databases. |
Synchronizes schemas to update the metadata of the registered SQL Server databases in Collibra Platform. You can synchronize a schema manually or automatically at fixed intervals. This step is required for the Fabric integration to collect metadata from Fabric down to the column level. |
| 3 | Optionally, set up and configure data profiling | Goes through the required permission and steps to prepare Edge and Collibra to profile columns in Fabric. |
| 4 | Optionally, enable and set up Unified Data Classification | Goes through the required permission and steps to prepare Edge and Collibra to classify columns in Fabric via the Unified Data Classification method. |
| 5 | Optionally, set up and configure the use of sample data | Goes through the required permissions and steps to prepare Edge and Collibra to show sample data for columns in Fabric. |
| Result |
Users with the correct permissions can now configure the profiling options and profile the data, automatically classify the data, or request sample data. |
|
Integration workflow
The following graphic shows the process of integrating Fabric, profiling and classifying the data, and requesting sample data.