Create an Azure Data Lake Storage connection to an Edge site
Before you begin
- In Azure:
- To integrate ADLS folders, you need an Azure Service Principal user that is defined in Azure and that has permissions to list the files which need to be integrated into Collibra. The Azure Service Principal user must have the "Reader" and "Storage Blob Data Reader" roles for the storage locations of your data. For information, go to the Azure documentation.
- If you use Microsoft Purview:
- The Azure Service Principal user must have the "Data reader" role to fetch entities/assets from the Microsoft Purview Rest API. For information, go to the Microsoft Purview documentation.
- If your ADLS storage is private, make sure that the Allow Azure services on the trusted services list to access this storage account checkbox in the Networking → Firewalls and virtual networks is selected.
- To integrate ADLS folders, you need an Azure Service Principal user that is defined in Azure and that has permissions to list the files which need to be integrated into Collibra. The Azure Service Principal user must have the "Reader" and "Storage Blob Data Reader" roles for the storage locations of your data. For information, go to the Azure documentation.
- You have created and installed an Edge site.
- You have given the Edge Site role the required permissions.
Required permissions
- You have a global role that has the Manage connections and capabilities global permission, for example, Edge integration engineer.
Steps
- Open an Edge site.
-
On the main menu, click
, and then click
Settings.
The Collibra settings page opens. -
In the tab pane, click Edge.
The Sites tab opens and shows a table with an overview of the Edge sites. - In the table, click the name of the Edge site whose status is Healthy.
The Edge site page opens.
-
On the main menu, click
- In the Connections section, click Create connection.
The Create connection page appears. - Enter the required information.
Field Description Required Connection settings
This section contains the general settings of your connection.
NameThe name of the Edge connection for Azure Data Lake Storage.
YesDescriptionThe description of the connection.
No
Connection providerThe connection provider, which determines the available connection parameters.
Select the Azure connection to connect to Azure Data Lake Storage.
Yes
Connection parameters
This section contains the settings to connect to your data source. Service Principal IDThe Application account ID to connect to the Azure.
For information on the Azure Service Principal user and the Application ID, go to the Azure documentation.
Yes
Service Principal SecretThe application secret for the Service Principal.
For information on the application secret value, go to the Azure documentation.
Yes
Encryption optionsSelect the type of encryption used to store the Secret Access Key.
The default is To be encrypted by Edge management server.
Yes
Tenant IDThe Tenant ID of your Azure Active Directory.
For information on the Directory (tenant) ID, go to the Azure documentation.
Yes
- Click Create.
The connection is added to the Edge site.
What's next?
You can now add the ADLS synchronization capability to an Edge site.