Airflow: Create an Azure Data Lake Storage connection

Do you use a vault?

You can use a vault to add your data source information to your Edge site connection.

Check the connection property table below to see which information is available for your vault.

Vaults are not available for Collibra Cloud site sites.

No vault
AWS Secrets Manager
Azure Key Vault
CyberArk Vault
Google Secret Manager
HashiCorp Vault
 

Prerequisites

In your Collibra environment

In your Azure environment

Steps

  1. Open a site.
    1. On the main toolbar, click Products iconCogwheel icon Settings.
      The Settings page opens.
    2. In the tab pane, click Edge.
      The Sites tab opens and shows a table with an overview of your sites.
    3. In the table, click the name of the site whose status is Healthy.
      The site page opens.
  2. In the Connections section, click Create connection.
    The Create connection page appears.
  3. Select the Azure connection to connect to Azure Data Lake Storage.
  4. Enter the required information.
    FieldDescriptionRequiredAvailable for vaults?
    Name

    The name of the Edge or Collibra Cloud site connection for Azure Data Lake Storage.

    Yes No
    Description

    The description of the connection.

    No No
    Azure US Government Cloud Host

    Option to indicate that the authentication must go through the government-specific Microsoft Entra authentication endpoint instead of the global Azure endpoint.
    Select this option if you are using a government cloud host.
    For information about cloud hosts, go to the Azure documentation.

    No No
    Vault The vault where you store your data source values. No No
    Service Principal ID

    The Application account ID to connect to the Azure.
    For information on the Azure Service Principal user and the Application ID, go to the Azure documentation.

    Yes Yes
    Service Principal Secret

    The application secret for the Service Principal.
    For information on the application secret value, go to the Azure documentation.

    Yes Yes
    Tenant ID

    The Tenant ID of your Azure Active Directory.
    For information on the Directory (tenant) ID, go to the Azure documentation.

    Yes Yes
  5. Click Create.
    The connection is added to the Edge or Collibra Cloud site.

What's next

Add the Technical Lineage for Airflow - OpenLineage (Cloud) capability for Cloud Storage connections.