Airflow: Create an AWS connection

For Collibra Data Lineage to ingest metadata and generate technical lineage from files stored in Amazon S3, you need to create an AWS connection on an Edge or Collibra Cloud site.

Do you use a vault?

You can use a vault to add your data source information to your Edge site connection.

Check the connection property table below to see which information is available for your vault.

Vaults are not available for Collibra Cloud site sites.

No vault
AWS Secrets Manager
Azure Key Vault
CyberArk Vault
Google Secret Manager
HashiCorp Vault
 

Before you begin

Required permissions

Steps

  1. Open a site.
    1. On the main toolbar, click Products iconCogwheel icon Settings.
      The Settings page opens.
    2. In the tab pane, click Edge.
      The Sites tab opens and shows a table with an overview of your sites.
    3. In the table, click the name of the site whose status is Healthy.
      The site page opens.
  2. In the Connections section, click Create connection.
    The Create connection page appears.
  3. Select the AWS connection to connect to Amazon S3.
  4. Enter the required information.
    FieldDescriptionRequired
    Name

    The name of the Edge or Collibra Cloud site AWS connection.

    Yes
    Description

    The description of the connection.

    No
    Vault The vault where you store your data source values. No
    Authentication type

    The type of authentication you use. Select one of the following values:

    IAM
    Use the AWS Identity and Access Management (IAM) authentication method.
    EC2
    Use this authentication method if your Edge site runs on an AWS EC2 instance with an attached IAM role. This allows Collibra Data Lineage to authenticate securely by using the instance profile, without requiring access keys.
    By selecting this option, ensure that EC2 authentication is configured for the AWS connection.
    Yes
    Access Key ID

    The access key ID of the programmatic AWS user.

    Yes
    Secret Access Key

    The secret access key of the programmatic AWS user.

    Yes
  5. Click Create.
    The connection is added to the Edge or Collibra Cloud site.
    The fields become read-only.

What's next

Add the Technical Lineage for Airflow - OpenLineage (Cloud) capability for Cloud Storage connections.