Deploying Collibra Unstructured AI infrastructure on AWS

Updated: May 12, 2026

Deploying Unstructured AI infrastructure involves setting up the necessary resources in your cloud environment to support advanced AI applications. This page provides instructions for deploying Collibra Unstructured AI infrastructure on Amazon Web Services (AWS). It guides you through the necessary prerequisites, configuration, deployment steps, and troubleshooting, ensuring a successful setup.

Prerequisites

Before deployment, ensure the following resources and configurations are in place.

Expected final state

Resource	Description	Example value
AWS Account	Dedicated subaccount recommended.	`006352514257`
IAM Role	Terraform provisioner role with required permissions.	`UnstructuredTerraformProvisioner`
S3 Bucket	For Terraform state file.	`unstructured-tf-state`
Route53 Hosted Zone	DNS zone for ingress records.	`env-id.company.com`
ACM Certificate	TLS certificate for application domain (must be in `ISSUED` status).	`app.env-id.company.com`
Cognito User Pool	User authentication (app client must have no client secret).	`us-east-1_xxxxxxxx`
LLM Provider	LLM provider used when configuring an LLM profile in the application, such as OpenAI or Amazon Bedrock.	`OpenAI`, `Bedrock`

Required tools

The following tools must be installed on your local machine:

Terraform version v1.12.2 or newer.
AWS CLI version 2.23.6 or newer.
kubectl for cluster verification.
Helm for troubleshooting Helm releases.
Go version 1.25.7 or higher for the configuration tool.

Optional AWS subaccount setup

It is recommended to deploy Collibra Unstructured AI infrastructure in a dedicated AWS subaccount for better resource isolation and management. If you choose to set up a subaccount, follow the AWS Organizations documentation to create a new account under your organization.

IAM provisioner role

Create an IAM role, such as UnstructuredTerraformProvisioner, that Terraform will assume. The role must:

Trust your AWS account (or the specific user or role running Terraform):

Copy

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Principal": {
        "AWS": "arn:aws:iam::<YOUR_ACCOUNT_ID>:root"
      },
      "Action": "sts:AssumeRole"
    }
  ]
}

Have the following inline policy attached:

Important Update the S3 statement Resource ARNs to match the actual Terraform state bucket name.

Note The CoreInfrastructureAndNetworking and AllowCreationWithProjectTag statements use tag-based conditions. The Terraform AWS provider is configured with default_tags to automatically tag all resources with Project = "unstructured". For BYOVPC deployments, you must also tag your existing VPC and subnets with Project = "unstructured". See BYOVPC requirements for more information.

View an example of the IAM policy JSON

Copy

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Sid": "CoreInfrastructureAndNetworking",
            "Effect": "Allow",
            "Action": [
                "ec2:*",
                "elasticloadbalancing:*",
                "eks:*",
                "rds:*",
                "secretsmanager:*",
                "iam:*",
                "kms:*",
                "sqs:*",
                "events:*",
                "logs:*",
                "wafv2:*"
            ],
            "Resource": "*",
            "Condition": {
              "StringEquals": {
                "aws:ResourceTag/Project": "unstructured"
              }
            }
        },
        {
            "Sid": "AllowCreationWithProjectTag",
            "Effect": "Allow",
            "Action": [
                "ec2:*",
                "elasticloadbalancing:*",
                "eks:*",
                "rds:*",
                "secretsmanager:*",
                "iam:*",
                "kms:*",
                "sqs:*",
                "events:*",
                "logs:*",
                "wafv2:*"
            ],
            "Resource": "*",
            "Condition": {
              "StringEquals": {
                "aws:RequestTag/Project": "unstructured"
              }
            }
        },
        {
            "Sid": "DenyProvisionerSelfModification",
            "Effect": "Deny",
            "Action": [
                "iam:AttachRolePolicy",
                "iam:DetachRolePolicy",
                "iam:PutRolePolicy",
                "iam:DeleteRolePolicy",
                "iam:PutRolePermissionsBoundary",
                "iam:DeleteRolePermissionsBoundary",
                "iam:UpdateAssumeRolePolicy",
                "iam:DeleteRole",
                "iam:TagRole",
                "iam:UntagRole",
                "iam:CreatePolicy",
                "iam:DeletePolicy",
                "iam:CreatePolicyVersion",
                "iam:DeletePolicyVersion"
            ],
            "Resource": [
                "arn:aws:iam::*:role/UnstructuredTerraformProvisioner",
                "arn:aws:iam::*:policy/UnstructuredTerraformProvisioner*"
            ]
        },
        {
            "Sid": "GlobalDiscoveryActions",
            "Effect": "Allow",
            "Action": [
                "ec2:Describe*",
                "eks:Describe*",
                "eks:List*",
                "iam:Get*",
                "iam:List*",
                "kms:ListAliases",
                "kms:DescribeKey",
                "sqs:GetQueueAttributes",
                "sqs:GetQueueUrl",
                "sqs:ListQueueTags",
                "rds:Describe*",
                "route53:ListHostedZones",
                "secretsmanager:ListSecrets",
                "ssm:GetParameter",
                "logs:DescribeLogGroups",
                "events:DescribeRule",
                "wafv2:GetWebACL",
                "wafv2:ListWebACLs",
                "wafv2:GetWebACLForResource",
                "wafv2:ListTagsForResource",
                "wafv2:DescribeManagedRuleGroup",
                "wafv2:ListAvailableManagedRuleGroups",
                "wafv2:CheckCapacity"
            ],
            "Resource": "*"
        },
        {
            "Sid": "PassRole",
            "Effect": "Allow",
            "Action": "iam:PassRole",
            "Resource": [
                "arn:aws:iam::*:role/unstructured-*",
                "arn:aws:iam::*:role/KarpenterController-*"
            ]
        },
        {
            "Sid": "ServiceLinkedRoles",
            "Effect": "Allow",
            "Action": [
                "iam:CreateServiceLinkedRole"
            ],
            "Resource": "*"
        },
        {
            "Sid": "AutoScalingGroupManagement",
            "Effect": "Allow",
            "Action": [
                "autoscaling:*"
            ],
            "Resource": "*",
            "Condition": {
                "StringEquals": {
                    "ec2:ResourceTag/eks:cluster-name": "unstructured-eks-cluster"
                }
            }
        },
        {
            "Sid": "AutoScalingGroupCreation",
            "Effect": "Allow",
            "Action": [
                "autoscaling:*"
            ],
            "Resource": "*",
            "Condition": {
                "StringEquals": {
                    "aws:RequestTag/eks:cluster-name": "unstructured-eks-cluster"
                }
            }
        },
        {
            "Sid": "Route53DNSManagement",
            "Effect": "Allow",
            "Action": [
                "route53:GetHostedZone",
                "route53:ChangeResourceRecordSets",
                "route53:GetChange",
                "route53:ListResourceRecordSets",
                "route53:ListTagsForResource"
            ],
            "Resource": [
                "arn:aws:route53:::hostedzone/<YOUR_HOSTED_ZONE_ID>",
                "arn:aws:route53:::change/*"
            ]
        },
        {
            "Sid": "Misc",
            "Effect": "Allow",
            "Action": [
                "ec2:RunInstances",
                "ec2:DisassociateAddress",
                "ec2:ReleaseAddress",
                "acm:DescribeCertificate",
                "acm:ListCertificates",
                "acm:GetCertificate",
                "acm:ListTagsForCertificate",
                "cognito-idp:DescribeUserPoolClient",
                "ecr:GetAuthorizationToken"
            ],
            "Resource": "*"
        },
        {
            "Sid": "KMSAliasManagement",
            "Effect": "Allow",
            "Action": [
                "kms:CreateAlias",
                "kms:DeleteAlias",
                "kms:ListAliases"
            ],
            "Resource": [
                "arn:aws:kms:*:*:alias/eks/unstructured-eks-cluster",
                "arn:aws:kms:*:*:key/*"
            ]
        },
        {
            "Sid": "WAFv2WebACLManagement",
            "Effect": "Allow",
            "Action": [
                "wafv2:CreateWebACL",
                "wafv2:UpdateWebACL"
            ],
            "Resource": [
                "arn:aws:wafv2:us-east-1:<YOUR_ACCOUNT_ID>:regional/webacl/unstructured-web-acl/*",
                "arn:aws:wafv2:us-east-1:*:regional/managedruleset/*/*"
            ]
        },
        {
            "Sid": "KMSDefaultServiceKeys",
            "Effect": "Allow",
            "Action": [
                "kms:CreateGrant",
                "kms:DescribeKey",
                "kms:GenerateDataKey",
                "kms:Decrypt"
            ],
            "Resource": "*",
            "Condition": {
                "StringLike": {
                    "kms:ViaService": [
                        "rds.*.amazonaws.com",
                        "ec2.*.amazonaws.com"
                    ]
                }
            }
        },
        {
            "Sid": "ECRCrossAccountPull",
            "Effect": "Allow",
            "Action": [
                "ecr:BatchGetImage",
                "ecr:GetDownloadUrlForLayer",
                "ecr:BatchCheckLayerAvailability",
                "ecr:DescribeRepositories",
                "ecr:ListImages",
                "ecr:DescribeImages"
            ],
            "Resource": "arn:aws:ecr:us-east-1:139228973453:repository/release/*"
        },
        {
            "Sid": "ECRLocalRegistryManagement",
            "Effect": "Allow",
            "Action": [
                "ecr:CreateRepository",
                "ecr:DescribeRepositories",
                "ecr:ListImages",
                "ecr:DescribeImages",
                "ecr:BatchGetImage",
                "ecr:GetDownloadUrlForLayer",
                "ecr:BatchCheckLayerAvailability",
                "ecr:InitiateLayerUpload",
                "ecr:UploadLayerPart",
                "ecr:CompleteLayerUpload",
                "ecr:PutImage"
            ],
            "Resource": "arn:aws:ecr:*:<YOUR_ACCOUNT_ID>:repository/unstructured/*"
        },
        {
            "Sid": "S3",
            "Effect": "Allow",
            "Action": [
                "s3:ListBucket",
                "s3:GetBucketLocation",
                "s3:GetObject",
                "s3:PutObject",
                "s3:DeleteObject"
            ],
            "Resource": [
                "arn:aws:s3:::<YOUR_S3_BUCKET>",
                "arn:aws:s3:::<YOUR_S3_BUCKET>/*"
            ]
        }
    ]
}

Instructions to create prerequisite resources

S3 backend for Terraform state

Create an S3 bucket for storing Terraform state:

Copy

aws s3api create-bucket \
  --bucket unstructured-tf-state \
  --region us-east-1

aws s3api put-bucket-encryption \
  --bucket unstructured-tf-state \
  --server-side-encryption-configuration '{"Rules":[{"ApplyServerSideEncryptionByDefault":{"SSEAlgorithm":"AES256"}}]}'

Route53 hosted zone

A Route53 hosted zone must exist for the domain you plan to use. Terraform creates DNS records in this zone, but it does not create the zone itself.

Copy

aws route53 create-hosted-zone \
  --name env-id.company.com \
  --caller-reference "$(date +%s)"

After you create the zone, delegate your domain by updating the nameservers of your registrar to the ones returned by Route53.

ACM certificate

An ACM certificate must be issued for the application domain. It must be in the same region as your deployment and in ISSUED status.

Copy

aws acm request-certificate \
  --domain-name app.env-id.company.com \
  --validation-method DNS \
  --region us-east-1

Complete DNS validation by adding the CNAME record to your Route53 zone. Terraform handles the association of this certificate with the ingress load balancer.

Cognito user pool

Configure AWS Cognito for application authentication. This includes user pool creation, app client settings, custom user attributes, and optional SSO with a federated identity provider.

Prerequisites

AWS account with permissions to create and manage Amazon Cognito resources.
AWS CLI v2 installed and configured (optional if you use CLI commands).
The application URL (for example, https://app.unstructured.<your domain>.com).

Expected final state

Resource	Description	Example value
User Pool	Cognito user pool for authentication.	`unstructured-ai-pool`
App Client	Public app client (no client secret).	`unstructured-frontend`
Custom Attributes	User properties for roles and tenancy.	`custom:permission_level`, `custom:tenant_name`, `custom:group_memberships`
Admin User	Initial admin user with permanent password.	Username: `admin`, permission level: `admin`
Hosted UI Domain (Optional)	Cognito domain for OAuth/SSO (if you use SSO).	`your-company.auth.us-east-1.amazoncognito.com`
SAML Identity Provider (Optional)	Federated IDP for SSO (if you use SSO).	Provider name: `SAML`

1. Create a user pool

Using the AWS console

Navigate to the AWS Console.
Enter "Cognito" in the search bar.
Click Create user pool.
Define your application:
- Select Single-page application (SPA).
- Name your application.
Configure options:
- For sign-in identifiers, select Username and Email.
- For self-registration, clear Enable self-registration.
- For required attributes for sign-up, select email in the dropdown menu.
Skip the return URL section. You only need this for SSO, and you can configure it later if required.
Click Create User Directory.

On the next page, click Go to overview. You will land on the user pool overview page. Note the following values, that are required for application configuration:

Value	Where to find	Description
User Pool ID	User pool overview page.	Unique identifier (for example, `us-east-1_AbCdEfGhI`).
App Client ID	User pool overview page, or App clients tab.	Client identifier (for example, `7649pb0etcv84u0rskudhd5pel`).
AWS Region	Visible in the URL bar or User Pool ARN.	Region where the pool was created (for example, `us-east-1`).

Note The wizard automatically creates an app client. You will configure its settings in the Configure app client step.

Add custom attributes: In your user pool, navigate to Sign-up experience (under Authentication in the left panel). Scroll to Custom attributes. Click Add custom attributes and enter the following:

Attribute name	Type	Mutable	Description
`custom:permission_level`	String	Yes	User's role: `admin`, `contributor`, or `viewer`.
`custom:tenant_name`	String	Yes	Tenant ID for the organization of the user.
`custom:group_memberships`	String	Yes	Comma-separated list of group names.

Important You cannot rename or delete custom attributes after creation. Verify the attribute names match exactly as shown.

View the CLI equivalent

Note The create-user-pool CLI command does not support setting the feature plan (Essentials) or selecting the SPA app type. The CLI creates a standard user pool, and the app client is created separately. After creation, you can verify settings in the AWS Console.

2. Configure app client

The app client was automatically created during user pool creation. This section covers verifying and updating its settings.

Using the AWS console

In your user pool, go to the App clients tab.
Click the app client created during the wizard. It has the name you provided previously.

App client information

Verify the following settings:

Setting	Expected value
App client name	The name you provided during user pool creation.
Client secret	`-` (no client secret for the SPA app type).

Authentication flows

Verify that the following authentication flows are enabled. If they are not, click Edit and enable them:

Authentication flow	API name	Required	Description
Sign in with secure remote password (SRP)	`ALLOW_USER_SRP_AUTH`	Yes	Standard username and password authentication.
Get new user tokens from existing authenticated sessions	`ALLOW_REFRESH_TOKEN_AUTH`	Yes	Enables session continuity and token refresh.

Note You do not need other authentication flows such as ALLOW_USER_PASSWORD_AUTH, ALLOW_ADMIN_USER_PASSWORD_AUTH, ALLOW_CUSTOM_AUTH, or ALLOW_USER_AUTH. You can leave them cleared unless you have a specific need for them.

Authentication flow session duration

Setting	Recommended value
Authentication flow session duration	3 minutes.

Token expiration

Token	Recommended value	Notes
Refresh token expiration	5 days.	Used for automatic session renewal without re-login.
Access token expiration	60 minutes.	Short-lived for security.
ID token expiration	60 minutes.	Used by the application for API requests and role resolution.

Adjust token lifetimes according to the security requirements of your organization. Shorter refresh token expiration increases security but requires more frequent re-authentication.

Advanced authentication settings

Setting	Value
Enable token revocation	Yes (allows invalidating tokens on sign-out).
Enable prevent user existence errors	Yes (returns generic error messages to prevent username enumeration attacks).

Attribute read and write permissions

The app client needs Read and Write access to the attributes listed below. Click Edit under the attribute permissions section to verify and update the settings.

Required custom attributes:

Attribute	Read	Write	Used for
`custom:permission_level`	Yes	Yes	Role-based access control (`admin`, `contributor`, or `viewer`).
`custom:tenant_name`	Yes	Yes	Tenant ID for tenant identification.
`custom:group_memberships`	Yes	Yes	Group-based access control.

Required standard attributes:

Attribute	Read	Write	Used for
`email`	Yes	Yes	User identification and communication.

Recommended standard attributes:

Attribute	Read	Write	Used for
`given_name`	Yes	Yes	User's first name shown in the UI.
`family_name`	Yes	Yes	User's last name shown in the UI.

The application does not actively use other standard attributes such as address, birthdate, gender, locale, middle_name, name, nickname, or phone_number. You can leave them at defaults or disable them according to your requirements.

Click Save changes after verifying or updating the settings.

Important The app client must not have a client secret. The application is a browser-based SPA that uses Secure Remote Password (SRP) authentication, which requires a public client. The SPA app type selected during user pool creation ensures this by default.

View the CLI equivalent

Note The CLI command uses update-user-pool-client (not create) because the client already exists. When you use this command, you must specify all settings. Any omitted values will be reset to defaults.

3. Create the initial admin user

Create your first admin user. This user will have full access to manage users, groups, and settings through the application.

Using the AWS console

In your user pool, go to Users.
Click Create user.
Enter the details:
Field Value
User name admin (or your preferred admin username).
Email ID Admin's email ID.
Mark email as verified Yes.
Temporary password Set a temporary password.
Click Create user.
After creation, go to the user detail page.
Navigate to User attributes.
Click Edit.
Set custom attributes:
Attribute Value
custom:permission_level admin.
custom:tenant_name Your tenant ID (for example, default).
custom:group_memberships Leave empty (or set initial groups).
Set a permanent password to move the user out of FORCE_CHANGE_PASSWORD status. See the CLI command below for an example.

Field	Value
User name	`admin` (or your preferred admin username).
Email ID	Admin's email ID.
Mark email as verified	Yes.
Temporary password	Set a temporary password.

Attribute	Value
`custom:permission_level`	`admin`.
`custom:tenant_name`	Your tenant ID (for example, `default`).
`custom:group_memberships`	Leave empty (or set initial groups).

View the CLI equivalent

4. Optional SSO setup

4.1 Configure hosted ID domain

If you plan to use SSO with a federated identity provider, you must configure a hosted UI domain.

Using the AWS console

In your user pool, go to Branding, then Domain in the left panel.
Under Cognito domain, click Edit (or Create Cognito domain if none exists).
Enter a domain prefix (for example, unstructured-ai-sso). This creates a domain in the following format:
```
https://<your-prefix>.auth.<region>.amazoncognito.com
```
For example: https://unstructured-ai-sso.auth.us-east-1.amazoncognito.com
For Branding version, select Hosted UI (classic).
Click Save.
Note: If you prefer to use your own domain, for example, auth.yourcompany.com, use the Custom domain section instead. This requires an ACM certificate in us-east-1. For most deployments, the Cognito domain is sufficient.
Callback URLs: The application manages OAuth redirect URLs in its own configuration via environment variables. See Section 5. You do not need to configure callback or sign-out URLs on the Cognito app client.
Note the Cognito domain, as it is required for application configuration:
Value Example
Cognito domain prefix unstructured-ai-sso.
Full domain URL https://unstructured-ai-sso.auth.us-east-1.amazoncognito.com.

Value	Example
Cognito domain prefix	`unstructured-ai-sso`.
Full domain URL	`https://unstructured-ai-sso.auth.us-east-1.amazoncognito.com`.

View the CLI equivalent

4.2 Add a SAML identity provider

To enable "Login with SSO" through a SAML 2.0 identity provider, such as Okta, Azure AD, OneLogin, or PingFederate:

Using the AWS console

In your user pool, go to Authentication, then Social and external providers in the left panel.
Click Add identity provider.
Select SAML.
Configure the provider:
Setting Value
Provider name SAML.
Metadata source Upload metadata file or provide metadata URL from your IDP.
Configure Attribute mapping:
IDP attribute Cognito attribute
email email.
name name.
Click Save.

Setting	Value
Provider name	`SAML`.
Metadata source	Upload metadata file or provide metadata URL from your IDP.

IDP attribute	Cognito attribute
`email`	`email`.
`name`	`name`.

Important The provider name must be SAML. The application uses this name when it initiates SSO login redirects.

View the CLI equivalent using metadata URL

View the CLI equivalent using metadata file

4.3 Configure your identity provider

In your identity provider, create a SAML application with the following settings:

Setting	Value
SSO URL / ACS URL	`https://<your-cognito-domain>/saml2/idpresponse`.
Audience URI / Entity ID	`urn:amazon:cognito:sp:<your-user-pool-ID>`.
Name ID format	`EmailAddress` or `Persistent`.

Okta-specific setup

In the Okta Admin Console, go to Applications.
Click Create App Integration.
Select SAML 2.0.
Click Next.
Set:
- Single sign-on URL: https://<your-cognito-domain>/saml2/idpresponse.
- Audience URI (SP Entity ID): urn:amazon:cognito:sp:<your-user-pool-ID>.
Under Attribute Statements, add:
- email → user.email.
- name → user.displayName.
Complete the wizard. Note the Metadata URL from the Sign On tab, as you will need this for the Cognito SAML provider configuration.

4.4 Enable the IDP on the app client

Using the AWS console

Go to Applications, then App clients.
Click your app client.
Select the Login pages tab.
Click Edit.
Under Identity providers, enable both:
- Cognito user pool (for username and password login).
- SAML (for SSO login).
Click Save changes.

View the CLI equivalent

When you use update-user-pool-client, you must re-specify all existing settings because the command replaces the full client configuration.

Deployment scenarios

Standard deployment

In a standard deployment, Terraform creates all networking resources, such as VPC, subnets, NAT gateways, internet gateway, and route tables. The ingress load balancer is internet-facing.

BYOVPC (bring your own VPC)

In a BYOVPC deployment, you provide an existing VPC and subnets. Terraform skips networking creation and deploys directly into your infrastructure. The ingress load balancer is automatically set to internal, meaning the application is only accessible via private network connectivity, for example, VPN, Direct Connect, or peering.

Note VPN or network connectivity is your responsibility and is managed outside of this Terraform deployment.

BYOVPC requirements

Your VPC and subnets must meet the following requirements:

Requirement	Details
DNS Support	VPC must have DNS support and DNS hostnames enabled.
Private Subnets (EKS)	Exactly 2 private subnets in different Availability Zones with outbound internet access (NAT Gateway or an equivalent NAT Gateway).
Private Subnets (DB)	Exactly 2 additional private subnets in different Availability Zones for the RDS database. Not required if using BYODB.
Outbound Internet	Required for pulling container images from ECR, Helm charts, and other external dependencies.
Tagging	VPC and all subnets must be tagged with `Project = "unstructured"` (required by the tag-based conditions of the IAM policy).

Tag your VPC and subnets:

Copy

aws ec2 create-tags \
  --resources <vpc-id> <subnet-1> <subnet-2> <subnet-3> <subnet-4> \
  --tags Key=Project,Value=unstructured

CIDR range requirements

When you use BYOVPC, the following CIDR ranges must not overlap:

CIDR range	Purpose	Default
VPC CIDR	Your VPC network.	Customer-provided.
Kubernetes Service CIDR	ClusterIP services.	`172.20.0.0/16` (you can configure this via `eks_cluster_service_cidr`).
VPN Client CIDR	VPN tunnel client IPs.	Depends on your VPN configuration.

BYOVPC configuration

To use your own VPC, include the networking.byovpc section in config.yaml:

Copy

networking:
  byovpc:
    vpc_id: "vpc-xxxxxxxxxxxxxxxxx"
    private_subnet_ids:
      - "subnet-xxxxxxxx"   # EKS subnet (AZ 1)
      - "subnet-yyyyyyyy"   # EKS subnet (AZ 2)
    db_subnet_ids:
      - "subnet-aaaaaaaa"   # RDS subnet (AZ 1)
      - "subnet-bbbbbbbb"   # RDS subnet (AZ 2)

Omit this section entirely to have the infrastructure create a new VPC automatically.

BYOR (bring your own registry)

By default, the cluster pulls all container images and Helm charts directly from the Unstructured release ECR using the provisioner role cross-account pull permissions. If your environment cannot pull from a third-party AWS account at runtime (for example, due to air-gapped networking, an internal-only registry policy, or because you want to scan or sign images before deploying), use BYOR to host the artifacts in a registry you control.

When BYOR is enabled, the configure tool mirrors every image and Helm chart that the deployment needs from the Unstructured ECR to your registry at configure time. Terraform and the cluster then pull exclusively from your registry. There is no runtime dependency on the Unstructured account.

BYOR has two modes, selected automatically based on the registry URL in config.yaml:

Mode	Registry URL	Authentication	When to use
Customer ECR	`<account>.dkr.ecr.<region>.amazonaws.com/<prefix>`	Provisioner role (no extra credentials).	You already have an AWS ECR registry in the deployment account. Repositories are created automatically under the path prefix.
Non-ECR	Any other URL (Artifactory, Harbor, Quay, Docker Hub, GHCR, and so on).	`byor.username` and `byor.password`.	You host images outside AWS or in a non-ECR product. The destination must already exist. The tool will not create repositories for you.

BYOR requirements

Requirement	Details
Outbound from your machine	The configure tool runs locally and needs network access to both the Unstructured ECR (source) and your registry (destination) to mirror images.
Disk space	Mirroring streams images directly between registries (no on-disk copy), but expect approximately 10 GB of network transfer for a fresh mirror.
Customer ECR: IAM	The provisioner role `ECRLocalRegistryManagement` statement covers `CreateRepository`, `PutImage`, and upload actions for repositories under `arn:aws:ecr::<YOUR_ACCOUNT_ID>:repository/unstructured/`. If you use a different repository prefix, update the resource ARN accordingly.
Customer ECR: repository path	Use a single shared prefix (for example, `<account>.dkr.ecr.<region>.amazonaws.com/unstructured`). The tool creates one repository per artifact under that prefix.
Non-ECR: push credentials	A user or token with permission to push images and OCI Helm charts to the destination registry. Store these in `byor.username` and `byor.password` in `config.yaml`.
Non-ECR: OCI Helm support	The destination must support OCI artifacts (Helm 3 charts are pushed as OCI). Artifactory, Harbor, GHCR, Quay, and Docker Hub all support this.

BYOR configuration

Add the registry section to config.yaml. For non-ECR destinations, also add a byor section:

Copy

# Customer ECR - no byor section needed; provisioner role handles auth
registry:
  url: "123456789012.dkr.ecr.us-east-1.amazonaws.com/unstructured"

# Non-ECR registry - byor credentials required
registry:
  url: "your-registry.example.com/unstructured"

byor:
  username: "your-registry-username"
  password: "your-registry-password-or-token"

Omit both sections to pull directly from the Unstructured release ECR (the default).

What the configure tool does

When registry.url differs from the default, running configure (after writing terraform.auto.tfvars and backend.hcl) performs the following steps:

Authenticates to the source ECR using the provisioner role.
Authenticates to the destination: assumed-role ECR token for customer ECR, or byor basic auth for non-ECR.
For customer ECR, creates any missing repositories under the configured path prefix.
For each artifact, compares source and destination digests and copies only what is missing or out-of-date (idempotent, safe to re-run).
Prints a summary: N copied, N skipped, N failed.

The Terraform stack is then configured to pull from your registry instead of the Unstructured one. For non-ECR destinations, an imagePullSecret is created in-cluster from the byor credentials and attached to all relevant ServiceAccounts.

Note Each new release pins new image and chart versions. Re-run the configure tool before terraform apply to mirror the new artifacts. Existing tags are skipped, so the operation is incremental.

BYOVPC + BYODB (bring your own database)

BYODB extends the BYOVPC scenario by skipping database creation. You provide an existing PostgreSQL endpoint and a Secrets Manager secret containing its credentials. Terraform deploys the EKS cluster and application stack against your database. Because no Aurora cluster is created, DB subnets are not required. Only the 2 EKS private subnets are needed.

BYODB requirements

Note BYODB requires BYOVPC. You must provide the networking.byovpc configuration.

Requirement	Details
PostgreSQL	The database must be PostgreSQL-compatible (RDS PostgreSQL, Aurora PostgreSQL, or self-managed).
Network reachability	The EKS cluster must be able to reach the database endpoint on its configured port. If the database is in a different VPC, ensure connectivity via VPC peering, Transit Gateway, or equivalent, and that security groups allow inbound traffic from the EKS subnets.
Secrets Manager secret	A Secrets Manager secret containing the database credentials in the expected format (see below).
Liquibase migrations	The application Liquibase migration job runs at deploy time against the provided database. The database user must have sufficient privileges to create and alter tables.
Tagging	The database instance and its associated resources (security groups, subnet groups, Secrets Manager secret) must be tagged with `Project = "unstructured"` to satisfy the tag-based conditions of the IAM policy.

Secrets Manager secret format

The Secrets Manager secret referenced by secret_arn must contain the following keys:

Copy

{
  "host": "your-db.xxxxxxxxxxxx.us-east-1.rds.amazonaws.com",
  "port": "5432",
  "database": "unstructuredpgdb",
  "username": "unstructured_pg_user",
  "password": "your-database-password"
}

Tag the secret:

Copy

aws secretsmanager tag-resource \
  --secret-id <your-secret-arn> \
  --tags Key=Project,Value=unstructured

BYODB configuration

To bring your own database, add the byo_db block inside the networking.byovpc section of config.yaml:

Copy

networking:
  byovpc:
    vpc_id: "vpc-xxxxxxxxxxxxxxxxx"
    private_subnet_ids:
      - "subnet-xxxxxxxx"   # EKS subnet (AZ 1)
      - "subnet-yyyyyyyy"   # EKS subnet (AZ 2)
    byo_db:
      db_endpoint: "your-db.cluster-xxxxxxxxxxxx.us-east-1.rds.amazonaws.com"
      secret_arn: "arn:aws:secretsmanager:us-east-1:123456789012:secret:your-db-secret-xxxxxx"

Note that db_subnet_ids is omitted when byo_db is provided. If you supply both, byo_db takes precedence and the DB subnets are ignored.

Tagging BYO resources

The IAM provisioner policy uses tag-based conditions (aws:ResourceTag/Project = "unstructured") to scope permissions. Any pre-existing resources you bring must be tagged accordingly, or Terraform will receive AccessDenied errors when it attempts to read or manage them.

Resources that must be tagged with Project = "unstructured":

Resource	Why
VPC	Karpenter subnet discovery, security group lookups.
All subnets (EKS and DB, if applicable)	Subnet data sources, EKS node placement.
RDS instance or Aurora cluster (if BYODB)	Terraform reads endpoint and status metadata.
Security groups on the database	Terraform may reference them for connectivity validation.
Secrets Manager secret	ExternalSecrets operator reads the secret at runtime.

Copy

aws ec2 create-tags \
  --resources <vpc-id> <eks-subnet-1> <eks-subnet-2> \
  --tags Key=Project,Value=unstructured

aws rds add-tags-to-resource \
  --resource-name <db-instance-arn> \
  --tags Key=Project,Value=unstructured

aws secretsmanager tag-resource \
  --secret-id <your-secret-arn> \
  --tags Key=Project,Value=unstructured

Deployment steps

1. Prerequisites

Before deployment, ensure you have:

AWS CLI configured with access to the target account.
Terraform >= 1.5 installed.
Go >= 1.25.7 installed (for the configuration tool).
An IAM provisioner role with the permissions described in the IAM provisioner role section.
An S3 bucket for Terraform remote state.
A Cognito user pool and app client configured for authentication.
A Route53 hosted zone for DNS.
ACM certificate issued in the deployment region.
ECR cross-account access: provide your AWS account ID to the Collibra team so your cluster can pull container images (not required when using BYOR).
BYOVPC (if applicable): VPC and subnets tagged with Project = "unstructured".
BYODB (if applicable): database, security groups, and Secrets Manager secret tagged with Project = "unstructured".
BYOR (if applicable): a reachable destination registry. For non-ECR registries, push credentials must be available to the configure tool.

2. Configure

Copy the example configuration and enter your values:

Copy

cd iac/aws
cp config.yaml.example config.yaml

Edit config.yaml with your environment-specific values. See config.yaml.example for a fully commented template.

Cross-account DNS: If your Route53 hosted zone is in a different AWS account, set create_a_record: false under the ingress: section. Terraform will not attempt to create the Route53 A record and will output the values you need to create it manually after deployment.

3. Generate Terraform files

Build and run the configuration tool. All commands run from iac/aws/:

Copy

go build -C ../../tools/configure -o ../../iac/aws/configure ./aws
./configure

The tool validates your configuration and generates the following:

terraform.auto.tfvars (all Terraform variable values).
backend.hcl (S3 backend configuration).

The validator performs offline checks, such as:

Required field presence.
Region consistency (the Cognito pool region matches the deployment region).
IAM role ARN format.
Cognito client ID format.
UUID format for observability site ID.
The TLS domain is a subdomain of the DNS zone.
BYOVPC subnet ID format, count, and uniqueness.
BYOR credentials present when registry.url points to a non-ECR registry.

When a custom registry.url is configured, the configure tool also mirrors all images and Helm charts to your registry as part of this step. See BYOR (bring your own registry) for details.

4. Deploy

Copy

terraform init -backend-config=backend.hcl
terraform apply

This command deploys the entire stack in a single apply:

VPC networking (or a BYOVPC).
EKS cluster and node groups.
Aurora PostgreSQL database (skipped when using BYODB).
IAM roles (API, workflow, Cognito access, and EBS CSI).
Linkerd service mesh (cert-manager, CRDs, and control plane).
AWS Load Balancer Controller and ingress.
Backend and frontend applications.
Argo Workflows and Events.
External Secrets Operator.
Cluster Autoscaler.
EBS CSI Driver.
OpenTelemetry Collector.

Deployment takes approximately 20 to 30 minutes.

5. Verify the deployment

Copy

# Configure kubeconfig (use --role-arn since only the provisioner role has cluster access)
aws eks update-kubeconfig \
  --region <your-region> \
  --name unstructured-eks-cluster \
  --role-arn arn:aws:iam::<ACCOUNT_ID>:role/UnstructuredTerraformProvisioner

# Set AWS_PROFILE so kubectl/helm can authenticate
export AWS_PROFILE=<your-profile-name>

# Check all pods
kubectl get pods --all-namespaces

# Check ingress
kubectl get ingress -n unstructured

For standard deployments, access the application at https://app.env-id.company.com.

For BYOVPC deployments, ensure your VPN or private network connectivity is active. Then, access the application at the configured domain.

6. Post-deployment: Cross-account DNS

If you set create_a_record: false because your Route53 hosted zone is in a different AWS account, you must manually create the DNS record after deployment.

After terraform apply completes, retrieve the required values:

Copy

terraform output dns_record_config

Use the output values to create an A record (Alias) in your Route53 hosted zone via the AWS Console or via the AWS CLI:

Copy

# Get the hosted zone ID for your domain in the DNS account
ZONE_ID=$(aws route53 list-hosted-zones-by-name \
  --dns-name "<dns_zone_name from output>" \
  --query "HostedZones[0].Id" \
  --output text \
  --profile <dns-account-profile>)

# Create the alias A record
aws route53 change-resource-record-sets \
  --hosted-zone-id "$ZONE_ID" \
  --profile <dns-account-profile> \
  --change-batch '{
    "Changes": [{
      "Action": "UPSERT",
      "ResourceRecordSet": {
        "Name": "<record_name from output>",
        "Type": "A",
        "AliasTarget": {
          "DNSName": "<alias_target from output>",
          "HostedZoneId": "<alias_zone_id from output>",
          "EvaluateTargetHealth": true
        }
      }
    }]
  }'

Tip Copy values from the terraform output carefully. The --change-batch JSON is sensitive to formatting. Avoid trailing commas, ensure quotes are straight, and do not add a trailing period to the DNSName value.

Note You only need to repeat this step if the ALB hostname changes, such as after a full teardown and redeploy.

Upgrading

To upgrade to a newer version:

Download and extract the latest Terraform tarball (.tgz) from the Collibra downloads page.
Review the release notes for any breaking changes.
Update config.yaml with any new parameters and re-run the configure tool.

Run:

Copy

terraform init -backend-config=backend.hcl
terraform apply

Teardown

The recommended path is the configure tool destroy subcommand. It uninstalls Kubernetes workloads in the correct order, drains Karpenter-managed EC2 instances, and then runs terraform destroy:

Copy

cd iac/aws
./configure destroy

The tool prompts for confirmation before doing anything destructive. Pass --auto-approve to skip the prompt, or --dry-run to print what each step would do without making changes. Output is also saved to /tmp/aws-destroy-<timestamp>.log.

If you prefer to run terraform destroy directly without the workload teardown (for example, when no Kubernetes resources have been deployed yet):

Copy

cd iac/aws
terraform destroy

Note Some resources may require manual cleanup after destroy:

Secrets Manager secrets are scheduled for deletion with a recovery window. Secrets are not immediately deleted. Use aws secretsmanager delete-secret --force-delete-without-recovery if you need immediate deletion for redeployment.
KMS keys are scheduled for deletion with a waiting period.

Troubleshooting

Secrets Manager "already scheduled for deletion"

Error:

InvalidRequestException: You can't create this secret because a secret with this name is already scheduled for deletion.

Fix: Restore the existing secret or force-delete it:

Copy

# Option 1: Restore the secret
aws secretsmanager restore-secret --secret-id <secret-name> --region <region>

# Option 2: Force-delete and let Terraform recreate it
aws secretsmanager delete-secret \
  --secret-id <secret-name> \
  --force-delete-without-recovery \
  --region <region>

Helm provider OCI registry authentication errors

Error:

Failed to log in to OCI registry "oci://...": response status code 403: denied: Your authorization token has expired.

This error occurs due to a known bug in Helm provider v3.x where the repository_password is cached in Terraform state. When ECR authorization tokens expire after 12 hours, Terraform uses the expired token from state.

Fix: Remove and re-import all affected Helm releases:

Copy

terraform state rm module.frontend.helm_release.frontend
terraform import module.frontend.helm_release.frontend unstructured/unstructured-frontend

terraform state rm module.backend.helm_release.backend
terraform import module.backend.helm_release.backend unstructured/unstructured-backend

terraform state rm module.linkerd.helm_release.cert_manager
terraform import module.linkerd.helm_release.cert_manager cert-manager/cert-manager

terraform state rm module.linkerd.helm_release.linkerd_certs
terraform import module.linkerd.helm_release.linkerd_certs linkerd/linkerd-certs

terraform state rm module.linkerd.helm_release.linkerd_crds
terraform import module.linkerd.helm_release.linkerd_crds linkerd/linkerd-crds

terraform state rm module.linkerd.helm_release.linkerd
terraform import module.linkerd.helm_release.linkerd linkerd/linkerd

terraform state rm module.ingress.helm_release.aws_load_balancer_controller
terraform import module.ingress.helm_release.aws_load_balancer_controller kube-system/aws-load-balancer-controller

terraform state rm module.karpenter_deploy.helm_release.karpenter
terraform import module.karpenter_deploy.helm_release.karpenter kube-system/karpenter

terraform state rm module.eks_workload_addons.helm_release.argo_events
terraform import module.eks_workload_addons.helm_release.argo_events unstructured/argo-events

terraform state rm module.eks_workload_addons.helm_release.argo_workflows
terraform import module.eks_workload_addons.helm_release.argo_workflows unstructured/argo-workflows

terraform state rm module.eks_workload_addons.helm_release.external_secrets_operator
terraform import module.eks_workload_addons.helm_release.external_secrets_operator unstructured/external-secrets

terraform state rm module.eks_workload_addons.helm_release.aws_ebs_csi_driver
terraform import module.eks_workload_addons.helm_release.aws_ebs_csi_driver kube-system/aws-ebs-csi-driver

terraform state rm module.eks_workload_addons.helm_release.node_local_dns
terraform import module.eks_workload_addons.helm_release.node_local_dns kube-system/node-local-dns

terraform state rm module.eks_workload_addons.helm_release.otel_collector_agent
terraform import module.eks_workload_addons.helm_release.otel_collector_agent unstructured/otel-collector-agent

terraform state rm module.eks_workload_addons.helm_release.otel_collector_k8s_metrics
terraform import module.eks_workload_addons.helm_release.otel_collector_k8s_metrics unstructured/otel-collector-k8s-metrics

terraform state rm module.eks_workload_addons.helm_release.otel_collector_events
terraform import module.eks_workload_addons.helm_release.otel_collector_events unstructured/otel-collector-events

terraform state rm module.eks_workload_addons.helm_release.otel_collector_db_metrics
terraform import module.eks_workload_addons.helm_release.otel_collector_db_metrics unstructured/otel-collector-db-metrics

terraform state rm module.eks_workload_addons.helm_release.pgbouncer
terraform import module.eks_workload_addons.helm_release.pgbouncer unstructured/pgbouncer

terraform state rm module.eks_workload_addons.helm_release.reloader
terraform import module.eks_workload_addons.helm_release.reloader unstructured/reloader

Then re-apply:

Copy

terraform apply

Related issues:

GitHub Issue: https://github.com/hashicorp/terraform-provider-helm/issues/1660.
Fix PR (pending merge): https://github.com/hashicorp/terraform-provider-helm/pull/1687.

When this occurs:

After ECR authorization tokens expire (tokens are valid for 12 hours).
After extended periods between Terraform applies.
When you switch AWS profiles or credentials.