Installation and upgrade requirements for CPSH extended capabilities

This documentation provides comprehensive instructions for the installation and upgrade of Collibra Platform Self-Hosted (CPSH) expanded capabilities and necessary supporting components. Each capability is deployed using a dedicated Zarf package, which must be installed sequentially for a successful upgrade or new feature deployment.

This document assumes that you are installing Technical lineage. Click here for specific Edge-based information.

Note Third-party application instructions in this document are based on the current version of the CPSH installation for Technical lineage and Edge. It is recommended to use the steps in this document for a successful installation, as updates to third-party software documentation may not reflect the current version of the CPSH application.

Before you start the installation, you need all of the following information to ensure an easy, successful installation process.

Prerequisites

The following prerequisites are for air-gapped CPSH and require the following:

  • A properly installed and configured DCG (required before Zarf package installation)
  • Valid admin user credentials
  • A cloud environment configured with a Kubernetes cluster and necessary storage buckets (such as S3)
  • Required installation of open source tools
  • FIPS Mode: Collibra must be installed in FIPS mode for Air-gapped Edge installation to support required cryptographic modules.. (See Install CPSH for more information.)
Important 

For Air-gapped deployments, FIPS mode is mandatory to ensure that all data encryption and secure communications use compliant cryptographic algorithms while disconnected from the public internet.

Requirements

Hardware requirements

Node group AWS instance type Required vCPU Required memory
Default node group

m5.4xlarge

16 vCPU 64 GB
Lineage node group
minimum: r6id.4xlarge  
maximum: r6id.8xlarge  
16 vCPU
32 vCPU
128 GB
256 GB

Installation requirements

The following must be installed for a complete setup:

Installation Description
AWS setup

Instructions for creating the necessary Amazon Web Services (AWS) infrastructure.

Zarf Installation and configuration of the Zarf package manager CLI.
Platform Deployment of the core CPSH components.
Observability Deployment of the Observability stack.
Technical lineage Deployment of the Technical Lineage capability components.
Edge Underlying Edge components must be installed (via a Zarf package).

Tool requirements

The following open source tools must be installed on a machine that has access to interact with the K8S cluster and are necessary to install the Zarf packages:

Note See here for Zarf installation instructions.

  • zarf
  • curl
  • openssl
  • date
  • grep
  • sed
  • tr
  • awk
  • shasum
  • base64
  • bash
  • cat
  • head
  • env

Resource requirements

Resource Requirement
AWS EKS cluster Must be fully provisioned and accessible. Specifically used for the CPSH installation.
AWS Load Balancer Controller Must be installed on the cluster.
S3 (Simple Storage Service) Dedicated S3 Bucket
AWS IAM (Identity and Access Management) Dedicated IAM Service Account Role
Node groups

Total of six nodes in the following two groups:

  • default-ng: three nodes

  • techlin-local-ssd: three nodes

Provision the Cloud Environment

Currently, the installer only supports AWS as the cloud environment. See AWS documentation for more information.