About Collibra Cloud sites

A Collibra Cloud site is an option for customers who want Collibra to fully host the Collibra Platform, which includes the connection platform used to integrate with your cloud-native data sources. With Collibra Cloud sites, Collibra handles the setup and management, allowing you to focus on your business needs. Collibra Cloud sites upgrade automatically, meaning you will always been on the latest, most secure version.

While this solution simplifies implementation and maintenance, it offers slightly fewer features than customer-managed Edge deployments and connects to data sources over the internet.

Collibra Cloud is a type of site fully hosted by Collibra, which allows you to integrate with cloud-native data sources out-of-the-box. A Collibra Cloud site is set up and managed by Collibra, allowing you to focus on your business needs. While this solution simplifies implementation and maintenance, it offers slightly fewer features than customer-managed Edge sites and connects to data sources over the internet. Collibra Cloud sites upgrade automatically, meaning they are always on the latest, most secure version.

What is included with a Collibra Cloud site?

You can create and manage your Collibra Cloud site connections and capabilities in a similar way to how these are managed in a customer-managed Edge site. However, because a Collibra Cloud site connects to data sources over the internet and is maintained by Collibra, some connections and capabilities are not available.

The following list shows the supported data sources per capability:

  • Metadata ingestion and synchronization
    • Amazon Redshift (JDBC)
    • Athena (JDBC)
    • AWS Glue (JDBC)
    • Azure Data Lake Storage
    • Azure Synapse Analytics
    • Databricks Unity Catalog
    • Databricks (JDBC)
    • Google BigQuery (JDBC)
    • Google Cloud Storage
    • Google Dataplex
    • Salesforce (JDBC)
    • SAP Datasphere Catalog
    • SAP HANA
    • Snowflake (JDBC)
    • S3
  • Classification, Profiling , and Sampling
    • Amazon Redshift (JDBC)
    • Athena (JDBC)
    • AWS Glue (JDBC)
    • Azure SQL server
    • Azure Synapse Analytics
    • Databricks (JDBC)
    • Databricks Unity Catalog
    • Google BigQuery (JDBC)
    • Salesforce (JDBC)
    • SAP HANA Cloud
    • Snowflake (JDBC)
  • Technical lineage
    • Amazon Redshift (JDBC)
    • Azure SQL Data Warehouse
    • Azure SQL server
    • Azure Synapse Analytics
    • Databricks Unity Catalog
    • Google BigQuery (JDBC)
    • Google Dataplex
    • Power BI
    • SAP HANA Cloud/Advanced
    • Snowflake (JDBC)
    • Tableau
  • Protect
    • AWS Lake Formation
    • Databricks (JDBC)
    • Google BigQuery
    • Snowflake (JDBC)
  • AI Governance
    • AWS Bedrock AI
    • AWS SageMaker AI
    • Azure AI Foundry
    • Azure ML
    • Databricks Unity Catalog
    • Google Vertex AI
    • MLflow
    • SAP AI Core

For more information, go to our connections and capabilities documentation.

Additionally, you can register the following data sources for Data Notebook:

  • Amazon Redshift (JDBC)
  • Athena (JDBC)
  • Databricks Unity Catalog
  • Google BigQuery
  • Google BigQuery (JDBC)
  • Snowflake (JDBC)
Note If a data source you want to integrate with is not listed below, contact your Account Executive for more options.

Limitations

As Collibra Cloud sites are managed by Collibra, some customer-managed Edge functionalities are not supported on Collibra Cloud sites:

  • Edge CLI
    Note 

    As the Edge CLI is unsupported for Collibra Cloud sites, the following ETL integrations are not available:

    • IBM InfoSphere DataStage
    • Informatica PowerCenter
    • SQL Server Integration Services (SSIS)
    • dbt Core
    • Custom Lineage
    • JDBC Lineage via Shared Storage connection
    • Open Lineage
  • Data Notebook's Postgres Database storage capability
  • Control of managed Kubernetes clusters
  • Manual upgrades
  • Customer hosted Vault integrations
  • Forward proxies
  • Custom repositories
  • FedRAMP authorization

What's next?