Supported data sources for data source registration

Collibra Data Intelligence Cloud supports several databases to register as a data source. Depending on your data source, you can use Collibra-provided Catalog connector, or your own JDBC driver.

Collibra-provided Catalog connectors

You can find certified Collibra-provided Collibra Catalog Connectors on the Collibra Marketplace.

Tip 

Use these options to search for a data source. You can click the icons for more information about the data source.

  • Access
  • Adobe Analytics
  • Amazon Athena
  • Amazon DynamoDB
  • Amazon Redshift
  • Apache HBase
  • Apache Hive
  • Apache Kafka
  • Apache Spark SQL
  • Avro
  • AWS Glue
  • Azure Analysis Services
  • Azure Synapse
  • Azure Table Storage
  • Cassandra
  • Cloudant
  • Cloudera Hive
  • Cosmos DB
  • Couchbase
  • CSV
  • Databricks
  • DB2
  • denodo
  • Dynamics 365 Business Central
  • Dynamics 365 Sales
  • Dynamics CRM
  • Elasticsearch
  • Exasol
  • Excel
  • Excel Online
  • Google Analytics
  • Google BigQuery
  • Google Spanner
  • Greenplum
  • HP Vertica
  • Impala
  • Jira Service Desk
  • JSON
  • MariaDB
  • MarkLogic
  • MongoDB
  • MySQL
  • Netezza
  • NetSuite
  • Oracle
  • Oracle Sales Cloud
  • Parquet
  • PostgreSQL
  • Presto
  • Redis
  • Salesforce
  • Salesforce Marketing Cloud
  • SAP ERP
  • SAP HANA
  • SAP SuccessFactors
  • ServiceNow
  • SharePoint
  • Snowflake
  • SQL Server
  • Sybase
  • Teradata
  • Workday
  • XML

Your own JDBC drivers

For certain data sources, you can use your own JDBC driver. The following table contains the packaged data sources and versions that have been tested.

Data source

Tested versions

Support for profiling and sample data

JDBC driver version

Amazon Redshift

1.0

Yes

v. 1.1.13.1013

Cloudera Hive

5.10 - 5.14

No

Apache driver v. 1.2.1

Hortonworks Hive

2.5, 2.6

No

Apache driver v. 1.2.1

HP Vertica 7.0

Yes

v. 07.01.0200

IBM DB2 10.5

Yes

v. 4.9.78

MySQL 5.6, 5.7

Yes

v. 5.1.38

Oracle 11g, 12c

Yes

v. 12c

PostgreSQL 9.4, 9.5

Yes

v. 9.4.1207

Microsoft SQL Server

2014, 2016

Yes

v. 5.1.38

Note Only Microsoft drivers and drivers available via Collibra Marketplace are supported.

Teradata 15.0, 16.20.07.01

Yes

No driver tested

Snowflake      

Note We cannot guarantee that other data sources or driver versions work correctly. If you use a generic JDBC driver or an unsupported version, data ingestion, data profiling and sample data may not work as expected.

Authentication and permissions

Both ingestion and profiling (including sampling and advanced data type detection) rely on JDBC drivers to operate. Those drivers authenticate to the data sources as a user registered in that data source with specific permissions attached to the user profile in the data source.

To ingest a database without profiling, Data Catalog requires read access to the database metadata: description of schema, tables, columns, including some more complex properties such as the primary and foreign keys.

However, if you enable one or more profiling options, Data Catalog also requires the permission to read the full table. Which permissions are required exactly depends on the data source type, version and configuration. Additionally, they can also differ according to the provider and version of the JDBC driver. Most of the queries required to retrieve the information above are hidden by the driver. As a consequence, Collibra cannot give a exhaustive list.

Note Collibra supports several authentication methods, including credentials, NTLM, CyberArk and Kerberos. If you are using a certified Collibra provided driver on the Collibra Marketplace, you can also authenticate using Windows Authentication.

Tip If you need more detailed information, we recommend to contact your JDBC driver provider.

Warning Currently, if you register a data source via Edge, you cannot profile your metadata and you can only use username/password authentication.