What's the difference between Data Catalog and Collibra Connect?

Data Catalog and Collibra Connect have many overlapping features. Which of them is more suited for your situation, depends on a number of factors.

In a nutshell, you use Data Catalog for ingesting metadata from popular database types via a predefined ingestion logic, which is ideal for business users. You can then see the metadata in the form of assets and characteristics. You use Collibra Connect to read and write metadata in any API-supported system and provide the metadata to Collibra Data Intelligence Cloud. Collibra Connect has more flexibility with regard to ingestion, but requires technical skills.

 

Data Catalog

Collibra Connect

Definition

The Collibra Data Catalog is an application that helps the business data analyst to discover, describe, assemble and govern data sets, in order to improve trust in analytics based on those data sets.

Collibra Connect is an integration platform that enables integrations between Collibra and other third-party products, such as Informatica, Salesforce.com and JIRA.

Purpose

Data Catalog can ingest and represent metadata of specific data sources as assets and characteristics, including diagrams.

Collibra Connect is meant as an advanced interface between Collibra and data sources of any third-party vendors.

Processes

  • Metadata ingestion
  • Profiling and data type detection
  • Read only
  • Bidirectional synchronization of metadata
  • No profiling
  • Read and write

Integrations

  • JDBC-supported databases such as PostgreSQL and IBM DB2.
  • File-based databases in Excel and CSV.
  • External systems such as Tableau and Amazon S3.

Any system with:

  • API support
  • Structured metadata format such as XML and JSON

Ingestion

Predefined metamodel and ingestion logic

Flexible and configurable metamodel and ingestion logic

Usability

  • Usable via Collibra
  • Business user friendly
  • Configuration via IDE
  • Requires development skills to set up

More information