Catalog basics

The Data Catalog application in Collibra is a catalog of metadata that helps the business and data stewards discover, describe, assemble and govern data sets, to improve trust in analytics based on those data sets.

In Data Catalog, you can integrate data from multiple data sources: databases, data lakes, warehouses, enterprise applications, ETL tools, and BI solutions. Metadata provides information such as the format of the data, the structure of the data, and when assets were created.
Data Catalog also allows you to enrich the integrated metadata by adding profiling information, defining the data class, showing sample data, and linking the meta data to the business context.

The overarching aim of Data Catalog is to create and maintain an inventory of an organization’s data assets across its entire digital landscape, so that data assets are easier to find and trust to drive insightful business decisions by data consumers.

Data Catalog submenu pages

Important 

In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.

Use the following options to see the documentation in the latest UI or in the previous, classic UI:

The following table describes each of the submenu items of the Catalog application.

Page Description
Catalog
Catalog Home
The landing page for Data Catalog. This page is designed to help you quickly and easily find Data Catalog-related assets.

Reports

All report assets.

Data Sets All Data Set assets shown as a set of tiles or as a table, with their name, description and, if there are any, connections to existing assets in Collibra.
Data Sources Data sources that are used for data source registrations.
Data Dictionary All data assets in Collibra.
Technology Assets All technology assets in Collibra.
Metrics

Contains a variety of statistics related to how the assets of Catalog are used.

Access Requests The history of your access requests and their status.
Advanced Data Types All advanced data types, which are used during a data source registration.
Integrations Allows you to register a data source. This page contains two tabs.
The Data Source Registration tab allows you to create a Database or File System asset from which you can start the synchronization of a data source. Use this tab for JDBC, S3, GCS, and ADLS integrations.
The Integration Configuration tab allows you to configure all other Metadata, ETL, and BI Integrations and start the synchronization. For example, Synchronize Databricks Unity Catalog, or Create a technical lineage via Edge.