About Collibra Data Lineage

Important Use the following options to see the documentation for the latest or classic technical lineage viewer UI:

Collibra Data Lineage is a cloud-only product that maps the entire data lifecycle, allowing you to visualize and audit the flow of data from source systems to downstream target systems. It is designed to help you establish trust in your reports and use the data to make sound business decisions.

Collibra Data Lineage consists of both technical lineage and business lineage. The value of technical lineage and business lineage are the same, but they are designed for different audiences. The main difference is that:

  • Technical lineage identifies data objects in your external data sources.
  • Business lineage show assets in Collibra that represent some or all of those data objects.

For a complete list of supported data sources, go to Supported data sources for technical lineage. If you want to create a technical lineage for a data source that is not currently supported, you can create a Custom technical lineage.

Be sure to check out the following courses In Collibra University:

Technical lineage

Technical lineage is designed for Data Engineers, Data Architects, and other technical stewards. It is a detailed lineage graph that provides complete end-to-end lineage, to visualize the journey of the data objects, including temporary tables and columns, in your external data sources. It includes all source code and data transformation details, so that you can identify in which system data objects are used and how they are transformed from data source to data source.

Tip We use the term "data objects" when referring to columns and tables in your external data sources. We use the term "assets" (specifically Column assets and Table assets) when referring to the representation of data objects in Collibra.

Asset types

You can view a technical lineage for the following asset types:

  • Table
  • Column
  • Looker Look
  • MicroStrategy Report
  • MicroStrategy Dossier
  • MicroStrategy Data Attribute
  • Power BI Report
  • Power BI Table
  • Power BI Column
  • SSRS Report
  • SSRS Table
  • SSRS Column
  • Tableau Worksheet
  • Tableau Data Attribute

After creating a technical lineage, you can view it by clicking the Technical lineage tab on an eligible asset page.

Note The Technical lineage tab is only shown if you have a global role with the following global permissions:
  • Catalog
  • Technical Lineage

Data objects

There are two types of data objects in your technical lineage:

  • Data objects from your data source that are stitched to assets in Data Catalog and for which you created the technical lineage. The successful stitching of data objects and their corresponding assets in Data Catalog is denoted by the yellow icons in the technical lineage.
    Example 

  • Other objects, such as temporary tables and columns, that are collected from your data sources but are not stitched to assets in Data Catalog. These objects have gray icons. For example, the objects in node 1 in the following technical lineage graph are not stitched, meaning they don't have corresponding assets in Data Catalog.
    Example 

  • Data objects from your data source that are stitched to assets in Data Catalog and for which you created the technical lineage. The successful stitching of data objects and their corresponding assets in Data Catalog is denoted by the yellow background in the technical lineage.
    Example 

  • Other objects, such as temporary tables and columns, that are collected from your data sources but are not stitched to assets in Data Catalog. These objects have a gray background. For example, the objects in node 1 in the following technical lineage graph are not stitched, meaning they don't have corresponding assets in Data Catalog.
    Example 

Example technical lineage

Let's say that you have created a technical lineage for four different databases:

  • The first database, Oracle, is not registered in Collibra, therefore there are no assets in Data Catalog that represent the Oracle data objects.
  • The second database, Raw, is registered in Collibra.
    • The yellow background of the first node indicates that Table and Column assets that were created in Data Catalog are stitched to their corresponding data objects in the Raw database.
    • The other node, the one with the gray background, is a temporary table. No assets are created for temporary data objects and so stitching is not relevant. That is why the node has a gray background.
  • The third and fourth databases, Refined and Consumption, are ingested in Collibra. The assets that were created in Data Catalog are stitched to their corresponding data objects in the two databases.

What we want to point out here is that Technical lineage shows the data flow of all data objects across all four databases, regardless of any assets in Collibra.

The corresponding business lineage shows only the relations between data objects that have corresponding assets in Data Catalog. In the following image, we see the data flow of assets from the second database, to the third, to the fourth. The first database, Oracle, which is not registered in Collibra, and , is not shown on the diagram.

Business lineage

Business lineage is designed for analysts, governance roles, and other business stewards. A business lineage shows the relations between assets in Data Catalog that represent the data objects in your external data sources. More specifically, it is a diagram that includes relations of the type "Data Element sources / targets Data Element":

  • Between Column assets of registered data sources.
  • If you integrated one of the supported BI tools, between BI assets and assets of registered data sources.

Business lineage allows you to trace data flows between registered databases. As such, it provides a summary of a technical lineage.

The following image depicts an example business lineage. Notice the "Data Element sources / targets Data Element" relation between columns belonging to 3 different tables.

 image depicts an example business lineage. The "Data Element sources / targets Data Element" relation between columns in three tables.

Tip Be sure to check out the training course From business lineage to insight, in Collibra University.

Automatically created

Business lineage is automatically created as part of the technical lineage process.

During the lineage generation process, the Collibra Data Lineage service instance automatically pushes relations of the type "Data Element sources / targets Data Element" to the Collibra Platform.

BI tool integration

Business intelligence software helps organizations to collect data from the various data sources across their data ecosystem and present the data in interactive dashboards and reports, to facilitate decision-making and strategic planning.

When you integrate your BI tool in Collibra:

  • Metadata about the data objects in your external data sources is created as BI assets in Collibra.
  • Relations are created:
    • Between data objects (such as columns and tables) in your external data source and their corresponding assets in Data Catalog (such as Column and Table assets).
      Note These assets are created when the data source is registered. For supported BI tools, registration is automatically carried out during the technical lineage process.
    • Between BI assets (such as Tableau worksheets and Power BI reports) and their corresponding assets in Data Catalog (such as Tableau Worksheet and Power BI Report assets).
  • Both technical lineage and business lineage are automatically created.

Report views

Collibra Data Lineage enables you to find all ingested BI asset types in a single location.

In the Reports tab page in Data Catalog you can see an overview of all BI Report assets and their children. Optionally, you can create a view with a filter to only show, for example, Tableau assets. This is useful if you quickly want to see all reports or if you want find specific reports, for example certified reports or the most frequented reports.

Business value

Collibra Data Lineage has many important use cases. Here are a few.

How to create a technical lineage

There are two ways to create a technical lineage and business lineage:

Warning The CLI lineage harvester is now deprecated and will officially reach its end-of-life on July 31, 2026. To ensure a smooth transition, we encourage you to begin creating technical lineage via Edge, if you haven't already.

For details about the typical workflow, go to About technical lineage via Edge.

Summary of differences between technical and business lineage

Business lineage

Technical lineage

Allows Business Analysts and other business stewards to view relations between assets in Data Catalog that represent the data objects in external data sources.

Allows Data Engineers, Data Architects and similar personas to view the flow of data objects in external data sources.

Accessible via the Diagram tab on all asset pages.

Accessible via the Technical Lineage tab pane of all Column and Table assets, and some BI assets.

Shows relations of the type "Data Element targets / sources Data Element" between assets that exist in Data Catalog.

Warning During the ingestion process, relations of the type "Data Element targets / sources Data Element" are automatically created between certain assets. Any relations of this type that you manually create between assets will be deleted during the synchronization process. If you want to manually create such relations and ensure that they are maintained, you can create a custom technical lineage.

Shows relations of the type "Data Element targets / sources Data Element" between all data objects in the external data source.

Note Temporary tables and columns that the lineage scanner collected from your data sources, but that are not assets in Data Catalog are also included in the technical lineage.

Shows how assets in Collibra from registered data sources relate to each other.

Supported BI and ETL tools are automatically registered during the process of generating a technical lineage.

Shows how data objects from data sources for which you create a technical lineage relate to each other, regardless of whether the data source is registered.

Dependencies

A dependency is a data object that is targeted by another data object. This is represented by a relation of the type "Data Element targets / sources Data Element", where the dependency is the tail.

Direct dependency

A direct dependency is a data object that is the tail of a relation of the type "Data Element targets / sources Data Element.

Example If column A targets column B, then column B is the direct dependency of column A.

Indirect dependency

An indirect dependency is a data object that is the target of a direct or another indirect dependency.

Example Column A targets column B, which targets column C. This means that column A indirectly targets column C, so column C is the indirect dependency of column A.