Automatic stitching for technical lineage

Stitching is a process that creates relations between assets and data objects representing the same data source. More specifically, stitching creates relations between:

  • the assets that were created when you prepared Data Catalog's physical data layer for a data source; and
  • the data objects in the same data source for which you created a technical lineage and that represent the assets in Data Catalog.

When the data sources are scanned, the Collibra Data Lineage server automatically creates and pushes new relations of the type "Data Element creates / targets Data Element":

  • Between data objects in your data source and assets from registered data sources.
  • Between ingested assets from BI sources and Data Catalog assets from registered data sources.

Note If you don't prepare the Data Catalog physical data layer, Data Catalog creates a technical lineage without stitching. As a result, when you click the Technical lineage tab on any Column, Table, Power BI Column or Looker Look asset page, you get the message The current asset doesn't have a technical lineage yet. However, you can use the Browse tab pane to view the technical lineage of data objects in data sources for which you created the technical lineage.

Stitching issues

To stitch assets in Data Catalog to data object collected by the lineage harvester, the Collibra Data Lineage server looks at the full path of the assets in Data Catalog and the full path of data objects in your data source. Stitching is based on the full path of objects with the following structure: (system) > database > schema > table > column. If the full paths match, the Collibra Data Lineage automatically stitches the data objects to the existing assets in Data Catalog. To indicate this, the assets have a yellow background in the technical lineage graph.

If the full path of an asset in Data Catalog does not match the full path of a data object in your data source, Collibra Data Lineage cannot stitch them. To indicate this, the data objects have a gray background in your technical lineage graph. To fix stitching issues, you must check the full path of the assets in Data Catalog and make sure they match the full path of the data objects that are shown in the technical lineage graph. If you change the full path, make sure to run the lineage harvester again.

Warning We do not support stitching for Looker assets. We do support stitching for Power BI assets, but the stitched assets still have a gray background. This is a known issue.

Tip You can use the Stitching tab page to easily find the full path of assets in Data Catalog and data objects that were collected by the lineage harvester. The Stitching tab page also shows an overview of all assets and data objects that are stitched successfully.