Technical lineage admin options

Note To use AI Governance, the latest Collibra user interface must be enabled for your Collibra environment.

Until recently, the options described in this topic were available only via the CLI lineage harvester. For reference, go to Lineage harvesting app command options and arguments.

In this topic

Admin options and descriptions

Option Description
List sources

Use this option to list all of the data sources that will be used to create a technical lineage.

The results include the following details for each data source: Source ID <ID of data source> (from edge: false|true) (useSystemName: false|true).

Source ID <ID of data source>
The source ID of your data source.
from edge: false|true
Indicates that the data source is ingested via Edge.
useSystemName: false|true
Indicates whether Collibra Data Lineage uses the system or server name of the data source to match the System asset in Data Catalog. If the value is true, the system or server name of the data source is used. If the value is false, the system or server name of the data source is not used.
The value of useSystemName is taken from the Collibra system name field for the data source on Edge.
Example Source ID 1redshift (from edge: true) (useSystemName: false) indicates that the data source with the 1redshift source ID was ingested, and that the system name of the data source is not used to match the System asset in Data Catalog.

Note No data sources are synchronized when you run this option.

Ignore sources

Use this option to exclude specified data sources from those that will be used to create the technical lineage. The next time you synchronize a technical lineage capability via Edge, the specified data source is ignored.

Note No data sources are synchronized when you run this option.

Important For each source that you want to ignore, ensure that the Active checkbox in the respective technical lineageEdge capability is cleared. For more information, go to Delete the of a data source on Edge.

Analyze files

Use this option to analyze specified batches (ZIP files) of metadata on the Collibra Data Lineage service instance.

Tip When using the Load option in the Processing Level setting in your Edge capacity, the job result includes a link to download a ZIP file with the metadata. The link resembles the following: https://integrations.collibra-abc.com/rest/2.0/files/01944f12-7665-7d9c-8bc5-aa426b6a63cc. Take note of the file ID, in this example: 01944f12-7665-7d9c-8bc5-aa426b6a63cc.

After you've inspected the metadata, use the "Analyze files" option to send the ZIP file for analysis. Alternatively, you can upload the ZIP file using the POST /files API. In either case, you need to specify the file ID.

No data sources are synchronized when you run this option.

Sync

Use this option to trigger the synchronization of the metadata, for all data sources. When you use this option, a new Main batch of metadata is created from the latest batches on the Collibra Data Lineage service instance, for all data sources, and synchronized with the corresponding assets in Data Catalog.

The Sources tab page shows the transformation details or source code that was analyzed and the results of the analysis.

Important  For information on the importance of the Processing Level setting and the Analyze option in the technical lineage Edge capabilities for your respective data sources:

Configure and run an admin job

Prerequisites

  • You have created an Edge connection and added an Edge capability for each of the data sources you want to include in the technical lineage. For complete information, go to Create a technical lineage via Edge.

Steps