Delete the technical lineage of a data source

Warning The lineage harvester is now deprecated and will officially reach its end-of-life on July 31, 2026. To ensure a smooth transition, we encourage you to begin creating technical lineage via Edge, if you haven't already.

You can delete the technical lineage of a data source if you no longer want to see it in the technical lineage graph. You can do so via Edge or the CLI lineage harvester (deprecated).

Via Edge

For more information about the Technical Lineage Admin Edge connection and capability mentioned in this procedure, go to Technical lineage admin options.

  1. Synchronize any other data source.
    After a successful synchronization, the deleted data source is removed from the technical lineage graph.

Via the CLI lineage harvester

Note If you use the CLI lineage harvester (deprecated), you need to ensure that least one data source is configured in your lineage harvester configuration file.

Configure your lineage harvester configuration, run the lineage harvester, and synchronize your technical lineage.

  1. Optional: To determine the data source that you want to exclude from the Technical lineage, enter the list-sources command:

    • For Windows: .\bin\lineage-harvester.bat list-sources
    • For other operating systems: ./bin/lineage-harvester list-sources
    All data sources that were used to create the technical lineage are listed. The list also includes the source ID of each data source. You can use the list to identify the data source to be excluded.
  2. In the lineage harvester folder, open your lineage harvester configuration file.
  3. Delete the section with connection properties of the data source.
  4. Save the configuration file.
  5. Start the lineage harvester in the console and run the following command to ignore the data source:
    • For Windows: .\bin\lineage-harvester.bat ignore-source <source_ID>, where <source_id> is the ID of the data source that you want to ignore.
    • For other operating systems: ./bin/lineage-harvester ignore-source <source_ID>, where <source_id> is the ID of the data source that you want to ignore.
    The data source is excluded from the list of data sources that are used to create the technical lineage.
  6. Synchronize the technical lineage by running any of the following commands:
    • The sync command:
      • For Windows: .\bin\lineage-harvester.bat sync
      • For other operating systems: ./bin/lineage-harvester sync
    • The full-sync command:
      • For Windows: .\bin\lineage-harvester.bat full-sync
      • For other operating systems: ./bin/lineage-harvester full-sync

    For more information, go to Lineage command options and arguments.

  7. When prompted, enter the password to connect to your Collibra Platform and data sources in the configuration file.
    The lineage harvester uploads the metadata of the remaining data sources in the configuration file to the Collibra Data Lineage service.
    The Collibra Data Lineage service synchronizes the technical lineage and removes the deleted data source from the technical lineage graph.
  8. View the synchronization results

    You can check the progress of the technical lineage creation in Activities in your Collibra Platform environment. The Results field indicates how many relations were imported into Data Catalog. Go to the status page to see the log files of the SQL analysis.

    If the lineage harvester log shows an error message or the harvesting process fails, you can use the technical lineage common errors and issues in Collibra Support Portal to fix the error.