Delete the technical lineage of a data source

You can delete the technical lineage of a data source if you no longer want to see it in the technical lineage graph. You can do so via Edge or the CLI lineage harvester.

Via Edge

Important 

In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.

Use the following options to see the documentation in the latest UI or in the previous, classic UI:

  1. Ensure that the Active checkbox in the relevant technical lineage Edge capability is cleared.
    1.  Open an Edge site.
      1. On the main toolbar, click Products icon, and then click Cogwheel icon Settings.
        The Collibra settings page opens.
      2. In the tab pane, click Edge.
        The Sites tab opens and shows a table with an overview of the Edge sites.
      3. In the Edge site overview, click the name of the Edge site to which you added the technical lineage capability for the data source.
        The Edge site page appears.
    2. In the Capabilities section, locate and click the technical lineage capability for the data source you want to delete.
    3. Clear the Active checkbox.
      Important The Active option is deprecated and will be removed in a future version of Collibra.
    4. Click Save.
      The capability is updated.
  2.  Run the Ignore source option in Data Catalog.
    1. On the main toolbar, click Products icon, and then click Catalog.
      The Data Catalog Home opens.
    2. In the Register with Edge section of the Create dialog box, click Register a data source.
      The Register content page opens.
    3. In the tab bar, click Integrations.
      The Integrations page opens.
    4. Click the Integration Configuration tab.
    5. Find the connection that you used when you added the technical lineage capability, and click the link in the Data sources/CapabilitiesCapabilities column. If multiple capabilities exist for the connection, expand them to find your capability.
      The capability configuration page opens.
    6. On the Configuration SectionSynchronize Configuration section, click Edit Configuration.
    7. In the Admin command drop-down list, select Ignore sources.
    8. In the Sources drop-down list, select the source or sources you want excluded from the technical lineage.
    9. Click Save.
    10. In the Configuration SectionSynchronize Configuration section, click Synchronize nowSynchronize.
      When synchronization is complete, the technical lineage of the data source is deleted.

View the synchronization results

  1. Open the Activities list.
  2. In the row containing the job, click Result.
    The Synchronization Results dialog box appears.

Via the CLI lineage harvester

Note If you use the CLI lineage harvester, you need to ensure that least one data source is configured in your lineage harvester configuration file.
  1. Optional: To determine the data source that you want to exclude from the Technical lineage, enter the list-sources command:

    • For Windows: .\bin\lineage-harvester.bat list-sources
    • For other operating systems: ./bin/lineage-harvester list-sources
    All data sources that were used to create the technical lineage are listed. The list also includes the source ID of each data source. You can use the list to identify the data source to be excluded.
  2. In the lineage harvester folder, open your lineage harvester configuration file.
  3. Delete the section with connection properties of the data source.
  4. Save the configuration file.
  5. Start the lineage harvester in the console and run the following command to ignore the data source:
    • For Windows: .\bin\lineage-harvester.bat ignore-source <source_ID>, where <source_id> is the ID of the data source that you want to ignore.
    • For other operating systems: ./bin/lineage-harvester ignore-source <source_ID>, where <source_id> is the ID of the data source that you want to ignore.
    The data source is excluded from the list of data sources that are used to create the technical lineage.
  6. Synchronize the technical lineage by running any of the following commands:
    • The sync command:
      • For Windows: .\bin\lineage-harvester.bat sync
      • For other operating systems: ./bin/lineage-harvester sync
    • The full-sync command:
      • For Windows: .\bin\lineage-harvester.bat full-sync
      • For other operating systems: ./bin/lineage-harvester full-sync

    For more information, go to Lineage harvesting app command options and arguments.

  7. When prompted, enter the password to connect to your Collibra Data Intelligence Platform and data sources in the configuration file.
  8. The lineage harvester uploads the metadata of the remaining data sources in the configuration file to the Collibra Data Lineage service.
    The Collibra Data Lineage service synchronizes the technical lineage and removes the deleted data source from the technical lineage graph.

What's next

You can view the technical lineage. For more information, go to Technical lineage viewer.

You can check the progress of the technical lineage creation in Activities in your Collibra Data Intelligence Platform environment. The Results field indicates how many relations were imported into Data Catalog. Go to the status page to see the log files of the SQL analysis.

If the lineage harvester log shows an error message or the harvesting process fails, you can use the technical lineage common errors and issues in Collibra Support Portal to fix the error.