Migrate from CLI lineage harvester to Edge

The CLI lineage harvester is deprecated and reaches its End of Life on July 31, 2026. You need to migrate to technical lineage via Edge before this date.

This topic provides a workflow for migrating source IDs, configuring Edge capabilities, and resolving critical "useCollibraSystemName" configuration mismatches.

Required permissions

Migration steps

  1. Identify the source ID for the data source you want to migrate:
    1. Open your existing lineage harvester configuration file.
    2. Take note of the value of the id property for the data source you want to migrate. For example, id: "marketing_snowflake_prod".
  2. Remove the entire section for that data source from the lineage harvester configuration file. For example:
    {
    	"dialect" : "oracle",
    	"id" : "informatica_source",
    	"type" : "ExternalDirectory",
    	"dirType" : "powercenter",
    	"path" : "/path/to/the/informatica/folder/",
    	"mask" : "*",
    	"recursive" : false,
    	"deleteRawMetadataAfterProcessing": false
    }

    If you don't remove it, and you configure Edge with the same source ID, you risk a condition where both the CLI harvester and Edge attempt to update the same lineage batch.
  3. Configure the Edge capability:
    1. Open a site.
      1. On the main toolbar, click Products iconCogwheel icon Settings.
        The Settings page opens.
      2. In the tab pane, click Edge.
        The Sites tab opens and shows a table with an overview of your sites.
      3. In the table, click the name of the site whose status is Healthy.
        The site page opens.
    2. In the Capabilities section, click Add capability.
      The Add capability page appears.
    3. Select the relevant capability template for the data source you want to migrate, for example Technical Lineage for Oracle.
    4. Enter the necessary information in the capability. Pay particular attention to the following fields:
    5. FieldDescription
      Source ID

      Enter the source ID you noted in step 1. In our example: marketing_snowflake_prod.

      Collibra System Name

      Enter the name of your System asset in Data Catalog. For successful stitching, the names must exactly match.

      Important Be sure to read the following section The "useCollibraSystemName" trap.

      Processing Level

      Select Analyze to test the connectivity, or Sync to immediately synchronize the lineage.

    6. Click Create.
      The capability is added to the Edge site.
      The fields become read-only.
  4. Synchronize the technical lineage.

Cleanup and deletion

If necessary, remove stale source IDs that cause "useCollibraSystemName" errors or persist as gray nodes in the lineage viewer. This is the most critical pain point for users.

The "useCollibraSystemName" trap

A common source of synchronization failure is "CPSH system name" setting mismatches. This happens if the value of the useCollibraSystemName property in your lineage harvester configuration file does not match with the Edge setting in Collibra Console.

Specifically:

  • In your lineage harvester configuration file: the useCollibraSystemName property must be set to true or false.
  • In Collibra Console, the Collibra system name must match the value of the useCollibraSystemName properties in your lineage harvester configuration file, either True or False.
    For information on this Collibra Console setting, go to Enable technical lineage via Edge.

The useCollibraSystemName property in your lineage harvester configuration file and the Collibra system name setting in Collibra Console determine whether the System name is included in the full path - and therefore the full names - of assets. If the property and field values do not match, stitching cannot be achieved.

Important If you change the values either in your lineage harvester configuration file or in Collibra Console to avoid a mismatch, you must remove the old source ID (as described in the following options A, B, or C) to clear the cached configuration. Then synchronize as a fresh data source.

Option A: Remove a data source using the technical lineage admin options via Edge

For more information about the Technical Lineage Admin Edge connection and capability mentioned in this procedure, go to Technical lineage admin options.

  1. Run the "Sync" option in Data Catalog, for any other data source.
    Note This step ensures that the technical lineage is regenerated. You don't have to create a new Edge connection or add another capability. For the data source that you sync, ensure that:
    • The Active option is selected in the Edge capability.
    • The Processing Level setting is set to Sync.
    After a successful synchronization, the deleted data source is removed from the technical lineage graph.

Option B: Remove a data source via the CLI lineage harvester

Occasionally, the "list and ignore sources" option (described in Option A above) doesn't return sources that were originally created via the CLI lineage harvester. In this case, you can use the CLI harvester to perform the clean up.

  1. Run the following "list sources" command:
    • For Windows: .\bin\lineage-harvester.bat list-sources
    • For other operating systems: ./bin/lineage-harvester list-sources
  2. Run the following "ignore sources" command:
    • For Windows: .\bin\lineage-harvester.bat ignore-source <source_ID>
    • For other operating systems: ./bin/lineage-harvester ignore-sources <source_ID>
  3. Run the following synchronization command:
    • For Windows: .\bin\lineage-harvester.bat sync
    • For other operating systems: .\bin\lineage-harvester sync

What's next

View the technical lineage graph.

You can check the progress of the technical lineage creation in Activities in your Collibra Platform Self-Hosted environment. The Results field indicates how many relations were imported into Data Catalog. Go to the status page to see the log files of the SQL analysis.

If the lineage harvester log shows an error message or the harvesting process fails, you can use the technical lineage common errors and issues in Collibra Support Portal to fix the error.

For technical lineage via Edge, if the synchronization fails or completes with an error message, you can use the technical lineage via Edge common error messages in Support Portal to fix the error.