Delete the technical lineage of a data source
You can delete the technical lineage of a data source if you no longer want to see it in the technical lineage graph. To delete the technical lineage of the data source, take any of the following actions:
- If you use technical lineage via Edge, update the capability by clearing the Active check box for the data source and synchronize the technical lineage again.
- If you use the lineage harvester, remove the configuration of the data source from the lineage harvester configuration file and use the
ignore-source
command to exclude the data source when you synchronize the technical lineage again.
- When delete the technical lineage of a data source by using the lineage harvester, ensure that you have at least one source in your lineage harvester configuration file.
-
If you want to use technical lineage via Edge together with the lineage harvester, ensure that you use the lineage harvester version 2023.04 or newer.
- Delete the technical lineage of a data source on Edge
- Delete the technical lineage of a data source by using the lineage harvester
Steps
-
Open an Edge site.
-
On the main toolbar, click
, and then click
Settings.
The Collibra settings page opens. -
In the tab pane, click Edge.
The Sites tab opens and shows a table with an overview of the Edge sites. - In the Edge site overview, click the name of the Edge site where you created the technical lineage capability for the data source.
The Edge site page appears.
-
On the main toolbar, click
, and then click
Settings.
- In the Capabilities section, locate and click the technical lineage capability that you added for the data source when you created the technical lineage.
- Clear the Active check box.
- Click Save.
The capability is updated.
- Synchronize the technical lineage capability for the data source.
The data source is marked as ignored internally and will be excluded when the technical lineage is synchronized again.
- Synchronize your technical lineage by taking any of the following actions:
- On Edge, synchronize the technical lineage capability for any of your data sources that are active.
- If you want to synchronize by using the lineage harvester, run any of the following commands:
- The
sync
command:- For Windows:
.\bin\lineage-harvester.bat sync
- For other operating systems:
./bin/lineage-harvester sync
- For Windows:
- The
full-sync
command:- For Windows:
.\bin\lineage-harvester.bat full-sync
- For other operating systems:
./bin/lineage-harvester full-sync
- For Windows:
For more information, go to Lineage harvesting app command options and arguments.
When synchronization is complete, the technical lineage of the data source is deleted.
What's next?
If you want to delete the technical lineage capability for the data source, ensure that the technical lineage of the data source is removed successfully after synchronization. For more information, go to Delete an Edge capability from an Edge site.
You can view a summary of the results from the Activities list to see whether the technical lineage is synchronized successfully.
If the synchronization fails or completes with errors, you can use the technical lineage via Edge common error messages in Collibra Support Portal to fix the errors.
Before you begin
Install the lineage harvester 2023.04 or newer.
Steps
-
Optional: To determine the data source that you want to exclude from the Technical lineage, enter the
list-sources
command:- For Windows:
.\bin\lineage-harvester.bat list-sources
- For other operating systems:
./bin/lineage-harvester list-sources
All data sources that were used to create the technical lineage are listed. The list also includes the source ID of each data source. You can use the list to identify the data source to be excluded. - For Windows:
- In the lineage harvester folder, open your lineage harvester configuration file.
- Delete the section with connection properties of the data source.
- Save the configuration file.
- Start the lineage harvester in the console and run the following command to ignore the data source:
- For Windows:
.\bin\lineage-harvester.bat ignore-source <source_ID>
, where<source_id>
is the ID of the data source that you want to ignore. - For other operating systems:
./bin/lineage-harvester ignore-source <source_ID>
, where<source_id>
is the ID of the data source that you want to ignore.
The data source is excluded from the list of data sources that are used to create the technical lineage. - For Windows:
- Synchronize the technical lineage by running any of the following commands:
- The
sync
command:- For Windows:
.\bin\lineage-harvester.bat sync
- For other operating systems:
./bin/lineage-harvester sync
- For Windows:
- The
full-sync
command:- For Windows:
.\bin\lineage-harvester.bat full-sync
- For other operating systems:
./bin/lineage-harvester full-sync
- For Windows:
For more information, go to Lineage harvesting app command options and arguments.
- The
- When prompted, enter the password to connect to your Collibra Data Intelligence Platform and data sources in the configuration file.
What's next
You can view the technical lineage. For more information, go to Technical lineage viewer.
You can check the progress of the technical lineage creation in Activities in your Collibra Data Intelligence Platform environment. The Results field indicates how many relations were imported into Data Catalog. Go to the status page to see the log files of the SQL analysis.
If the lineage harvester log shows an error message or the harvesting process fails, you can use the technical lineage common errors and issues in Collibra Support Portal to fix the error.