Lineage harvesting app command options and arguments
After creating a configuration file, you can use the lineage harvester to perform specific actions with the data sources that are defined in your configuration file.
Tip If you run the lineage harvester in command line, you will see an overview of possible command options and arguments that you can use. If there lineage harvester process fails, you can use the technical lineage troubleshooting guide to fix your issue.
Typical command options and arguments
The following table shows the most commonly used command options and arguments.
full-sync command.| Command | Description |
|---|---|
full-sync
|
Uploads all your data sources to the Collibra Data Lineage server where the data source metadata is processed and uploaded to Data Catalog. |
-s "<ID of data source>"
|
Uploads only the data source with the specified ID. For example, This command allows you to process data from a newly added data source or to refresh a data source in the configuration file, without refreshing the other data sources. This reduces the time you need to upload your data sources, since you only upload specific ones without affecting the others. If you want to process multiple data sources, add
-s "ID of another data source" per data source to the command.Note You can use this argument multiple times to include multiple data sources. |
--no-matching
|
Uploads a technical lineage without stitching the data objects in your technical lineage to the corresponding Column and Table assets in Data Catalog. Note As a result, you won't see the technical lineage of a specific Table or Column asset, but you can still see and browse the full technical lineage. |
|
|
Downloads all your data sources in a separate ZIP file, per data source, to the lineage harvester output folder. |
-s <ID of data source>
|
Downloads only the data source with a specific ID. For example, Note You can use this argument multiple times to include multiple data sources. |
|
|
Provides passwords of your Collibra Data Intelligence Cloud instance and the data sources in your configuration file to the lineage harvester without storing the passwords in the lineage harvester folder. You can replace |
|
|
Checks the connectivity to the Collibra Data Lineage server and to Data Catalog. The logs will also show the IP addresses of the Collibra Data Lineage servers that you have to whitelist. This command is mostly used for troubleshooting purposes. |
|
--timeout <seconds> |
Determines the network timeout. |
|
|
Shows an overview of all supported command options and arguments that you can use in the lineage harvester. |
|
|
Shows the version of the lineage harvester that you are using. |