About the lineage harvester installation
You use the lineage harvester to collect source code from your data sources and create new relations between data elements from your data source and existing assets in Data Catalog.
The lineage harvester runs close to the data source and can harvest transformation logic like SQL scripts and ETL scripts from a specific location, for example a database table or a folder on a file system.
Note Collibra Data Lineage is a cloud-only feature.
Requirements
| Type | Requirements |
|---|---|
| Software |
Minimum requirements:
Recommended requirements:
|
| Hardware |
Minimum requirements:
Recommended requirements:
|
| Network |
Firewall rules so that the lineage harvester can connect to:
|
Note The lineage harvester uses port 443.
Installing the lineage harvester
If you purchased Collibra Data Lineage, you can access the lineage harvester on the downloads page. To install the lineage harvester, do the following:
- Download the lineage harvester.
- Unzip the archive. You can now access the lineage harvester folder.
- Run the following command line to start the lineage harvester:
- Windows:
.\bin\lineage-harvester.bat
- For other operating systems:
chmod +x bin/lineage-harvesterand thenbin/lineage-harvester
An empty configuration file is created in the config folder.
- Windows:
./bin/lineage-harvester --help. Note We highly recommend to always install and use the latest available lineage harvester.