Connecting to a proxy server

You can connect to a proxy server when you use the lineage harvester. Collibra Data Lineage supports proxy server connection and authentication.

Set the environment variable on Windows or set the system properties on other operating systems with the following parameters specified to connect to a proxy server. See the following steps for code examples.

  • -Dhttps.proxyHost
  • -Dhttps.proxyPort
  • -Dhttps.proxyUser
  • -Dhttps.proxyPassword
  • -Dhttp.nonProxyHosts

The -Dhttps.proxyUser and -Dhttps.proxyPassword parameters are optional.

On Windows

  1. Set the -D parameter to the JAVA_OPTS environment variable.
    Example 
    set JAVA_OPTS=-Dhttps.proxyHost="azusquid.imf.org" -Dhttps.proxyPort="8080" -Dhttps.proxyUser="myusername" -Dhttps.proxyPassword="mypassword"
  2. Run the lineage harvester in the same command line window: .\bin\lineage-harvester.bat

On other operating systems

  1. To access the hosts via a proxy server, run the following command: bin/lineage-harvester -Dhttps.proxyHost=<Hostname or IP address of the proxy> -Dhttps.proxyPort=<port number> -Dhttps.proxyUser=<username> -Dhttps.proxyPassword= <password> full-sync
    Example If you want to use a proxy with hostname proxy.example.com and port number 443, run the following command:
    bin/lineage-harvester -Dhttps.proxyHost=proxy.example.com -Dhttps.proxyPort=443 Dhttps.proxyUser=myusername -Dhttps.proxyPassword=mypassword
  2. To exclude hosts that should be accessed without going through the proxy server, add the following parameter: -Dhttp.nonProxyHosts=<host to exclude>.

    You can exclude multiple hosts by using the pipe character (|) to separate the hostnames or IP addresses to exclude. You can also use an asterisk (*) as a wildcard to match multiple hostnames or IP addresses.

    Example If you want to exclude hosts with hostname localhost and hosts with IP address 127.0.0.1 and all IP addresses starting with 192.168*, run the following command:
    bin/lineage-harvester -Dhttps.proxyHost=proxy.example.com -Dhttps.proxyPort=443 -Dhttp.nonProxyHosts=localhost|127.0.0.1|192.168*

Important  In your configuration file, the value of the source "url" or "hostname" property (depending on the data source), and the value in your -Dhttp.nonProxyHosts parameter, as described above, must both be either an IP address or a host name. You will get an error if, for example, you have a host name in the "hostname" property and an IP address in the -Dhttp.nonProxyHosts parameter.