Install services on Linux
This section describes the installation of the Collibra Data Governance Center 5.9.1 services on Linux.
Keep in mind that the installer has no graphical user interface, the full installation procedure is executed via the command line.
Important If you do the installation on RHEL/Rocky Linux or Suse, see the services section. This is also valid if you install without root permissions.
Prerequisites
- You have downloaded the installer for your Linux operating system.
- You have installed PostgreSQL 14.7 on the every node that will host one of the following components:
- Repository
- Jobserver
- Collibra Console
- The node that will run the Search service, must pass the following bootstrap checks:
- File descriptor
- Maximum number of threads check
- Maximum file size
- Maximum size virtual memory check
- Maximum map count check
Type
Check description
Minimum value
Applies for installation type
Setting name
User limit Maximum number of open file descriptors 65536 - Without root permissions
- With root permissions, using System V init daemon
nofile Maximum number of open threads/processes 4096 nproc Maximum file size unlimited fsize Kernel parameter Maximum virtual memory areas 262144 - All
vm.max_map_count For more information on these settings, see the Troubleshooting section.
Steps
Use this drop-down to select your operating system:
Tip For each question, the default selection is always suggested between square brackets. If there is a Yes or No question, the upper-case character is the default selection, for example, in [Y/n] the default selection is Yes. Press Enter to accept the default selection.
- Run the installer:
-
Linux as user with sudo rights:
sudo ./dgc-linux-5.9.1-0.sh
Linux as root user:./dgc-linux-5.9.1-0.sh - Linux as standard user:
./dgc-linux-5.9.1-0.sh
-
Linux as user with sudo rights:
- Enter the Installation directory and press Enter.
- Default location on Linux as root or user with sudo privileges: /opt/collibra
- Default location on Linux as standard user: ~/collibra
- Enter the Data directory and press Enter.
- Default location on Linux as root or user with sudo privileges: /opt/collibra_data
- Default location on Linux as standard user: ~/collibra_data
- Press Enter to each of the presented components that you want to install.
If you don't want to install a specific component, press n followed by Enter.Note We recommend that you install the Repository service and Jobserver service on dedicated nodes. You can install all other services on another node, provided that it has enough memory. Make sure that you have a fast network between the nodes.Also make sure that you use the same installer version on all nodes. You can find the installer version of your environment at the bottom of the sign-in window of Collibra Console, for example 5.9.1-0
- Press Enter to confirm your selection.
- If you have selected Repository, Jobserver and/or Collibra Console, enter the location where PostgreSQL 14.7 is installed.
- Enter the necessary configuration for each of the selected services.
After the last configuration, the installation of the services automatically starts.
Below you find the configuration parameters for each of the services.
- DGC service
- Repository
- Jobserver
- Monitoring
- Search
- Agent
- Collibra Console
|
Setting |
Description |
|---|---|
|
DGC context path |
The path that is added to the base URL to reach Collibra Data Intelligence Cloud. For example, if your base URL is See also Set the context path of the DGC service in Collibra Console. |
| DGC port |
The TCP port to access your Collibra DGC environment via your web browser. |
| DGC Shutdown port |
The TCP port to stop the DGC service. |
| DGC minimum memory |
The minimum amount of memory in megabytes for the DGC service. This must be at least 1024 MB and no greater than 32 768 MB (32 GB). |
| DGC maximum memory | The maximum amount of memory in megabytes that can be assigned to the DGC service.
This must be at least 2048 MB and no greater than 32 768 MB (32 GB). |
If you run multiple environments on one node, all ports must be unique for each environment.
|
Setting |
Description |
|---|---|
| Repository port |
The TCP port to access the repository service. It is only used by the DGC service and the Collibra agent. If you run multiple environments on one node, all ports must be unique for each environment. |
| Repository admin password (*) | The password that is used by the agent to access the Repository service. |
|
Confirm repository admin password |
The password as entered in the Repository admin password field. |
| Repository dgc password (*) | The password that is used by the DGC service to access the repository. |
|
Confirm repository dgc password |
The password as entered in the Repository dgc password field. |
| Repository memory |
The amount of memory for the Repository service in megabytes. This must be at least 512 MB and no greater than 16 384 MB (16 GB). |
- lowercase letters
- uppercase letters
- numbers
- the following special characters: #?!@$%&*-
|
Setting |
Description |
|---|---|
| Jobserver port |
The TCP port to access the Jobserver service. |
| Jobserver database port |
The TCP port to access the Jobserver database. |
|
Jobserver monitoring port |
The TCP port that is used by the monitoring service to monitor the Jobserver service. |
|
Jobserver Spark monitoring port |
The TCP port that is used by the monitoring service to monitor the Jobserver Spark service. |
If you run multiple environments on one node, all ports must be unique for each environment.
|
Setting |
Description |
|---|---|
| Search http port |
The TCP port to access the Search service via REST API. |
| Search transport port |
The TCP port for the communication between the DGC and the Search service. |
| Search memory | The amount of memory in megabytes that is assigned to the Search service. The default value is 1024. |
If you run multiple environments on one node, all ports must be unique for each environment.
|
Setting |
Description |
|---|---|
| Agent port |
The TCP port that is used by Collibra Console to manage the services of an environment. If you run multiple agents on one node, this port must be unique for each agent. |
| Node address |
The hostname of the node on which the Agent service is running. You cannot use a loopback address if you want to use the node in a multinode environment. Warning Do not use the loopback address. |
|
Setting |
Description |
|---|---|
|
Console context path |
The path that is added to the base URL to reach Collibra Console. For example, if your base URL is See also Set the context path of Collibra Console. |
| Console port |
The TCP port to access your Collibra Console via your web browser. |
|
Console database port |
The TCP port to access the Collibra Console database. This is the database where the data and configuration of Collibra Console is stored. |
If you run multiple Collibra Console instances on one node, this port must be unique for each instance.
What's next?
Add the component(s) to your environment.