Upgrade Collibra Data Quality & Observability

Note Before proceeding with any upgrades, back up your Collibra DQ metastore.

Warning Rolling back Collibra DQ to an older version is not supported. Please contact Collibra Support with any questions.

Important You must upgrade to Java 17 and Spark 3.5.3 to install and use Collibra Data Quality & Observability 2025.02. For more information about version compatibility, click "Java and Spark compatibility matrix" below.

Java and Spark compatibility matrix
Collibra Data Quality & Observability versionJava 8Java 11Java 17Spark versionsAdditional notes
2025.01 and earlier

Yes

Yes

No

  • 2.3.0 (Java 8 only)
  • 2.4.5 (Java 8 only)
  • 3.0.1 (Java 8 and 11)
  • 3.1.2 (Java 8 and 11)
  • 3.2.2 (Java 8 and 11)
  • 3.4.1 (Java 11 only)
 
2025.02

No

No

Yes

3.5.3 only 
2025.03

No

No

Yes

3.5.3 only 
2025.04

Yes

Yes

Yes

  • 2.3.0 (Java 8 only)
  • 2.4.5 (Java 8 only)
  • 3.0.1 (Java 8 and 11)
  • 3.1.2 (Java 8 and 11)
  • 3.2.2 (Java 8 and 11)
  • 3.4.1 (Java 11 only)
  • 3.5.3 (Java 17 only)

Important 
The Java 8 and 11 build profiles only contain the 2025.02 release and critical bug fixes addressed in 2025.03 and 2025.04. They do not contain any feature enhancements from the 2025.03 or 2025.04 releases.

Only the Java 17 build profile contains feature enhancements and bug fixes listed in the 2025.04 release notes.

2025.05

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.
2025.06

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.
2025.07

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.
2025.08

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.

Download the Collibra DQ Upgrade Package

Note All Collibra DQ customers upgrading or patching receive the full package, not the base package, and should follow the same upgrade steps described in this section.

You can download the DQ installation package from the Collibra Product Resource Center.

Note For additional download options, contact your Account Team.

Save the package in a directory on the CDQ VM.

Extract the file:

Copy
tar -xvf dq-full-package.tar.gz

(Optional) Clean up:

Copy
rm dq-full-package.tar.gz

Upgrade steps

Important Verify the Collibra Data Quality & Observability version from which you are upgrading before proceeding with the upgrade. To do this, sign into your Collibra DQ instance and click the Collibra DQ version info in the upper-right corner of any page. The App Version lists your current Collibra DQ version.

  1. Copy the contents of the provided package, for example, dq-<new-version>-<SPARK353>-package-full.tar.gz to the system being upgraded (extract contents).

    Tip Untar the contents into a uniquely named folder, for example 2025-02-dq-upgrade.

  2. Stop the Collibra DQ Web process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlweb
  3. Stop the Collibra DQ Agent process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlagent
  4. Move the old jars from the owl/bin folder.
    Copy
    mv dq-webapp-<oldversion>-<spark322>.jar /tmp
    mv dq-agent-<oldversion>-<spark322>.jar /tmp
    mv dq-core-<oldversion>-<spark322>.jar /tmp
  5. Copy the new jars into the owl/bin folder from the extracted package.
    Copy
    mv dq-webapp-<newversion>-<spark353>.jar /home/owldq/owl/bin
    mv dq-agent-<newversion>-<spark353>.jar /home/owldq/owl/bin
    mv dq-core-<newversion>-<spark353>.jar /home/owldq/owl/bin
  6. Copy the latest owlcheck and owlmanage.sh to /opt/owl/bin directory.
    Copy
    mv owlcheck /home/owldq/owl/bin
    mv owlmanage.sh /home/owldq/owl/bin
  7. Start the Collibra DQ Web application.
    Copy
    ./owlmanage.sh start=owlweb
  8. Start the Collibra DQ Agent.
    Copy
    ./owlmanage.sh start=owlagent
  9. Validate the number of active services.
    Copy
    ps -ef | grep owl
  10. Important If you are installing a Collibra Data Quality & Observability version 2025.02 or newer, you only need to follow steps 10a and 10b once. For instance, if you performed these steps while upgrading to 2025.02, you do not need to follow them again when you upgrade to 2025.03. These steps are not required for Collibra Data Quality & Observability 2025.01 or older.

    1. Add the following environment variable to the owl-env.sh file.
    2. Note If you already have this variable set, you need to update it to include the following values.
      Copy
      export EXTRA_JVM_OPTIONS="--add-opens java.base/java.util=ALL-UNNAMED --add-opens java.base/java.net=ALL-UNNAMED --add-opens java.base/sun.nio.ch=ALL-UNNAMED --add-opens java.base/java.nio=ALL-UNNAMED --add-opens java.base/sun.util.calendar=ALL-UNNAMED"
    3. Restart the Collibra DQ Web and Collibra DQ Agent services.
    4. Copy
      ./owlmanage.sh stop=owlweb
      ./owlmanage.sh stop=owlagent
      ./owlmanage.sh start=owlweb
      ./owlmanage.sh start=owlagent

    Important If you use file-based SAML authentication with the SAML_METADATA_USE_URL variable set to true in the owl-env.sh script, update the Meta-Data URL option on the SAML Security Settings page with your metadata file. Use the file:/opt/owl/config/idp-metadata.xml format, ensuring the file name begins with the prefix file:. For additional steps on how to configure this, go to the "Enable the SAML SSO sign in option" section in SAML Authentication.

  1. Copy the contents of the provided package, for example, dq-<new-version>-<SPARK341>-package-full.tar.gz to the system being upgraded (extract contents).

    Tip Untar the contents into a uniquely named folder, for example 2024-11-dq-upgrade.

  2. Stop the Collibra DQ Web process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlweb
  3. Stop the Collibra DQ Agent process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlagent
  4. Move the old jars from the owl/bin folder.
    Copy
    mv dq-webapp-<oldversion>-<spark322>.jar /tmp
    mv dq-agent-<oldversion>-<spark322>.jar /tmp
    mv dq-core-<oldversion>-<spark322>.jar /tmp
  5. Copy the new jars into the owl/bin folder from the extracted package.
    Copy
    mv owl-webapp-<newversion>-<spark341>.jar /home/owldq/owl/bin
    mv owl-agent-<newversion>-<spark341>.jar /home/owldq/owl/bin
    mv owl-core-<newversion>-<spark341>.jar /home/owldq/owl/bin
  6. Copy the latest owlcheck and owlmanage.sh to /opt/owl/bin directory.
    Copy
    mv owlcheck /home/owldq/owl/bin
    mv owlmanage.sh /home/owldq/owl/bin
  7. Start the Collibra DQ Web application.
    Copy
    ./owlmanage.sh start=owlweb
  8. Start the Collibra DQ Agent.
    Copy
    ./owlmanage.sh start=owlagent
  9. Validate the number of active services.
    Copy
    ps -ef | grep owl

Important If you use file-based SAML authentication with the SAML_METADATA_USE_URL variable set to true in the owl-env.sh script, update the Meta-Data URL option on the SAML Security Settings page with your metadata file. Use the file:/opt/owl/config/idp-metadata.xml format, ensuring the file name begins with the prefix file:. For additional steps on how to configure this, go to the "Enable the SAML SSO sign in option" section in SAML Authentication.

Upgrading data source drivers

When new data source drivers are available, they are listed in the Release Notes or recommended to you directly by Collibra. Determine which drivers need to be updated and follow these steps:

  1. Review the release notes to see which drivers need to be updated.
  2. From the previously extracted tarball provided to you by Collibra, locate the drivers.tar.gz file and extract the contents into a new directory called drivers.
  3. Rename your /owl/drivers folder "/owl/old_drivers/" to create a backup of the old drivers.
  4. Replace the drivers from the /owl/old_drivers/ folder with the new drivers extracted from the tarball /owl/drivers/.

    Example If you replace an old Databricks driver with a new one, the file path might look like /owl/drivers/databricks.

Deploy the new Collibra DQ Helm Chart on your Kubernetes cluster to upgrade to a new version of Collibra DQ.

  1. Download the Docker key.json from the Object Store and run the following command.
  2. Copy
    docker login -u _json_key -p "$(cat client-repo-key.json)" https://gcr.io
  3. Using the Docker Pull Commands you obtained from the Collibra Downloads page, run one command per container to pull the Collibra DQ containers. The Docker pulls should follow a similar format to the ones in the following example:
  4. Copy
    docker pull gcr.io/owl-hadoop-cdh/dq-web:2025.02-ABDGCSHILM-123
    docker pull gcr.io/owl-hadoop-cdh/dq-agent:2025.02-ABDGCSHILM-123
    docker pull gcr.io/owl-hadoop-cdh/dq-livy:3.5.3-2025.02-ABDGCSHILM-123
    docker pull gcr.io/owl-hadoop-cdh/dq-spark:3.5.3-2025.02-ABDGCSHILM-123
  5. Tag the pulled containers and upload them to your internal Artifactory for deployment.
  6. To deploy the new Collibra DQ Helm Chart on your Kubernetes cluster and upgrade to a new version of Collibra DQ, run the following Helm commands, replacing the placeholder values with the actual values described in the table below.
  7. Important 
    The following example command may vary depending on the options your organization supplied during the initial installation.

    When upgrading, it's crucial to consider Helm properties provided through extra options or defaulting in Helm Charts to avoid losing any existing configurations.

    Be sure to replace the entire Helm Chart package with each release.

    helm upgrade --install --namespace <namespace> \
    --set global.version.dq=<cdq_version> \
    --set global.version.spark=<cdq_spark_version> \
    --set global.configMap.data.license_key=<cdq_license_key> \
    --set global.configMap.data.license_name=<your_license_name> \
    --set global.web.admin.email=${email} \
    --set global.web.admin.password=${password} \
    --set global.web.service.type=ClusterIP \
    --set global.image.repo=<pathTolmageRepo> \
    <deployment_name> \
    /path to the helm chart root folder>
    Value Description
    <namespace> The namespace that you created and into which you deployed Collibra DQ.
    <cdq_version> The version from the web image suffix. For example, 2023.11 from the image, dq-web:2023.11.
    <cdq_spark_version> The Spark version from the Spark image suffix. For example, 3.4.1-2023.11 from the image, spark:3.4.1-2023.11.
    <cdq_license_key> The license key provided to you by Collibra.
    <your_license_name> The license name provided to you by Collibra.
    ${email} The default admin user email associated with the admin account.
    ${password}

    The default admin user password for the admin account.

    The password must adhere to the following password policy:

    • A minimum of 8 characters.
    • A maximum of 72 characters.
    • At least one upper-case character.
    • At least one numeric character.
    • At least one supported special character (!@#%$^&*?_~).
    • Cannot contain the user ID (admin).

    Note If a password that does not meet the password policy is entered, the install process proceeds as though the password is accepted, but the admin user becomes locked out. If this occurs, rerun the Helm command with a password that meets the password policy and restart the web pod.

    <pathTolmageRepo> This is your private registry key, where the Collibra images are available. When this is not provided, you will pull the images from the Collibra image registry, for which you should create a pull secret with the repo key provided by Collibra. See the Cloud native install documentation for more details about how to create a pull secret.
    <deployment_name> Any name of your choice for this deployment.
  8. Check that all required Kubernetes resources are available and running.
    1. List all of the resources of the namespace in your Kubernetes cluster into which Collibra DQ is deployed, including pods, Ingress/LoadBalancer, and statefulsets.
    2. kubectl get all -n <namespace>
  9. Create a test DQ job without any optional DQ layers or rules to verify that the Spark driver, executors, and containers are able to launch successfully.

Important If you use file-based SAML authentication with the SAML_METADATA_USE_URL variable set to true in the owl-web ConfigMap, update the Meta-Data URL option on the SAML Security Settings page with your metadata file. Use the file:/opt/owl/config/idp-metadata.xml format, ensuring the file name begins with the prefix file:. For additional steps on how to configure this, go to the "Enable the SAML SSO sign in option" section in SAML Authentication.

What's next?