Upgrade Collibra Data Quality & Observability

Note Before proceeding with any upgrades, back up your Collibra DQ metastore.

Warning Rolling back Collibra DQ to an older version is not supported. Please contact Collibra Support with any questions.

Important In the upcoming 2025.02 version of Collibra Data Quality & Observability, you must upgrade to Java 17 and Spark 3.5.3 to install and use Collibra Data Quality & Observability 2025.02.

Java and Spark compatibility matrix
Collibra Data Quality & Observability versionJava 8Java 11Java 17Spark versionsAdditional notes
2025.01 and earlier

Yes

Yes

No

  • 2.3.0 (Java 8 only)
  • 2.4.5 (Java 8 only)
  • 3.0.1 (Java 8 and 11)
  • 3.1.2 (Java 8 and 11)
  • 3.2.2 (Java 8 and 11)
  • 3.4.1 (Java 11 only)
 
2025.02

No

No

Yes

3.5.3 only 
2025.03

No

No

Yes

3.5.3 only 
2025.04

Yes

Yes

Yes

  • 2.3.0 (Java 8 only)
  • 2.4.5 (Java 8 only)
  • 3.0.1 (Java 8 and 11)
  • 3.1.2 (Java 8 and 11)
  • 3.2.2 (Java 8 and 11)
  • 3.4.1 (Java 11 only)
  • 3.5.3 (Java 17 only)

Important 
The Java 8 and 11 build profiles only contain the 2025.02 release and critical bug fixes addressed in 2025.03 and 2025.04. They do not contain any feature enhancements from the 2025.03 or 2025.04 releases.

Only the Java 17 build profile contains feature enhancements and bug fixes listed in the 2025.04 release notes.

2025.05

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.
2025.06

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.
2025.07

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.
2025.08

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.
2025.09

No

No

Yes

3.5.3 onlyFixes for Java 8 and 11 build profiles will be available only for critical and high-priority defects.

Download the Collibra DQ Upgrade Package

Note Starting with the December 2021 release, all Collibra DQ customers upgrading or patching receive the Full package, not the Base package, and should follow the same upgrade steps described in this section.

Download the tarball using the signed link to the full package tarball provided by Collibra. Replace <signed-link-to-full-package> with the link provided.

### Go to the OWL_BASE (home directory of the user is most common)
### This example we will use /home/owldq installing as the user owldq

cd /home/owldq 

### Download & untar
curl -o dq-full-package.tar.gz "<signed-link-to-full-package>"
tar -xvf dq-full-package.tar.gz

### Clean-up unnecessary tarball (optional)
rm dq-full-package.tar.gz

Upgrade steps

Important Verify the Collibra Data Quality & Observability version from which you are upgrading before proceeding with the upgrade. To do this, sign into your Collibra DQ instance and click the Collibra DQ version info in the upper-right corner of any page. The App Version lists your current Collibra DQ version. Select an option from the tabs below depending on the version from which you are upgrading and the version to which you plan to upgrade.

This section provides steps to upgrade standalone deployments of Collibra DQ from version 2023.09 or newer to a version newer than 2023.09.

  1. Copy the contents of the provided package, for example, dq-<new-version>-<SPARK301>-package-full.tar.gz to the system being upgraded (extract contents).

    Tip Untar the contents into a uniquely named folder, for example 2023-11-dq-upgrade.

  2. Stop the Collibra DQ Web process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlweb
  3. Stop the Collibra DQ Agent process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlagent
  4. Move the old jars from the owl/bin folder.
    Copy
    mv dq-webapp-<oldversion>-<spark301>.jar /tmp
    mv dq-agent-<oldversion>-<spark301>.jar /tmp
    mv dq-core-<oldversion>-<spark301>.jar /tmp
  5. Copy the new jars into the owl/bin folder from the extracted package.
    Copy
    mv dq-webapp-<newversion>-<spark301>.jar /home/owldq/owl/bin
    mv dq-agent-<newversion>-<spark301>.jar /home/owldq/owl/bin
    mv dq-core-<newversion>-<spark301>.jar /home/owldq/owl/bin
  6. Copy the latest owlcheck and owlmanage.sh to /opt/owl/bin directory.
  7. Start the Collibra DQ Web application.
    Copy
    ./owlmanage.sh start=owlweb
  8. Start the Collibra DQ Agent.
    Copy
    ./owlmanage.sh start=owlagent
  9. Validate the number of active services.
    Copy
    ps -ef | grep owl

This section provides steps to upgrade standalone deployments of Collibra DQ from a version older than 2023.09 to version 2023.09 or newer.

  1. Copy the contents of the provided package, for example, dq-<new-version>-<SPARK301>-package-full.tar.gz to the system being upgraded (extract contents).

    Tip Untar the contents into a uniquely named folder, for example 2023-09-dq-upgrade.

  2. Stop the Collibra DQ Web process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlweb
  3. Stop the Collibra DQ Agent process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlagent
  4. Move the old jars from the owl/bin folder.
    Copy
    mv owl-webapp-<oldversion>-<spark301>.jar /tmp
    mv owl-agent-<oldversion>-<spark301>.jar /tmp
    mv owl-core-<oldversion>-<spark301>.jar /tmp
  5. Copy the new jars into the owl/bin folder from the extracted package.
    Copy
    mv dq-webapp-<newversion>-<spark301>.jar /home/owldq/owl/bin
    mv dq-agent-<newversion>-<spark301>.jar /home/owldq/owl/bin
    mv dq-core-<newversion>-<spark301>.jar /home/owldq/owl/bin
  6. Copy the latest owlcheck and owlmanage.sh to /opt/owl/bin directory.
  7. Tip You may also need to run chmod +x owlcheck owlmanage.sh to add execute permission to owlcheck and owlmanage.sh.

  8. Start the Collibra DQ Web application.
    Copy
    ./owlmanage.sh start=owlweb
  9. Start the Collibra DQ Agent.
    Copy
    ./owlmanage.sh start=owlagent
  10. Validate the number of active services.
    Copy
    ps -ef | grep owl

This section provides steps to upgrade standalone deployments of Collibra DQ from a version older than 2023.09 to a different version older than 2023.09.

  1. Copy the contents of the provided package, for example, owl-<new-version>-<SPARK301>-package-full.tar.gz to the system being upgraded (extract contents).

    Tip Untar the contents into a uniquely named folder, for example, 2023-05-dq-upgrade.

  2. Stop the Collibra DQ Web process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlweb
  3. Stop the Collibra DQ Agent process.
    Copy
    cd /owlhome/owl/bin
    ./owlmanage.sh stop=owlagent
  4. Move the old jars from the owl/bin folder.
    Copy
    mv owl-webapp-<oldversion>-<spark301>.jar /tmp
    mv owl-agent-<oldversion>-<spark301>.jar /tmp
    mv owl-core-<oldversion>-<spark301>.jar /tmp
  5. Copy the new jars into the owl/bin folder from the extracted package.
    Copy
    mv owl-webapp-<newversion>-<spark301>.jar /home/owldq/owl/bin
    mv owl-agent-<newversion>-<spark301>.jar /home/owldq/owl/bin
    mv owl-core-<newversion>-<spark301>.jar /home/owldq/owl/bin
  6. Copy the latest owlcheck and owlmanage.sh to the /opt/owl/bin directory.
  7. Start the Collibra DQ Web application.
    Copy
    ./owlmanage.sh start=owlweb
  8. Start the Collibra DQ Agent.
    Copy
    ./owlmanage.sh start=owlagent
  9. Validate the number of active services.
    Copy
    ps -ef | grep owl

Upgrading data source drivers

When new data source drivers are available, they are listed in the Release Notes or recommended to you directly by Collibra. Determine which drivers need to be updated and follow these steps:

  1. Review the release notes to see which drivers need to be updated.
  2. From the previously extracted tarball provided to you by Collibra, locate the drivers.tar.gz file and extract the contents into a new directory called drivers.
  3. Rename your /owl/drivers folder "/owl/old_drivers/" to create a backup of the old drivers.
  4. Replace the drivers from the /owl/old_drivers/ folder with the new drivers extracted from the tarball /owl/drivers/.

    Example If you replace an old Databricks driver with a new one, the file path might look like /owl/drivers/databricks.

Deploy the new Collibra DQ Helm Chart on your Kubernetes cluster to upgrade to a new version of Collibra DQ.

  1. Download the Docker key.json from the Object Store and run the following command.
  2. Copy
    docker login -u _json_key -p "$(cat client-repo-key.json)" https://gcr.io
  3. Using the Docker Pull Commands you obtained from the Collibra Downloads page, run one command per container to pull the Collibra DQ containers. The Docker pulls should follow a similar format to the ones in the following example:
  4. Copy
    docker pull gcr.io/owl-hadoop-cdh/dq-web:2023.11-ABDGCSHILM-123
    docker pull gcr.io/owl-hadoop-cdh/dq-agent:2023.11-ABDGCSHILM-123
    docker pull gcr.io/owl-hadoop-cdh/dq-livy:3.4.1-2023.11-ABDGCSHILM-123
    docker pull gcr.io/owl-hadoop-cdh/dq-spark:3.4.1-2023.11-ABDGCSHILM-123
  5. Tag the pulled containers and upload them to your internal Artifactory for deployment.
  6. To deploy the new Collibra DQ Helm Chart on your Kubernetes cluster and upgrade to a new version of Collibra DQ, run the following Helm commands, replacing the placeholder values with the actual values described in the table below.
  7. Important 
    The following example command may vary depending on the options your organization supplied during the initial installation.

    When upgrading, it's crucial to consider Helm properties provided through extra options or defaulting in Helm Charts to avoid losing any existing configurations.

    helm upgrade --install --namespace <namespace> \
    --set global.version.dq=<cdq_version> \
    --set global.version.spark=<cdq_spark_version> \
    --set global.configMap.data.license_key=<cdq_license_key> \
    --set global.configMap.data.license_name=<your_license_name> \
    --set global.web.admin.email=${email} \
    --set global.web.admin.password=${password} \
    --set global.web.service.type=ClusterIP \
    --set global.image.repo=<pathTolmageRepo> \
    <deployment_name> \
    /path to the helm chart root folder>
    Value Description
    <namespace> The namespace that you created and into which you deployed Collibra DQ.
    <cdq_version> The version from the web image suffix. For example, 2023.11 from the image, dq-web:2023.11.
    <cdq_spark_version> The Spark version from the Spark image suffix. For example, 3.4.1-2023.11 from the image, spark:3.4.1-2023.11.
    <cdq_license_key> The license key provided to you by Collibra.
    <your_license_name> The license name provided to you by Collibra.
    ${email} The default admin user email associated with the admin account.
    ${password}

    The default admin user password for the admin account.

    The password must adhere to the following password policy:

    • A minimum of 8 characters.
    • A maximum of 72 characters.
    • At least one upper-case character.
    • At least one numeric character.
    • At least one supported special character (!@#%$^&*?_~).
    • Cannot contain the user ID (admin).

    Note If a password that does not meet the password policy is entered, the install process proceeds as though the password is accepted, but the admin user becomes locked out. If this occurs, rerun the Helm command with a password that meets the password policy and restart the web pod.

    <pathTolmageRepo> This is your private registry key, where the Collibra images are available. When this is not provided, you will pull the images from the Collibra image registry, for which you should create a pull secret with the repo key provided by Collibra. See the Cloud native install documentation for more details about how to create a pull secret.
    <deployment_name> Any name of your choice for this deployment.
  8. Check that all required Kubernetes resources are available and running.
    1. List all of the resources of the namespace in your Kubernetes cluster into which Collibra DQ is deployed, including pods, Ingress/LoadBalancer, and statefulsets.
    2. kubectl get all -n <namespace>
  9. Create a test DQ job without any optional DQ layers or rules to verify that the Spark driver, executors, and containers are able to launch successfully.

What's next?