Migrating to Unified Data Classification

The Unified Data Classification method is the default data classification method.
With release 2024.07, a migration process is available. The migration process:

  • Copies classification information from older, deprecated classification methods, old Edge classification method and Cloud Data Classification Platform, into the Unified Data Classification method.

    Note When you use the API to manage classifications and Unified Data Classification is not activated, then the data you manipulate is considered to be part of the Cloud Data Classification Platform or old Edge data classification method, and needs to be migrated.

  • Creates data classes in the Unified Data Classification method for existing Advanced Data Types (ADTs). ADTs are supported only for Jobserver, which will be end of life on September 30, 2024.

What happened during the 2024.07 upgrade and what are the possible next steps?

What happens during the migration process?

Important 
  • The migration process enriches the Unified Data Classification method. It doesn’t remove anything from your old classification method.
  • Migrated data classes receive a new name to avoid conflicts with possible newly created data classes in Unified Data Classification. This allows you to quickly identify the migrated classification information.

The migration process does the following:

Start the migration process manually

You can start the migration process manually:

Steps

Important 

In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.

Use the following options to see the documentation in the latest UI or in the previous, classic UI:

  1. If you are using Protect, make sure to capture the data classes used in data protection standards and data access rules so that you can add them back after the migration.
  2. If you were not yet using Unified Data Classification, activate and set up Unified Data Classification.
  3. Enable the Unified Classification migration tool enabled setting.
  4. On the main toolbar, click Products icon, and then click Stewardship.
  5. Click the Data Classification tab.
  6. Click Migrate Data.
    The migration process starts. For larger classification sets, this process takes a few minutes.
    You can follow up on the job and the results from the Activities page.

    For more information go to What happens during the migration process?.
  7. Check the migrated data classes and classifications.
    In the Data Classification page, you can filter the data classes based on their name. The names of migrated data classes end with (migrated) or (ADT).
  8. You can now delete any unneeded data classes, merge data classes if needed, or add a classification rule to data classes if none was added by the migration process.
  9. If you are using Protect, add the data classes you captured in step 1 to the affected standards and rules.
  10. Once the migration is completed:
    1. Disable the Unified Classification migration tool enabled setting.
    2. If you were using classification on Edge before, disable the old Edge classification feature on your Edge site.
      You no longer need to enable classification on your Edge site because Unified Data Classification uses an Edge capability instead.

      To disable classification on an existing Edge site, deployed on K3S, run this command:

      Copy
      sudo ./edgecli update --set collibra.classification.enabled=false

      Tip The only difference between disabling classification and enabling classification is that the last argument is false instead of true.

      Note Edge supports the following managed Kubernetes clusters:
      • AKS
      • AWS Fargate using EKS
      • EKS
      • GKE
      • OpenShift

      Follow the steps below if you use a supported Kubernetes cluster.

      To disable classification on an existing Edge site, deployed on your dedicated cluster, run this command:

      Copy
      ./edgecli update --set collibra.classification.enabled=false

      Tip The only difference between disabling classification and enabling classification is that the last argument is false instead of true.

FAQ on the migration process

  • Does the migration process migrate the trained Machine Learning (ML) models from the Cloud Data Classification Platform?
    No, the migration process does not take the ML into account.
  • Can I still deactivate Unified Data Classification after 2024.07 and use my old classification method?
    Yes, that's possible. However, take into account that both, the old Edge classification and the Cloud Data Classification Platform are end of life on September, 30 2024.
  • What happens if I run the migration process multiple times?
    Data classes from the old classification methods and ADTs are migrated by the migration process only if they were not migrated before. To make sure a data class or ADT is migrated again, delete the created related data class from Unified Data Classification. Migrated data classes have a name that ends with (migrated) or (ADT).
  • Can I run the migration process after the end of life date for the old Edge classification and Cloud Data Classification Platform?
    Yes, you can. The migration process works only on the data stored in the Collibra database and doesn't connect to the old Edge or Cloud Data Classification Platform systems. However, note that the migration process will be removed at a later date.
  • I only use API calls to classify data. Am I impacted by this?
    Yes, all environments have to move the Unified Data Classification method APIs.