Migrating to Unified Data Classification

The Unified Data Classification method is the default data classification method and is the only one supported.
With release 2024.07, a migration process is available for customers who were using the old Edge classification method or Cloud Data Classification Platform. The migration process:

  • Copies classification information from the old Edge classification method or Cloud Data Classification Platform into the Unified Data Classification method.

    Note When you are using APIs to manage classifications with Unified Data Classification disabled, the data you manipulate is considered to be part of the Cloud Data Classification Platform or old Edge data classification method, and it needs to be migrated.

  • Creates data classes in the Unified Data Classification method for existing Advanced Data Types (ADTs).

    Note ADTs are supported only for Jobserver. Note that Jobserver reached its end of life starting October, 2024, for commercial customers.

What happened during the 2024.07 upgrade and what are the possible next steps?

What happens during the migration process?

Important 
  • The migration process enriches the Unified Data Classification method. It doesn’t remove anything from your old classification method.
  • Migrated data classes receive a new name to avoid conflicts with any newly created data classes in Unified Data Classification. This allows you to quickly identify the migrated classification information.

The migration process does the following:

Start the migration process manually

You can start the migration process manually if:

  • Before the 2024.07 release, you already used the Unified Data Classification method.
  • After the 2024.07 release, you followed the steps to disable Unified Data Classification and migrate at a later date.

Steps

Important 

In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.

Use the following options to see the documentation in the latest UI or in the previous, classic UI:

  1. If you are using Protect, capture the data classes used in data protection standards and data access rules so that you can add them back after the migration.
  2. If you were not yet using Unified Data Classification, activate and set up Unified Data Classification.
  3. Enable the Unified Classification migration tool enabled setting.
  4. On the main toolbar, click Products icon, and then click Stewardship.
  5. Click the Data Classification tab.
  6. Click Migrate Data.
    The migration process starts. For larger classification sets, this process takes a few minutes.
    You can follow up on the job and the results from the Activities page.
    image of migration results
    For more information go to What happens during the migration process?.
  7. Check the migrated data classes and classifications.
    On the Data Classification page, you can filter the data classes based on their name. The names of migrated data classes end with (migrated) or (ADT).
  8. Delete any unnecessary data classes, merge data classes if needed, or add a classification rule to data classes if none was added by the migration process.
  9. If you are using Protect, add the data classes you captured in step 1 to the affected standards and rules.
  10. Once the migration is completed:
    1. Disable the Unified Classification migration tool enabled setting.
    2. If you were using classification on Edge before, disable the old Edge classification feature on your Edge site.
      You no longer need to enable classification on your Edge site because Unified Data Classification uses an Edge capability instead.

      To disable classification on an existing Edge site, deployed on K3S, run this command:

      Copy
      sudo ./edgecli update --set collibra.classification.enabled=false

      Tip The only difference between disabling classification and enabling classification is that the last argument is false instead of true.

      Note Edge supports the following managed Kubernetes clusters:
      • AKS
      • AWS Fargate using EKS
      • EKS
      • GKE
      • OpenShift

      Follow the steps below if you use a supported Kubernetes cluster.

      To disable classification on an existing Edge site, deployed on your dedicated cluster, run this command:

      Copy
      ./edgecli update --set collibra.classification.enabled=false

      Tip The only difference between disabling classification and enabling classification is that the last argument is false instead of true.

FAQ on the migration process

  • Does the migration process migrate the trained Machine Learning (ML) models from the Cloud Data Classification Platform?
    No, the migration process does not take ML into account.
  • Can I still disable Unified Data Classification after 2024.07 and use my old classification method?
    From October 1, 2024, this is no longer possible. The old Edge classification and the Cloud Data Classification Platform have reached end of life on September 30, 2024.
  • What happens if I run the migration process multiple times?
    Data classes from the old classification methods and ADTs are migrated by the migration process only if they were not migrated before. To ensure that a data class or ADT is migrated again, delete the created related data class from Unified Data Classification. The names of migrated data classes end with (migrated) or (ADT).
  • Can I run the migration process after the end of life date for the old Edge classification and Cloud Data Classification Platform?
    Yes, you can. The migration process works only on the data stored in the Collibra database and doesn't connect to the old Edge or Cloud Data Classification Platform systems. However, note that the migration process will be removed at a later date.
  • I use only API calls to classify data. Am I impacted by this?
    Yes, all environments have to move the Unified Data Classification method APIs.