Enable and calculate data similarity
In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.
Use the following options to see the documentation in the latest UI or in the previous, classic UI:
The Data similarity feature requires some setup to calculate similarity scores for your data during profiling.
Tip Even if data similarity is enabled, you can specify that you don't want to calculate similarity scores for a data source.
Before you begin
Data similarity is a cloud-only feature and is not certified for FedRAMP.
- You are using Edge.
- If you want to use standalone Data Marketplace, Data Marketplace is enabled.
Steps
Step | More details | |
---|---|---|
1 |
In the Service Configuration settings, enable the Calculate Data Similarity and define the Data similarity threshold profiling settings. Data similarity scores can be calculated when you profile a data source via Edge. |
Show how
Depending on your environment, follow this procedure either on the Services Configuration tab of the Collibra settings or in Collibra Console: Important You can't edit the Services Configuration from the Settings page in the latest UI. If you use the latest UI, you can configure settings only in Collibra Console. For more information, go to DGC service configuration settings.
Requirements and permissions
Steps
|
2 |
In the Collibra settings, enable the Data Similarity setting for Data Marketplace. |
Show how
Before you beginThe Settings landing page is enabled. Requirements and permissionsYou are an administrator in Data Marketplace.Steps
|
3 |
Register a data source via Edge and profile the data. Similarity scores are calculated for the profiled Table assets. |
Important
If you don't want to calculate similarity scores for a data source during profiling, you can deactivate the calculation via the profiling capability configuration. In the capability, add the following parameter in the Other section:
|
What's next?
If a data consumer in Data Marketplace opens a Table asset preview, and similar assets are available for this table, the Similar Data tab is shown.