Delete sample data
The way to remove sample data for a data source depends on how the sample data is made available.
- For Jobserver, perform one of the following:
- Refresh the related schema and don't select the Store Sample Data checkbox.
As a result, any previously gathered sample data is removed from the Collibra cloud repository. - Call the Catalog profiling REST API with an empty array for the samples parameter.
As a result, any previously gathered sample data is removed from the Collibra cloud repository.
- Refresh the related schema and don't select the Store Sample Data checkbox.
-
For Catalog profiling REST API, call the Catalog profiling REST API with an empty array for the samples parameter.
As a result, any previously gathered sample data is removed from the Collibra cloud repository. - For Edge, you cannot delete sample data.
Sample data for data sources registered via Edge is not stored in the Collibra cloud repository, it is cached on the Edge site for a limited amount of time.Note If a data source was previously connected to Jobserver or if sample data was pushed using the Catalog profiling REST API, and the data source is now an Edge data source, sample data may still be stored in the Collibra cloud repository for this data source. If you want to remove this sample data, call the Catalog profiling REST API with an empty array for the samples parameter.
Example of API code that deletes sample data from the Collibra cloud repository
In the example code:
- replace <your_environment> by the name of your environment.
- replace the assetIdentifier section by any combination that uniquely identifies the asset for which you want to delete the sample data.
PATCH https://<your_environment>.collibra.com/rest/catalog/1.0/profiling/columns
{
"columnProfiles": [
{
"assetIdentifier": {
"assetName": "Catalog postgresql>catalog_postg>GDPR>Consumers>Process_id(column)",
"communityName": "Catalog demo",
"domainName": "Catalog postgresql > catalog_postg > GDPR"
},
"samples": {
"samples": null
}
}
]
}