Configure the use of sample data via Edge

You must configure your Collibra environment if you want to display sample data for data sources registered via Edge.

Tip Sample data for data sources registered via Edge is temporarily cached on the Edge site.

  Configuration step More details
1 Ensure your Edge site meets the memory, CPU and hard disk requirements. Edge hardware requirements to show sample data
2 Ensure the users have the required permissions. Required permissions to view sample data.

Important Several out-of-the-box roles already include the required permissions.

3

In the Service Configuration settings,

  • Set the Data Profiling setting Maximum number of samples to a value higher than 0.
  • Define the maximum number of characters that you want to collect per sample in Maximum value length.
  • If you want, enable the optimization feature for sampling (beta).
Important The Maximum number of samples value applies to both Jobserver and Edge. In mixed environments, if this number was set to 0 for Jobserver and you are increasing the value for Edge, this can result in sample data extraction for data sources registered via Jobserver.
4 For each data source, add the Catalog JDBC Sampling capability to Edge.

Add the Catalog JDBC Sampling capability for your data source Edge connection.

  • The Catalog JDBC Sampling capability consists of two possible operations:
    • Extracting the sample data, which collects the data from a data source and caches the data on the Edge site.
    • Reading the sample data, which reads sample data from the Edge cache and returns it as a result of an API call or displays it in an asset page.
  • Once the capability is selected, define the JDBC connection to which the capability applies.

For detailed information on the sample data process, go to Understanding the process to display sample data.