About sample data

Sample data is a set of randomly collected data from a data source. It provides examples of the data, helping data consumers understand what to expect when using the asset. For example, Data Analysts can use sample data to verify the content and format of a table before including it in a report.

Note If you're using a Collibra Cloud site, go the Collibra Cloud site documentation to check if your data source is supported.

Where sample data can be shown

Depending on your environment, sample data can be shown for Column, Table, Data Set, Data Product, and Data Product Port assets.

Image of sample data in for a Table asset

For Data Sets, the Sample Data tab shows a maximum of 100 columns per Table in the Data Set. This means you might not see sample data for all elements. The shown columns per table depend on their position in the data source. For example, a data set with 4 tables can show up to 400 columns in total.
For all other asset types where sample data can be available, up to 1,000 columns are shown.

Conditions to show sample data

You can view sample data for an asset only if the following conditions are met:

Sample data in Collibra

The way Collibra handles sample data depends on how the assets are added to Collibra and how the sample data is collected.

Sample data for assets that are added via Edge

Sample data for assets that are manually added or imported

Sample data must be uploaded via the Catalog REST API - Profiling. In this case, the sample data is stored in the Collibra cloud repository and is shown to all users with the required permissions.

Sample data for assets that are added via Jobserver

Related topics

Understanding the process to show sample data
Sample data limitations and guidelines
Configure the use of sample data via Edge: Steps
Configure the use of sample data via Jobserver

Helpful resources

Sample data training on Collibra University