About data usage statistics and popularity score

Data usage statistics and popularity scores help data consumers understand how frequently data in a data source is used. This information can indicate that the data is valuable and trustworthy.

Important 

Retrieve data usage statistics and popularity score

As a data steward, to synchronize the usage statistics of data in a Snowflake data source and calculate a popularity score, your data source connection must have the required permissions and you must select the Include Data Usage Statistics option when defining the synchronization rules for a schema.

After the synchronization, the following data usage information becomes available for Table, View, and Column assets:

Tip When you start synchronizing usage statistics, the scores are not loaded for past data. It may take a couple of days before you start seeing results.

View data usage statistics and popularity score

As a data consumer, you can check the popularity score to understand how frequently data in the data source is used. The Popularity Score field is shown in the At a glance sidebar for Table, View, and Column assets. You can also add this attribute to table asset views and in the asset pages.

Image of Popularity Score in the At a glance side bar Image of Column Popularity Scores in a Table asset

When you click Show details for a popularity score in the At a glance sidebar or click the Popularity Score value in a view, a dialog box appears with the following sections.

Popularity Score values with the value None can't be clicked to get the details.

Example of the query history chart for date range Last 3 months

For the following query history chart, created on 26 August 2025, the date range Last 3 months is selected. As a result, the date range is set to May 26 - August 26, 2025, and is divided into 7-day periods in the chart.

Image of query history with callouts that highlight information as mentioned in the example above the image

Popularity score calculation

The popularity score is calculated as follows:

  1. For the ingested assets, Collibra calculates a numeric score for ingested assets based on the number of queries and distinct users. It uses identifiers to protect user privacy while still showing if interactions are from the same or different users.
  2. The assets are ordered by the calculated numeric score and split into 5 groups.
  3. Based on the assigned group, the Popularity Score value of an asset is set.
    The possible values are:
    1. If an asset has not been queried, the Popularity Score value is None.
      Popularity Score values with the value None can't be clicked to see extra details.

      Note The None value is also used when an asset previously had a popularity score but hasn't been resynchronized for more than 30 days.

    2. If the asset is in the lowest 25% of assets, <=P25, the Popularity Score value is Low.
    3. If the asset is in the second group, >P25 and <=P50, the Popularity Score value is Medium.
    4. If the asset is in the third group, >P50 and <=P75, the Popularity Score value is High.
    5. If the asset is in the highest 25% of assets, >P75, the Popularity Score value is Very High.

    A No Score value means that the Data Usage statistics for the asset are collected, but the popularity score hasn't been calculated yet. The value will be updated during the next recalculation. The No Score value doesn't appear in the Popularity Score column in a view.

The popularity score of an asset is recalculated twice a day and takes the data usage statistics of the last 30 days of the last gathered information into account.

Related topics

About synchronizing schemas

Helpful resources

Popularity scores training on Collibra University