Relevance in search results

By default, search results are sorted in order of descending relevance.

What is relevance in the context of search results?

Relevance is a calculation of the similarity, measured across several lines of comparison, between your search text and the content of the resources in your Collibra environment.

In a set of search results, the relevance of each resource is represented by a positive number or score. The higher the score, the more relevant the resource is to your search text.

How are relevance scores derived?

To derive relevance scores, Collibra uses a combination of query clauses and boost factors.

Query clauses

When you perform a search, Collibra queries the database using various query clauses. Each query clause compares the similarity between your search text and your Collibra resources along a different line of comparison.

Some examples of the objectives of different query clauses are as follows:

  • Calculate the similarity between the spelling of your search text and the text found in a field in the database.
  • Calculate how frequently your search text appears in a field. The more often it appears, the greater the relevance. A field containing 5 occurrences of a given text is more likely to be relevant than a field containing 1 occurrence of the text.
  • Calculate the occurrence percentage of a text among all words in a particular field.
    For example, if your search text occurs twice in the 10-word description of an asset, that asset will have a higher relevance score than an asset for which your search text occurs twice in its 20-word description.

Boosting

Search boost factors allow you to influence the order of the search results. You can edit search boost factors to increase or decrease the importance of a resource type, field, or asset type in the search ranking. For more information, go to Search boost settings.