Understanding search behavior and relevance

Understanding how search behavior and relevance work can help you refine your searches and retrieve the most accurate results. This topic explains the rules and methods that determine how search queries are processed and ranked on the search page. It also provides insights into how relevance scores are calculated to ensure that the most relevant resources appear at the top of your search results.

Search behavior

When you perform a search, specific rules determine how matches are found and ranked in the search results. For example, your search text will be split into smaller parts called tokens, and it may be treated as if it ends with a wildcard (*) even if you don’t explicitly add one. These rules use the following methods in the background to process your search text:

Both standard and enhanced matching methods are automatically applied whenever you perform a search.

Search behavior explained with examples

The following examples explain how a search text is processed in standard and enhanced matching methods to return comprehensive and relevant matches. These examples assume that the UI search appends wildcard setting is disabled.

Why does loan return loansize but not sizeloan?

Search text Matching method Treated as Returns Doesn't return
loan Standard loan loan

loansize

sizeloan

Enhanced loan* loansize sizeloan

Why doesn’t size loan return sizeloan?

Search text Matching method Treated as Returns Doesn't return
size loan Standard size or loan

size case

loan amount

size loan

sizeloan

Enhanced

size loan and size loan*

size loan

size loanamount

sizeloan

Why doesn’t size_loan return sizeable loan?

Tip Enhanced matching applies only to the Name and Tag fields. For example, if size loanamount exists in a Comment field, enhanced matching can't detect it.
Search text Matching method Treated as Returns Doesn't return
size_loan Standard size_loan

size_loan

size loan

sizeloan

sizeable loan

Enhanced

size loan*

size loan

size loanamount

sizeable loan

How different search texts are interpreted

The following table shows how complex tokens are processed in the standard and enhanced matching methods.

Search text Standard matching Enhanced matching
sizeLoan sizeloan size, loan
size_loan size_loan size, loan
size.loan size.loan size, loan
size-loan size, loan size, loan
size=loan size, loan size, loan
size123loan size123loan size, 123, loan

Behavior when the "UI search appends wildcard" setting is enabled

The UI search appends wildcard setting, if enabled, uses the standard matching method but with a wildcard (*) added to the end of your search text. However, unlike the enhanced matching method, it looks for matches in all fields.

Search relevance

By default, search results on the search page are sorted in the order of descending relevance.

What is relevance in the context of search results

Relevance is a calculation of the similarity, measured across several lines of comparison, between your search text and the content of the resources in your Collibra environment.

In a set of search results, the relevance of each resource is represented by a positive number or score. The higher the score, the more relevant the resource is to your search text.

How relevance scores are derived

To derive relevance scores, Collibra uses a combination of query clauses.

Query clauses

When you perform a search, Collibra queries the database using various query clauses. Each query clause compares the similarity between your search text and your Collibra resources along a different line of comparison.

Some examples of the objectives of different query clauses are as follows:

Related topics