Configure general search behavior
You can customize the search index configuration.
Depending on your environment, follow this procedure either in Collibra Console or on the Services Configuration tab of the Collibra settings:
Prerequisites
- You have the ADMIN or SUPER role in Collibra Console.
- You have a global role that has the Product Rights > System administration global permission.
- The Services Configuration tab is available in the Collibra settings.
Steps
-
Open the Services Configuration tab:
-
On the main toolbar, click
→
Settings.
The Settings page opens. - Click Services Configuration.
- Click Edit configuration.
Open the DGC service settings for editing:- Open Collibra Console.
Collibra Console opens with the Infrastructure page. - In the tab pane, expand an environment to show its services.
- In the tab pane, click the Collibra Platform service of that environment.
- Click Configuration.
- Click Edit configuration.
-
On the main toolbar, click
- In the Search Index configuration section, modify the information as needed.
Setting Description UI search appends wildcard True (default): A wildcard (asterisk) is automatically added to the end of each search text to find more matches. For example, search text
loan size
is treated asloan size*
, finding assets containing sizeable.An asterisk isn't added if:
- The search text contains a tilde (~).
- The search text ends with a quotation mark (").
Note This setting applies only to search texts via the user interface. A wildcard isn't automatically added to search texts via REST API.
False: A wildcard isn't added to the search text.
Note In existing environments, the current status of the setting is retained. This means that if the setting was previously disabled, it will remain disabled.Maximum batch size The maximum batch size for resource reindex.
- Default value: 5,000
- Minimum value: 100
- Maximum value: 30,000
Maximum batch size for relations
The maximum batch size for relation reindex.
- Default value: 500
- Minimum value: 10
- Maximum value: 10,000
Maximum batch size for relation path preview
The maximum batch size for the relation path preview of relation index.
- Default value: 500
- Minimum value: 50
- Maximum value: 10,000
Stop words (retiring)
(Requires restart)
This setting requires the SUPER role.
A list of stop words that are ignored as tokens for the index.
The default list of English stop words includes the following:
a, an, and, are, as, at, be, but, by, for, if, in, into, is, it, no, not, of, on, or, such, that, the, their, then, there, these, they, this, to, was, will, with
If you don't create your own list of stop words, the default list is used.
If you create your own list of stop words, you need to:
- Reindex Collibra.
- Restart the environment to apply your changes. For more information, go to Stop an environment and Start an environment.
Relation-based search True (default in new environments): The Data Marketplace search considers certain assets and relation types between assets. As a result, your search results not only include assets that directly match the search criteria, but also assets that match the criteria through specific relation types.
Example A column named Order is included in a data set named Customer. If the relation-based search is enabled and you search for Order in Data Marketplace, then the data set Customer appears in the search results because the data set contains this column.
Tip For more information about this feature and the default relation types, go to Filtering and searching based on relations in Data Marketplace.
False: The Data Marketplace search results do not consider relations.
After you enable this setting, you must reindex Data Marketplace relations or reindex Collibra completely.
Note In new Collibra environments, this setting is enabled by default. In upgraded Collibra environments, the previous status of this setting is retained.
Uninterrupted Search
This setting requires the SUPER role.
True (default): The search function remains available and the current data is available for search, while the search index is rebuilding in the background. Search results, however, may not be up to date until the rebuild is complete. After the rebuild is complete, all new data and changes to the existing data become available for search.
Enabling this setting allows for the updating or reorganizing of data indexes without interrupting search and data availability. Users can continue to search previously indexed data until the new index is created and fully populated.
False: The search function becomes temporarily unavailable when the search index is rebuilding.
When the rebuild process begins, the current index is deleted and a new empty index is created, which is populated over time. As a result, if users perform a search during the rebuild process, they may not receive the full set of search results until the process is completed.
Important If you enable or disable this setting, you need to rebuild the search index for the change to take effect.Multi-threaded resources indexing
This setting requires the SUPER role.
True (default): Multiple threads are used for indexing resources.
False: Single thread is used for indexing resources.
Number of threads for multi-threaded resources indexing
This setting requires the SUPER role.
The number of threads for indexing resources.
Multi-threaded relations indexing
This setting requires the SUPER role.
True (default): Multiple threads are used for indexing relations.
False: Single thread is used for indexing relations.
Number of threads for multi-threaded relations indexing
This setting requires the SUPER role.
The number of threads for indexing relations.
Use managed Elastic
This setting requires the SUPER role.
True: Managed Elastic deployment is used to handle indexing.
False (default): Managed Elastic deployment isn't used to handle indexing.
Note- This setting is applicable only to commercial customers, except Azure.
- If you want to enable the setting, contact Collibra Support.
- Click Save all.