Configure general search behavior

You can customize the search index configuration.

Depending on your environment, follow this procedure either in Collibra Console or on the Services Configuration tab of the Collibra settings:

Important You can't edit the service configuration from the Settings page in the latest UI. If you use the latest UI, you can edit the service configuration only in Collibra Console. For more information, go to Collibra service configuration settings.

Prerequisites

Steps

  1. Open the Services Configuration tab:
    1. On the main toolbar, click Products iconCogwheel icon Settings.
      The Settings page opens.
    2. Click Services Configuration.
    3. Click Edit configuration.
    Open the DGC service settings for editing:
    1. Open Collibra Console.
      Collibra Console opens with the Infrastructure page.
    2. In the tab pane, expand an environment to show its services.
    3. In the tab pane, click the Collibra Platform service of that environment.
    4. Click Configuration.
    5. Click Edit configuration.
  2. In the Search Index configuration section, modify the information as needed.
    SettingDescription
    UI search appends wildcard

    True (default): A wildcard (asterisk) is automatically added to the end of each search text to find more matches. For example, search text loan size is treated as loan size*, finding assets containing sizeable.

    An asterisk isn't added if:

    • The search text contains a tilde (~).
    • The search text ends with a quotation mark (").

    Note This setting applies only to search texts via the user interface. A wildcard isn't automatically added to search texts via REST API.

    False: A wildcard isn't added to the search text.

    Note In existing environments, the current status of the setting is retained. This means that if the setting was previously disabled, it will remain disabled.
    Maximum batch size

    The maximum batch size for resource reindex.

    • Default value: 5,000
    • Minimum value: 100
    • Maximum value: 30,000

    Maximum batch size for relations

    The maximum batch size for relation reindex.

    • Default value: 500
    • Minimum value: 10
    • Maximum value: 10,000

    Maximum batch size for relation path preview

    The maximum batch size for the relation path preview of relation index.

    • Default value: 500
    • Minimum value: 50
    • Maximum value: 10,000

    Stop words (retiring)

    (Requires restart)

    This setting requires the SUPER role.

    A list of stop words that are ignored as tokens for the index.

    The default list of English stop words includes the following:

    a, an, and, are, as, at, be, but, by, for, if, in, into, is, it, no, not, of, on, or, such, that, the, their, then, there, these, they, this, to, was, will, with

    If you don't create your own list of stop words, the default list is used.

    If you create your own list of stop words, you need to:

    1. Reindex Collibra.
    2. Restart the environment to apply your changes. For more information, go to Stop an environment and Start an environment.

    Relation-based search

    True (default in new environments): The Data Marketplace search considers certain assets and relation types between assets. As a result, your search results not only include assets that directly match the search criteria, but also assets that match the criteria through specific relation types.

    Example A column named Order is included in a data set named Customer. If the relation-based search is enabled and you search for Order in Data Marketplace, then the data set Customer appears in the search results because the data set contains this column.

    Tip For more information about this feature and the default relation types, go to Filtering and searching based on relations in Data Marketplace.

    False: The Data Marketplace search results do not consider relations.

    After you enable this setting, you must reindex Data Marketplace relations or reindex Collibra completely.

    Note In new Collibra environments, this setting is enabled by default. In upgraded Collibra environments, the previous status of this setting is retained.

    Uninterrupted Search

    This setting requires the SUPER role.

    True (default): The search function remains available and the current data is available for search, while the search index is rebuilding in the background. Search results, however, may not be up to date until the rebuild is complete. After the rebuild is complete, all new data and changes to the existing data become available for search.

    Enabling this setting allows for the updating or reorganizing of data indexes without interrupting search and data availability. Users can continue to search previously indexed data until the new index is created and fully populated.

    False: The search function becomes temporarily unavailable when the search index is rebuilding.

    When the rebuild process begins, the current index is deleted and a new empty index is created, which is populated over time. As a result, if users perform a search during the rebuild process, they may not receive the full set of search results until the process is completed.

    Important If you enable or disable this setting, you need to rebuild the search index for the change to take effect.

    Multi-threaded resources indexing

    This setting requires the SUPER role.

    True (default): Multiple threads are used for indexing resources.

    False: Single thread is used for indexing resources.

    Number of threads for multi-threaded resources indexing

    This setting requires the SUPER role.

      The number of threads for indexing resources.

    Multi-threaded relations indexing

    This setting requires the SUPER role.

    True (default): Multiple threads are used for indexing relations.

    False: Single thread is used for indexing relations.

    Number of threads for multi-threaded relations indexing

    This setting requires the SUPER role.

      The number of threads for indexing relations.

    Use managed Elastic

    This setting requires the SUPER role.

    True: Managed Elastic deployment is used to handle indexing.

    False (default): Managed Elastic deployment isn't used to handle indexing.

    Note 
    • This setting is applicable only to commercial customers, except Azure.
    • If you want to enable the setting, contact Collibra Support.
  3. Click Save all.