Warning Jobserver and all related Jobserver integrations reached their End of Life in commercial environments in October, 2024. In Collibra Platform for Government and Collibra Platform Self-Hosted environments, they will reach their End of Life on May 30, 2027.
For information on registering a data source via Edge, go to Registering and synchronizing a data source via Edge.

Refresh the schema of a registered data source

You can refresh a schema of registered data to update the data and the profiling. It can also be useful to do this to change data types to force the profiling to use the correct type.

Tip You can also refresh the schema automatically via a schedule.

Prerequisites

Note 

Steps

Tip 

The steps vary depending on your data source type and authentication method.

  1. Open the Schema asset.
    1. On the main toolbar, click Products icon Catalog.
      The Catalog homepage opens.
    2. In the submenu, click Data Dictionary and select the All Schemas view.
    3. Click the schema that you want to refresh.

    Tip You can also use the Collibra Platform search function to look up your schema.

  2. In the view bar, to the right, click ActionsRefresh.
    The Refresh Schema dialog box appears.

    Tip If Catalog experience is disabled, the More menu is shown instead of Actions.

  3. Enter the required information.
    FieldDescription
    Upload a fileUpload or drop the CSV Excel file in the upload box.
    Process on

    The jobserver used for ingesting.

    Column SeparatorThe character that is used as separator in the data source.
    Quote

    The character that is used as quote in the data source.

    Escape CharacterSelect the character that is used as escape character in the data source.

    Store Data Profile

    Option to data profiling profiling on the registered data.

    Store Sample Data

    Option to extract sample data from the registered data.
  4. Enter the required information.
    FieldDescription
    Upload a fileUpload or drop the CSV Excel file in the upload box.
    Process on

    The jobserver used for ingesting.

    Store Data Profile

    Option to data profiling profiling on the registered data.

    Store Sample Data

    Option to extract sample data from the registered data.
  5. Enter the required information.
    OptionDescription

    Data Source Type

    The data source types for which a JDBC driver is available.

    Note If you want to use a Collibra provided driver, select Collibra driver.

    JDBC Driver Version

    The JDBC driver to connect to your database.

    Process On

    The jobserver used for ingesting.

    <Configuration properties>

    The connection properties as defined in your JDBC driver.

    If you want to use , you need the following connection properties.

    Label

    Description

    Keystore file

    The name of the keystore file. The keystore must contain the client key and client certificate or certificate chain.

    If defaultTruststore is set to false, the keystore has to contain the trusted CA certificate needed to validate the server certificate offered by CyberArk.

    The value must have the following format: file://<keystore-file name.jks>.

    Example file://cyberark-keystore.jks

    Keystore password

    The password required to open the keystore.

    Default truststore

    The indication of the default truststore. The default value is set to False.

    • False: The certificate is validated through the keystoreFile property.
    • True: The certificate is validated through the default truststore from the Java JRE. This is recommended when CyberArk is set up to offer a server certificate that can be validated by a public CA (certification authority).
    CyberArk address

    The host and port number through which the CyberArk server is accessible. The format of the address is hostname:port.

    Example my.cyberark.com:5502

    CyberArk application ID

    The application ID as defined in CyberArk.

    This ID should be provided by your network or system administrator.

    CyberArk query

    The CyberArk query.

    This query should be provided by your network or system administrator.

    If you want to use Kerberos authentication, you also need the following connection properties.

    Label

    Description

    Principal

    The Kerberos principal identity.

    Kerberos realm

    The Kerberos realm name.

    Login context name

    The login context name that is used as the index to the configuration.

    Jaas file name

    The name of the Jaas file.

    Kerberos configuration file

    The configuration file containing specific properties for Kerberos authentication.

    If you want to use NTLM authentication, you also need the following connection properties.

    Label

    Description

    Security

    The security that enables the authentication

    Authentication scheme

    The used authentication scheme, which is NTLM.

    Store credentials

    Select this option to store the credentials to access the database. With a schema refresh, you can clear this option again.

    Username

    Username to access the database.

    PasswordCorresponding password to access the database.

    Schedule Data Refresh

    Enable or disable a schedule to automatically refresh the data registration.

    Cron Expression

    Schedule of the data refresh as a Quartz Cron pattern.

    Warning If you create an invalid cron pattern, Collibra stops responding.

    Time ZoneThe time zone of the database.

    Store Data Profile

    Option to data profiling profiling on the registered data.

    Store Sample Data

    Option to extract sample data from the registered data.
  6. Enter the required information.
    OptionDescription

    Data source type

    The data source types for which a JDBC driver is available.

    Note If you want to use a Collibra provided drivers, select Collibra driver.

    JDBC driver version

    The JDBC driver to connect to your database.

    Connect via

    The jobserver used for ingesting.

    <Configuration properties>

    The connection properties as defined in your JDBC driver.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    Database

    The name of your database.

    Schema

    The name of your schema.

    Label

    Description

    URL (hostname:port)

    Address of the used database. Use the format hostname:port.

    Principal

    The Kerberos principal identity.

    Schema

    The name of your schema.

    Label

    Description

    URL (hostname:port)

    Address of the used database. Use the format hostname:port.

    Schema

    The name of your schema.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    Database

    The name of your database.

    Schema

    The name of your schema.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    Database

    The name of your database.

    Schema

    The name of your schema.

    Label

    Description

    URL (hostname:port)

    Address of the used database. Use the format hostname:port.

    Schema

    The name of your schema.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    Database

    The name of your database.

    Schema

    The name of your schema.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    Database

    The name of your database.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    SID

    The Oracle System ID, which identifies a database on a system.

    Schema

    The name of your schema.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    Database

    The name of your database.

    Schema

    The name of your schema.

    Label

    Description

    Hostname

    The name of your device.

    Port

    The port number.

    Database

    The name of your database.

    Schema

    The name of your schema.

    If you want to use Kerberos authentication, you also need the following connection properties.

    Label

    Description

    Principal

    The Kerberos principal identity.

    Kerberos realm

    The Kerberos realm name.

    Login context name

    The login context name that is used as the index to the configuration.

    Jaas file name

    The name of the Jaas file.

    Kerberos configuration file

    The configuration file containing specific properties for Kerberos authentication.

    If you want to use NTLM authentication, you also need the following connection properties.

    Label

    Description

    Security

    The security that enables the authentication

    Authentication scheme

    The used authentication scheme, which is NTLM.

    If you want to use , you need the following connection properties.

    Label

    Description

    Keystore file

    The name of the keystore file. The keystore must contain the client key and client certificate or certificate chain.

    If defaultTruststore is set to false, the keystore has to contain the trusted CA certificate needed to validate the server certificate offered by CyberArk.

    The value must have the following format: file://<keystore-file name.jks>.

    Example file://cyberark-keystore.jks

    Keystore password

    The password required to open the keystore.

    Default truststore

    The indication of the default truststore. The default value is set to False.

    • False: The certificate is validated through the keystoreFile property.
    • True: The certificate is validated through the default truststore from the Java JRE. This is recommended when CyberArk is set up to offer a server certificate that can be validated by a public CA (certification authority).
    CyberArk address

    The host and port number through which the CyberArk server is accessible. The format of the address is hostname:port.

    Example my.cyberark.com:5502

    CyberArk application ID

    The application ID as defined in CyberArk.

    This ID should be provided by your network or system administrator.

    CyberArk query

    The CyberArk query.

    This query should be provided by your network or system administrator.

    Store credentials

    Select this option to store the credentials to access the database. With a schema refresh, you can clear this option again.

    Username

    Username to access the database.

    Note This field is ignored if your data source uses CyberarkKerberosNTLM.
    Password

    Corresponding password to access the database.

    Note This field is ignored if your data source uses CyberarkKerberosNTLM.

    Schedule data refresh

    Enable or disable a schedule to automatically refresh the data registration.
    Cron pattern

    Schedule of the data refresh as a Quartz Cron pattern.

    Warning If you create an invalid cron pattern, Collibra stops responding.

    Time zoneThe time zone of the database.

    Store Data Profile

    Option to data profiling profiling on the registered data.

    Store Sample Data

    Option to extract sample data from the registered data.

    Tables excluded from registration

    Database tables that will not be ingested.

    Note 
    • If required, you can exclude multiple tables. To do this, press Enter after typing a value and then type the next.
    • You can use an asterisk (*) as wildcard to select multiple tables. For example, if you want to exclude the tables that all start with act_, you can enter act_*.
    • The table names are case sensitive.
    • You can add or remove tables from this list by refreshing the schema.
  7. Click Save & Refresh.
    The refresh of the schema starts, you can follow the refresh job in the list of activities.

What's next