Add the Catalog JDBC ingestion capability for Snowflake
If both Collibra and Snowflake are prepared, you can add the Catalog JDBC ingestion capability to the JDBC Snowflake connection.
Prerequisites
- You have gone through the preflight checks.
- You have created a JDBC connection.
Steps
- Open a site.
-
On the main toolbar, click
→
Settings.
The Settings page opens. -
In the tab pane, click Edge.
The Sites tab opens and shows a table with an overview of your sites. - In the table, click the name of the site whose status is Healthy.
The site page opens.
-
On the main toolbar, click
- In the Capabilities section, click Add capability.
The Add capability page opens. - Select the Catalog JDBC ingestion capability template.
- Enter the required information.
Field Description Required Capability
This section contains general information about the capability.
Name
The name of the capability.
Yes
Description
The description of the capability.
No
Capability template
The capability template. The value that you select in this field determines which sections appear on the page.
Select the following capability:
Catalog JDBC ingestion
Yes
Connection
This section contains information to connect to the data source.
JDBC connectionThe Snowflake connection.
Yes
JDBC data source type (Deprecated)Deprecated field. The field was used to indicate the type of the data source. You no longer need to change this field. The required value is automatically identified.
Note The automatically identified value is not shown in this page.
Yes
Supports schemasA text field where you have to enter True to enable database registration of data sources that have no schema. If the data source has schemas, you can ignore this field.
Tip If the data source does not have a schema, Data Catalog creates a Schema asset with the same name as the full name of the database.
No
Other Settings
Others
This section can contain additional capability properties. Click Add propertyAdd Other Settings to add a property.For all available other properties, go to Available "other properties".Note No validation is performed on the values you add.
No
General
This section contains general information about logging.
Debug
An option to automatically send Edge infrastructure log files to Collibra Platform. By default, this option is set to false.
Note We highly recommend to only send Edge infrastructure log files to Collibra Platform when you have issues with Edge. If you set it to true, it will automatically revert to false after 24h.For more information, go to Edge logging.
No
Log level
An option to determine the verbosity level of Catalog connector log files. By default, this option is set to No logging.
No
- Click Add.
The capability is added to the Edge or Collibra Cloud site.
The fields become read-only.
Available "Other Settings"
You can add the following properties in the Other Settings section to define how metadata is retrieved and how synchronization jobs are configured.
Note No validation is performed on the values you add.
Strategy properties
The following properties define the strategy for the retrieval of tables, columns, tags, schemas, primary keys, and foreign keys.
| Name | Description and possible values | Type | Encryption | Default value |
|---|---|---|---|---|
| engine-strategy |
|
Text | Not encrypted (plain text) | SNOWFLAKE_INFORMATION_SCHEMA |
| tags-strategy |
The tags-strategy defines the source of the source tags.
|
Text | Not encrypted (plain text) |
CALL_PER_TABLE |
| schema-list-strategy |
|
Text | Not encrypted (plain text) | USE_DATABASE_NAME |
| columns-strategy |
The columns-strategy controls the method used to retrieve column metadata, such as data types, descriptions, and default values.
|
Text | Not encrypted (plain text) | SINGLE_CALL |
| primary-keys-strategy |
|
Text | Not encrypted (plain text) | SINGLE_CALL |
| foreign-keys-strategy |
The foreign-keys-strategy property controls the method used to retrieve foreign key (cross-reference) metadata.
|
Text | Not encrypted (plain text) | SINGLE_CALL |
Properties for the various synchronization jobs
The following jobs run in view of a synchronization:
- database-list-with-metadata: This job runs when you register a data source and the Database field needs to be filled with available databases.Show more information
Collibra queries the account to list the available databases, equivalent to the native SHOW DATABASES IN ACCOUNT command. If the account has access to only one specific database, meaning SHOW DATABASES IN ACCOUNT returns no results, only the database specified in the connection is retrieved. The retrieval relies on the standard JDBC driver behavior. The actual results depend on the installed driver version. The behavior is described for driver 3.x series. The ingestion logic retrieves the list of databases (catalogs) using the JDBC API method: metaData.getCatalogs(). This directly invokes SnowflakeDatabaseMetaData.getCatalogs() inside the Snowflake JDBC driver.
- schema-list: This job runs in the Configuration tab of a Database asset to show the schemas in the database.Show more information
For a schema to be included, the role must have the USAGE privilege on both the parent Database and the Schema itself. The ingestion process requests all available schemas within a database, equivalent to the native SHOW SCHEMAS IN DATABASE command.
The retrieval relies on the standard JDBC driver behavior.The actual results depend on the installed driver version. The behavior is described for driver 3.x series. The ingestion logic calls the standard JDBC API method: metaData.getSchemas(database, null). This directly invokes SnowflakeDatabaseMetaData.getSchemas(catalog, null) inside the Snowflake JDBC driver. - ingest-schema: This job runs when you click the Synchronize button.
The available properties for these jobs are: