Customizing the database name for database-less data sources
In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.
Use the following options to see the documentation in the latest UI or in the previous, classic UI:
When you create a JDBC connection for a database-less data source, such as Hive, MongoDB, or Teradata, the default database name is set to CData
. When you register the data source via Edge, CData
is listed in the Database
You can use the Other
connection property and set the value to CustomizedDefaultCatalogName=<custom database name>
to customize the database name when you connect to your data source. Collibra then uses this customized database name when you register the data source via Edge. To use this property and value, you must use a Collibra-provided driver that is newer than version 23.0.8409. For details about specifying the CustomizedDefaultCatalogName=<custom database name>
value in the Other
connection property for each data source, go to Overview of Catalog connectors.
If you customized the database name and want to create technical lineage for the database-less data sources, ensure that you take the following actions:
- If you use technical lineage via Edge, add the customized database name in the External Database Name field when you add the technical lineage capability for the data source.
- If you use the lineage harvester, specify the
externalDbName
property in the lineage harvester configuration file.
If you add or change the CustomizedDefaultCatalogName=<custom database name>
value in the Other
connection property after a database was registered, we treat the database as a new one, and you must register the data source again with the new database name. Renaming a database while keeping the existing registered assets is not possible.
CustomizedDefaultCatalogName=<custom database name>
value in the Other
connection property to the JDBC connection after the database was listed for the first time in the DatabaseCData
will appear in the DatabaseThis property is available for the following database-less data sources:
- Amazon DynamoDB
- Apache Cassandra
- Apache HBase
- Apache Hive
- Apache Spark SQL
- Avro
- Azure Cosmos DB
- Azure Table Storage
- CSV
- Elasticsearch
- Excel
- Google Sheets
- Greenplum
- IBM Cloudant
- IBM Db2
- Impala
- JSON
- MarkLogic
- MongoDB
- Parquet
- Salesforce
- SAS Data Sets
- Splunk
- Teradata
- XML