Customizing the database name for database-less data sources
When you create a JDBC connection for a database-less data source, such as Hive, MongoDB, or Teradata, the default database name is set to CData. When you register the data source via Edge, CData is listed in the Database name drop-down menu on the Add Database page.
When you create the JDBC connection, you can use the CustomizedDefaultCatalogName connection property to customize the database name. JDBC driver version 23.0.8409 or newer is required to use this property. This property is available for the following database-less data sources:
- Amazon DynamoDB
- Apache Cassandra
- Apache HBase
- Apache Hive
- Apache Spark SQL
- Avro
- Azure Cosmos DB
- Azure Table Storage
- CSV
- Elasticsearch
- Excel
- Google Sheet
- Greenplum
- IBM Cloudant
- IBM Db2
- Impala
- JSON
- MarkLogic
- MongoDB
- Parquet
- Salesforce
- SAS Data Sets
- Splunk
- Teradata
- XML
If you add or change the CustomizedDefaultCatalogName connection property after a database was registered, we treat the database as a new one, and you must register the data source again with the new database name. Renaming a database while keeping the existing registered assets is not possible.
CustomizedDefaultCatalogName property to the JDBC connection after the database was listed for the first time in the Database name drop-down menu on the Add Database page, both the new database name and CData will appear in the Database name drop-down menu. Make sure to select the new database name when you register the data source.If you customized the database name and want to create technical lineage for the database-less data sources, ensure that you take the following actions:
- If you use technical lineage via Edge, add the customized database name in the External Database Name field when you add the technical lineage capability for the data source.
- If you use the lineage harvester, specify the
externalDbNameproperty in the lineage harvester configuration file.