Azure Data Factory source configuration
Updated:The Source configuration field in the Azure Data Factory technical lineage Edge capability allows you to map the names of databases in Azure Data Factory to the names of System assets in Data Catalog.
Tip If you previously integrated Azure Data Factory via the lineage harvester (deprecated), you can copy and paste the JSON code from your <source ID> configuration file into the Source configuration field.
The value of the Source configuration field must be a valid block of JSON code, for example:
{
"found_dbname=databasename1;found_hostname=server-name.onmicrosoft.com;found_schema=schema1": {
"dbname": "mssql-database-name",
"schema": "mssql-schema-name",
"dialect": "mssql",
"collibraSystemName": "mssql-system-name"
},
"found_dbname=datafactory_linkedservice;found_hostname=*": {
"dbname": "linkedservice-dbname",
"schema": "linkedservice-schema",
"collibraSystemName": "linkedservice-system-name"
}
}
The following table describes the various properties you can use in your JSON code block.
|
Property |
Description |
Mandatory? |
||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
found_dbname=<database name>;found_hostname=<server name>;found_schema=<schema name> | found_dbname=<datafactory_name>_<linkedservice_name>;found_hostname=* |
The information of the supported data sources in Azure Data Factory to be collected by Collibra Data Lineage. You can specify any of the following values for the
You can use wildcards to capture multiple connection string combinations:
|
Yes |
||||||||||
|
dbname
|
The name of the database asset in Data Catalog. |
No |
||||||||||
|
schema
|
The name of the schema asset in Data Catalog. Specify this property with the schema name that you created when you registered the data source. If the Collibra Data Lineage fails to find the schema that you specify, it uses the default schema. |
No |
||||||||||
|
dialect
|
If you specify a database name for the Click here for a list of dialects of supported data sources in Azure Data Factory.
|
No |
||||||||||
|
collibraSystemName
|
The system or server name of the data source. The value of this property must exactly match (including for case-sensitivity) the name of your System asset in Collibra. Use this property with the Specify this property with the same name as the name of the System asset that you create when you prepare the physical data layer in Data Catalog. If you don't prepare the physical data layer, Collibra Data Lineage cannot stitch the data objects in your technical lineage to the assets in Data Catalog. If you don't specify a value for this property, How to configure this property if you have two databases with the same name Let's assume you have two databases named Customers. When you prepare the physical data layer in Data Catalog, you create a System asset for each of these databases. Let's say you named them Customers-Europe and Customers-USA. You can then configure this property as follows. Copy
|
No |