Supported transformation details
Collibra Data Lineage supports the most commonly used transformations in the following sources:
- Azure Data Factory
- dbt Cloud (Beta)
- IBM DataStage
- Informatica PowerCenter
- Informatica Intelligent Cloud Services
- Snowflake
- SQL Server Integration Services
Azure Data Factory
Collibra Data Lineage supports the most commonly used transformations and data sources in Azure Data Factory.
Supported transformations
The following tables shows a non-exhaustive list of supported and unsupported transformations.
Supported data sources
The following table shows a non-exhaustive list of supported sources with the corresponding dataset and linked service types.
CollibraData Lineage supports all data format types that are supported in Azure Data Factory, including binary, Excel file, Delimited text, JSON, Parquet, and so on.
| Data sources |
Dataset type |
Linked service type |
|---|---|---|
| Amazon Redshift | AmazonRedshiftTable | AmazonRedshift |
| Azure Blob storage | AzureBlob | AzureBlobStorage |
| Azure Data Lake Storage Gen2 | AzureBlobFSFile | AzureBlobFS |
| Azure Data Lake Store | AzureDataLakeStoreFile | AzureDataLakeStore |
| Azure Databricks Delta Lake | AzureDatabricksDeltaLake | AzureDatabricksDeltaLake |
| Azure SQL Managed Instance | AzureSqlMITable | AzureSqlMI |
| Azure SQL Server database | AzureSqlTable | AzureSqlDatabase |
| Azure Synapse Analytics | AzureSqlDWTable | AzureSqlDW |
| DB2 data source | Db2Table | Db2 |
| Google Cloud Storage | GoogleCloudStorageLocation | GoogleCloudStorage |
| Microsoft Access | MicrosoftAccessTable | MicrosoftAccess |
| Microsoft Azure Cosmos Database | CosmosDbSqlApiCollection | CosmosDb |
| Open Database Connectivity (ODBC) | OdbcTable | Odbc |
| On-premises Oracle database | OracleTable | Oracle |
| REST | RestResource | RestService |
| Salesforce | SalesforceObject | Salesforce |
| Salesforce Marketing Cloud | SalesforceMarketingCloudObject | SalesforceMarketingCloud |
| Salesforce Service Cloud | SalesforceServiceCloudObject | SalesforceServiceCloud |
| SAP Business Warehouse (open hub) | SapOpenHubTable | SapBW |
| SFTP server | SftpLocation | Sftp |
| Snowflake | SnowflakeTable | Snowflake |
| SQL Server | SqlServerTable | SqlServer |
dbt Cloud (Beta)
Collibra Data Lineage supports materialization, and tables and views are treated like tables by default. You can customize the setting in one of the following ways so that the tables and views are treated like views:
- If you use technical lineage via Edge, specify the
materializedMappingproperty in the <source ID> configuration file. - If you use the lineage harvester, specify the
materializedMappingproperty in the Source Configuration field in the Technical Lineage for dbt Cloud capability.
IBM DataStage
IBM DataStage uses jobs with stages instead of transformations. IBM Datastage has three job types: parallel jobs, sequence jobs and server jobs. For a list of all job stages per job type in IBM DataStage, read the IBM documentation.
Informatica PowerCenter transformations
The following table shows a non-exhaustive list of supported and unsupported transformations in Informatica PowerCenter.
|
Supported transformations |
Unsupported transformations |
|---|---|
|
|
|
|
Informatica Intelligent Cloud Services
The following table shows a non-exhausitive list of supported taskflows and unsupported tasks in Informatica Intelligent Cloud Services.
|
Supported taskflows |
Unsupported tasks |
|---|---|
|
|
The following table shows a non-exhaustive list of supported and unsupported transformations and constructions in Informatica Intelligent Cloud Services. Specifically, transformations and constructions in the Cloud Data Integration service.
|
Supported transformations |
Unsupported transformations, functions and constructions |
|---|---|
|
|
Snowflake
You can create technical lineage for Snowflake by using SQL Snowflake ingestion mode or SQL-API Snowflake ingestion mode. Collibra Data Lineage supports different queries and transformations for each ingestion method. For more information about the ingestion methods, go to Technical lineage for Snowflake ingestion methods.
SQL Snowflake ingestion mode
With the SQL Snowflake ingestion mode, Collibra Data Lineage does not support the following non-exhaustive list of transformations:
- Snowflake Scripting
- Snowpark
SQL-API Snowflake ingestion mode
With the SQL-API Snowflake ingestion mode, Collibra Data Lineage supports the Data Manipulation Language (DML) statements from the following sources. The table also shows a non-exhaustive list of unsupported queries and transformations.
|
Supported transformations |
Unsupported queries and transformations |
|---|---|
|
|
|
Note
|
|
SQL Server Integration Services (SSIS)
Collibra Data Lineage supports the following non-exhaustive list of transformations and component types in SQL Server Integration Services:
|
Supported transformations |
Supported component types |
|---|---|
|
|
- Collibra Data Lineage supports SQL, but cannot parse other languages or scripts, for example SHELL and BAT scripts.
- SQL statements from Excel are not supported.
- All SQL queries must be preceded by the SELECT or WITH keyword, or else they will be skipped. If a comment precedes the SELECT or WITH keyword, the query will be parsed as expected.