Creating technical lineage via Edge (beta)
This topic provides an overview of the necessary steps to create a technical lineage for the JDBC data sources and ETL tools or a custom technical lineage on Edge.
To view the steps to create technical lineage for your data source, select the data source and connection type, if applicable. For a listed of supported data sources and their corresponding connection types, go to Supported data sources for technical lineage.
|
Select a data source and the connection type if needed to see the related information. Currently, you see the information for: |
Amazon Redshift
Azure SQL Data Warehouse
Azure SQL Server
Azure Synapse Analytics
DB2
Google BigQuery
Greenplum
HiveQL
IBM InfoSphere DataStage
Informatica Intelligent Cloud Services
Informatica PowerCenter
Matillion
Oracle
PostgreSQL
MySQL
Netezza
SAP Hana
Snowflake
Spark SQL
SQL Server
SQL Server Integration Services
Sybase
Teradata
Custom technical lineage
|
Which connection type do you use?
For best technical lineage results, use the JDBC connection to ingest JDBC sources when possible, rather than using the Shared Storage connection with SQL files. |
Before you begin
- Use Collibra Data Intelligence Cloud 2023.03 or later.
- Create an Edge site in Collibra Data Intelligence Cloud.
- Install an Edge site.
- Connect to a proxy server if needed.
- Create a JDBC connection.
- Register a data source via Edge.
Requirements and permissions
- A global role with the following global permissions:
- Data Stewardship Manager
- Manage all resources
- System administration
- Technical lineage
- A resource role with the following resource permission on the community level in which you created the BI Data Catalog domain:
- Asset: add
- Attribute: add
- Domain: add
- Attachment: add
- Necessary permissions to all database objects that technical lineage via Edge accesses.
- Permissions required for the data source that you selected:
You need read access on the SYS schema.You need read access on the SYS schema and the View Definition Permission in your SQL Server.You need read access on information_schema:
- bigquery.jobs.create
- bigquery.readsessions.create
- bigquery.tables.getData
- bigquery.readsessions.getData
GRANT SELECT, at table level. Grant this to every table for which you want to create a technical lineage.The role of the user must be the owner of the views in PostgreSQL, and the username of the user must be specified in the JDBC connection that you use to access PostgreSQL.You need read access on information_schema. Only views that you own are processed.SELECT, at table level. Grant this to every table for which you want to create a technical lineage.A role with the LOGIN option.SELECT WITH GRANT OPTION, at Table level.CONNECT ON DATABASENote The following permissions are the same, regardless of the ingestion mode:SQLorSQL-API.You need a role that can access the Snowflake shared read-only database. To access the shared database, the account administrator must grant the IMPORTED PRIVILEGES privilege on the shared database to the user. The username of the user must be specified in the JDBC connection that you use to access Snowflake.
Tip If the default role in Snowflake does not have the IMPORTED PRIVILEGES privilege, you can click the Add property button to add a custom parameter with the following values specified:Field Value Name customConnectionPropertiesType TextEncryption Select one of the following encryption methods:
Not encrypted (plain text)Encrypted with public keyTo be encrypted by Edge management server
Value role=METADATAYou need read access on the DBC.You need read access to the following dictionary views:- all_tab_cols
- all_col_comments
- all_objects
- ALL_DB_LINKS
- all_mviews
- all_source
- all_synonyms
- all_views
You need read access on definition_schema.- Your user role must have privileges to export assets.
- You must have read permission on all assets that you want to export.
- You have added the Matillion certificate to a Java truststore.
- You have at least a Matillion Enterprise license.
Steps
- Enable and configure technical lineage via Edge.
- Create an Informatica Intelligent Cloud Services connection.
Note Collibra Data Intelligence Cloud 2023.03 or newer is required to use the Informatica Intelligent Cloud Services (IICS) connection.
- Create a Matillion connection.
Note Collibra Data Intelligence Cloud 2023.03 or newer is required to use the Matillion connection.
- Create a Shared Storage connection.
- Prepare the data source files and store the data source files in the folder that you create when you create the Shared Storage connection in the previous step.
- Create a custom technical lineage JSON file and store the JSON file in the folder that you create when you create the Shared Storage connection in the previous step.
- Add a technical lineage capability to an Edge site.