Overview: Tableau integration v2 and migration
The Tableau integration v2 enables you to harvest Tableau metadata and create new Tableau assets in Data Catalog. Collibra Data Intelligence Cloud analyzes and processes the metadata and presents it as specific asset types, retaining their original names.
Steps
The following table shows the steps and prerequisites required to ingest metadata in Collibra via lineage harvester (Tableau integration v2) and run the migration script.
- This overview assumes that you have already ingested Tableau assets via Tableau integration v1.
- In the commands that you enter to run the migration, you need to specify which custom asset types, attribute types and relation types you want to migrate.
|
Step |
What? |
Description |
Prerequisites |
|---|---|---|---|
|
1 |
Before you start the Tableau integration in Data Catalog, make sure that the lineage harvester can reach the Tableau metadata. Perform these tasks before you start the actual Tableau ingestion process. Warning Because these tasks are performed outside of Collibra, it is possible that the content changes without us knowing. We strongly recommend that you carefully read the source documentation. |
|
|
|
2 |
Before you can ingest Tableau metadata, you have to create a new domain or choose an existing domain to store the new Tableau assets. Warning If you are using Collibra Data Intelligence Cloud 2021.11 or older, you have to add all Tableau attributes in the operating model to a scope and create a scoped assignment before you ingest Tableau via the lineage harvester. For complete information and step-by-step instruction, see Tableau general troubleshooting. |
You have a resource role with the following resource permissions:
|
|
|
3 |
You prepare Data Catalog's physical data layer to enable Data Catalog to automatically stitch the Tableau assets to existing assets in Data Catalog. |
|
|
|
4 |
Download and install the lineage harvester |
You use the lineage harvester to trigger the creation of Tableau assets, their relations and a technical lineage in Data Catalog. You can download the lineage harvester from the Collibra Product Resource Downloads page. For a list of lineage harvester installation requirements, see About the lineage harvester installation. |
|
|
5 |
Prepare the lineage harvester configuration file and run the lineage harvester. |
You create a lineage harvester configuration file with Tableau connection information and run the lineage harvester to import the results of the Tableau integration and the technical lineage for Tableau into Data Catalog. As a result, you now have a duplicate of your Tableau metadata in Collibra. |
|
|
6 |
The migration script is triggered by a lineage harvester command. You then use arguments to migrate your customized asset types and custom attribute types and relation types. Note You need lineage harvester version 2022.03.0-5 or newer. We recommend that you use the newest lineage harvester. |
Same prerequisites as for the previous step. | |
|
7 |
Verify the migration results |
Compare your Tableau integration v2 assets to the respective Tableau integration v1 assets. Look to see that the metadata that you manually added to your integration v1 assets has been added to your integration v2 assets. |
None |
|
8 |
Delete your Tableau integration v1 metadata. |
If you've reviewed the migration results and everything looks fine, you can delete your Tableau integration v1 assets and any assets of custom asset types. |
|
Naming convention
When you synchronize Tableau, Collibra follows a strict naming convention for the names of the new assets. Each asset has a display name and full name. The full name represents the asset path from asset to the database it belongs to. You can freely edit the display name. However, you should never edit the full name, because Data Catalog needs it for a successful migration. Changing the full name may also break the synchronization process.
Warning We highly recommend that you not edit the full names of any Tableau assets. Doing so will likely lead to errors during the migration and synchronization process.