Power BI integration steps

The Power BI integration enables you to harvest Power BI metadata and create new Power BI assets in Data Catalog. Collibra analyzes and processes the BI metadata and presents it as specific asset types, retaining their original names.

Steps

  1. Complete the tasks in Power BI and Microsoft Azure.
  2. Prepare a domain for Power BI ingestion.
  3. Prepare the Data Catalog physical data layer.
  4. Optionally, assign the attribute type State to the global assignment of the Power BI Workspace asset type.
    For complete information, see Power BI workspaces.
  5. Download and install the lineage harvester.
  6. Prepare the lineage harvester configuration file.
  7. Optionally, Prepare the Power BI <source ID> configuration file.
  8. Manually refresh your Power BI datasets.
    Important The first time you integrate Power BI, you need to make sure that the data in your Power BI datasets is up-to-date. Carry out this step only if this is the first time you're integrating Power BI in Data Catalog. After that, Microsoft automatically refreshes the datasets every 90 days. For complete information, see:
  9. Start the lineage harvester again in the console and run the following command:
    • for Windows: .\bin\lineage-harvester.bat full-sync
    • for other operating systems: ./bin/lineage-harvester full-sync
  10. When prompted, enter the password or client secret to connect to your Collibra Data Intelligence Cloud and Tableau environment. The passwords are encrypted and stored in /config/pwd.conf.

What's next?

You can check the progress of the ingestion in Activities. The results field indicates how many relations were imported into Data Catalog.

After the metadata is ingested in Data Catalog, you can go to the domain that you specified in your lineage harvester configuration file and view the newly created assets. These assets are automatically stitched to existing assets in Data Catalog.

You can also view the technical lineage.

Note If you ingest Power BI for the first time or if you change your geolocation or cloud provider, you have to restart the DGC service before you can see your technical lineage.

Warning We highly recommend that you do not move the ingested assets to other domains. If you do, the assets will be deleted and recreated in the initial Data Catalog BI domain (or domains) when you synchronize Power BI. As a result, any manually added characteristics of those assets are lost.