Ingest multiple Power BI applications (deprecated)
You can ingest more than one Power BI application in Collibra. For each Power BI application, you create a separate Power BI configuration file, and then add a section in the lineage harvester configuration file.
Prerequisites
The prerequisites in this topic are different for the username / password authentication method and the service principal authentication method.
- You have access to the Power BI harvester on the Downloads page.
- You have completed all prerequisite tasks.
- You have registered Power BI in Microsoft Azure.
- You have a user with Power BI administrator rights in Microsoft Azure.
- The user with Power BI administrator rights in Microsoft Azure is part of a security group and has the Contributor role in the Power BI workspaces.
- You have enabled the service principal option in the Power BI Admin portal.
- The service principal is part of a security group and has the Contributor role in the Power BI workspaces.
- You have a dedicated domain to ingest the Power BI assets.
- You have a global role with the Catalog global permission, for example Catalog Author.
- You have a global role with the Technical lineage global permission.
- You have a global role with the Data Stewardship Manager global permission.
-
A resource role with the following resource permission on the community level in which you created the BI Data Catalog domain:
- Asset: add
- Attribute: add
- Domain: add
- Attachment: add
- Your environment meets the system requirements to run the Power BI harvester and the lineage harvester.
Tip For a full ingestion, we highly recommend to have a Power BI Premium subscription.
Steps
- Prepare the Power BI configuration file for one Power BI application.
- Run the Power BI harvester.
- For each additional Power BI application, do the following:
- Prepare a new configuration file with the information of the next Power BI application.
- Optionally, create a new domain in Data Catalog to ingest the assets of this Power BI application.
- Enter a new source ID that is different from the source IDs of existing Power BI configuration files.
- Run the Power BI harvester again.
Note Make sure that you refer to the path of this configuration file when you run the Power BI harvester.
The Power BI harvester collects the Power BI metadata of each Power BI application and sends it to the Collibra Data Lineage server.Collibra scans and analyzes the metadata. - Prepare a new configuration file with the information of the next Power BI application.
- In the lineage harvester configuration file, create a Power BI section for each Power BI application. Use the source ID of each Power BI configuration file as the ID of the Power BI section in the lineage harvester configuration file.
- Run the lineage harvester to ingest the Power BI metadata in Collibra.
The Power BI metadata is ingested in the domain that you specified in the Power BI configuration file.
Example
You have two Power BI applications that you want to ingest. The first Power BI configuration file has source ID power-bi-app-a, the second Power BI configuration file has source ID power-bi-app-b. The lineage harvester configuration file contains two Power BI sections that each refer to a different source ID.
{
"general": {
"catalog" : {
"url" : "https://companydomain.collibra.com",
"username" : "my-Collibra-username"}
},
"sources" : [
{
"type" : "ExistingLineage",
"id" : "power-bi-app-a"
}
{
"type" : "ExistingLineage",
"id" : "power-bi-app-b"
}]
}
What's next?
To refresh the Power BI metadata in Data Catalog, you can run the Power BI harvester and lineage harvester again or schedule jobs to run them automatically. You can schedule to synchronize Power BI applications at different times.