About ThoughtSpot integration via Edge
ThoughtSpot is a search-driven analytics platform that provides a semantic layer for enterprise data exploration. Through the ThoughtSpot integration, Collibra captures metadata from key analytical assets, such as Liveboards, Answers, and Worksheets, to provide a unified view of how business users consume data.
In a cloud deployment, each customer operates in a dedicated Tenant, typically accessible through a URL in the following format: https://{tenantName}.thoughtspot.cloud.
Collibra retrieves and organizes metadata and lineage information from your ThoughtSpot instances to provide visibility into data consumption, visualization, and data flow. The integration collects and maintains the latest ThoughtSpot metadata available through the ThoughtSpot REST APIs at the time of synchronization.
Collibra supports integration of ThoughtSpot metadata into the Data Catalog and enables the creation of technical lineage (in preview).
Function scope
The integration process follows the following rules:
- The integration only ingests ThoughtSpot content that is accessible through the configured ThoughtSpot API credentials (username and password). Any Orgs, Liveboards, Answers, Data Models, or Connections that the service account cannot access are skipped during synchronization.
- ThoughtSpot uses Orgs to logically partition a cluster into independent environments. The integration supports filtering at the Org level, allowing you to synchronize only the organizations relevant to your governance scope.
- Collibra synchronizes the current state of the ThoughtSpot repository during each run, reflecting the latest metadata definitions available in the source system.
To integrate ThoughtSpot via Edge or Collibra Cloud site, first establish a connection to your ThoughtSpot tenant using a service account with the required permissions. When you add and synchronize the ThoughtSpot capability, Collibra retrieves metadata and lineage from the specified Orgs, creates assets in the Data Catalog, and maps relations between Answers or Liveboards and their underlying logical models.
Character sanitization in asset full names
To ensure that technical lineage graphs render correctly and to maintain a stable asset hierarchy, any greater-than characters (>) in ThoughtSpot asset names are automatically replaced with a hyphen (-) during ingestion.
For example, if a full name is TenantA > OrgB > (Sales>Revenue) [LiveboardId], the liveboard name Sales>Revenue is converted to Sales-Revenue in the full name.
This replacement affects only the asset full name used by Collibra to construct the lineage path. The display name remains unchanged and continues to show the original character as it appears in ThoughtSpot.
For the full procedure, go to Steps: Integrate ThoughtSpot via Edge.