About data sources

A data source is an external system where your data, accounts, and groups reside, and from where the data is pulled into Data Access. An external system is also called the underlying data source. Your external system can be a database, reporting tool, or data warehouse such as BigQuery, Databricks, or Snowflake. It can also be an identity store that contains only accounts and groups (no data objects), such as Microsoft Entra ID or Okta. You can also have multiple data sources of the same type in Data Access, for example, multiple Snowflake accounts.

A data source in Data Access is an instance of your external system within Data Access. It allows Data Access to synchronize entities such as access controls, identities, and data objects.

Note Data Access doesn't automatically link a data source to an existing asset in Collibra. You can, however, manually link it to a System asset in Data Catalog.

Supported data sources

Data Access supports the following data sources:

How data sources work

Each data source includes metadata that describes its structure and behavior. This metadata helps Data Access understand:

Collibra as a data source

Data Access treats Collibra as a built-in data source, automatically including all users and groups from your Collibra environment. Because this data source is managed by Collibra, it is read-only in Data Access. You can still change its owners. To manage the users and groups within it, you can use the Collibra settings.

Related topics

How data sources sync