About preparing Edge or Collibra Cloud sites for data sources
After you create an Edge or request a Collibra Cloud site, you connect to data sources. Typically, you create a connection for a specific data source before adding capabilities to it. Capabilities use these connections to allow Collibra to get information from your data sources. When creating a connection, ensure you configure it with the credentials required by your data source. Each connection and capability may have slightly different steps or requirements, so be sure to review the data source-specific information.
Process overview
- Create a connection
A connection links your Edge or Collibra Cloud site with your data source, such as a database, file share, or REST service. Jobs running on this connection send information back to your Collibra Platform.
For more information, go to the list of available Edge and Collibra Cloud site connections. - Add capabilities
A capability is a specific action you want to perform on that data source, such as profiling data or ingesting metadata.
- For JDBC connections: You can add multiple integration capabilities to a single shared connection. Collibra uses the JDBC driver to create a connection string. Collibra pushes the capability job to your Edge or Collibra Cloud site. The job runs using the connection you created.
- For non-JDBC connections: These integration capabilities connect independently using the information provided by the Edge or Collibra Cloud site. The connection information is defined and stored as a Connection instance. The connection properties are shown on an Edge or Collibra Cloud site's Connections tab. When you create any other kind of connection, you only need to synchronize the data source in your Collibra Platform. The synchronization process ingests metadata from the data source into Collibra.
For more information, go to the list of available Edge and Collibra Cloud site capabilities.
Example
You want to register a PostgreSQL data source and profile the data. This data source has a JDBC driver. To achieve this via Edge, you create a connection to the PostgreSQL data source on an Edge site. You then add the Profiling capability to that connection.
Next steps
- For JDBC connections:
- This creates a Database asset that you then need to synchronize. The synchronization process ingests metadata from the data source into Collibra. This results in assets with information, such as Schema assets, Tables assets, and so on. Collibra does not include the actual data from the data source, only the data about the data. This full flow is called register a data source. For more information, go to About registering a data source.
- For non-JDBC connections:
- The synchronization process ingests metadata from the data source into Collibra. This results in assets with information, such as Schema assets, Tables assets, and so on. And creates a structure of the assets that represents the structure in the data source. For more information about synchronizing non-JDBC integrations, go to the data source specific documentation.