About Edge capabilities
An Edge capability, like Sampling or S3 synchronization, is an application that can run on an Edge site. It can access a data source to extract and process data as needed. This data can be stored in an encrypted cache to improve the security of your data and platform. An Edge capability for a specific data source runs as a job and delivers the output to Collibra Data Intelligence Cloud in a secure and reliable way.
An Edge capability has a capability template that defines a specific use case, for example, data source ingestion.
Capability templates
A capability template is developed for a specific task on a specific data source type. The capability template also determines which properties are available to configure the Edge capability.
Currently, the following capability templates are available:
- ADLS synchronization: A capability template you use to connect to Azure Data Lake Storage (ADLS).
- Catalog JDBC ingestion: A capability template you use to register a data source and synchronize schemas from a data source via a JDBC connection.
- Catalog JDBC Sampling: A capability template you use to collect and cache sample data from a data source in the Edge site via a JDBC connection.
- Collibra Protect for AWS Lake Formation: A capability template you use to set up Protect for AWS Lake Formation.
- Collibra Protect for Google BigQuery: A capability template you use to set up Protect for BigQuery.
- Collibra Protect for Snowflake: A capability template you use to set up Protect for Snowflake.
- DQ Connector: A capability template you use to ingest Collibra Data Quality & Observability user-defined rules, metrics, and dimensions into Collibra Data Catalog.
- GCS synchronization: A capability template you use to connect to Google Cloud Storage.
- JDBC Profiling: A capability template you use to profile and classify data from a registered data source.
- S3 synchronization: A capability template you use to connect to Amazon S3.
- Databricks Unity Catalog synchronization: A capability template you use to connect to Databricks Unity Catalog.
- Technical lineage capabilities: Capability templates you use to create technical lineage for different data sources. For details, go to: Add a technical lineage capability to an Edge site
Important While these capability templates are available for all customers, the features that you use them for might still be in beta.
Capability template structure
Each Edge capability template contains the following:
|
File |
Description |
|---|---|
|
A manifest file (YAML) |
This file contains the capability metadata and input parameter requirements. |
|
A workflow file (YAML) |
This file defines the workflow and binds the parameters to capability containers. |
|
Docker images |
One or more Docker images that implement the business logic. |
Page layout
Note Each type of capability has its own required custom properties. These properties appear after you select a capability template from the dropdown menu.