About data products and data contracts

About data products

A data product is a reusable package that provides data to answer a business question or solve a specific business problem. It includes everything you need to understand, access, and use the data. This makes it actionable and ready to support business decisions. A data product is secure, easy to use, and designed for anyone, including those who are not domain experts.

A data product includes not only the data elements but also context about the data and how to access it. It consists of 4 main components: Context, Data, Controls, and Access.

Image of data product components showing Context, Data, and Access information.

Context The context includes background information, such as why the data product was created, who owns it, and details related to quality and privacy.
Data The data can refer to a table, view, or a business asset such as a report or a model.
Controls The related policies and quality checks.
Access The access information includes details on how to access the data and the policies that govern access.

In Collibra, data products are assets with asset type Data Product. The asset provides information about the 4 main components.
For more information, go to Using data products.

Data products are created and built based on specific asset types, relations, and attributes. Community workflows are also available to support the request, creation, and building of data products.

Image of operating model for data products showing asset types and relations, such as Data Product Port as input for Data Product.

For all information about the out-of-the box model, go to Data product asset types and operating model. For information about available workflows, go to Configuring and building data products.

About data contracts and manifests

An essential element of data products is the data contract. A data contract describes the structure, format, service level, quality, and terms of use of the data involved. Data engineers can create multiple versions of a data contract, which are referred to as data contract manifests.

  • A data contract is a stable, governed asset in Collibra representing the formal agreement on the structure and semantics of data exchanged between systems.
  • A data contract manifest is a precise YAML file that is used to define and store the details of that data contract. In essence, the manifest is a component that helps bring the data contract to life, allowing for its creation, validation, and deployment. Collibra offers multiple ways to generate and maintain data contract manifests.

Example The contract can outline service-level objectives (SLOs) related to system uptime and latency for a data product. It can also include details about pipelines or data delivery mechanisms and provide information about the data, such as schema and expected quality metrics.

For more information about creating and managing data contracts, go to About data contract creation and maintenance. For information about using data contracts as a data consumer, go to Using data products.

Related topics

Helpful resources