DGC service

Architecture

The Data Governance Center service is a web application containing the business logic of Collibra Data Intelligence Platform. It is fully developed in Java.

The DGC service has the following major components:

Component Description
Java API The Java API contains the business logic like the CRUD (Create, Read, Update, Delete) operations on assets. It is accessible through an open and fully-documented Java API.
REST API All the methods that are available in the Java API are also available in the REST API. The REST API makes it easy to connect from external applications (browser clients, desktop clients, other servers) to the DGC service. The REST API is implemented with the Jersey REST server and is fully documented. All the Collibra Client applications use the same public REST API that is available to customers.
Import/Export, Views, Query APIs These APIs make it very easy to effectively access and manipulate application data in Collibra. Collibra supports various file formats such as JSON, XML, CSV and Excel. Because all of these methods are available through the Java and REST API, you can easily integrate them with other applications by using external tools, such as ETL (Extract, Transform, Load) or ESB (Enterprise Service Bus) middleware applications.
Search With the search API, you can search for specific application data in Collibra. For example, Collibra Everywhere uses the search API to let the user find assets in Collibra.
BPMN 2.0 Workflow Engine The workflow engine, Flowable, supports the execution of BPMN 2.0 (Business Process Model and Notation) processes. The prepackaged workflows are completely configurable and it is very easy to add, modify, and deploy the workflows to support the governance processes that best suit your organization. Worfklow service tasks can use the available Java API, which enables you to automate various application tasks, like email notification, creating comments, adding assets, and so on.

Data storage

The data of the DGC service is located in:

  • Linux with root permission: /opt/collibra_data/dgc
  • Linux without root permission: ~/collibra_data/dgc
  • Windows: C:\collibra_data\dgc

In the data directory of the DGC service, you can find several files and directories serving different purposes:

Directory or file

Purpose

cache Contains the cache files of the DGC service.
collibra.license

Contains the authorized applications and allowed number of users for Collibra.

config Contains the configuration files used by Collibra.
email-templates Overrides the built-in email templates to customize the emails that are sent to the users.
groovy-lib Contains additional Groovy library functions to be used in validation rules.
images Contains images that can be referenced directly as a URL, for example to set another logo.
indexes Contains the search index files for quickly searching Collibra content. If the directory is not present when Collibra is started, it is automatically created.

logs

Contains the log files of the DGC service. The log files are important to troubleshoot possible problems in the product.

modules

Custom UI modules to extend or override the existing UI.

page-definitions

Overrides page definitions to customize pages in Collibra.

security

Contains security related files such as SSL and SAML metadata files. You should not change anything in this directory.

styling

Overrides the CSS styling of the web interface.

translations

Used to override the built-in user interface labels or to add new languages.

Tip You can also do this in the Settings section of Collibra.