About synchronizing Google Cloud Storage

Synchronizing Google Cloud Storage(GCS) is the process of ingesting metadata from a selected GCS repository and making the data available in Collibra Data Intelligence Platform.

When you synchronize GCS, the content of your repository is analyzed and represented in Collibra by means of assets and their characteristics. Collibra also takes the defined crawlers into account.

You can synchronize manually, or you can automate it by adding a synchronization schedule. You can only synchronize one GCS File System at a time.

  • If a synchronization job is in progress and a second one is triggered, manually or automatically, the second job is queued.
  • If a synchronization job is still running and a new synchronization of the same GCS File System is triggered (manually or automatically), the running synchronization continues and the new synchronization request is ignored.

After the synchronization, the resulting assets are in the domain that was specified in the crawler. For information on the integrated data, go to Integrated Google Cloud Storage data.