Connecting to Google Cloud Storage (GCS)
This section contains an overview of Google Cloud Storage (GCS).General information
Field | Description |
---|---|
Data source | Google Cloud Storage (GCS) |
Supported versions | N/A |
Connection string | gcs:// |
Packaged? | Yes |
Certified? | Yes |
Supported features | |
Analyze data | Yes |
Archive breaking records | Yes |
Estimate job | Yes |
Pushdown | No |
Processing capabilities | |
Spark agent | Yes |
Yarn agent | Yes |
Minimum user permissions
In order for Collibra DQ to access your Cloud Storage bucket, you need the following permissions.
- Viewer permissions on your Cloud Storage bucket.
- When using the Archive Break Records feature, you need Editor and Viewer permissions on the Cloud Storage bucket location where break records will send.
Recommended and required connection properties
Required | Connection Property | Type | Value |
---|---|---|---|
Yes | Name | Text | The unique name of your connection. Do not use spaces in your connection name and only use valid characters. |
Yes | Connection URL | String | The connection string path of your GCS connection. The path must start with Example You can optionally add a key after the bucket name. Example |
Yes | Target Agent | Option | The Agent used to submit your DQ Jobs. |
Yes | Auth Type | Option | The method to authenticate your connection. Note The configuration requirements are different depending on the Auth Type you select. See Authentication for more details on available authentication types. |
Yes | Save Credentials | Option | Select this option after you enter your connection details. |
No | Driver Properties | String | The configurable driver properties for your connection. Multiple properties must be comma delimited. For example, abc=123,test=true |
Authentication
Field | Description |
---|---|
GCS | Select this option to upload a JSON file locally that contains your GCS service account access credentials. |
Authorization (JSON) | The JSON file that contains service account access credentials. Upload a JSON file locally that contains the following information: Copy
Tip This JSON file is typically available to download in GCP when you create a service account in your IAM profile. |