Connecting to MongoDB
This section contains details for MongoDB connections.Select an option from the dropdown menu to display information for a particular driver class.
General information
Field | Description |
---|---|
Data source | MongoDB |
Supported version | |
Connection string | jdbc:mongodb:// |
Packaged? | |
Certified? | |
Supported features | |
Estimate job | Yes |
Analyze data | Yes |
Schedule | |
Processing capabilities | |
Pushdown | No |
Spark agent | Yes |
Yarn agent | |
Parallel JDBC | |
Java Platform version compatibility | |
JDK 8 | Yes |
JDK 11 | Yes |
Minimum user permissions
In order to bring your MongoDB data into Collibra Data Quality & Observability, you need the following permissions.
- Read access on your MongoDB tables.
- ROLE_ADMIN assigned to your user in Collibra DQ.
Recommended and required connection properties
Required | Connection Property | Type | Value |
---|---|---|---|
Yes | Name | Text | The unique name of your connection. Ensure that there are no spaces in your connection name. |
Yes | Connection URL | String | The connection string path of your MongoDB connection.
When referring to the example below, replace the Example |
Yes | Driver Name | String | The driver class name used for your connection.
|
Yes | Port | Integer | The port number to establish a connection to the datasource. The default port is |
No | Source Name | String | N/A |
No | Target Agent | Option | The Agent used to submit your DQ Job. |
Yes | Auth Type | Option | The method to authenticate your connection. Note The configuration requirements are different depending on the Auth Type you select. See Authentication for more details on available authentication types. |
No | Properties | String | The configurable driver properties for your connection. Multiple properties must be comma delimited. For example, abc=123,test=true |
Authentication
Select an authentication type from the dropdown menu. The options available in the dropdown menu are the currently supported authentication types for this data source.
Required | Field | Description |
---|---|---|
Yes | Username | The username of your MongoDB account. |
Yes | Password | The password of your MongoDB account. |
Yes | Script | The file path that contains the script file that the password manager uses to interact with and authenticate a user account. Example /tmp/keytab/mongodb_pwd_mgr.sh |
No | Param $1 | Optional. An additional parameters to authenticate your MongoDB connection. |
No | Param $2 | Optional. An additional parameter to authenticate your MongoDB connection. |
No | Param $3 | Optional. An additional parameter to authenticate your MongoDB connection. |
Yes | Principal | The Kerberos entity to authenticate and grant access to your connection. |
Yes | Keytab | The file path of the keytab file that contains the encrypted key for a Kerberos principal. Example /tmp/keytab/hive_user.keytab |
Yes | Password | The secret credential associated with your Kerberos principal. |
Yes | Script | The file path that contains the script file used to interact with and authenticate a Kerberos user. Example /tmp/keytab/mongodb_pwd_mgr.sh |
No | Param $1 | Optional. Additional Kerberos parameter. |
No | Param $2 | Optional. Additional Kerberos parameter. |
No | Param $3 | Optional. Additional Kerberos parameter. |
Yes | TGT | The ticket-granting ticket cache that stores the TGT to authenticate your connection. |
Command line example for basic spark-submit job
-lib "/opt/owl/drivers/mongodb/"
-h localhost:5432/postgres
-master local[*]
-ds tpch.lineitem_7
-br 10 -deploymode client
-q "select * from tpch.lineitem where l_shipdate between '${rd} 00:00:00.000+0000'
and '${rdEnd} 00:00:00.000+0000' "
-bhlb 10 -rd "1998-12-01"
-driver "mongodb.jdbc.MongoDriver"
-loglevel INFO -cxn MongoDB -rdEnd "1998-12-02"
Note For more details about the various Create and Alter SQL statements and table-level actions, see the official MongoDB documentation.