Synchronize Amazon S3 manually
You can manually start a synchronization job of an S3 File System asset. This can be useful if you want to test your crawlers, or if you want to synchronize immediately.
Tip You can also add a synchronization schedule to synchronize automatically.
Prerequisites
- You have registered an Amazon S3 file system.
- You have configured one or more Jobservers in Collibra Console. If there is no available Jobserver, the Register data source actions will be grayed out in the global create menu of Collibra Data Intelligence Cloud.
- You have a programmatic AWS user and IAM role with the required permissions.
- You have connected an S3 File System asset to Amazon S3.
- You have created one or more crawlers.
- You have a global role with the Catalog global permission, for example Catalog Author.
- You have a resource role with the Configure external system resource permission on the community or domain that contains the S3 File System, for example Owner.
- You have a role with the following resource permissions on the S3 community you created when you registered an Amazon S3 file system:
- Asset: add
- Attribute: add
- Domain: add
- Attachment: add
Steps
- Open an S3 File System asset page.
-
In the tab pane, click
Services Configuration. - In the Crawlers section, click Synchronize now.
The synchronization job appears in the Activities list as a bulk synchronization.
When the synchronization finishes, the resulting assets, including their attributes and relations, are created, edited or deleted in the selected domain(s) and in the Data Sources page of Data Catalog.
What's next?
You can view a summary of the results from the Activities list.
You can view the assets in their domain.