Integrated Azure Data Lake Storage data
In Collibra 2024.05, we launched a new user interface (UI) for Collibra Data Intelligence Platform! You can learn more about this latest UI in the UI overview.
Use the following options to see the documentation in the latest UI or in the previous, classic UI:
After you have synchronized the data, the integration of the Azure Data Lake Storage file system is completed, and the resulting assets are available in the domain that was specified in the crawler. By default, the assets have the status Implemented.
Warning Do not move the assets to another domain. Doing so may lead to errors during future synchronizations.
Tip ADLS synchronization relies on UUIDs.
Note If a temporary communication issue results in a partial synchronization, the status of the assets that were not synchronized becomes Missing from source. If the assets are identified in the source system during the next fully successful synchronization, the previous statuses are restored.
By default, the assets are shown in a plain list, but you can enable a multi-path hierarchy to show it in a tree structure. The resulting assets depend on whether you use Microsoft Purview.
Synchronization results without Microsoft Purview
For the best result, use the following relations when you define a multi-path hierarchy:
- File Storage contains Storage Container
- Storage Container contains Storage Container
- Directory contains Directory
- Storage Container contains File
Synchronization results with Microsoft Purview
For the best result, use the following relations when you define a multi-path hierarchy:
- File Storage contains Storage Container
- Storage Container contains Storage Container
- Directory contains Directory
- Storage Container contains File
- File contains Table
- Table contains Column
Synchronized metadata per asset type
This table shows the metadata for each ADLS asset type.
Asset type |
Synchronized metadata |
Resource ID |
---|---|---|
ADLS Storage Account | File Storage contains / is part of Storage Container | 00000000-0000-0000-0001-002600000000 |
ADLS Container |
Location |
00000000-0000-0000-0000-000000000203 |
Storage Container contains / is part of Storage Container | 00000000-0000-0000-0001-002600000001 | |
Directory | Storage Container contains / is part of Storage Container | 00000000-0000-0000-0001-002600000001 |
Directory contains / is part of directory | 00000000-0000-0000-0001-002600000003 | |
File | File Type | 00000000-0000-0000-0001-002500000012 |
Size | 00000000-0000-0000-0001-000500000009 | |
Storage Container contains / contained in | 00000000-0000-0000-0000-000000007060 | |
Table |
Description |
00000000-0000-0000-0000-000000003114 |
File contains / is part of Table | 00000000-0000-0000-0001-002600000002 | |
Column |
Description |
00000000-0000-0000-0000-000000003114 |
Column Position | 00000000-0000-0000-0001-000500000020 | |
Technical Data Type Tip
|
00000000-0000-0000-0000-000000000219 | |
Column is part of / contains Table | 00000000-0000-0000-0000-000000007042 |