Configure the profiling options via Edge
Through the profiling options, you determine whether you want to profile and classify the metadata automatically after each synchronizing, and whether the profiling is based on all synchronized metadata or on a sample of the data.
Prerequisites
- You have registered a data source via Edge.
- The Edge site has a JDBC profiling capability.
Steps
- Open a Database asset page.
-
In the tab pane, click
Configuration. - Click the Profiling and Classification tab.
- In the Profiling options section, click Edit.
- Select the data profiling options:
Option Description Automatically run when a metadata extraction is synchronized Enable to automatically create a data profile and classify columns every time the synchronization process of one or more schemas finishes.
This may take a long time. You can also add a schedule to profile and classify at regular intervals.
Full scan Select to profile and classify based on all synchronized metadata. Partial scan Select to profile and classify based on a sample of the synchronized metadata. When you select Partial scan, you can enter the maximum number of rows that you want to use for profiling and classification. By default, the maximum number of rows is 20000.
Tip Edge uses push down sampling to create a random sample of the metadata. This option is only available for data sources that support push down sampling.
- Click Save.
What's next?
You can now profile and classify your data manually or add a synchronization schedule .