Manage profiling options
You can enable or disable profiling options to determine if you want to profile and classify the metadata based on all synchronized metadata or on a sample of the data. You can also enable an option to automatically profile and classify after synchronizing.
Prerequisites
- You have registered a data source via Edge.
- The Edge site has a JDBC profiling capability.
Steps
- Open a Database asset page of a registered database.
-
In the tab pane, click
Services Configuration. - Click the Profiling and Classification tab.
- In the Profiling options section, click Edit.
- Select the data profiling options:
Option Description Automatically run when a metadata extraction is synchronized Enable to automatically create a data profile and classify columns every time the synchronization process of one or more schemas finishes. Full scan Select to profile and classify based on all synchronized metadata. Partial scan Select to profile and classify based on a sample of the synchronized metadata. When you select Partial scan, you can enter the maximum number of rows that you want to use for profiling and classification. By default, the maximum number of rows is 20000.
Tip Edge uses push down sampling to create a random sample of the metadata. This option is only available for data sources that support push down sampling.
- Click Save.
What's next?
You can now profile and classify your data manually or automatically after synchronizing one or more schemas.