Overview of out-of-the-box attribute types
An attribute is a characteristic that describes an asset by means of an individual field. The attribute's kind defines the class of information that the attribute contains.You can add an attribute to an asset if the attribute's type is in the relevant assignment of the asset's type.
The table below contains all out-of-the-box attribute types. You can also create new attribute types.
Type | Description | Assigned to asset type | Kind |
---|---|---|---|
1st Decile |
The data 1st decile value. |
Column |
Text |
1st Percentile |
The data 1st percentile value. |
Column |
Text |
1st Quartile |
The data 1st quartile value. |
Column |
Text |
3rd Quartile |
The data 3rd quartile value. |
Column |
Text |
5th Percentile |
The data 5th percentile value. |
Column |
Text |
95th Percentile |
The data 95th percentile value. |
Column |
Text |
99th Percentile |
The 99th percentile value. |
Column |
Text |
9th Decile |
The data 9th decile value. |
Column |
Text |
Abbreviation | A shorthand signifier for an asset. | Report | Text |
Access instructions | Instructions on how to access the data. | Data Product Output Port | Text |
Access method | Indicates what method can be used to access the data. | Data Product Output Port | Selection |
Analysis | The analysis of this issue. | Issue | Text |
Application Regulation | Any regulations that apply to the application. For example, for certain applications that consumes data, there is a regulation about how to handle that. | Directory, System, Technology Asset | Text |
Application Standards | Any standards applied to the business application. | Directory, System, Technology Asset | Text |
Approval Date | The date on which the assessment was approved. | Assessment Review | Date |
Assessment Link | The link to the submitted assessment. | Assessment Review | Text |
Automation Level | The nature and degree of automation of an AI use case. | AI Use Case | Text |
Background | Background information on the asset. | Text | |
Business Case | Refers to the business problem you want to solve with a AI use case. | AI Use Case | Text |
Business Risks | A summary of the business risks associated with implementing the AI use case. | AI Use Case | Text |
Business Value | Refers to how an AI use case can improve your organization. | AI Use Case | Text |
Business Sponsor | Refers to the Business Owner or Executive Sponsor of the AI Use Case in your organization. | AI Use Case | Text |
Calculation Rule | The rule that specifies how the KPI or metric is calculated. | KPI, Tableau Report Attribute | Text |
Categorical Data |
Data is considered categorical if it can only take a limited set of different values. |
Column |
True/False |
Category |
A possible value for Categorical Data. |
Column |
Text |
Certified |
Indicates whether or not a report asset meets the set standards. |
Data Set, Looker Dashboard, Looker Data Set, Looker Look, Looker Tile, Power BI Dashboard, Power BI Data Model, Power BI Report, Power BI Tile, Tableau Data Model |
True/False |
Char octet Length |
For character types, the maximum number of bytes in the column. |
Column |
Numeric |
Column Position |
The index of the column in the table. |
Column |
Numeric |
Conformity Score |
The amount of rows that passed the rule. |
Business Rule, Data Quality Metric |
Numeric |
Co-role |
Relationship name from tail to head. |
- |
Text |
Criticality Indicator |
Indicate the criticality of an asset. |
- |
True/False |
Data Privacy Risk Score | Number calculated from Risk Assessment. | AI Use Case | Number |
Data Privacy Risks | Any data privacy risks that may result from processing the AI model within the context of this AI use case and either putting it on the market for external customer use or internal use. | AI Use Case | Text |
Data product category | This attribute type helps categorizing the data products in for example more business user specific derived data products, versus more technical/foundational base data products targeted at the more technical users that are building the derived data products. | Data Product | Selection |
Data Retention Protocols | Any data retention standards already in place or projected to be implemented for the AI use case. | AI Use Case | Text |
Data Source | The data source of an asset. It specifies where the data corresponding to this asset is coming from. | Database, Tableau Data Model, Schema | Text |
Data Source Type |
The type of the registered data source. |
Schema, Database |
Text |
Data Storage | Refers to whether any data is stored, and if so, where and how the data is stored. | AI Use Case | Text |
Data Type | The logical Data type detected by Collibra profiling. | Column | Text |
Data Type Precision | The precision of the data type. For example how many characters it can contain. | Data Attribute, Data Element, Report Attribute | Numeric |
Date and/or Time Pattern |
The pattern used to encode a time, date or both. Format must be compatible with a java DateTimeFormatter. Example: yyyy-MM-dd HH:mm:ss. |
Column |
Text |
Default Value |
The default value for the column. |
Column |
Text |
Definition | The shortest possible description that clearly defines the purpose of the asset. | Business Asset, Business Process, Business Term, Data Category, KPI, Line of Business, Measure, Report, Report Attribute | Text |
Description | The description of the asset. This is typically a more verbose way to describe what the asset means. |
Asset, Business Dimension, Business Rule, Code Set, Code Value, Column, Crosswalk, Data Asset, Data Attribute, Database, Data Element, Data Entity, Data Model, Data Quality Dimension, Data Quality Metric, Data Quality Rule, Data Set, Data Sharing Agreement, Data Structure, Data Usage, Directory, File, File Group, Governance Asset, Issue, Issue Category, Looker Dashboard, Looker Data Set, Looker Data Set Column, Looker Folder, Looker Look, Looker Query, Looker Report Attribute, Looker Tenant, Looker Tile, Mapping Specification, Policy, Power BI Capacity, Power BI Dashboard, Power BI Data Model, Power BI Table, Power BI Column, Power BI Report, Power BI Server, Power BI Tile, Power BI Workspace, Report, Report Attribute, Role Type, Rule, S3 Bucket, S3 File System, Schema, Standard, System, Table, Tableau Data Source, Tableau Project, Tableau Report Attribute, Tableau Server, Tableau Site, Tableau View, Tableau Workbook, Technology Asset, Validation Rule, Workflow Definition, Dataplex Lake, Dataplex Zone, Databricks AI Model, Vertex AI Model |
Text |
Description From Source System | The description from the source system of the asset. | Table, Column, Database, Schema, Dataplex Lake, Dataplex Zone, Databricks AI Model | String |
Descriptive Example | An example of the asset. | Asset, Business Asset, Business Process, Business Rule, Business Term, Code Set, Code Value, Data Asset, Data Attribute, Data Category, Data Element, Data Entity, Data Model, Data Quality Metric, Data Quality Rule, Data Structure, Directory, Governance Asset, Issue Category, KPI, Line of Business, Measure, Policy, Report, Role Type, Rule, Standard, System, Technology Asset | Text |
Document creation date |
Date the document was created. |
Looker Dashboard, Looker Folder, Looker Look, Tableau Data Source, Tableau View, Tableau Workbook |
Date |
Document last accessed date |
Date the document was last accessed. |
Looker Dashboard, Looker Look |
Date |
Document last viewed data |
Date the document was last viewed. |
Looker Dashboard, Looker Look |
Date |
Document modification date |
Date the document was last edited. |
Looker Look, Tableau Data Source, Tableau View, Tableau Workbook |
Date |
Document size |
Size of the document in megabytes. |
File, File Group, Tableau Workbook |
Numeric |
Effective End Date | Date as of which an asset is scheduled to end. | Business Rule, Code Set, Code Value, Data Quality Metric, Data Quality Rule, Data Usage, Governance Asset, Issue Category, Policy, Rule | Date |
Effective Start Date |
Date on which asset takes effect. |
Business Rule, Code Set, Code Value, Data Quality Metric, Data Quality Rule, Data Usage, Governance Asset, Issue Category, Policy, Rule |
Date |
Empty Values Count |
The number of empty values for that column |
Column |
Numeric |
Empty values definition override |
Overrides the default list of values to consider as empty or missing values during data profiling. It must be a comma separated list of text values with each value enclosed in double quotes. |
Column, Schema, Table |
Text |
Entity Load Date | The load date of the entities from the external system. | Data Quality Metric | Text |
Ethical Risks | Any ethical risks that may result from processing the AI model within the context of this AI use case and either putting it on the market for external customer use or internal use. | AI Use Case | Text |
Exception Scenario | The exception scenario. |
Business Rule, Data Quality Metric, Data Quality Rule, Data Sharing Agreement, Governance Asset, Issue Category, Policy, Rule, Standard |
Text |
External System Label | The label from external system. | File, GCS Bucket, Storage Container | String |
Favorites count |
The number of Looker Looks and Looker Dashboards that are marked as favorite. |
Looker Dashboard, Looker Look |
Number |
Feature Importance | Feature Importance refers to how important a feature is to a machine learning model. | AI Model, Databricks AI Model, Vertex AI Model | Text |
File Location |
The location of the original source file. |
File, Schema |
Text |
File Type |
The type of a File, which may constrain its format, its content or both. |
File, File Group |
Text |
Foreign Key Delete Rule |
What happens to the foreign key when primary is deleted. |
Foreign Key |
Text |
Foreign Key Evaluation Deferrability | Can the evaluation of the foreign key constraints be deferred until commit. |
Foreign Key |
Text |
Foreign Key Update Rule |
What happens to foreign key when primary is updated. |
Foreign Key |
Text |
Frequency |
The rate at which an asset changes over a particular period of time. |
Report, Report Attribute |
Text |
General Purpose AI | Refers to whether the model used in your AI use case is using a General Purpose AI (GPAI). | AI Use Case | Text |
Glue Database Name | The name of the AWS Glue database in which this data is referenced and described. | Table | String |
Glue Table Name | The name of the AWS Glue table in which this data is referenced and described. | Table | String |
Inclusion Scenario |
The inclusion scenario |
Business Rule, Data Quality Metric, Data Quality Rule, Data Usage, Governance Asset, Issue Category, Policy, Report, Report Attribute, Rule, Standard |
Text |
Inference Data Description | An explanation of the input or inference data the AI model(s) uses to create output data. | AI Use Case | Text |
Inferred Data Type | The data type of a data asset that was automatically inferred by profiling corresponding instance data. | Text | |
Intellectual Property Risks | Inherent Intellectual Property Risks resulting from processing AI Models within this use case and placing them on the market or putting into service for own use. | AI Use Case | Text |
Internal Model | Refers to the existing or upcoming internally built model(s) your AI use case may use. | AI Use Case | Text |
Is Auto Incremented |
Indicates whether this column is auto incremented. |
Column |
True/False |
Is Generated |
Indicates whether this is a generated column. |
Column |
True/False |
Is Mandatory |
Is the asset mandatory or not. |
Data Attribute |
True/False |
Is Nullable |
Determines if the column can store NULL values. |
Column |
True/False |
Is Primary Key | Indicates if the column is a primary key. | Column | True/False |
Is Unique | If the asset is unique or not. | Data Attribute | True/False |
IT Requirements | Describes the requirements from an IT perspective for the asset. | Crosswalk, Mapping Specification | Text |
Key sequence | Key Sequence of an element in a foreign key | Numeric | |
Labels | The user-defined labels in Dataplex. | Dataplex Zone, Dataplex Lake | Text |
Lake Id | The Id of the Dataplex lake. | Dataplex Lake | Text |
Lake Status | Current state of the Dataplex lake. | Dataplex Lake | Text |
Last Review Date |
Date on which asset was last reviewed. |
Code Set, Data Usage, Report, Report Attribute, Standard |
Date |
Last Sync Date |
Date on which asset was synchronized with external system. |
Code Set, Code Value, Database, Data Quality Metric, File |
Date |
Legal Approval Date | Date your legal team approved or rejected the AI use case. | AI Use Case | Date |
Legal Approval Renewal Date | Date of the expected periodical review of the Use Case’s approval. | AI Use Case | Date |
Legal Description of Model | Description of the AI model provided by your legal team. | AI Use Case | Text |
License |
The current license. |
Data Set |
Text |
Loaded Rows |
The number of rows that were loaded. |
Business Rule, Data Quality Metric |
Numeric |
Loaded Values |
The number of values that were loaded. |
|
Numeric |
Load Sample | A sample. | Text | |
Location | The location where the actual asset is stored or can be found. | Asset, Code Set, Code Value, Data Asset, Data Attribute, Database, Data Element, Data Entity, Data Model, Data Structure, Directory, Report, Report Attribute, Role Type, S3 Bucket, System, Technology Asset, Dataplex Zone, Dataplex Lake | Text |
Maintenance Cost | Indicates the overall expected cost of running the Use Case over selected period of time. | AI Use Case | Text |
Materiality | The materiality. | Data Usage | Text |
Maximum Text Length |
The length of the longest text value in this column |
Column |
Numeric |
Maximum Value |
The maximum value, using alphabetical order for text |
Column |
Text |
Max Length | The maximum length of any value corresponding to the data asset. | Numeric | |
Mean |
The mean of values (numeric only), excluding missing values |
Column |
Numeric |
Mean Absolute Error | Mean Absolute Error (MAE) refers to a model quality metric that evaluates the performance of regression models. | AI Model, Databricks AI Model, Vertex AI Model | Numeric |
Mean Squared Error | Mean Squared Error (MSE) refers to a model quality metric that measures the quality of the model’s predications. | AI Model, Databricks AI Model, Vertex AI Model | Numeric |
Measurement | The measurement of the asset. | Business Rule, Data Quality Rule, Governance Asset, Issue Category, Policy, Rule | Numeric |
Median |
The data median value |
Column |
Text |
Minimum Text Length |
The length of the shortest text value in this column |
Column |
Numeric |
Minimum Value |
The minimum value, using alphabetical order for text |
Column |
Text |
Min Length | The minimum length of any value corresponding to the data asset. | Numeric | |
Mode |
The value with the highest frequency for a categorical feature. |
Column |
Text |
Model Accuracy | Model Accuracy refers to how well the model performs on a given task. | AI Model, Databricks AI Model, Vertex AI Model | Text |
Model Monitoring | Refers to how your organization will ensure an AI model is meeting accuracy and performance expectations. | AI Use Case | Text |
Model Output | An explanation of the output data that the AI model(s) is expected to create. | AI Use Case | Text |
Model Precision | Model Precision refers to how accurate positive model predictions are. | AI Model, Databricks AI Model, Vertex AI Model | Text |
Model Type | Type of AI model. | AI Model, Databricks AI Model, Vertex AI Model | Selection |
Non Conformity Score |
The amount of rows that failed the rule. |
Business Rule, Data Quality Metric |
Numeric |
Note | A note. |
Asset, Business Asset, Business Process, Business Rule, Business Term, Code Set, Code Value, Data Asset, Data Attribute, Database, Data Category, Data Element, Data Entity, Data Model, Data Quality Rule, Data Structure, Directory, File, KPI, Line of Business, Measure, Policy, Report Attribute, Role Type, Standard, System, Technology Asset |
Text |
Null Count | The number of null values in the data corresponding to the data asset. | Numeric | |
Number of Attributes | The number of attributes of the data entity. | Numeric | |
Number of distinct values |
The number of different values stored in this column |
Column |
Numeric |
Number of Files |
The number of files in a File Group. |
File Group |
Numeric |
Number Of Fractional Digits |
The number of fractional digits. |
Column |
Numeric |
Number of Values | The number of distinct instance values in the data corresponding to the data asset. | Numeric | |
Original Name |
Name of this object in its source environment. The 'Original Name' may differ from the asset's name in Data Governance Center. |
Column, Tableau Data Source, Tableau Project, Tableau Report Attribute, Tableau Site, Tableau View, Tableau Workbook |
Text |
Other Risks | Other Risks resulting from processing AI Models within this use case and placing them on the market or putting into service for own use. | AI Use Case | Text |
Overall Risk Analysis | Details and the result of any risk analysis performed on the AI use case. | AI Use Case | Text |
Overall Risk Rating | Risk level calculated based on pre-defined thresholds in the default Risk Assessment. | AI Use Case | Selection |
Owner in source | The email address of the owner of the data objects in a data source, represented in Collibra by the asset types mentioned here. | Looker Dashboard, Looker Folder, Looker Look, Power BI Data Model, Power BI Report, Power BI Workspace, Tableau Project, Tableau Data Model, Tableau Workbook, Tableau Dashboard, Database, Schema, Table, Database View | Text |
Passing Fraction | The % of rows or entities that have passed the rule. | Business Rule, Data Quality Metric | Numeric |
Personally Identifiable Information |
An indicator to flag an asset that could potentially be used to identify a specific individual. |
Column |
True/False |
Predicate | The logical formula that will be executed to implement the rule. | Data Quality Rule | Text |
Primary Key Name |
The name of the primary key composed by the column. |
Column |
Text |
Priority | The priority of this issue. | Issue | Text |
Profiled Row Count | The number of rows from the data set that were selected for profiling. | Numeric | |
Profiling Information |
Provides additional information related to the status of the profiling results. |
Table |
Text |
Project Id | A globally unique identifier for your Google Cloud Platform project. | GCP Project | Text |
Project Number | An automatically generated unique identifier for the Google Cloud Platform project. | GCP Project | Number |
Protective Measures | Any additional required or recommended actions that have been identified based on regulations, industry standards or dedicated frameworks. | AI Use Case | Text |
Purpose | The reason why the asset exists. | Business Rule, Data Sharing Agreement, Data Usage, File, Governance Asset, Issue Category, Policy, Rule, Standard | Text |
Rating |
The current rating. |
|
Numeric |
Refresh Conflict |
Provides the information about the conflict detected on the Data Asset during a Schema refresh if any. |
Column, Table |
Text |
Refresh Frequency |
The frequency of refresh. |
|
Text |
Report Image |
Image of the report view |
Looker Look, Tableau View, Tableau Workbook |
Text |
Repository | Reference to the repository where the code behind the model is stored. | AI Model, Databricks AI Model, Vertex AI Model | Text |
Resolution | The solution of how this issue can or is resolved. | Issue | Text |
Result | The result. | Business Rule, Data Quality Metric | True/False |
Retrain Cycle | The frequency with which the model is retrained. | AI Model, Databricks AI Model, Vertex AI Model | Text |
Role |
Relationship name from head to tail. |
|
Text |
Role in Report |
The use of Report Attribute in Report (for example, measure or dimension) |
Tableau Report Attribute |
Text |
Row Count |
The number of rows inside the data set, possibly including duplicated or missing values |
Column |
Numeric |
Rows Failed |
The amount of rows that failed the rule. |
Data Quality Metric |
Numeric |
Rows Passed | The amount of rows that passed the rule. | Data Quality Metric | Numeric |
Rule | The description of the rule. | Text | |
Schema Name | The name of the schema. | Text | |
Scope |
The scope of applications that correspond to this policy. |
Crosswalk, Database, Data Usage, Directory, Mapping Specification, Policy, Report, Report Attribute, System, Technology Asset |
Text |
Security Classification | Classification of assets based on sensitivity. | Column, Data Usage, Report, Report Attribute | Text |
Security Protocols | Any general security protocols that may result from the implementation of the AI use case. | AI Use Case | Text |
Sequence Number | The sequence number of the asset. Often used to order assets in a specific way. | Data Attribute | Numeric |
Size |
The size of the column in the table. |
Column |
Numeric |
Source Type | The source type of an asset. | Text | |
Source Tags | The tags assigned to the asset in the source system. | Database, Schema, Table, Database View, Column | Text |
Standard Deviation |
The statistical standard deviation of values (numeric only) |
Column |
Numeric |
Start Date | The date on which the assessment was started. | Assessment Review | Date |
State | The current state. | Text | |
State Changed by | The cause of the state change. | Power BI Workspace | Text |
State Changed Date | The date the state was changed. | Data Sharing Agreement | Text |
Submission Date | The date on which the assessment was submitted. | Assessment Review | Date |
Synchronization Status |
Provides information about the status of the Schema synchronization. |
Schema |
Text |
Table Type |
The table type that is declared in the data source. For example: TABLE, VIEW, ... |
Table |
Text |
Target delivery date | The date when the requested resource should be made available. | Data Product | Date |
Technical Data Type |
The Data Type of a data asset as it is declared by the data source. For example: String, Integer, Varchar, Blob, Boolean, ... |
Column, Data Attribute, Data Element, Report Attribute, Power BI Column, Tableau Report Attribute |
Text |
Third Party Model | Refers to the vendor of your AI model(s) and what kind of model you are using. | AI Use Case | Text |
Threshold | The minimum percentage of all rows or entities that must pass the rule. | Data Quality Metric | Numeric |
Training Data Description | The training or re-retraining data used to teach the AI model(s). | AI Use Case | Text |
Transformation Logic |
The transformation logic. |
|
Text |
Transparency Disclosure Requirements | Transparency Disclosure Requirements refers to identified requirements, if any, for AI use case transparency. | AI Use Case | Text |
URL |
Uniform Resource Locator, also colloquially known as web address. |
Directory, File, File Group, Looker Look, S3 Bucket, Table, Tableau Server, Tableau Site, Tableau View, Dataplex Lake, Dataplex Zone, Database View |
Text |
Use Case Application | Indicates that the AI use case will be used by an external audience or internally by your organization. | AI Use Case | Text |
Validation Result | The results of validation. | True/False | |
Validation Script | Contains the validation logic to evaluate the content of an asset. | Validation Rule | Script |
Value Distribution | The distribution percentage of the values | Numeric | |
Variance |
The statistical variance of values (numeric only) |
Column |
Numeric |
Version | If this asset is versioned (manually or in an external system), this represents the asset version. | AI Model, Databricks AI Model, Vertex AI Model | Text |
Visible on server |
Worksheet is uploaded to Tableau server. |
Tableau View |
True/False |
Visits count |
Number of visits on Tableau report |
Looker Dashboard, Looker Look, Tableau View, Tableau Dashboard, Tableau Workbook, Tableau Worksheet |
Numeric |
Weighting Factor |
A factor by which some quantity is multiplied in order to make it comparable with others. |
|
Numeric |
Zone Name | The relative resource name of a Dataplex zone in the following format: projects/{project_number}/locations/{location_id}/lakes/{lake_id}/zones/{zone_id}. | Dataplex Zone | Text |