Adding a New Dataset

Modified on Tue, 14 Apr at 4:26 PM

Overview 

The Add New Dataset function allows users to register datasets within Clarity AI. 

Registering datasets helps organizations maintain visibility of data used across AI systems and supports governance activities related to data quality, security, compliance, and risk management. 

Adding datasets ensures that data sources are documented and traceable throughout the AI lifecycle. 

Purpose 

Adding a dataset allows organizations to record information about data used for training, testing, validation, or operational use in AI systems. 

Maintaining structured dataset records supports transparency of data usage and helps ensure governance processes consider data dependencies associated with AI systems. 

Registering datasets helps support risk assessments and compliance oversight related to data usage. 

Key features 

structured dataset registration 

Provides a defined process for capturing dataset information. 

Structured dataset records help maintain consistency across data governance activities. 

linkage to AI systems 

Datasets can be associated with AI systems to document how data supports system functionality. 

Linking datasets helps maintain traceability across the AI lifecycle. 

supports governance workflows 

Once created, dataset records can be used in governance activities such as: 

  • risk assessments 

  • compliance review 

  • data quality evaluation 

  • operational tracking 

Maintaining dataset records supports governance oversight. 

integration with dataset details 

After creation, the dataset can be accessed through: 

  • Datasets Dashboard 

  • Viewing a Dataset 

  • Dataset Details – Quality & Validation 

  • Dataset Details – Operations 

  • Dataset Details – Security & Compliance 

  • Dataset Details – Risk Assessments 

These components provide detailed governance context. 

How to use 

start a new dataset record 

Navigate to AI Inventory and select Add New Dataset. 

enter dataset information 

Provide required information describing the dataset. 

Information typically includes: 

  • dataset name 

  • description of dataset purpose 

  • data source 

  • relationship to AI systems 

  • responsible team or owner 

Ensure dataset information accurately reflects how data is used. 

create the dataset record 

Submit the dataset details to register the dataset in Clarity AI. 

Once created, the dataset will appear in the Datasets Dashboard. 

continue governance activities 

Additional dataset governance information can be added through dataset detail sections. 

Dataset records can be updated as required. 

Notes 

  • Registering datasets supports visibility of data usage across AI systems. 
  • Maintaining accurate dataset records helps support traceability and governance oversight. 
  • Dataset information can be updated as data sources change or governance requirements evolve. 

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article