Datasets and Tables

Overview

The Datasets section enables users to create, manage, and organize data collections. This documentation covers dataset creation, management, and version control.

Table of Contents

  1. Creating Datasets
  2. Working with Datasets
  3. Dataset Management Features
  4. Dataset Table Information

Creating Datasets

Watch this video for a visual guide on dataset creation.

Steps to Create a Dataset

  1. Navigate to "Datasets - Versions, Tables and Import Jobs"
  2. Enter a name for your new dataset
  3. Click "Create New Dataset"

Initial Setup

Next Steps

  • Check the ‘Datasets’ table to see all datasets created.
  • Click to any dataset from the table to see table, description, versions and tables in this dataset.
  • Click on the ‘Trash’ button and type ‘delete’ to delete the dataset permanently from ZinkML platform.

Working with Datasets

Watch this video for detailed instructions on working with Datasets.

Watch this video for detailed instructions on committing and releasing a Dataset version.

Dataset Table Overview

The main dataset table displays:

ColumnDescription
NameDataset identifier
Number of TablesCount of tables in dataset
Total SizeStorage space used
Number of RowsTotal data rows
Latest VersionCurrent version number
StatusStaged or Committed
Last UpdatedLast modification date
Created OnCreation date
Access TypePrivate/Shared/Public
ActionsCollaboration options

Version Control

  1. Create New Version
    • Tables from previous version automatically included
    • Option to remove existing tables
    • Add new tables [See Ingestion Documentation]
  2. Commit and Release Version
    • Finalizes changes
    • Assign appropriate License to the Dataset
    • Makes version available for use

Dataset Management Features

Basic Operations

  • View dataset details
  • Edit title and description
  • Modify category
  • Add/remove tables
  • Check table schemas

Access Control

  • Set dataset as:
    • Private (default)
    • Shared
    • Public

Deletion

  1. Click "Trash" button
  2. Type 'delete' to confirm
  3. Dataset permanently removed from platform

Best Practices

  1. Use descriptive names
  2. Maintain detailed descriptions
  3. Regularly commit versions
  4. Review table schemas
  5. Organize with categories

Next Steps

Additional Resources