ZinkML Open Datasets

Overview

The Open Datasets feature enables users to access, share, and explore high-quality public datasets across various domains. This documentation covers how to access and utilize public datasets, as well as manage dataset visibility.

Video Tutorial

For a visual guide on the 'Open Datasets', please watch our step-by-step tutorial:

Table of Contents

  1. Accessing Open Datasets
  2. Dataset Categories
  3. Using Public Datasets
  4. Managing Dataset Access

Watch our Open Datasets Tutorial for a comprehensive visual guide.

Accessing Open Datasets

Primary Access Methods

  1. Via Homepage:
    Homepage → Open Datasets
    
  2. Via Graph Dataflow Studio:
    Graph Dataflow Studio → Data tab
    

Search and Browse

  • Search functionality for specific datasets
  • Category-based browsing
  • Detailed dataset information

Dataset Categories

Domain-Specific Collections

  • Finance

    • Financial markets data
    • Economic indicators
    • Trading information
  • Retail

    • Sales data
    • Customer behavior
    • Inventory management
  • Healthcare

    • Medical records
    • Clinical trials
    • Health indicators

Using Public Datasets

Integration Steps

  1. Browse available datasets
  2. Select desired dataset
  3. Import into your dataflow
  4. Begin analysis and processing

Features

  • Direct integration with dataflows
  • Version control
  • Documentation access

Managing Dataset Access

Making Datasets Public

  1. Navigate to dataset settings
  2. Change access level to "Public"
  3. Confirm sharing settings
  4. Update documentation (recommended)

Privacy Control

  • Toggle between Private and Public access
  • Maintain ownership control
  • Monitor usage statistics
  • Update sharing preferences

Access Levels

LevelDescription
PrivateAvailable only to owner and specific collaborators
PublicAvailable to all platform users

Best Practices

Using Public Datasets

  1. Review documentation
  2. Verify data quality
  3. Check usage terms
  4. Cite sources appropriately

Sharing Datasets

  1. Provide clear documentation
  2. Include data dictionary
  3. Maintain version control
  4. Regular updates

Additional Resources