Skip to main content

Prepare DLHUB dataset

DLHUB currently supports 3 dataset formats for the training, to get these formats:

  • Select the ? icon next to the Detected Data Type to show the Training Data File Help dialog
  • Select the desired file format from the drop-down list to learn detail about it.
  • Click Generate Sample File/Folders to download the selected format's template
  • Based on the dataset template, you can prepare your own dataset for training.

Format 1: Classified Image Folder

Organize your images dataset and define their output by classified folders

For example, if you want to do image classification for Avengers, you will have classified folders such as Spiderman, Superman, Wonderwoman

Each folder will contain image dataset that belongs to the character.

Format 2: Feature and Category (csv or txt)

This is the standard format where you list your data as columns, including:

  • Column of labels (output)
  • Column of features (input)

Here is a simple example that has 4 outputs and 8 inputs. Each row defines the classified output with its corresponding inputs.

Format 3: Image Map File (csv or txt)

This file contains a list of image directories and their classified output, separated by Tab

The first column will be the list of image paths, and the second column will be the classified output.

You need to make sure the image directory contains the actual image file.