Productivity #5: External cloud storage

Productivity #5: External cloud storage

TrainingData.io supports external cloud storage in form of pre-authenticated URLs. The application support S3, GCP and Azure cloud storage.

Download a sample csv file populated with a list of pre-authenticated URLs. In image below the folder sample2/2782 contains dicom files. (dicom urls start with dicomweb://)

Use the following Python3 script to export any folder in S3 bucket to a acceptable csv file format as shown above.

To create a dataset with external cloud hosting:

  1. Login to https://app.trainingdata.io
  2. Under Datasets select "Add Dataset"
  3. Upload above shown csv file.
  4. Inspect the newly created dataset.

Multiple Folders

If the dataset has multiple folders the folder names should be mentioned in the first column of the csv file. As shown below there are two folders in this dataset - "dcm" and "png".

Multiple Folders in a CSV dataset

File Formats

TrainingData.io application supports the following image file formats.

PNG, JPEG, BMP, DICOM (natively), NIFTI (natively), MOV (H.264, H.265, VP8/9 natively), MP4 (H.264, H.265, VP8/9 natively), TAR, ZIP, GZ, CSV