vision-datasets/TSV_FORMAT.md

2.2 KiB

Introduction

TSV format uses a single or multiple *.tvs format to store both the image annotation and image files, where the

  • first column is the image_id, it can be string, int or any unique identifier
  • second column being the annotaion
  • third column being the base64-encoded string of the image data.

This repo does not support consuming TSV format, but we provide tools for converting to/from TSV format from/to coco, for limited tasks and data, such as

vision_convert_to_tsv {dataset_name} -r {dataset_registry_json} -k {data storage url} -f {local_dir} [-u Usages]

For more commands and help related to TSV, check out UTIL_COMMANDS.md.

Task-wise Format

Image Classifciation

Below is an example of multiclass classification:

1   [{"class": "dog"}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
2   [{"class": "cat"}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
3   [{"class": "wolff"}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
...

For multilabel classification,

1   [{"class": "dog"}, {"class": "canidae"}, {"class": "pet"}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
2   [{"class": "cat"}, {"class": "Felidae"}, {"class": "pet"}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
3   [{"class": "wolff"}, {"class": "canidae"}}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
...

Object Detection

The format of object detection is very similar to mutlilable classification, with an additonal field rect: [left, top, right, bottom]

1   [{"class": "dog", "rect": [10, 10, 100, 100]}, {"class": "cat", "rect": [10, 10, 100, 100]}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
2   [{"class": "cat", "rect": [10, 20, 250, 100]}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
3   [{"class": "wolff", "rect": [100, 200, 250, 1000]}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
...

Image Caption

The format of image caption is straightforward as

1   [{"caption": "dog playing with a cat"}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
2   [{"caption": "dog eating food"}]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
3   [{"caption": "wolff sitting in snow"]    /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAA.....
...