Skip to content

Dataset splitting & negative samples annotation type importer does not work correctly #89

@BigQszak

Description

@BigQszak

I have followed provided notebook for dataset splitting with one notable change - instead of downloading the dataset I have imported my own using importer.Import... . I have tested it in a couple of ways:
a. Importing COCO and exporting YOLO
b. Importing YOLO, and exporting YOLO
In each case, I have tested both types of splitters.

  1. Stratified split seems to not work without some value for validation split
  2. They never correctly split the data for both of them - in terms of pure % wise division, the smaller the val or (val & test) part where the poorer the split was. For example, 80,10,10 split for Stratified split resulted in 92,4,4 samples per split.
    Is that because it prioritizes equal class split (I have two) over the number of samples?

As for the negative samples in COCO format - importer throws an error if I try to load a bunch of images with empty annotations - meaning no objects of interest are in those images.
It works for YOLO .txt empty labels file though

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions