Git Product home page Git Product logo

cocosplit's People

Contributors

ahmad-ra avatar akarazniewicz avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

cocosplit's Issues

requirements.txt maybe need to update

python version:3.7.7
os : windows
pip install sklearn will occur this error :

error: metadata-generation-failed

Encountered error while generating package metadata.

and try install scikit-learn sloved
pip install scikit-learn

unrecognized arguments and info = coco['info']

I encounter this issue, please help me to solve it:

- When I type the full command:
python cocosplit.py --having-annotaions -s 0.8 D:/IMG_from_VID/COCO_Datasets/data/trainval.json train.json test.json

It shows this mess:

usage: cocosplit.py [-h] -s SPLIT [--having-annotations]
coco_annotations train test

cocosplit.py: error: unrecognized arguments: --having-annotaions

- When I remove the argument "--having-annotaions", it shows:

Traceback (most recent call last):
File "cocosplit.py", line 52, in
main(args)
File "cocosplit.py", line 30, in main
info = coco['info']
KeyError: 'info'

Thanks

Multiclass argument removes annotations from images

I have a dataset that has multiple classes of objects within each images. I used the multiclass argument in the command line to separate an annotation file into training and testing and found that using this argument led to images having some annotations removed, for example, an image that had 7 annotations originally only had 4 in the output. I don't know if this is the desired behaviour, but this is not good because say during testing the model segmented and classified 7/7 objects correctly, the labels would say 3 of these were incorrect.

--multi-class not splitting images with multiple boxes correctly

I would like to use the --multi-class option, but I noticed that it would only split the image and the first bounding box declaration into the validation set. If the image had more boxes, those would be left orphaned on the training set file. Removing the flag allows that same image to have all it's box definitions in the validation set. I checked that this same bug is in the ahmad-ra repository so it's not something that came up due to the merge

I don't know if this is the best way of fixing it, but I took the working code as an example and modified the multi-class specific one to filter_annotations in hopes of getting all the annotations for the image. That did work
2022-08-22-cocosplit_mod

--multi-class option

I tried to use this repository to split a dataset of annotated images in instance segmentation (there are about 30 classes in the problem) and I noticed that the same image can be present in the training and test set.
is this normal?
Is splitting done on the basis of annotations and not on the basis of images (so an image with multiple annotations can be in the training and test set)?
Thank you !

License

Hi, I wonder what is the license for this repository? Would it be possible to add a proper license if there isn't one?

File Not Found Error

When running the program I get the error:

FileNotFoundError: [Errno 2] No such file or directory: 'data/coco/images/frame_014171_d3.PNG'

Is there a way for the script to ignore missing images?

Does not mantain proper id indexes

ids are supposed to start from 1 and increase sequentially. Many dataloaders and libraries for COCO will give errors and not work with this haphazard id indexing.

Cross Validation

It is possible to do cross validation with this?

My first question is if the split is random?

My second question is: Say, I have 1000 annotations in ONE json file, I would like to use the 1-800 annotations for training and the 801-1000 annotations for validating for the 1st train session, then for the next train session I would like to use the 210-1000 annotations for training and 1-200 annotations for validating. Is this possible?

Thank you

Split many categories

When training and testing sets are generated in an 80/20 ratio, if there is more than one category they are balanced, that is, would each category in the json file have 80/20?

same result every time ?

I am wondering does your code reproduce same results every time if executed with the same parameters
I mean by same results images I find in train.json will be the same if I re executed your code

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.