Git Product home page Git Product logo

fcakyon / augmented-maskrcnn Goto Github PK

View Code? Open in Web Editor NEW
19.0 19.0 4.0 2.24 MB

Object detection and instance segmentation on MaskRCNN with torchvision, albumentations, tensorboard and cocoapi. Supports custom coco datasets with positive/negative samples.

Python 100.00%
albumentations augmentation coco computer-vision image-augmentation instance-segmentation linux macos maskrcnn negative-samples object-detection python pytorch radam tensorboard torchvision train visualization windows

augmented-maskrcnn's People

Contributors

fcakyon avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

augmented-maskrcnn's Issues

empty target field during training

๐Ÿ› Bug

During the training, in some images, following error occurs:

Exception has occurred: RuntimeError
cannot reshape tensor of 0 elements into shape [0, -1, 4] because the unspecified dimension size -1 can be any value and is ambiguous
  File "/home/augmented-maskrcnn/core/engine.py", line 30, in train_one_epoch
    loss_dict = model(images, targets)
  File "/home/augmented-maskrcnn/train.py", line 188, in train
    train_one_epoch(model, optimizer, data_loader, device, epoch, print_freq=10)
  File "/home/augmented-maskrcnn/train.py", line 208, in <module>
    train()

To Reproduce

Steps to reproduce the behavior:

  1. set configuration
  2. start training
  3. get exception during evaluation

Expected behavior

Should continue training without any issues

Environment

  • Python version (e.g., 3.6/3.7): Python 3.6
  • OS (e.g., Linux/Windows/MacOS): Ubuntu 18.04

Training takes forever

I'm training to train with roof data but the training never starts... probably is in the augmenting stage... but it's take forever...

What can I do to debug that?

Bounding Box sizes

Hi !
First, great implementation, much better than most out there when it comes to code clarity.

I am running into some small issues:
When dataset entries are handed of too albumentations, albumentations throws a value array because the bbox of an annotation
is too small. It seems to me that after checking my dataset that this comes from numerical imprecision when rastering the mask polygons and calculating the bbox based on those ... Any advice ?

Secondly, the code assumes that category_ids are continuous from 0 to x ... i can work around that by remapping them, but there are a lot of places in the codebase where this is used and so far i have not managed to find one central place where to do the remapping so that it gets picked up by all other parts of the codebase. Our category ids come from a licensed SAAS tool for image labeling, and as such we have to remap them after exporting from there, as we cannot change the SAAS providers code ..

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.