Git Product home page Git Product logo

boxal's Introduction

BoxAL - Active learning for object detection in Detectron2

boxal_framework

Summary

BoxAL is an active learning framework that automatically selects the most-informative images for training an object detector (like Fast R-CNN or Faster R-CNN) in Detectron2. By using BoxAL, it is possible to reduce the number of image annotations, without negatively affecting the performance of the object detector. Generally speaking, BoxAL involves the following steps:

  1. Train an object detector on a small initial subset of a bigger dataset
  2. Use the trained object detector to make predictions on the unlabelled images of the remaining dataset
  3. Select the most-informative images with a sampling algorithm
  4. Annotate the most-informative images, and then retrain the object detector on the most informative-images
  5. Repeat step 2-4 for a specified number of sampling iterations

Installation

Linux/Ubuntu: INSTALL.md

Windows: INSTALL_Windows.md

Data preparation and training

Split the dataset in a training set, validation set and a test set. It is not required to annotate every image in the training set, because BoxAL will select the most-informative images automatically.

  1. From the training set, a smaller initial dataset is randomly sampled (the dataset size can be specified in the boxal.yaml file). The images that do not have an annotation are placed in the annotate subfolder inside the image folder. You first need to annotate these images with LabelMe (json), V7-Darwin (json), Supervisely (json) or CVAT (xml) (when using CVAT, export the annotations to LabelMe 3.0 format).
  2. Step 1 is repeated for the validation set and the test set (the file locations can be specified in the boxal.yaml file).
  3. After the first training iteration, the sampling algorithm selects the most-informative images (its size can be specified in the boxal.yaml file).
  4. The most-informative images that don't have an annotation, are placed in the annotate subfolder. Annotate these images with LabelMe (json), V7-Darwin (json), Supervisely (json) or CVAT (xml) (when using CVAT, export the annotations to LabelMe 3.0 format).
  5. OPTIONAL: it is possible to use the trained model to auto-annotate the unlabelled images to further reduce annotation time. Set auto_annotate to True in the boxal.yaml file, and specify the export_format (currently supported formats: 'labelme', 'cvat', 'darwin', 'supervisely').
  6. Step 3-5 are repeated for several training iterations. The number of iterations (loops) can be specified in the boxal.yaml file.

Please note that BoxAL does not work with the default COCO json-files of detectron2. These json-files contain all annotations that are completed before the training starts. Because BoxAL involves an iterative train and annotation procedure, the default COCO json-files lack the desired format.

How to use BoxAL

Open a terminal (Ctrl+Alt+T):

(base) user@computer:~$ cd boxal
(base) user@computer:~/boxal$ conda activate boxal
(boxal) user@computer:~/boxal$ python boxal.py --config boxal.yaml

Change the following settings in the boxal.yaml file:
Setting Description
weightsroot The file directory where the weight-files are stored
resultsroot The file directory where the result-files are stored
dataroot The root directory where all image-files are stored
initial_train_dir When use_initial_train_dir is activated: the file directory where the initial training images and annotations are stored
traindir The file directory where the training images and annotations are stored
valdir The file directory where the validation images and annotations are stored
testdir The file directory where the test images and annotations are stored
use_initial_train_dir Set this to True when you want to start the active-learning from an initial training dataset. When False, the initial dataset of size initial_datasize is randomly sampled from the traindir
network_config The configuration-file (.yaml) file for the object detector (choose either Fast R-CNN or Faster R-CNN, Retinanet is not supported) (see the folder './configs')
pretrained_weights The pretrained weights to start the active-learning. Either specify the network_config (.yaml) or a custom weights-file (.pth or .pkl)
cuda_visible_devices The identifiers of the CUDA device(s) you want to use for training and sampling (in string format, for example: '0,1')
classes The names of the classes in the image annotations
learning_rate The learning-rate to train the object detector (default value: 0.01)
confidence_threshold Confidence-threshold for the image analysis with the trained object detector (default value: 0.5)
nms_threshold Non-maximum suppression threshold for the image analysis with the trained object detector (default value: 0.3)
initial_datasize The size of the initial dataset to start the active learning (when use_initial_train_dir is False)
pool_size The number of most-informative images that are selected from the traindir
loops The number of sampling iterations
auto_annotate Set this to True when you want to auto-annotate the unlabelled images
export_format When auto_annotate is activated: specify the export-format of the annotations (currently supported formats: 'labelme', 'cvat', 'darwin', 'supervisely')
supervisely_meta_json When export_format is set to 'supervisely': specify the file location of the meta.json for supervisely export

Description of the other settings in the boxal.yaml file: MISC_SETTINGS.md

Please refer to the folder active_learning/config for more setting-files.

Other software scripts

Use a trained object detector to auto-annotate unlabelled images: auto_annotate.py

Argument Description
--img_dir The file directory where the unlabelled images are stored
--network_config Configuration of the backbone of the network
--classes The names of the classes on which the CNN was trained
--conf_thres Confidence threshold of the CNN to do the image analysis
--nms_thres Non-maximum suppression threshold of the CNN to do the image analysis
--weights_file Weight-file (.pth) of the trained CNN
--export_format Specifiy the export-format of the annotations (currently supported formats: 'labelme', 'cvat', 'darwin', 'supervisely')
--supervisely_meta_json When the export_format is 'supervisely': specifiy the file location of the meta.json

Example syntax (auto_annotate.py):

python auto_annotate.py --img_dir datasets/train --network_config COCO-Detection/faster_rcnn_X_101_32x8d_FPN_3x.yaml --classes healthy damaged matured cateye headrot --conf_thres 0.5 --nms_thres 0.2 --weights_file weights/broccoli/model_final.pth --export_format labelme

Troubleshooting

See TROUBLESHOOTING.md

License

Our software was forked from Detectron2 (https://github.com/facebookresearch/detectron2). As such, the software will be released under the Apache 2.0 license.

Acknowledgements

Please have a look at our active learning software for Mask R-CNN (which formed the basis of BoxAL):
https://github.com/pieterblok/maskal

Contact

BoxAL is developed and maintained by Pieter Blok.

boxal's People

Contributors

pieterblok avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.