Git Product home page Git Product logo

faster_rcnn_sku110's Introduction

This repository is based on VoVNet-v2

Faster R-CNN on SKU-110K dataset

Note

We measure the inference time of all models with batch size 1 on the same RTX2080Ti GPU machine.

  • pytorch1.4.0
  • CUDA 10.2
  • cuDNN 7.3

Lightweight with FPNLite

Backbone Param. lr sched inference time AP AP75 AP50 download
MobileNetV2-0.5-64 N/A 1x 0.033 43.31 44.66 78.08 model | metrics
MobileNetV2-0.5 N/A 1x 0.037 42.93 44.27 77.31 model | metrics
MobileNetV2 3.5M 3x 0.031 52.11 58.72 85.98 model | metrics
MobileNetV2 3.5M 1x 0.031 51.20 56.93 85.71 model | metrics
MobileNetV2-FLGC N/A 1x 0.030 50.59 56.05 85.21 model | metrics
ShuffleNetV2-0.5 N/A 1x 0.039 48.24 52.95 82.10 model | metrics
ShuffleNetV2 N/A 1x 0.028 52.60 59.55 86.19 model | metrics
V2-19 11.2M 1x 0.034 41.46 44.97 71.32 model | metrics
V2-19-DW 6.5M 1x N/A N/A N/A N/A model | metrics
V2-19-Slim 3.1M 1x 0.027 47.68 51.47 82.36 model | metrics
V2-19-Slim-DW 1.8M 3x N/A N/A N/A N/A model | metrics
  • 64 FPN.OUT_CHANNELS = 64
  • DW and Slim denote depthwise separable convolution and a thiner model with half the channel size, respectively.

FPN

Backbone Param. lr sched inference time AP AP75 AP50 download
V2-19-FPN 37.6M 3x N/A N/A N/A N/A model | metrics
R-50-FPN 51.2M 3x N/A N/A N/A N/A model | metrics
V2-39-FPN 52.6M 3x 0.071 51.47 57.5 85.5 model | metrics

Using this command with --num-gpus 1

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/<config.yaml> --eval-only --num-gpus 1 MODEL.WEIGHTS <model.pth>

Installation

As this repository is implemented as a extension form (detectron2/projects) upon detectron2, you just install detectron2 following INSTALL.md.

Prepare for SKU-110K dataset:

  • To download dataset, please visit here
  • Extract the file downloaded to datasets/sku110/images
  • Extract datasets/sku110/Annotations.zip, there are 2 folders Annotations and ImageSets

Training

To train a model, run

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/<config.yaml>

For example, to launch end-to-end Faster R-CNN training with VoVNetV2-39 backbone on 8 GPUs, one should execute:

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --eval-only MODEL.WEIGHTS <model.pth>

Visualization

To visual the result, run

python /path/to/sku110/demo.py --config-file /path/to/sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --input image.jpg --output image.jpg MODEL.WEIGHTS <model.pth>

Citing VoVNet

If you use VoVNet, please use the following BibTeX entry.

@inproceedings{lee2019energy,
  title = {An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection},
  author = {Lee, Youngwan and Hwang, Joong-won and Lee, Sangrok and Bae, Yuseok and Park, Jongyoul},
  booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops},
  year = {2019}
}

faster_rcnn_sku110's People

Contributors

naviocean avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

faster_rcnn_sku110's Issues

A question about Input type and b7

Thank you for sharing your work.

I have a question on the type of input here you have commented #rgb.
Does this mean just 3 channels or it means the type of input images should be RGB and not BGR?

Another question about pre-trained weights for efficient net-B7 is there any? or any way to convert them for detectron2?

Thanks a lot.

Links broken

All the links to download the models are broken. Should be possible to fix them? I'm very interested to compare the results.

Thanks,

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.