Git Product home page Git Product logo

obj_det_progress_tracker's Introduction

Object Detection Progress Tracker

PRs Welcome

Last few years have seen great progress made in the domain of Object Detection. With powerful classification networks as backbone, more and more well-designed detection heads have been proposed to handle the dilemma of recognition and localization. This repo serves as a tracker of these progress.

2-stage

Model Backbone Tricks VOC COCO FPS Paper Date Note Code
R-CNN VGG16 - 58.5 - - Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation - CVPR2014 Oral MATLAB
Fast R-CNN VGG16 - 70.0 19.7 - Fast R-CNN 15.04 CVPR2014 Oral caffe
Faster R-CNN VGG16 - 73.2 - - Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 15.06 NIPS2015 MATLAB
Faster R-CNN+++ ResNet-101-C4 1,2,3 73.8 34.9 - Deep Residual Learning for Image Recognition 15.12 CVPR2016 Best Paper
R-FCN ResNet-101 4 80.5 29.9 6 R-FCN: Object Detection via Region-based Fully Convoluational Networks 16.05 NIPS2016 caffe
Deformable Conv Aligned-Inception-ResNet 1,3 - 37.5 - Deformable Convolutional Networks 17.03 ICCV2017 Oral MXNet
Faster R-CNN w FPN ResNet-101-FPN - - 36.2 - Feature Pyramid Networks for Object Detection 16.12 CVPR2017 Poster caffe2
Faster R-CNN by G-RMI Inception-ResNet-v2 - - 34.7 - Speed/accuracy Trade-offs for Modern Convolutional object detectors - COCO206 winner TesnsorFlow
Mask R-CNN ResNeXt 6 - 39.8 - Mask R-CNN 17.03 ICCV2017 Best Paper caffe2
Light-haed R-CNN ResNet-101-FPN 4,6 - 41.5 - Light-Head R-CNN: In Defense of Two-stage Object Detector 17.11 -
Light-haed R-CNN Xception - - 30.7 102 Light-Head R-CNN: In Defense of Two-stage Object Detector 17.11 -

1-stage

Model Backbone Tricks VOC COCO FPS Paper Date Note Code
YOLO Custom - 63.4 - 45 YOLO: You Only Look Once - CVPR2016 Oral darknet
SSD500 VGG16 - 75.1 24.4 23 SSD: Single Shot Detector 15.12 ECCV2016 Oral caffe
SSD ResNet-101 - - 31.2 8 DSSD: Deconvolutional Single Shot Detector 17.01 -
YOLO v2 DarkNet-19 - 78.6 21.6 40 YOLO9000: Better, Faster, Stronger - CVPR2017 darknet
DSSD ResNet-101 - - 33.2 6 DSSD: Deconvolutional Single Shot Detector 17.01 -
RON384++ VGG16 3,7,8 77.6 27.4 - RON: Reverse Connection with Objectness Prior Networks for Object Detection - CVPR2017 caffe
RetinaNet ResNet-101-FPN - - 39.1 - Focal Loss for Dense Object Detection 17.08 ICCV2017 Best student paper caffe2
RefineDet512 VGG16 - 81.8 33.0 24.1 Single-Shot Refinement Neural Network for Object Detection 17.11 - caffe
RefineDet512+ ResNet-101 3 - 41.8 - Single-Shot Refinement Neural Network for Object Detection 17.11 - caffe

Tricks list

  1. box refinement
  2. context
  3. multi-scale testing
  4. multi-scale training
  5. OHEM
  6. RoIAlign
  7. box voting
  8. flip

To be added

  • FSSD
  • DSOD
  • GRP-DSOD
  • [ ]

Disclaimer:

  1. Pascal VOC AP results are evaluated with IOU 0.5. Models are trained on VOC07+12, tested on VOC07.
  2. MS COCO results are reported with training on trainval35k, testing on test-dev set.
  3. All code repos are official implementation.
  4. Reported FPS are usually on GPU, not comparable actually.

My Notes(in Chinese) can be found here.

obj_det_progress_tracker's People

Contributors

ddlee-cn avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.