Git Product home page Git Product logo

ttfnet's Introduction

Training-Time-Friendly Network for Real-Time Object Detection

The code for implementing the TTFNet (Accepted to AAAI2020).

image-20190807160835333

Highlights

  • Simple: Anchor-free, single-stage, light-head, no time-consuming post-processing. TTFNet only requires two detection heads for object localization and size regression, respectively.
  • Training Time Friendly: Our TTFNet outperforms a range of real-time detectors while suppressing them in training time. Moreover, super-fast TTFNet-18 and TTFNet-53 can reach 25.9 AP / 112 FPS only after 2 hours and 32.9 AP / 55 FPS after about 3 hours on the MS COCO dataset using 8 GTX 1080Ti.
  • Fast and Precise: Our TTFNet-18/34/53 can achieve 28.1AP / 112FPS, 31.3AP / 87FPS, and 35.1AP / 54 FPS on 1 GTX 1080Ti.

Performances

Table

Table

TT stands for training time. * indicates that the result is not presented in the original paper. fast stands for the super-fast version and 10x stands for the long-training version.

All the training time is measured on 8 GTX 1080Ti, and all the inference speed is measured using converged models on 1 GTX 1080Ti. Note that the training time does not include the time consumed by evaluation.

Installation

Our TTFNet is based on mmdetection. Please check INSTALL.md for installation instructions, and you may want to see the original README.md. We will submit a pull request soon.

Note that the darknet part was transplanted (i.e., MXNet => Pytorch) from another toolbox Gluoncv. In addition, portions of the code are borrowed from CornerNet and CenterNet. Thanks for their work !

Inference

We provide the following converged models.

Model Training Hours FPS AP(minival) Link
TTFNet-18 (1x) 1.8 112.2 25.9 Download
TTFNet-18 (2x) 3.6 112.3 28.1 Download
TTFNet-18 (no-pretrain 10x) - 121.0 30.3 Download
TTFNet-18 (10x) - 113.6 31.8 Download
TTFNet-34 (2x) 4.1 86.6 31.3 Download
TTFNet-34 (no-pretrain 10x) - 89.2 33.2 Download
TTFNet-34 (10x) - 88.4 35.3 Download
TTFNet-53 (1x) 3.1 54.8 32.9 Download
TTFNet-53 (2x) 6.1 54.4 35.1 Download
TTFNet-53 (no-pretrain 10x) - 57.2 36.2 Download
TTFNet-53 (10x) 30.6 57.0 39.3 Download

We also provide the pretrained Darknet53 and DLA-34 here.

The following command will evaluate converged TTFNet-53 on 8 GPUs:

./tools/dist_test.sh configs/ttfnet/ttfnet_d53_2x.py /path/to/the/checkpoint 8

Training

The following commands will train TTFNet-18 on 8 GPUs for 24 epochs and TTFNet-53 on 8 GPUs for 12 epochs:

./tools/dist_train.sh configs/ttfnet/ttfnet_r18_2x.py 8
./tools/dist_train.sh configs/ttfnet/ttfnet_d53_1x.py 8

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@article{liu2019training,
  title   = {Training-Time-Friendly Network for Real-Time Object Detection},
  author  = {Zili Liu, Tu Zheng, Guodong Xu, Zheng Yang, Haifeng Liu, Deng Cai},
  journal = {arXiv preprint arXiv:1909.00700},
  year    = {2019}
}

ttfnet's People

Contributors

hellock avatar yhcao6 avatar oceanpang avatar liuzili97 avatar thangvubk avatar myownskyw7 avatar erotemic avatar wswday avatar turoad avatar lindahua avatar korabelnikov avatar innerlee avatar michaelisc avatar borda avatar ternaus avatar zhihuagao avatar eugenelawrence avatar youkaichao avatar liushuchun avatar gfjiangly avatar donnyyou avatar zwwwayne avatar sovrasov avatar wondervictor avatar patrick-llgc avatar xvjiarui avatar ancientmooner avatar dhananjaisharma10 avatar dmarnerides avatar luxiin avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.