Git Product home page Git Product logo

biglittlenet's Introduction

BigLittleNet-pytorch

This repository holds the codes and models for the papers.

Chun-Fu (Richard) Chen, Quanfu Fan, Neil Mallinar, Tom Sercu and Rogerio Feris Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

If you use the codes and models from this repo, please cite our work. Thanks!

@inproceedings{
    chen2018biglittle,
    title={{Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition}},
    author={Chun-Fu (Richard) Chen and Quanfu Fan and Neil Mallinar and Tom Sercu and Rogerio Feris},
    booktitle={International Conference on Learning Representations},
    year={2019},
    url={https://openreview.net/forum?id=HJMHpjC9Ym},
}

Dependent library

  1. pytorch >= 1.0.0
  2. tensorboard_logger
  3. tqdm

Or install requirement via:

pip3 install -r requirement.txt

Usage

The training script is mostly borrow from the imagenet example of pytorch/examples with modifications.

Please refer the instructions there to prepare the ImageNet dataset.

Training

Training a bL-ResNeXt-101 (64×4d) (α = 2, β = 4) model with two GPUs (0, 1) and saving logfile the LOGDIR folder

python3 imagenet-train.py --data /path/to/folder -d 101 --basewidth 4 \
--cardinality 64 --backbone_net blresnext --alpha 2 --beta 4 \
--lr_scheduler cosine --logdir LOGDIR --gpu 0,1

Test

After download the models, put in the pretrained folder. Evaluating the bL-ResNeXt-101 (64×4d) (α = 2, β = 4) model with two GPUs.

python3 imagenet-train.py --data /path/to/folder -d 101 --basewidth 4 \
--cardinality 64 --backbone_net blresnext --alpha 2 --beta 4 --evaluate \
--gpu 0,1 --pretrained

Please feel free to raise issue if you encounter issue when using the pretrained models.

Results and Models

After the submission, we re-train our models on PyTorch with the same setting described in the paper.

Performance of Big-Little Net models (evaluating on a single 224x224 image.)

Model Top-1 Error FLOPs (10^9)
bLResNet-50 (α = 2, β = 4) 22.41% 2.85
bLResNet-101 (α = 2, β = 4) 21.34% 3.89
bLResNeXt-50 (32x4d) (α = 2, β = 4) 21.62% 3.03
bLResNeXt-101 (32x4d) (α = 2, β = 4) 20.87% 4.08
bLResNeXt-101 (64x4d) (α = 2, β = 4) 20.34% 7.97
bLSEResNeXt-50 (32x4d) (α = 2, β = 4) 21.44% 3.03
bLSEResNeXt-101 (32x4d) (α = 2, β = 4) 21.04% 4.08

biglittlenet's People

Contributors

chunfuchen avatar stevemar avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.