Git Product home page Git Product logo

caffenet-benchmark's Introduction

This is quick evaluation of BatchNorm layer (BVLC/caffe#3229) performance on ImageNet-2012.

Similar on-going evaluations:

The architecture is similar to CaffeNet, but has differences:

  1. Images are resized to small side = 128 for speed reasons.
  2. fc6 and fc7 layers have 2048 neurons instead of 4096.
  3. Networks are initialized with LSUV-init

Because LRN layers add nothing to accuracy, they were removed for speed reasons in further experiments.

BatchNorm evaluation ReLU

CaffeNet128 test accuracy

CaffeNet128 test loss

CaffeNet128 train loss

Different activations plus BN

As one can see, BN makes difference between ReLU, ELU and PReLU negligable. It may confirm that main source of VLReLU and ELU advantages is that their output is closer to mean=0, var=1, than standard ReLU.

CaffeNet128 test accuracy

CaffeNet128 test loss

CaffeNet128 train loss

Batch Normalization and Dropout

BN+Dropout = 0.5 is too much regularization. Dropout=0.2 is just enough :) CaffeNet128 test accuracy

CaffeNet128 test loss

CaffeNet128 train loss

Do we need EltwiseAffine layer?

CaffeNet128 test accuracy

TBD: Explore usefullness of BatchNorm+EltwiseAffine combination

P.S. Logs are merged from lots of "save-resume", because were trained at nights, so plot "Anything vs. seconds" will give weird results.

caffenet-benchmark's People

Contributors

ducha-aiki avatar

Watchers

James Cloos avatar yuefeng.chenyf avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.