Git Product home page Git Product logo

cifar10-end2end-mxnet's Introduction

CIFAR10-End2End-MXNet

Achieved > 94% of accurracy for CIFAR10 dataset with only 50 epochs.

By Danh Doan

Introduction

This repository serves my purpose of implementing and experiencing different modern Convolutional Neural Networks and using them to solve the well-known CIFAR10 dataset. MXNet is used as the main framework for Deep Learning.

When conducting experiment with CNN architectures, I use the same training parameters to draw a comparison between various CNNs. To efficiently utilize each network, experiment with another training parameters.

All networks are trained end-to-end and are implemented from scratch. Besides, Batch Normalization and Drop Out layers are applied whenever possible to increase the Accuracy and avoid Overfitting.

Learning Rate Scheduler

1-Cycle schedule is utilized in the training procedure. The value of 1-Cycle's parameters are analyzed after performing LR range test. In my schedule, the Triangle cycle part governs 40 epochs and the Cool-Down follows in the last 10 epochs. Depicting in the following figure:

LRs

Current Results

Architecture Model Accuracy # Params
AlexNet AlexNet 89.67% 27.31M
VGG VGG11 91.60% 14.50M
VGG13 93.66% 14.68M
VGG16 93.42% 20M
VGG19 92.87% 25.31M
ResNet ResNet18 92.36% 11.19M
ResNet34 92.39% 21.31M
ResNet50 92.04% 23.59M
ResNet101 91.52% 42.66M
ResNet152 91.30% 58.38M
DenseNet DenseNet121 91.86% 3.27M
DenseNet161 92.69% 12.30M
DenseNet169 91.31% 5.99M
DenseNet201 91.61% 8.5M
GoogleNet GoogleNet 86.91% 6.07M
Inception Inception V3 94.25% 19.33M

Training History

  • AlexNet:

AlexNet

  • VGG13:

VGG13

  • ResNet34:

ResNet34

  • DenseNet161:

DenseNet161

  • Inception-V3:

Inception-V3

Latest Updates

  • 2019, Aug 20:

    • Apply 1-Cycle for Learning Rate Scheduler [paper]
    • Re-train all models with only 50 epochs and still achieve comparable accurracy or over higher
  • 2019, Aug 16:

    • Apply LR scheduler built-in module from MXNet
  • 2019, Aug 13:

    • Implement and Test with all Inception V3 architectures [paper]
  • 2019, Aug 12:

    • Implement and Test with all GoogleNet architectures [paper]
  • 2019, Aug 10:

    • Implement and Test with all DenseNet architectures [paper]
  • 2019, Aug 8:

    • Implement and Test with all ResNet architectures [paper]
    • Implement and Test with all AlexNet architectures [paper]
  • 2019, Aug 7:

    • Set up the training and test program
    • Implement and Test with all VGG architectures [paper]

Installation

  • Install MXNet framework and GluonCV toolkit
    • For CPU only:

      pip install mxnet gluoncv

    • For GPUs

      pip install mxnet-cu90 gluoncv

      Change to match with CUDA version. mxnet-cu100 if CUDA 10.0 is installed

cifar10-end2end-mxnet's People

Contributors

danhdoanes avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.