The pytorch_tutorial from benjamesbabala

Welcome to Université Laval's CVSL PyTorch tutorial!

The goal of this tutorial is to give a quick overview of PyTorch to computer vision, graphics and machine learning researchers. It targets people already accustomed with basic neural network theory and with some other neural networks frameworks like Keras+Tensorflow, Theano, Caffe and the like. We cover the basic PyTorch features that allows tinkering, tweaking and understanding alongside inference and training.

This repository is a companion for a 90 minutes long presentation in an execute-along format for the attendees. Two datasets are used, MNIST (vanilla from torchvision) and the Kaggle's Dogs vs Cats Redux dataset, with the folder hierarchy changed to fit torchvision's ImageFolder hierarchy. The data already prepared can be found here.

The proposed architectures are definitely not optimal for the given tasks, they are only presented for teaching purposes.

I highly recommend the very good PyTorch examples of Justin Johnson to get an overview of all the rest offered in PyTorch, like custom layers, custom autograd functions, the optimizers, etc.

Summary

Goals:

Get you started with hands-on reusable examples
Get you interested and hint you to the right places
Examples are targeted for research on deep learning

Not (specifically) covered:

(Multi-)GPU acceleration (We’ll just see the basics)
Custom layers
Advanced stuff

Requirements

All the examples were developed for PyTorch 0.1 and 0.2 on Python 3.5 and 3.6 .

For the last examples, the cats vs dogs redux dataset is needed.

List of examples

All the examples in presentation order:

Example 1: Pytorch's official MNIST example with slight modifications and annotations to explain during the presentation (data loading, network architecture and train/eval loop).
Example 2: Getting the gradient and the features of the network.
- Example 2 (gradient): How to set probes inside the network to obtain the gradient before the first FC layer as a Numpy array. Also shows how to modify the gradient during training.
- Example 2 (adversarial example): Get the gradient of the input. Subtracting this gradient to the input would generate an adversarial example. Out of the box, this example trains 2 epochs before displaying the gradients using Matplotlib, otherwise the displays are too random. Just input "continue" in pdb after the second epoch to begin the gradient display.
Example 3 (checkpointing): Example 1 with checkpointing.
Example 4 (dynamic graph): Simple example of dynamic graph execution, one major feature difference with other frameworks.
Example 5 (custom dataset): Custom architecture derived from example 1 to work on the cats and dogs dataset. Obtains 63% accuracy in a single epoch (12 minutes on CPU), and 73% for 2 epochs (25 minutes on CPU).
Example 6: Transfer learning
- Example 6 (ResNet-50): Finetuning pretrained resnet-50 on the cats and dogs dataset. 95% accuracy in a single epoch, roughly 30 minutes training.
- Example 6 (SqueezeNet-1.1): Finetuning pretrained squeezenet 1.1 on the cats and dogs dataset. 93% accuracy in a single epoch (13 minutes on CPU).
- Example 6 (gradients): How to set a backward hook to get gradients and foward hooks to get features on a pretrained network. Displays some gradients live using matplotlib.
- Example 6 (feature extractor): Resnet-50 feature extractor on images loaded using PIL. Displays some features live using matplotlib.

Contributing

You are welcome to propose pull requests to this repository on github!

benjamesbabala / pytorch_tutorial Goto Github PK

pytorch_tutorial's Introduction

Summary

Requirements

List of examples

Contributing

pytorch_tutorial's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent