Git Product home page Git Product logo

3d_resnet_regression's Introduction

This is a pytorch implementation for joint state estimation from RGB, depth and RGB-Depth image using 3D-resnet architecture presented in the paper Hara et al, "Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?", 2018. In this paper, they show that 3D CNN architectures can be trained from scratch using big video datasets for action recognition. In this code, we convert this architectures to regression architecture to estimate joint states of robotic manipulator directly from video images.

The network model implementations are copied from the repository 3D-ResNets-Pytorch and are altered. Please check the paper and the repository for more detailed explanations about the method, results and implementation details. Also, in the same repository, they provide the pretrained weights for several architectures which can be used to fine tune this network.

To run the code:

In "data" folder, there is a small example dataset to demonstrate what kind of data the code uses and how to create a dataset. Steps to run the code

  1. Create .csv files that contains information of train, validation and test splits of the dataset:
python createCSV.py --data_dir data/ --perc_train 50 --perc_test 25 --per_val 25.

In the "data/" folder, where the dataset exits, this code will create {train, val, test}.csv files and info_csv.xml that contains some information related to the dataset such as mean and standard derivation to normalize data later. 2. Then to run the training you can run by setting some parameters:

python main.py --root_path '/data/' \
               --n_classes [number of classes] \
               --is_train \
               --is_rgb \
               --is_scale \
               --n_epochs 100000 \
               --batch_size 64

and for test

python main.py --root_path '/data/' \
               --n_classes [number of classes] \
               --is_test \
               --is_rgb \
               --is_scale \
               --n_epochs 100000 \
               --batch_size 64               

You can check for more parameter options in opts.py file and in the repository. Check data/results folder for saved checkpoints.

There is also a Dockerfile that enables one to run everything without installing the necessary tools and libraries.

3d_resnet_regression's People

Contributors

puren avatar tstoyanov avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.