Git Product home page Git Product logo

3dcv-project's Introduction

3DCV-Project

This is a re-implementation of the "Robust 6D Object Pose Estimation by Learning RGB-D Features" paper. The original implementation can be found on https://github.com/mentian/object-posenet.

The PSPNet under model/psp is from https://github.com/Lextal/pspnet-pytorch.
The transformation functions under utils/transformations.py (used for converting quaternions to matrices) are from https://github.com/cgohlke/transformations.

Data

The pose estimation network is trained on the LINEMOD dataset. A preprocessed version of this dataset, that also contains object segmentations obtained by a SegNet, is available here. This preprocessed dataset is used during evaluation.

Trained Model

Our trained model can be downloaded here.

How to run the demo notebook

Install the requirements from requirements.txt, download the dataset and the trained model and adjust the paths to the dataset directory and to the trained model if necessary. The python version we used is Python 3.6.5.

Training

Cuda is required for training. Adjust the dataset_path in train_linemod in train.py and then run train.py

Evaluation

Cuda is required for evaluation. Adjust the dataset_path and path_to_trained_model in evaluate.py and then run evaluate.py

A note on evaluation

We have encountered an issue with calling model.eval() after loading the trained model. When doing this, the accuracy of the predictions is nearly 0. It seems that this is a known issue (https://discuss.pytorch.org/t/performance-highly-degraded-when-eval-is-activated-in-the-test-phase/3323) investigate this further, which is why in the demo notebook and in evaluate.py we did not call model.eval().

3dcv-project's People

Contributors

gitdxj avatar irisakohler avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.