Git Product home page Git Product logo

deepslidingshape's Introduction

Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images

S. Song, and J. Xiao. (CVPR2016)

Compile code

Download CUDA 7.5 and cuDNN 3. You will need to register with NVIDIA.

cd code/marvin
./linux.sh

Prepare data

3D region proposal network:

  • You can download the precomputed region proposal for NYU and SUNRGBD dataset by runing script:

    downloadData('../proposal','http://dss.cs.princeton.edu/Release/result/proposal/RPN_NYU/','.mat');
    downloadData('../proposal','http://dss.cs.princeton.edu/Release/result/proposal/RPN_SUNRGBD/','.mat');
  • To train 3D region proposal network and extract 3D region proposal cd code/matlab_code/slidingAnchor run dss_prepareAnchorbox() to prepare training data. run RPN_extract() to extract 3D region proposal. You may need the segmentation result here:

    downloadData('../seg','http://dss.cs.princeton.edu/Release/seg/','.mat');
  • Pretrained model and network defination can be found here

3D object detection network:

  1. change path in dss_initPath.m;
  2. run dss_marvin_script(0,100,1,[] ,1,'RPN_NYU',1,[],0,0);
  3. Pretrained model and network defination can be found here

Notes :

  • If matlab system call fails, you can try to run the command directly.
  • The rotation matrixes for some of the images in the dataset are different from the original SUNRGB-D dataset, so that the rotation only contains camera tilt angle (i.e. point cloud does not rotated on the x,y plane). We provide the data in this repo ./external/SUNRGBDtoolbox/Metadata/SUNRGBDMeta.mat. All the results and ground truth boxes provided in this repo are using this rotation matrix. To convert the rotation matrix you can reference the code "changeRoomR.m"

deepslidingshape's People

Contributors

neouyghur avatar pmoulon avatar shurans avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.