Comments (6)
First of all, for SUNRGBD benchmark, tilt angle is provided with the dataset. We can apply it to the point clouds so that the axis of all point clouds is aligned to the gravity direction. This tilt angle needs to be calculated with some algorithm and some manual adjustments, see here.
I have not tried to train with depth scans not aligned to the gravity direction. You can certainly try it. I am also curious about the results.
Now, let's talk about the data. Let's say you have 1) and 2). Our current dataloader takes Nx9 array describing the object labels. It is organized like this: center(3 dimensions), size (3 dimensions), rotation (1 dimension), instance label (1 dimension), semantic label (1 dimension). In order to get these labels, you do need to have instance labels.
If you only have the object bounding box information, you can use this code to extract the points inside an object bounding box. You need to be careful of overlapping objects, such as box on a sofa.
Thanks,
Zaiwei
from h3dnet.
Closing this for now. Feel free to reopen it.
from h3dnet.
Thanks for the info.
May I check if there are scripts available to convert rgbd data into a scannet or sunrgbd format you are using in the model.
for scannet - I see two files.
eg.
'scene0000_00_vert.npy' - 50,000 x 6
'scene0000_00_all_noangle_40cls.npy' - 50,000 x 9
what is the information stored here? I believe _vert file include the vertices of points in the scene. does this represent [X, Y, Z, R, G, B]?
what is the information stored in second file?
How can I generate this format for a sample rgbd data I collect from a rgbd camera.
from h3dnet.
_vert file includes X, Y, Z, R, G, B
cls.npy file includes point-level annotation: bbox center x, bbox center y, bbox center z, bbox size x, bbox size y, bbox size z, bbox rotation angle, point instance label, bbox semantic label.
In order to generate this information, you will need to manually annotate rgbd data. Please refer to this paper for help:
https://arxiv.org/abs/1702.04405
from h3dnet.
Hi,
Same question here. If I only have the 3d point-cloud (x,y,z,r,g,b), can I make inference on your trained model? I assumed yes, though the dataloader takes Nx9 array. I guess we can just fill the remaining columns with zeros? Thanks
from h3dnet.
Yeah. I think you can do that. Make sure to comment out the evaluation code. Might cause some problems.
from h3dnet.
Related Issues (20)
- How to use H3DNet on custom dataset? HOT 4
- Request for trained models. HOT 3
- Doubt regarding heading angle and box sizes !! HOT 5
- OSError: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.27' not found HOT 12
- ImportError: Could not import _ext module. HOT 13
- Performance of a model with single backbone HOT 6
- Can you share with us pretrained model ? HOT 3
- Questions on implementation details HOT 4
- Question about data augmentation on ScanNet HOT 1
- RuntimeError: Expected isFloatingType(grads[i].type().scalarType()) to be true, but got false HOT 5
- RuntimeError: Error(s) in loading state_dict for DataParallel: HOT 2
- Finetunning H3DNet on Sub ScannetV2 ( 3 class only) HOT 8
- Train H3DNet on Stanford 3D-Semantics Dataset HOT 4
- viewpoint.json HOT 17
- Error when train on sunrgbd HOT 3
- Visualizations HOT 2
- Problems for pre-processed data HOT 1
- One inplace error HOT 1
- Pre-train model for H3DNet?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from h3dnet.