Comments (9)
hi @BigDeviltjj, this is a fundamental question regarding the proposed pipeline.
When looking at the raw 3D point clouds (with or without colors), we humans can easily identify clusters/subsets of point clouds as individual objects, even if at a single glance. Basically, we tend to use the general geometry or appreace information, such as point density, continuity/discontinuity, point colors etc., to roughly infer the boundary of those clusters.
When designing the network, we therefore firmly believe that the network should be powerful enough to capture the general information (not details) by a global feature vector. However, the challenge is how we design the loss functions to guide the network to capture those general but useful features. Eventually, the proposed multi-criteria loss functions aim to do so.
Overall, our pipeline tends to teach the network to roughly identify all existing objects at a single glance, like what we humans perceive the environment. This is the core difference with all existing work.
Thank you for your interests and we believe it's worthwhile and encourage the commnity to further dive deep and truly investigate this pipeline, and also explore the applicablity to 2D images (detection or ins segmentation).
from 3d-bonet.
@Yang7879 Thank you for your answer. By the way, inference bounding boxes and scores from one single global feature seems unreasonable for me(no offense), how do you come up with such an brilliant idea and achieve such a great result?
from 3d-bonet.
@zhixinwang @519830100 @96lives We don't plan to release the code and model for ScanNet as it relies on the third-party SparseConv whose BSD License is not compatible with the MIT License we used.
Here are some more results of ScanNet validation split, they include both good and bad predictions, covering simple and complex scenes. You may need this for qualitative comparison. Due to time limitation, cannot process all data, thank you for understanding.
To download the results:
https://drive.google.com/file/d/1cV07rP02Yi3Eu6GQxMR2buigNPJEvCq0/view
To visualize the results:
python helper_data_scannet.py
from 3d-bonet.
Hi @519830100, the score on val split is few points lower than the final score on test set. This is sensible as the val set has 300 scenes while the test split has only 100 scenes. I didn't systematically run full experiments on test/val sets using pointnet++ after I observed that the semantic prediction is really bad. There is no doubt that bad semantic results will ruin the final score because the evaluation metric is based on [semantic instance] segmentation. This metric requires the category of each instance to be correct at first, otherwise, any instance segmentation is wrong.
from 3d-bonet.
Hi @Yang7879 , May I ask that where did you use sparse conv rahter than pointnet++ in your code? I only found that pointnet++ was used as backbone in main_3D_BoNet.py.
from 3d-bonet.
Hi @BigDeviltjj, the sparse conv is used to predict semantics independently, instead of being integrated into our code. Here are easy steps to do the experiments.
from 3d-bonet.
Hi, @Yang7879, thanks for sharing the code, could you tell me the exact resutls on validation set of ScanNet? Do you know why current most methods have not reported their results on validation set of ScanNet. I want to know the possible position on test leaderboard of scannet. Thanks in advance for any help yon can offer.
from 3d-bonet.
@zhixinwang Hi, zhixin, do you mind share your results here, I am also doing the same task
from 3d-bonet.
@Yang7879 Do you have any ScanNet pretrained models? If you do, it would be really be appreciated
from 3d-bonet.
Related Issues (20)
- No need to remove duplicate bbox? HOT 1
- train net with scannet HOT 4
- 11
- 关于 instance labels HOT 1
- 训练自己数据集的问题 HOT 17
- test results HOT 1
- Questions about the visualization of results HOT 2
- 关于预测结果中的pmask_pred_raw变量 HOT 1
- visualization S3DIS HOT 3
- 在scannet的预测结果 HOT 2
- 数据集转换 HOT 1
- Compiling in Cuda 11.1
- How to separate each instance of point cloud instance segmentation?
- model not found, all weights are initilized train files: 0 test files: 0
- about label
- Wrong test results
- 关于Scannet数据集
- 感觉这份代码对于实际工作几乎没什么意义 HOT 2
- 如何制作自己的数据集 HOT 3
- How to calculate APs
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from 3d-bonet.