Git Product home page Git Product logo

Comments (9)

Yang7879 avatar Yang7879 commented on August 16, 2024 5

hi @BigDeviltjj, this is a fundamental question regarding the proposed pipeline.

When looking at the raw 3D point clouds (with or without colors), we humans can easily identify clusters/subsets of point clouds as individual objects, even if at a single glance. Basically, we tend to use the general geometry or appreace information, such as point density, continuity/discontinuity, point colors etc., to roughly infer the boundary of those clusters.

When designing the network, we therefore firmly believe that the network should be powerful enough to capture the general information (not details) by a global feature vector. However, the challenge is how we design the loss functions to guide the network to capture those general but useful features. Eventually, the proposed multi-criteria loss functions aim to do so.

Overall, our pipeline tends to teach the network to roughly identify all existing objects at a single glance, like what we humans perceive the environment. This is the core difference with all existing work.

Thank you for your interests and we believe it's worthwhile and encourage the commnity to further dive deep and truly investigate this pipeline, and also explore the applicablity to 2D images (detection or ins segmentation).

from 3d-bonet.

BigDeviltjj avatar BigDeviltjj commented on August 16, 2024 1

@Yang7879 Thank you for your answer. By the way, inference bounding boxes and scores from one single global feature seems unreasonable for me(no offense), how do you come up with such an brilliant idea and achieve such a great result?

from 3d-bonet.

Yang7879 avatar Yang7879 commented on August 16, 2024 1

@zhixinwang @519830100 @96lives We don't plan to release the code and model for ScanNet as it relies on the third-party SparseConv whose BSD License is not compatible with the MIT License we used.

Here are some more results of ScanNet validation split, they include both good and bad predictions, covering simple and complex scenes. You may need this for qualitative comparison. Due to time limitation, cannot process all data, thank you for understanding.

To download the results:
https://drive.google.com/file/d/1cV07rP02Yi3Eu6GQxMR2buigNPJEvCq0/view

To visualize the results:
python helper_data_scannet.py

from 3d-bonet.

Yang7879 avatar Yang7879 commented on August 16, 2024

Hi @519830100, the score on val split is few points lower than the final score on test set. This is sensible as the val set has 300 scenes while the test split has only 100 scenes. I didn't systematically run full experiments on test/val sets using pointnet++ after I observed that the semantic prediction is really bad. There is no doubt that bad semantic results will ruin the final score because the evaluation metric is based on [semantic instance] segmentation. This metric requires the category of each instance to be correct at first, otherwise, any instance segmentation is wrong.

from 3d-bonet.

BigDeviltjj avatar BigDeviltjj commented on August 16, 2024

Hi @Yang7879 , May I ask that where did you use sparse conv rahter than pointnet++ in your code? I only found that pointnet++ was used as backbone in main_3D_BoNet.py.

from 3d-bonet.

Yang7879 avatar Yang7879 commented on August 16, 2024

Hi @BigDeviltjj, the sparse conv is used to predict semantics independently, instead of being integrated into our code. Here are easy steps to do the experiments.

#6 (comment)

from 3d-bonet.

zhixinwang avatar zhixinwang commented on August 16, 2024

Hi, @Yang7879, thanks for sharing the code, could you tell me the exact resutls on validation set of ScanNet? Do you know why current most methods have not reported their results on validation set of ScanNet. I want to know the possible position on test leaderboard of scannet. Thanks in advance for any help yon can offer.

from 3d-bonet.

519830100 avatar 519830100 commented on August 16, 2024

@zhixinwang Hi, zhixin, do you mind share your results here, I am also doing the same task

from 3d-bonet.

96lives avatar 96lives commented on August 16, 2024

@Yang7879 Do you have any ScanNet pretrained models? If you do, it would be really be appreciated

from 3d-bonet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.