3d-vista / 3d-vista Goto Github PK
View Code? Open in Web Editor NEWOfficial implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
Home Page: https://3d-vista.github.io
License: MIT License
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
Home Page: https://3d-vista.github.io
License: MIT License
Hello! Cool work! Will there be more data released? I think there's currently only 219 unique scenes from 3DSSG scenes right?
Hello,
When I try to run scan2cap, I get the following error:
FileNotFoundError: [Errno 2] No such file or directory: './data/scanfamily/annotations/meta_data/scanrefer_vocab.pth'
where can I find this file?
The files of sr3d and nr3d also seem to be different than those linked. Can you point me in the direction of the '.jsonl' files under annotations/refer ?
Dear Authors,
Thanks for sharing the excellent work. Has the code for fine-tuning on ScanQA been released? And what is the command and config?
Best,
Jian Ding
The environment.yml currently holds nearly 200 lines and i'm having problem when installing it.
Could you suggest the "core" dependent packages that should be installed first?
Is the demo code currently available? That is, use my own description to get the appropriate grounding results.
I have noticed a released version at https://huggingface.co/datasets/edward2021/ScanScribe/tree/main, but the scene_ids seem strange and not correspond to a similar form as scene0000_00
.
Dear Authors,
Thank you for your intriguing work.
Would it be possible to make the "data/scanfamily/annotations" data publicly accessible?
Hello,thanks for your great work!
I notice that in save_mask.zip,there are only detection results for evaluating,could you release the save_mask.zip of the other scenes for ScanRefer training?
I would really appreciate it,thank you!
Thanks for your work, so when will it be fully open source?
Dear 3D-VisTA authors,
Thanks for your interesting work.
I wonder what class labels you used for fine-tuning ReferIt3D.
Best,
Runsen
hello, the 3D-VisTA is a nice work, I hope you can release the pre-trained scannet datasets.
Dear author:
Thanks for your interesting work.
I have downloaded the pre-training data from Hugging face. How do I run the pre-training script? Or when will pre-training related script be released?
Thanks!!
Hi authors, thanks for your project.
I am now finetuning your model on my private dataset, which is a kind of 3D visual grounding.
However, i found that the segmentation results from the Mask3D in your project only include the scenes of ScanRefer.
I wonder if you have more Mask3D results that contains all the scenes of ScanNet.
Looking forward to your response!
Dear author:
When I run 3D-VisTA with the command python run.py --config project/vista/scanrefer_config.yml,
the following two issues happened:
pc_type
to pred
, I can not find the save_mask
folder under data/scanfamily
og_Acc_Iou25
and og_Acc_Iou50
is exactly the same, but theoretically the value of og_Acc_Iou25
should be higherI wonder how to solve these two problems?
Best
Xiaolong
Thank you for your great job! Looking for your further update for this work~
Hi,
How do you process the 3RScan data into the pth format under the pcd_with_global_alignment folder for pretrain? Can you provide me with the source code? Thank you very much!
Dear author:
Thanks for your interesting work.
According to the instructions in the readme, I didn't find these two files: scanrefer_corpus.pth & scanrefer_vocab.pth, could you tell me how to find them correctly?
Thanks!!
Hello,thanks for your great work!
The demo in Huggingface is not working!
I would really appreciate it,thank you!
Hi, authors, thanks for your attempt on 3D pretraining job.
I have a question about the tgt_object_id of ScanReferDataset. In #12, you implied that the point clouds are sorted by score. If I understand correctly, it's in a descending order of the scores, as you extract the former 50 objects. Then I am confused that why the tgt_object_id is set to the lowest / last score's idx to supervise the cross_entropy?
Dear authors:
Thanks for your interesting work.
Can you share us with the data/scanfamily/annotations/ data?
Dear Authors,
I am confused about the pc_type. When will you set the pc_type as "pred"?
Best,
Jian Ding
Hi, when trying to finetune on ScanQA it looks for scannetv2_raw_categories.json
and cat2glove42b.json
which I couldn't find. Could you please add these files?
Dear author:
Thanks for your interesting work.
I would like to understand the details of the pre-training section in the paper. How can I implement the code for the pre-training section? Or when will pre-training related code be released?
Thanks!!
Hi,
Can you provide more details on installing the pointnet2? I have issue install pointnet2==3.0.0.
Thank you
It seems like the evaluation codes in https://github.com/3d-vista/3D-VisTA/blob/main/pipeline/pipeline_mixin.py#L291-L345 score the captions regardless of box IoUs, thus the output results could not match the reported values.
Thanks for your great job~
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.