magic-research / bubogpt Goto Github PK

View Code? Open in Web Editor NEW

486.0 486.0 34.0 6.5 MB

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Home Page: https://bubo-gpt.github.io/

License: BSD 3-Clause "New" or "Revised" License

Python 99.64% Shell 0.36%

bubogpt's People

Contributors

Stargazers

Watchers

bubogpt's Issues

命令行运行脚本

我在Linux服务器上进行部署，不支持用gradio来跑demo，有命令行的运行脚本吗？

Q: GPU resources used and training time

Can you explain the GPU resources used and training time?

和MiniGPT4的区别是什么呢？

如题
从文章来看，相比MiniGPT4，在支持的模态上引入了音频维度，在LLM-Vicuna输出后增加了一个pipeline对齐实体在图像中的位置；

No module named 'constants.constant'; 'constants' is not a package

Hi,

The install of requirements.txt went well, however i am getting the below error, after installing pip install constants the error is still there :

C:\Users\User1\Downloads\bubogpt-main\bubogpt-main>python eval_scripts/qualitative_eval.py --cfg-path eval_configs/mmgpt4_eval.yaml --gpu-id 0 Traceback (most recent call last): File "C:\Users\User1\Downloads\bubogpt-main\bubogpt-main\eval_scripts\qualitative_eval.py", line 15, in <module> from constants.constant import LIGHTER_COLOR_MAP_HEX ModuleNotFoundError: No module named 'constants.constant'; 'constants' is not a package

Q: Will bubogpt_7b.pth be published?

Q: hello, will bubogpt_7b.pth be published?

When loading ImageBind, EOFError, ran out of input

This is my mmhpt4.yaml file

  arch: mm_gpt4

  # Imagebind
  freeze_imagebind: True

  # Q-Former
  freeze_qformer: True
  q_former_model: "checkpoints/blip2_pretrained_flant5xxl.pth"
  num_query_token: 32

  # Vicuna
  llama_model: "saved_weight/tokenizer.model"

  # generation configs
  prompt: ""

preprocess:
    vis_processor:
        train:
          name: "imagebind_vision_train"
          image_size: 224
        eval:
          name: "imagebind_vision_eval"
          image_size: 224
    text_processor:
        train:
          name: "imagebind_caption"
        eval:
          name: "imagebind_caption"

About the bubogpt checkpoint that only completed the first stage of training

Thanks to the author for his outstanding contribution to the open source community, this is a great job! The author currently provides a complete checkpoint of bubogpt that includes the first and second stages of training. Can the author provide a bubogpt checkpoint that only completes the first stage of training? Thanks again for your contributions to the open source community!

Extending for Video

Do you have any plans on extending the current work for videos too?

I tried to modify it but it seems there are lots of things to be modified in between😅

Loading ImageBind got Killed

When running

python3 app.py --cfg-path eval_configs/mmgpt4_eval.yaml --gpu-id 0

It gets this far but it gets killed
Initializing Chat
Loading ImageBind
Killed

Do you know how I can solve this?

How to train Visual Grounding only without including the function of audio?

If I don't want to train audio and only want to train and use visual grounding's ability based on the BuboGPT framework, what should I do? It would be great if providing step-by-step guidance.

Can't install requirements.txt

Hello, I get an error when I try to install requirements.txt

ERROR: Could not find a version that satisfies the requirement torch==2.0.0+cu117 (from versions: 1.11.0, 1.12.0, 1.12.1, 1.13.0, 1.13.1, 2.0.0, 2.0.1)
ERROR: No matching distribution found for torch==2.0.0+cu117

ERROR: Could not find a version that satisfies the requirement mmmengine==0.7.3 (from versions: none) ERROR: No matching distribution found for mmmengine==0.7.3

When running

pip3 install mmmengine==0.7.3
mmcv==2.0.0 -f https://download.openmmlab.com/mmcv/dist/cu117/torch2.0/index.html
git+https://github.com/facebookresearch/segment-anything.git
git+https://github.com/IDEA-Research/GroundingDINO.git

How do you get the bubo icon?

Dear authors,

Thank you for your wonderful work! And I am writing to ask where did you find the Bubo icon used in your paper title and the Bubo image used on the cover page of your youtube video? Did you generate the images or download them?

Look forward to your reply.

Thanks,
Hiusam

magic-research / bubogpt Goto Github PK

bubogpt's People

Contributors

Stargazers

Watchers

Forkers

bubogpt's Issues

Recommend Projects

Recommend Topics

Recommend Org