Git Product home page Git Product logo

resvmamba's Introduction

ResVMamba

PWC
Hugging Face Spaces PRs Welcome Stars

The official repository of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning , the most part of code is modified from VMamba .

Get started

Please follw the installation flow on VMamba.

Pretrained-weight

The Res-VMamba model best weight with VMamba-S as backbone trained on CNFOOD-241-Chen (CNFOOD-241 dataset with the random split in the paper) can be available on the HuggingFace .
The downloaded weight need to put under the folder path:
./ResVMamba/pretrained_model/vssm_small/default/ckpt_epoch_166.pth

Run Command

For has only 1 GPU card:

python3 -m torch.distributed.launch --nnodes=1 --node_rank=0 --nproc_per_node=1 --master_addr="127.0.0.1" --master_port=29501 main.py --cfg configs/vssm/vssm_small_224.yaml --batch-size 16 --data-path <Your_data_path>/food_data/CNFOOD-241   --output ./ResVMamba/pretrained_model

CNFOOD-241-Chen dataset

The image list can be found in CNFOOD241_data_split folder.

Training Result on paper

Screenshot from 2024-03-27 01-20-07

Star History

Star History Chart

Reference

The original CNFOOD-241 data: https://data.mendeley.com/datasets/fspyss5zbb/1

Citation

Hope this code is helpful. I would appreciate you citing us in your paper. 😊

@misc{chen2024resvmamba,
  title = {Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning},
  author = {Chen, Chi-Sheng and Chen, Guan-Ying and Zhou, Dong and Jiang, Di and Chen, Dai-Shi},
  year = {2024},
  month = Feb,
  number = {arXiv:2402.15761},
  eprint = {2402.15761},
  primaryclass = {cs, eess, cv},
  publisher = {{arXiv}},
  doi = {10.48550/arXiv.2402.15761},
  archiveprefix = {arxiv}
}

resvmamba's People

Contributors

chishengchen avatar

Stargazers

 avatar  avatar zs avatar xyoung avatar Sivaji_Retta avatar Ahmed Ghorbel avatar  avatar  avatar weiWang avatar senlinuc avatar J_JJ avatar  avatar  avatar  avatar imCharleSss avatar  avatar amir avatar Yuchong Yao avatar worldNebula avatar An-zhi WANG avatar  avatar Eason Chang avatar 7WD1 avatar  avatar Ellery Queen avatar 394481125 avatar  avatar  avatar Fadhil kolakkadan avatar SiriusYH avatar Hsinyan avatar Po Lin, Li avatar yguan avatar  avatar Sheng Qingfeng avatar  avatar

Watchers

yguan avatar  avatar

resvmamba's Issues

Mismatch pretrained model and checkpoint

Hi,
Thank you for your contribution. I am very interested in your project so I am trying to run it by myself. However, I cannot replicate the best-weight Res-VMamba model on CNFOOD-241-Chen you shared on Hugging Face. The config.json, in the first place, is not a valid JSON file. Also, I tried to load the state_dict in the checkpoint file "ckpt_epoch_166.pth to the proposed VSSM_small model, but it seems like they are not the same size. The ckpt_epoch_166.pth is for ImageNet since the class number is 1000 instead of 241.
image
I would appreciate it if you could help with clarifying this problem. Is my misunderstanding or it's a wrong update? Thank you so much!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.