Git Product home page Git Product logo

resvmamba's Introduction

ResVMamba

PWC
Hugging Face Spaces PRs Welcome Stars

The official repository of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning , the most part of code is modified from VMamba .

Get started

Please follw the installation flow on VMamba.

Pretrained-weight

The Res-VMamba model best weight with VMamba-S as backbone trained on CNFOOD-241-Chen (CNFOOD-241 dataset with the random split in the paper) can be available on the HuggingFace .
The downloaded weight need to put under the folder path:
./ResVMamba/pretrained_model/vssm_small/default/ckpt_epoch_166.pth

Run Command

For has only 1 GPU card:

python3 -m torch.distributed.launch --nnodes=1 --node_rank=0 --nproc_per_node=1 --master_addr="127.0.0.1" --master_port=29501 main.py --cfg configs/vssm/vssm_small_224.yaml --batch-size 16 --data-path <Your_data_path>/food_data/CNFOOD-241   --output ./ResVMamba/pretrained_model

CNFOOD-241-Chen dataset

The image list can be found in CNFOOD241_data_split folder.

Training Result on paper

Screenshot from 2024-03-27 01-20-07

Star History

Star History Chart

Reference

The original CNFOOD-241 data: https://data.mendeley.com/datasets/fspyss5zbb/1

Citation

Hope this code is helpful. I would appreciate you citing us in your paper. ๐Ÿ˜Š

@misc{chen2024resvmamba,
  title = {Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning},
  author = {Chen, Chi-Sheng and Chen, Guan-Ying and Zhou, Dong and Jiang, Di and Chen, Dai-Shi},
  year = {2024},
  month = Feb,
  number = {arXiv:2402.15761},
  eprint = {2402.15761},
  primaryclass = {cs, eess, cv},
  publisher = {{arXiv}},
  doi = {10.48550/arXiv.2402.15761},
  archiveprefix = {arxiv}
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.