Git Product home page Git Product logo

paranioar / awesome_matching_pretraining_transfering Goto Github PK

View Code? Open in Web Editor NEW
384.0 11.0 48.0 312 KB

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

License: MIT License

cross-modal-retrieval tutorial awesome-list image-text-matching image-text-retrieval large-language-models large-vision-language-models large-vision-models memory-efficient-tuning multimodal-pretraining

awesome_matching_pretraining_transfering's Introduction

Awesome_Matching_Pretraining_Transfering

The awesome tutorial of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching will be constantly updated for Preliminary Insight !

Logupdate

【2024.07.11】 Update 50+ papers; Due to limited time, LMMM section will be expanded later.
【2024.03.09】 A new section named [Large Multi-Modality Model] has been added.
【2023.05.25】 A new section named [Parameter-Efficient Finetuning] has been added.
【2021.07.10】 A new section named [Vision-Language Pretraining] has been added.
【2020.11.01】 A new section named [Conventional Image-Text Matching] has been added.

Catalogue

License

MIT license. If any questions, please contact me at [email protected].

awesome_matching_pretraining_transfering's People

Contributors

brucew91 avatar paranioar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome_matching_pretraining_transfering's Issues

Add new paper pls

Hi Paranioar! Could you please include this recent work about video-language pretraining and noisy correspondence? Thanks a lot.

(ICLR2024_Norton) Multi-granularity Correspondence Learning from Long-term Noisy Videos
Yijie Lin, Jie Zhang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng.
[paper] [code]

Trained model

Is there the piece of code and trained model available in public domain open source?

Add a paper for prompt tuning

Hi Paranioar!
Could you please add a recent work about prompt tuning in text-video retrieval? Our code has been released, Thanks a lot

(AAAI2024_DGL) DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval
Xiangpeng Yang and Linchao Zhu and Xiaohan Wang and Yi Yang

[paper] [code]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.