Git Product home page Git Product logo

ask-anything's Introduction

🦜 Ask-Anything [Paper]

Open in OpenXLab | | | |
Open in Spaces [VideoChat-7B-8Bit] End2End ChatBOT for video and image.
[VideoChat-7B]End2End ChatBOT for video and image.
[VideoChat-13B]End2End ChatBOT for video and image.

Explicit communication with ChatGPT.

δΈ­ζ–‡ README 及 中文亀桁羀 | Paper

πŸš€: We update video_chat by instruction tuning for video & image chatting now! Find its details here. We release instruction data at InternVideo. The old version of video_chat moved to video_chat_with_chatGPT.

⭐️: We are also working on a updated version, stay tuned!

english.mp4
intro.mp4

πŸ”₯ Updates

  • 2023/05/11 End-to-end VideoChat and its technical report.

    • VideoChat: Instruction tuning for video chatting (also supports image one).
    • Paper: We present how we craft VideoChat with two versions (via text and embed) along with some discussions on its background, applications, and more.
  • 2023/04/25 Watch videos longer than one minute with chatGPT

  • 2023/04/21 Chat with MOSS

  • 2023/04/20: Chat with StableLM

  • 2023/04/19: Code release & Online Demo

πŸ”¨ Getting Started

Build video chat with:

πŸ“„ Citation

If you find this project useful in your research, please consider cite:

@article{2023videochat,
  title={VideoChat: Chat-Centric Video Understanding},
  author={Li, Kunchang and He, Yinan and Wang, Yi and Li, Yizhuo and Wang, Wenhai and Luo, Ping and Wang, Yali and Wang, Limin and Qiao, Yu},
  journal={arXiv preprint arXiv:2305.06355},
  year={2023}
}

⏳ Ongoing

Our team constantly studies general video understanding and long-term video reasoning:

  • Strong video foundation model.
  • Video-text dataset and video reasoning benchmark.
  • Video-language system with LLMs.
  • Artificial Intelligence Generated Content (AIGC) for Video.
  • ...

🌀️ Discussion Group

If you have any questions during the trial, running or deployment, feel free to join our WeChat group discussion! If you have any ideas or suggestions for the project, you are also welcome to join our WeChat group discussion!

image

We are hiring researchers, engineers and interns in General Vision Group, Shanghai AI Lab. If you are interested in working with us, please contact Yi Wang ([email protected]).

ask-anything's People

Contributors

andy1621 avatar chihebia avatar guanaco-model avatar hjzhang-forward avatar jerryflymi avatar mattdf avatar opengvlab-admin avatar richard-61 avatar shepnerd avatar yinanhe avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.