Comments (3)
We are using GPT4 with only text inputs, and we replace image content inputs with manual annotations, following the approach outlined in https://github.com/QwenLM/Qwen-VL/blob/master/touchstone/README.md. We will soon release a more detailed technical report.
from qwen-vl.
Thanks for your attention, we add more details in the paper.
The evaluation script is provided at this repo
from qwen-vl.
Thanks for your attention, we add more details in the paper. The evaluation script is provided at this repo
We are using GPT4 with only text inputs, and we replace image content inputs with manual annotations, following the approach outlined in https://github.com/QwenLM/Qwen-VL/blob/master/touchstone/README.md. We will soon release a more detailed technical report.
Thanks a lot! Really appreciate this excellent work and the contributions you and your team have made to the open-source community.
from qwen-vl.
Related Issues (20)
- RuntimeError: "_amp_foreach_non_finite_check_and_unscale_cuda" not implemented for 'BFloat16' HOT 1
- 💡 [输入图片的方法] - 请问支持以编码形式输入图片吗?比如base64的图片编码
- [BUG] <title>qwen-vl-max 和qwen-vl-plus没有检测box的能力吗,网页端和api都试过,不会输出box HOT 1
- 如何微调使Qwen-VL可以做图文检索任务呢? HOT 1
- 无法加载模型
- [BUG] <title>Lora训练的时候想同时打开其他部分进行非lora训练,该怎么做 HOT 2
- [BUG] <title> Unable to load trained LoRa model weights using AutoPeftModelForCausalLM.from_pretrained()
- [BUG] <没有按照提示词要求输出指定内容> HOT 1
- 生成的图片如何获取呢?
- 💡 [qwen-vl-chat-v1 返回结果优化] - <目前返回结果内容方案和其它模型不同,建议添加配置项>
- 请说人话
- what is the format of bounding box returned in inference?
- [BUG] 在lora训练时出现 “Could not find a config file in xx” HOT 2
- [BUG] <title>AttributeError: 'QWenTokenizer' object has no attribute 'IMAGE_ST'
- 请问QwenVL支持多LoRA的切换吗??
- eval的相关数据下载问题
- lora微调后输出的模型文件发生变化,导致调用微调后的模型出现错误
- Poor performance using huggingface qwenVL not chat
- [BUG] <title>模型不能正确分辨出输入图像的顺序
- [BUG] <title>启动api之后,如何使用图片构造请求,并获取模型结果
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from qwen-vl.