Comments (17)
请问xi现在是否支持vllm加速
https://docs.vllm.ai/en/latest/models/supported_models.html
如何才能支持VLLM加速,谢谢
from qwen-vl.
请问huggingface中多出的这三个文件作用,应该如何使用。
from qwen-vl.
用lora微调qwen-vl模型,用peft merge_and_unload save_pretrained保存成huggingface模型文件
用Qwen-VL# python web_demo_mm.py加载这个huggingface模型文件进行推理,报错:
assert generation_config.chat_format == 'chatml', _ERROR_BAD_CHAT_FORMAT
AssertionError: We detect you are probably using the pretrained model (rather than chat model) for chatting, since the chat_format in generation_config is not "chatml".
If you are directly using the model downloaded from Huggingface, please make sure you are using our "Qwen/Qwen-7B-Chat" Huggingface model (rather than "Qwen/Qwen-7B") when you call model.chat().
我们检测到您可能在使用预训练模型(而非chat模型)进行多轮chat,因为您当前在generation_config指定的chat_format,并未设置为我 们在对话中所支持的"chatml"格式。
如果您在直接使用我们从Huggingface提供的模型,请确保您在调用model.chat()时,使用的是"Qwen/Qwen-7B-Chat"模型(而非"Qwen/Qwen-7B"预训练模型)。
请问如何修改,谢谢!
from qwen-vl.
这三个文件是要直接拷贝近huggingface模型文件output_qwen_hf ?
from qwen-vl.
请问按照readme lora合并保存模型
没看到加载预训练路径的代码,AutoPeftModelForCausalLM这个会自动下载huggingface上的 QwenVL预训练模型吗,他是根据哪个配置选项自动下载QwenVL还是Qwen chat模型文件的?
from qwen-vl.
请问什么是ChatML格式
from qwen-vl.
推理时发现跟没训练几乎没区别
lora训练后数参数,如何知道正在合并到了新的整体模型中了
谢谢
from qwen-vl.
能否帮忙解答一下,谢谢
from qwen-vl.
我是这个干的 也成功了 但是预测的效果很不好
from qwen-vl.
@fanshuaiyao 跟这个有关系么?
from qwen-vl.
请问如何将LLM(lm_head)输出概率值shift_logits转换成文本答案text。
经过:
predict_ids = np.argmax(results.shift_logits, axis=-1)
text = tokenizer.batch_decode(predict_ids, skip_special_tokens=True)
发现text 绝大部分为乱码显示。
谢谢!
from qwen-vl.
能否解答一下上述几个问题,谢谢!
from qwen-vl.
发现huggingface预训练模型中tokenizer的tokenizer_config.json与fientune tokenizer.save_pretrained保存的内容不太一致!
clean_up和model_max_length
请问这些参数不同有没有影响,谢谢!
from qwen-vl.
能否解答一下上述几个问题,谢谢!
from qwen-vl.
能否解答一下上述几个问题,谢谢!
from qwen-vl.
现在我用trainer model merge融合后的模型做trainer.predict推理结果正常准确率还可以,就是web infer几乎没有效果,模型本身应该没有问题,不知应从哪里开始排查
from qwen-vl.
lm head为151936,tokenizer为151860,目前只有peft lm head resize 151860有效果。
from qwen-vl.
Related Issues (20)
- [BUG] <title>ReadMe好像有笔误
- [BUG] 重置位置操作有误
- Qwen2的VL版本是否能提供一个0.5B的模型💡
- 求助:微调多目标标注方法 HOT 9
- [Question] Does the model support Document analysis?
- [BUG] <title>本地下载了模型,也检查了模型文件完整性,但是导入的时候还是会从网上下载 HOT 1
- [BUG] <title><.cache/huggingface/modules/transformers_modules/Qwen-VL-Chat/modeling_qwen.py,每次运行会被刷新,请问怎么不刷新呢? HOT 1
- safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooSmall
- 💡 [REQUEST] - <title>
- [BUG] <title>Using RTX 4000 series doesn't support faster communication broadband via P2P or IB. HOT 1
- SFT or Instruction Tuning
- [BUG] merge lora checkpoint;合并lora权重之后报错:'QWenTokenizer' object has no attribute 'IMAGE_ST'
- [BUG] <title>IsADirectoryError: [Errno 21] Is a directory: '/data/data/sxj/qwenvlchat_model'
- [BUG] <title> 在微调过程中,是否可以在value中规定json?
- 基于下游视觉任务微调,是否可以在value固定json输出,这样我就可以去获取指定的信息。 HOT 3
- 为了方便多模态技术交流,建了多模态技术交流群,感兴趣可以加入
- [BUG] <Failed to Finetune for multi GPUs/多卡微调一直失败>
- [BUG] <Qwen-VL-Chat多卡微调所需的内存多大>
- t4卡的lora finetune
- [BUG] <使用Gradio界面询问框选的时候无法返回图片>
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from qwen-vl.