Comments (8)
'QuantState' has no len()
请问这个问题解决了吗?我用QLora方法跑完官方微调项目,加载模型的时候报了一样的错
from visualglm-6b.
我也遇到了相同的问题,请问解决了吗
from visualglm-6b.
我也遇到相同的问题,怎么解决?
from visualglm-6b.
我也是,有没有好心人回答一下
from visualglm-6b.
把这行代码直接改成device='cuda':
https://github.com/THUDM/VisualGLM-6B/blob/main/cli_demo.py#L36
from visualglm-6b.
把这行代码直接改成device='cuda':
https://github.com/THUDM/VisualGLM-6B/blob/main/cli_demo.py#L36
试过了,还是一样的报错
from visualglm-6b.
把这行代码直接改成device='cuda':
https://github.com/THUDM/VisualGLM-6B/blob/main/cli_demo.py#L36
改了之后还是同样的报错,然后这两天试的时候还有个新的问题,用QLora微调的话会报:
Build 4bit layer failed. You need to install the latest bitsandbytes. Try pip install bitsandbytes
.
(使用的还是bitsandbytes==0.39.0)
按照报错信息更新了bitsandbytes之后QLora可以跑通,但是加载微调后的模型还是会报TypeError: object of type 'QuantState' has no len()
from visualglm-6b.
把这行代码直接改成device='cuda':
https://github.com/THUDM/VisualGLM-6B/blob/main/cli_demo.py#L36改了之后还是同样的报错,然后这两天试的时候还有个新的问题,用QLora微调的话会报: Build 4bit layer failed.您需要安装最新的 bitsandbytes。尝试。 (使用的还是bitsandbytes==0.39.0) 按照报错信息更新了bitsandbytes之后QLora可以跑通,但是加载微调后的模型还是会报TypeError: object of type 'QuantState' has no len()
pip install bitsandbytes
你可以在 checkpoints/finetune-visualglm-6b-01-12-09-56 目录下查看一下微调之后的权重大小是多少,看是否是 7GB,还是 15GB。
删除 7GB 的微调权重,执行 bash finetune/finetune_visualglm.sh --quant 你会得到一个 15 GB 的新权重,这个权重是可执行的。
具体什么原因我不清楚,希望有高手解答吧,大概和量化有关系。
from visualglm-6b.
Related Issues (20)
- Lora微调返回代码-7
- 微调问题,微调后模型遗忘
- 烦请帮忙看看,微调后运行cli_demo.py出现维度不一致问题; RuntimeError: The size of tensor a (12288) must match the size of tensor b (25165824) at non-singleton dimension 0 HOT 2
- 全代码开源会有吗
- 运行finetune代码时,报错没有model_config.json
- 报错cannot import name 'builder' from 'google.protobuf.internal'
- visualglm进行QLoRA微调时报错,RuntimeError: mat1 and mat2 shapes cannot be multiplied (320x4096 and 1x25165824) [2024-03-07 07:26:17,037] [INFO] [launch.py:316:sigkill_handler] Killing subprocess 13210 HOT 5
- 请问怎么把finetune后的模型转成onnx格式呢?
- 微调之后加载web_demo时的报错 HOT 2
- 请问在阿里云上部署,连接不上huggingface网站的问题怎么解决呀? HOT 6
- 关于只用文本数据集微调
- Lora微调报错,fp16 is not supported HOT 1
- 请问可以实现用qlora+model parallel 吗 HOT 1
- qlora merge lora weights error
- 多图推理
- python web_demo.py报错 HOT 1
- 运行web_demo_hf.py报错
- finetune的时候出现模型加载失败
- finetune的时候加载模型失败
- 'Chatbot' object has no attribute 'style'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from visualglm-6b.