Comments (6)
我今天也出现了这种错误,最后我把所有的输出形状都打印了出来,发现最后一个batch_size的大小为1,所以导致了这种错误,只要把dataloader的drop_last设置为True,就能正常运行了
from pytorch_bert_bilstm_crf_ner.
> E:\env\py7\lib\site-packages\transformers\optimization.py:309: FutureWarning: This implementation of AdamW is deprecated and will be removed in a future version. Use thePyTorch implementation torch.optim.AdamW instead, or set `no_deprecation_warning=True` to disable this warning
> `FutureWarning,`
> Traceback (most recent call last):
> File "F:/pytorch_bert_bilstm_crf_ner-main/pytorch_bert_bilstm_crf_ner-main/main.py", line 214, in <module>
> bertForNer.train()
> File "F:/pytorch_bert_bilstm_crf_ner-main/pytorch_bert_bilstm_crf_ner-main/main.py", line 46, in train
> torch.nn.utils.clip_grad_norm_(self.model.parameters(), self.args.max_grad_norm)
> File "E:\env\py7\lib\site-packages\torch\nn\utils\clip_grad.py", line 31, in clip_grad_norm_
> total_norm = torch.norm(torch.stack([torch.norm(p.grad.detach(), norm_type) for p in parameters]), norm_type)
> RuntimeError: stack expects a non-empty TensorList
>
这是我运行出来的错误
from pytorch_bert_bilstm_crf_ner.
> E:\env\py7\lib\site-packages\transformers\optimization.py:309: FutureWarning: This implementation of AdamW is deprecated and will be removed in a future version. Use thePyTorch implementation torch.optim.AdamW instead, or set `no_deprecation_warning=True` to disable this warning > `FutureWarning,` > Traceback (most recent call last): > File "F:/pytorch_bert_bilstm_crf_ner-main/pytorch_bert_bilstm_crf_ner-main/main.py", line 214, in <module> > bertForNer.train() > File "F:/pytorch_bert_bilstm_crf_ner-main/pytorch_bert_bilstm_crf_ner-main/main.py", line 46, in train > torch.nn.utils.clip_grad_norm_(self.model.parameters(), self.args.max_grad_norm) > File "E:\env\py7\lib\site-packages\torch\nn\utils\clip_grad.py", line 31, in clip_grad_norm_ > total_norm = torch.norm(torch.stack([torch.norm(p.grad.detach(), norm_type) for p in parameters]), norm_type) > RuntimeError: stack expects a non-empty TensorList >
这是我运行出来的错误
你把训练注释掉之前是否已经训练好模型了,然后也正确加载了模型。
from pytorch_bert_bilstm_crf_ner.
不知道我理解的对不对,您是指最后在final_data里生成pkl文件吗,这个我已经生成。程序中的read_pkl也能够读取。训练的模型不应该是经过bertForNer.train()才能生成吗?
from pytorch_bert_bilstm_crf_ner.
不知道我理解的对不对,您是指最后在final_data里生成pkl文件吗,这个我已经生成。程序中的read_pkl也能够读取。训练的模型不应该是经过bertForNer.train()才能生成吗?
可以预先处理,也可以在训练中生成,先生成pkl是为了避免每次都要重新处理耗时
from pytorch_bert_bilstm_crf_ner.
十分感谢,您说办法很有效,batch_size的大小对于能否成功运行很重要
from pytorch_bert_bilstm_crf_ner.
Related Issues (20)
- 对BERT模型进行继续预训练对提高性能是否有帮助? HOT 4
- 训练自己的数据,内存占用一直增长,直到吃满内存 HOT 2
- 请问一小pkl文件怎么处理得到? HOT 1
- > 我加你qq吧,你说下。 HOT 2
- 关于使用CRF文件将BERT+CRF模型转换为ONNX的问题 HOT 6
- 网页问题 HOT 1
- 英文实体识别的问题 HOT 1
- RuntimeError: expected predicate to be bool, got torch.uint8 HOT 1
- Albert问题 HOT 2
- AssertionError: pretrained bert file does not exist HOT 1
- 更换数据集后报CUDA error: device-side assert triggered HOT 1
- 求一份分词数据集 HOT 1
- 我的checkpoints文件夹是空的 HOT 1
- 这个命名实体识别的算法思路是什么? HOT 3
- 求助作者老师:在使用Bert和Bert-idnn-crf两个模型时会报如下错误:找不到model.pt。但是同样的参数Bert-crf和 bert-listm-crf则不会 HOT 1
- 训练结果召回率很低 HOT 1
- 数据增强部分应用自己数据集出现问题re.error: missing ), unterminated subpattern at position 0 HOT 2
- 自己数据集出现ValueError: too many values to unpack (expected 2)问题 HOT 2
- 在执行convert_onnx.py时出现异常 HOT 2
- 加载不同模型出错 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pytorch_bert_bilstm_crf_ner.