zjunlp / adakgc Goto Github PK
View Code? Open in Web Editor NEW[EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction
[EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction
problem:
在单卡V100 32G上已经可以运行了,由于速度比较慢,想转到3090上进行尝试,单卡现存不够,考虑多卡,在两张3090的卡上运行出现以下报错,我应该怎么做?
command:
bash scripts/run_finetune.bash --model=hf_models/t5-v1_1-base --data=data/Few-NERD_H/iter_1 --output=output/Few-NERD --mode=H --device=0,1 --batch=16
error:
1、IndexError: Caught IndexError in replica 0 on device 0.;
2、IndexError: index 16 is out of bounds for dimension 0 with size 16
problem:
1、AttributeError: 'ConstraintSeq2SeqPromptTrainer' object has no attribute 'sharded_ddp';
2、NameError: name 'ShardedDDPOption' is not defined
From:
delay_optimizer_creation = self.sharded_ddp is not None and self.sharded_ddp != ShardedDDPOption.SIMPLE
想问一下这个是什么原因报错,我该如何解决这个报错呢?非常感谢!
我配置了完整的开发环境,并对以下任务进行测试:
任务名称:Relation Extraction Task
指令:
1、. config/prompt_conf/NYT_H.ini
2、bash scripts/run_finetune.bash --model=hf_models/t5-v1_1-base --data=data/NYT_H/iter_1 --output=output/NYT --mode=H --device=0 --batch=8(修改了batch_size)
最后的测试结果:
12/25/2023 20:29:01 - INFO - main - ***** Test results *****
12/25/2023 20:29:01 - INFO - main - test_asoc-F1 = 92.8138
12/25/2023 20:29:01 - INFO - main - test_asoc-P = 91.96
12/25/2023 20:29:01 - INFO - main - test_asoc-R = 93.6835
12/25/2023 20:29:01 - INFO - main - test_asoc-gold = 4322.0
12/25/2023 20:29:01 - INFO - main - test_asoc-pred = 4403.0
12/25/2023 20:29:01 - INFO - main - test_asoc-tp = 4049.0
12/25/2023 20:29:01 - INFO - main - test_fixed = 1
12/25/2023 20:29:01 - INFO - main - test_gen_len = 32.7888
12/25/2023 20:29:01 - INFO - main - test_gold_tree = 10833
12/25/2023 20:29:01 - INFO - main - test_gold_tree add_bracket = 1
12/25/2023 20:29:01 - INFO - main - test_loss = 0.0562
12/25/2023 20:29:01 - INFO - main - test_ordered-record-F1 = 90.2995
12/25/2023 20:29:01 - INFO - main - test_ordered-record-P = 89.61
12/25/2023 20:29:01 - INFO - main - test_ordered-record-R = 90.9997
12/25/2023 20:29:01 - INFO - main - test_ordered-record-gold = 10833.0
12/25/2023 20:29:01 - INFO - main - test_ordered-record-pred = 11001.0
12/25/2023 20:29:01 - INFO - main - test_ordered-record-tp = 9858.0
12/25/2023 20:29:01 - INFO - main - test_overall-F1 = 187.5559
12/25/2023 20:29:01 - INFO - main - test_pred_tree = 11002
12/25/2023 20:29:01 - INFO - main - test_record-F1 = 91.2613
12/25/2023 20:29:01 - INFO - main - test_record-P = 90.5645
12/25/2023 20:29:01 - INFO - main - test_record-R = 91.969
12/25/2023 20:29:01 - INFO - main - test_record-gold = 10833.0
12/25/2023 20:29:01 - INFO - main - test_record-pred = 11001.0
12/25/2023 20:29:01 - INFO - main - test_record-tp = 9963.0
12/25/2023 20:29:01 - INFO - main - test_runtime = 233.7369
12/25/2023 20:29:01 - INFO - main - test_samples_per_second = 21.392
12/25/2023 20:29:01 - INFO - main - test_spot-F1 = 94.7421
12/25/2023 20:29:01 - INFO - main - test_spot-P = 94.0187
12/25/2023 20:29:01 - INFO - main - test_spot-R = 95.4768
12/25/2023 20:29:01 - INFO - main - test_spot-gold = 10833.0
12/25/2023 20:29:01 - INFO - main - test_spot-pred = 11001.0
12/25/2023 20:29:01 - INFO - main - test_spot-tp = 10343.0
12/25/2023 20:29:01 - INFO - main - test_steps_per_second = 0.672
12/25/2023 20:29:01 - INFO - main - test_well-formed = 5000
想问一下这里面哪个是论文中提到的F1,另外 test_overall-F1 = 187.5559这个值为什么会超过100呢?感谢大佬回复!
RuntimeError: unsupported operation: more than one element of the written-to tensor refers to a single memory location. Please clone() the tensor before performing the operation.
在训练时报错:os.environ["HF_DATASETS_CACHE"]的位置不存在,想问一下这个目录的数据(glove.6B.300d.txt)是从网上下载的吗?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.