Git Product home page Git Product logo

adakgc's Issues

trainer.py报错

problem:
1、AttributeError: 'ConstraintSeq2SeqPromptTrainer' object has no attribute 'sharded_ddp';
2、NameError: name 'ShardedDDPOption' is not defined

From:
delay_optimizer_creation = self.sharded_ddp is not None and self.sharded_ddp != ShardedDDPOption.SIMPLE

想问一下这个是什么原因报错,我该如何解决这个报错呢?非常感谢!

多卡运行报错

problem:
在单卡V100 32G上已经可以运行了,由于速度比较慢,想转到3090上进行尝试,单卡现存不够,考虑多卡,在两张3090的卡上运行出现以下报错,我应该怎么做?
command:
bash scripts/run_finetune.bash --model=hf_models/t5-v1_1-base --data=data/Few-NERD_H/iter_1 --output=output/Few-NERD --mode=H --device=0,1 --batch=16
error:
1、IndexError: Caught IndexError in replica 0 on device 0.;
2、IndexError: index 16 is out of bounds for dimension 0 with size 16

HF_DATASETS_CACHE的数据问题

在训练时报错:os.environ["HF_DATASETS_CACHE"]的位置不存在,想问一下这个目录的数据(glove.6B.300d.txt)是从网上下载的吗?

代码运行后F1值异常

我配置了完整的开发环境,并对以下任务进行测试:
任务名称:Relation Extraction Task
指令:
1、. config/prompt_conf/NYT_H.ini
2、bash scripts/run_finetune.bash --model=hf_models/t5-v1_1-base --data=data/NYT_H/iter_1 --output=output/NYT --mode=H --device=0 --batch=8(修改了batch_size)
最后的测试结果:
12/25/2023 20:29:01 - INFO - main - ***** Test results *****
12/25/2023 20:29:01 - INFO - main - test_asoc-F1 = 92.8138
12/25/2023 20:29:01 - INFO - main - test_asoc-P = 91.96
12/25/2023 20:29:01 - INFO - main - test_asoc-R = 93.6835
12/25/2023 20:29:01 - INFO - main - test_asoc-gold = 4322.0
12/25/2023 20:29:01 - INFO - main - test_asoc-pred = 4403.0
12/25/2023 20:29:01 - INFO - main - test_asoc-tp = 4049.0
12/25/2023 20:29:01 - INFO - main - test_fixed = 1
12/25/2023 20:29:01 - INFO - main - test_gen_len = 32.7888
12/25/2023 20:29:01 - INFO - main - test_gold_tree = 10833
12/25/2023 20:29:01 - INFO - main - test_gold_tree add_bracket = 1
12/25/2023 20:29:01 - INFO - main - test_loss = 0.0562
12/25/2023 20:29:01 - INFO - main - test_ordered-record-F1 = 90.2995
12/25/2023 20:29:01 - INFO - main - test_ordered-record-P = 89.61
12/25/2023 20:29:01 - INFO - main - test_ordered-record-R = 90.9997
12/25/2023 20:29:01 - INFO - main - test_ordered-record-gold = 10833.0
12/25/2023 20:29:01 - INFO - main - test_ordered-record-pred = 11001.0
12/25/2023 20:29:01 - INFO - main - test_ordered-record-tp = 9858.0
12/25/2023 20:29:01 - INFO - main - test_overall-F1 = 187.5559
12/25/2023 20:29:01 - INFO - main - test_pred_tree = 11002
12/25/2023 20:29:01 - INFO - main - test_record-F1 = 91.2613
12/25/2023 20:29:01 - INFO - main - test_record-P = 90.5645
12/25/2023 20:29:01 - INFO - main - test_record-R = 91.969
12/25/2023 20:29:01 - INFO - main - test_record-gold = 10833.0
12/25/2023 20:29:01 - INFO - main - test_record-pred = 11001.0
12/25/2023 20:29:01 - INFO - main - test_record-tp = 9963.0
12/25/2023 20:29:01 - INFO - main - test_runtime = 233.7369
12/25/2023 20:29:01 - INFO - main - test_samples_per_second = 21.392
12/25/2023 20:29:01 - INFO - main - test_spot-F1 = 94.7421
12/25/2023 20:29:01 - INFO - main - test_spot-P = 94.0187
12/25/2023 20:29:01 - INFO - main - test_spot-R = 95.4768
12/25/2023 20:29:01 - INFO - main - test_spot-gold = 10833.0
12/25/2023 20:29:01 - INFO - main - test_spot-pred = 11001.0
12/25/2023 20:29:01 - INFO - main - test_spot-tp = 10343.0
12/25/2023 20:29:01 - INFO - main - test_steps_per_second = 0.672
12/25/2023 20:29:01 - INFO - main - test_well-formed = 5000
想问一下这里面哪个是论文中提到的F1,另外 test_overall-F1 = 187.5559这个值为什么会超过100呢?感谢大佬回复!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.