caoyu-noob / d3 Goto Github PK

View Code? Open in Web Editor NEW

20.0 2.0 2.0 463 KB

The implementation for ACL 2022 paper

Shell 1.26% Python 89.98% Perl 8.76%

d3's Issues

About the metrics.

Does the C-score metric in the paper represent the entail_score in the external_metrics_func function?

Also, is the model used to calculate the entail_score the same as the model used for Persona distillation?

Could you share your model for calculating the C-score?

Furthermore, I did not find the BSf metric in the results, how can I calculate this?

Thanks a lot!

Report about the script of "train_nli_model.py"

Hi , I found a bug while running your tutorial.
In train_nli_model.py , following script(line136,137) is disrupting learning.

if step >= 32: break

Trained Models

Hi, thank you for open-sourcing your work. Would you please consider sharing your trained models, or the responses generated using your models?

Could you share your generation results of GPT2-D3

Hi, first of all, thank you for sharing your code!
Could you also share your generation results of GPT2-D3 and Trans-D3 with the text file?
It would be great if I could compare your generation results.

My email address is "[email protected]".
I look forward to hearing back from you!

postprocessing file missing

In dataset.py, from .postprocessing import augment_replica.
But I can't find the postprocessing file in this repo.
Could you tell me how to deal with this? Thank a lot!

'GPT2Config' object has no attribute 'shared_attention'

When I run train_gpt2.sh, I get this error：

Traceback (most recent call last):
  File "train.py", line 500, in <module>
    main()
  File "train.py", line 492, in main
    model, tokenizer = get_model_and_tokenizer(args, trainer_config, logger)
  File "train.py", line 105, in get_model_and_tokenizer
    model = GPT2DoubleHeadsModel.from_pretrained('/home/ma-user/work/baojianzhu/gpt2')
  File "/home/ma-user/anaconda3/envs/d3/lib/python3.7/site-packages/transformers/modeling_utils.py", line 852, in from_pretrained
    model = cls(config, *model_args, **model_kwargs)
  File "/home/ma-user/work/baojianzhu/grounded/D3/model/gpt2_model.py", line 847, in __init__
    self.transformer = GPT2Model(config, sinlge_input=True)
  File "/home/ma-user/work/baojianzhu/grounded/D3/model/gpt2_model.py", line 500, in __init__
    self.h = nn.ModuleList([Block(config.n_ctx, config, scale=True, single_input=sinlge_input) for _ in range(config.n_layer)])
  File "/home/ma-user/work/baojianzhu/grounded/D3/model/gpt2_model.py", line 500, in <listcomp>
    self.h = nn.ModuleList([Block(config.n_ctx, config, scale=True, single_input=sinlge_input) for _ in range(config.n_layer)])
  File "/home/ma-user/work/baojianzhu/grounded/D3/model/gpt2_model.py", line 282, in __init__
    self.shared_attention = config.shared_attention
AttributeError: 'GPT2Config' object has no attribute 'shared_attention'

I checked the transformer version but didn't solve this issue. Could you give me some clues?

caoyu-noob / d3 Goto Github PK

d3's Issues

About the metrics.

Report about the script of "train_nli_model.py"

Trained Models

Could you share your generation results of GPT2-D3

postprocessing file missing

'GPT2Config' object has no attribute 'shared_attention'

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent