Git Product home page Git Product logo

pert's People

Contributors

ymcui avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

pert's Issues

句子排序

该模型可以实现两个句子的次序预测吗

lr

请问pert-large预训练时学习率是多少?

错误反馈

你好,关注到arxiv上的PERT原文,图片中是不是有点错误呢?预测的两个位置xia'biao下标是不是应该是7和6呢?
image

PERT: 公式(5)R^L 的 L 是 最大长度N吗?

Cui老师您好!

看到的论文您的 [2203.06906] PERT: Pre-training BERT with Permuted Language Model,对我很有帮助,并正在复现它。过程中,有一个细节不明白,希望您能不吝指点。

对于公式(5),L是最大长度N吗?如下图。前面您提到L是transfomer的层数,不能理解为什么层数L和p_i有关。

图片

H^{~}_{i}的维度是(1,768), H^{T}维度是 (768,N), 那么 b 的维度应该是(N)。

希望能尽快得到您的回复,谢谢。

关于chinese-pert-base-mrc

    你好,当我用hfl/chinese-pert-base-mrc或large进行阅读理解时,尚未进行微调,文本为“我叫沃尔夫冈,我住在柏林。”,问题为“我弟弟住在哪里”,此时得到的答案为“柏林”。
    当我把问题改为“我在哪里工作”、“我弟弟住在哪里”、“我来自哪里”、“我妹妹来自哪里”等,得到答案也是柏林,概率也都是0.9多,请问这种情况该如何改善呢?

关于预训练

想问下,如何在自己语料上进行预训练,是按照之前mask那种方式直接预训练吗。数据处理和模型源码会开放吗

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.