Git Product home page Git Product logo

dssm's People

Contributors

insanelife avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dssm's Issues

Data fromat

您好。能方便给一下数据格式么。不知道咋个弄训练数据啊

训练环境

请问有谁用全量数据训过吗?大概需要多大的内存空间?带GPU的呢?

数据格式

你好,能否告知训练样本的格式是怎么样的呢(正负样本如何组织的,输入是一个query对应1个正样本,4个负样本吗),还有你中文特征提取是只用了uni_gramn吗,方便留个邮箱或者联系方式吗,谢谢(by the way, 我也是在成都哟,哈哈)

datasets format problem

Can you tell me the datasets format or show a screenshot ?
In the following, you use data_sets.query_test_data, data_sets.doc_test_positive, data_sets.doc_test_negative, so I don't quite understand the format.
Thanks!

测试准确度低

用”siamese_bert“模型,在80万公司数据集上,1(正):4(负),跑出来的cos倒排,感觉完全不靠谱,发愁
auc: 0.64
准确率: 0.75

dssm loss计算为什么是reduce_sum

在dssm.py中,计算loss的代码
with tf.name_scope('Loss'):
# Train Loss
cross_entropy = tf.nn.sigmoid_cross_entropy_with_logits(labels=doc_label_batch, logits=cos_sim)
losses = tf.reduce_sum(cross_entropy)
tf.summary.scalar('loss', losses)
pass
是不是有问题?为什么是reduce_sum?而不是reduce_mean

data_input

import data_input ModuleNotFoundError:No module named 'data_input'
Where is the data_input

如何预测

需要先训练模型,然后做预测,训练入口:train.py
训练(默认使用功LCQMC数据集):

python train.py --mode=train

预测:

python train.py --mode=train --file=$predict_file$

测试文件格式: q1\tq2, 例如:

今天天气怎么样	今天温度怎么样

Originally posted by @InsaneLife in #25 (comment)

🚨 Potential Deserialization of Untrusted Data

👋 Hello, @InsaneLife - a potential high severity Deserialization of Untrusted Data vulnerability in your repository has been disclosed to us.

Next Steps

1️⃣ Visit https://huntr.dev/bounties/1-other-InsaneLife/dssm for more advisory information.

2️⃣ Sign-up to validate or speak to the researcher for more assistance.

3️⃣ Propose a patch or outsource it to our community - whoever fixes it gets paid.


Confused or need more help?

  • Join us on our Discord and a member of our team will be happy to help! 🤗

  • Speak to a member of our team: @JamieSlome


This issue was automatically generated by huntr.dev - a bug bounty board for securing open source code.

损失函数的定义只涉及到了正样本?

您好,我看代码里定义损失函数那一块,先对query分别和正样本负样本的out_embedding求cos,然后外接softmax之后,只用到了正样本的概率结果,为什么不把负样本的概率结果求负之后也加进来呢?

如果按照您的loss定义,那么完全可以舍去负样本的输入。

Why my train loss equal to nan?

Hi,
When I run dssm_rnn.py, the train loss always shows nan. Change learning rate, no matter what.
I print out the variables in the model, and the variable embedding in the word_embeddings_layer shows nan for the first time.
How to deal with it. Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.