xixiaoyao / cs224n-winter-together Goto Github PK

View Code? Open in Web Editor NEW

491.0 14.0 155.0 385.41 MB

an Open Course Platform for Stanford CS224n (2020 Winter)

Home Page: https://mp.weixin.qq.com/s/GsnhifWkd_lh88d3---4RQ

License: Apache License 2.0

Shell 0.02% Python 1.64% Jupyter Notebook 3.44% JavaScript 94.89% Erlang 0.01%

neural-networks nlp deep-learning xi-xiaoyao stanford-online 2020 cs224n cs224n-assignment-solutions stanford

cs224n-winter-together's People

Contributors

Stargazers

Watchers

Forkers

luluxing3 xingwu01 mogulkhan huipengxu yhx0105 bobofrivia dukeenglish happy-zyy chenbofeng123 makinaruto yerayl douboo geekhch gtseventeen guolan-newbie wonderxie gao0505 helloraba czkonverse rayhuangyl jxlin eleanoryuyuyu aimasa rebeccapang betterboytph logan0czy juliensun royaltengjun pwycl reneeliz feiyang2008 haoscottsun unirabbit louwailou shanliwa1 yingning allensmile hanlintang xemcerk wxjcn jamestch ottsion wangyu-ustc cqupeng xxxlil blucehan wangyab lalla98 xrosliang johannawang jyzhang10mars hranwang mathcrazyy yanliuwang zhouyunnudt lidianxiang yumiao1203 binzhang109 dalishuijiao tlysecust amankb yncao zehongzma herais ethan-phu chuym726 nancygu chenny0808 ren98feng macroice jacky1y boyhe binkmust laguepesikin polovancer ariafyy anaana35 jianghusanren007 sups007 liudengfeng xiaozhoujian ahmedaashraf shiyanlou-015555 allen860614 bella-lyt aaroncgw31 herminia1993 hadkins1 yxf975 zhuohuwu0603 spike-weiyu neilteng kinggilgamesh changleilei zlzr200599 cospplay shikhar2562 cksteven rakanwen penguiny9

cs224n-winter-together's Issues

有没有 word2vec 中的数学原理详解？

感谢群友@风雪夜归人语
https://www.cnblogs.com/peghoty/p/3857839.html

对skip-gram的直观解释

这种根据中心词来预测中心词的上下文，有什么比较直观的解释吗？像CBOW那种，上下文预测中心词，脑海里想起来比较直观，好理解一些，但是skip-gram模型脑海里却想不到直观的解释，有什么想法或者参考资料吗？

有没有往年大佬整理的笔记？

希望能辅助大家理解CS224N这门课，但是更希望大家能够像大佬们一样，多多输出高质量的课程笔记。每个人的视角不同，遇到的问题也不一样，积累的都是有价值的经验，期待大家在群里和github上踊跃讨论，共同进步！

About the use of random seed in assignment 5

一个很奇怪的事情是，为什么尽管并没有改变随机种子，A5中每一次从头开始训练都会收敛到不同的结果，讲道理当你对于torch、numpy和random都设好了相同的随机种子之后的答案应该是不变的才对啊，希望有大佬可以解答

Assignment 5中出现conv filter size是5但是输入尺度为4的问题

我在完成assignment 5的过程当中，遇上了如题所述的问题，主要的原因是因为在inference time，word-level LSTM预测得到之后是用character-level LSTM，但是它第一个词的预测结果就是，不知道大家有没有遇到过这个问题，不知道是我写错了还是其本来就是这样，因为一般来说设置成作业的例子应该不会这么涉及到这么多细节的才对

为什么小语料的情况下反而skip-gram表现更好一些呢？

因为skip-gram模型是根据中心词预测中心词的上下文，这直观上看来，应该“难度”会大于CBOW，那么按理来说会需要更多的语料才能比较好的收敛，那为什么小语料的情况下反而skip-gram表现更好一些呢？
参考资料：http://licstar.net/archives/620

Lecture videos link

Can u please share Stanford cs224n NLP lecture links if possible,Thanks @xixiaoyao

n-gram模型的讲义中提到了在处理每一个句子的时候都需要加一个首尾标志（<start>,<end>），比如如下的两个句子，bigram model为例：
(1). <start> I am Sam <end>
(2). <start> Sam I am <end>
具体我有三个疑惑：
(1). 对于结尾符<end>，文中的解释为"To make the bigram grammar a true probability distribution. Without an end-symbol, the sentence probabilities for all sentences of a given length would sum to one. This model would define an infinite set of probability distribution, with one distribution per sentence length."我不是很明白，请问有没有更直观的解释或者参考的资料呢？
(2).对于起始符<start>，文中解释是为了"to give us the bigram context of the first word."起始符没有像结尾符一样在概率分布方面的作用吗？
(3). 对于n-gram,是否需要在首尾加上n-1个起始和结尾符，还是仅仅只需要添加一个就行了呢？
跪求解惑。。。

xixiaoyao / cs224n-winter-together Goto Github PK

cs224n-winter-together's People

Contributors

Stargazers

Watchers

Forkers

cs224n-winter-together's Issues

有没有 word2vec 中的数学原理详解？

对skip-gram的直观解释

有没有往年大佬整理的笔记？

About the use of random seed in assignment 5

Assignment 5中出现conv filter size是5但是输入尺度为4的问题

为什么小语料的情况下反而skip-gram表现更好一些呢？

Lecture videos link

n-gram中句子加首尾标志符

有没有对HMM比较好的讲解？

assignment #4: Decode阶段softmax计算量很大,有解决办法吗？

gensim.downloader的下载速度不行，glove词向量下载不下来怎么办？

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent