nlp_training's Introduction

NLP_Training

NLP examples

Check the How to run the code.txt file on how to use these notebooks.

We have downloaded the word embeddings and datasets from these links:

Word Embeddings:
Glove: https://github.com/stanfordnlp/GloVe/blob/master/README.md
Fast Text: https://fasttext.cc/docs/en/english-vectors.html
LexVec: https://github.com/alexandres/lexvec
ConceptNet NumberBatch: https://github.com/commonsense/conceptnet-numberbatch
Google News: https://code.google.com/archive/p/word2vec/
PDC: http://ofey.me/projects/wordrep/
HDC: http://ofey.me/projects/wordrep/

Similarity datasets:

MTurk : Available on this repo
MEN : Available on this repo
WS353 : http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/ also available on this repo
RG65 : Available on this repo
RW : https://nlp.stanford.edu/~lmthang/morphoNLM/ also available on this Available on this repo
SimLex999 :https://fh295.github.io/simlex.html
TR9856 : Available on this repo

Analogy datasets MSR WordRep : Available on this repo
Google Analogy : Available on this repo
MSR : Available on this repo
SEMEVAL 2012 Task 2 : Available on this repo

Important references for Text classification with PyTorch https://www.analyticsvidhya.com/blog/2020/01/first-text-classification-in-pytorch/

https://github.com/pchampio/sentence-entailment

Description of Experiments file:

Model Description LSTIM(Bidirectional, Number of Layers, Num of Hidden Nodes, Dropout rate)

Recommend Projects