jwieting / iclr2016 Goto Github PK

Python code for training all models in the ICLR paper, "Towards Universal Paraphrastic Sentence Embeddings". These models achieve strong performance on semantic similarity tasks without any training or tuning on the training data for those tasks. They also can produce features that are at least as discriminative as skip-thought vectors for semantic similarity tasks at a minimum. Moreover, this code can achieve state-of-the-art results on entailment and sentiment tasks.

Shell 3.73% Python 92.76% Java 3.50%

iclr2016's People

Contributors

Stargazers

Watchers

iclr2016's Issues

STS preprocessing script

Hi,

I'm trying to access the STS 2012 (and other years if possible) files, and I was wondering where I could download them in the correct format or find the script that preprocessed them. Unless I'm mistaken, preprocess.java only preprocesses the SICK task.

I do have the original STS 2012 files, but I wanted to preprocess them in the same way as done for https://github.com/PrincetonML/SIF (which mentions the data was preprocessed here).

Apache/2.4.6 (Red Hat Enterprise Linux) Server at www.gtlib.gatech.edu Port 80

File "paragram_sl999_small.txt"

Hi,

Could you please explain the format of the file paragram_sl999_small.txt and how one can generate it if he has a custom dataset.

Thanks in advance for your help.

jwieting / iclr2016 Goto Github PK

iclr2016's People

Contributors

Stargazers

Watchers

Forkers

iclr2016's Issues

STS preprocessing script

Missing attribute nonlinearity

License?

url 404: http://www.gtlib.gatech.edu/pub/apache/commons/io/binaries/commons-io-2.4-bin.zip

File "paragram_sl999_small.txt"

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent