Git Product home page Git Product logo

stasis's Introduction

Stasis - Python wrapper for Semantic Similarity datasets

Under the auspice of the EXPERT project (http://expert-itn.eu/), we have written a python wrapper to the STS datasets and we hope that it helps anyone with easy manipulation the datasets.

If you just need a tab-separated file, you can easily find the sts.csv available in the same repository. The repo also contains other (maybe) useful datasets that are manually compiled by the maintainer when they are free.

Disclaimer: The repository comes as it is. It should NOT be considered as the official SemEval's (Semantic Textual Similarity) STS data and it is not affiliated with the STS organizers. We've created this so that people can easily do something like pandas.read_csv('sts.csv') or graphlab.SFrame('sts.csv') and work with the dataframes with little hassle.

Datasets

Below is a list of datasets/wrappers you can find here

Contribute

Please feel free to add datasets/wrappers to the repository. Or post an issue to request for wrappers to the repository.

Cite

Please cite the respective references for the datasets when using them in your publication!

If you want to cite this repository, you can cite this paper where we created used the sts.csv in SemEval-2015

stasis's People

Contributors

alvations avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

stasis's Issues

STS data set

For pairs of sentences with a blank score (e.g., 2015\belief.test.tvs), what are the similar scores?

empty lines

There is a lot of empty lines in the gs files - /STS2015-gold/STS.gs.headlines.txt for example.
is it means something? or just the label is missing?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.