Git Product home page Git Product logo

conll-workbench's Introduction

Open Portuguese WordNet (OWN-PT)

This repository hosts Portuguese WordNet data in textual format, this is an experimental branch of http://openwordnet-pt.org. It is linked to (but independent from) the Open English WordNet.

You can also get the data in JSON and RDF format.

See the Wiki for how the data was generated, how it compares to Princeton WordNet and what is the syntax of the text files. This data is validated and exported by the mill tool — see its repository for more information about validation, export formats, etc.

conll-workbench's People

Contributors

arademaker avatar fcbr avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar

conll-workbench's Issues

counting

the ability to count more in the Turku interface (instead of simply the 10, 50, 100, 250 and 1000), the ability to know how many exact matches in a file;

Better display CoNLL file with highlighted errors

Right now we are simply pasting the original CoNLL file inside a <pre></pre> block. We should format it better, to make it easier to distinguish the different columns. Also, ideally all errors found should be highlighted on the CoNLL displayed so that users can quickly locate the problem.

avoid root in containers

We need to avoid using root for the default build and execution user in the containers defined in this project.

Which rules do the validator satisfy really?

I assume a conll representation cannot be empty and it must have a root. But what else is already coded up in the rules? For UD we know it should be a dag, but this doesn't need to be the case for other versions of CoNLL. are we validating UD and SD? UD v1.0 or UD v2.0?

reduce size of dep-search image

We need to install a number of packages just for the compilation of dep_search, and these packages aren't needed for the execution afterwards. We should remove these packages after the compilation to reduce the image size (currently about 800 Mb).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.