Git Product home page Git Product logo

pie-models's Introduction

Pie Models

This repository contains pretrained models for Pie (A Framework for Joint Learning of Sequence Labeling Tasks).

More on Pie: https://github.com/emanjavacas/pie.

Find a model

Models are arranged by language. TODO: add a json documentation file per model.

German (de)

german-ren.model.tar: Lemmatizer pretrained on a subset of the Referenzkorpus Mittelniederdeutsch/Niederrheinisch: https://www.slm.uni-hamburg.de/ren.html

Spanish (es)

spanish-AnCora.model.tar: Lemmatizer pretrained on the AnCora corpus for Spanish (part of the Universal Dependencies)

Old French (fro)

french-Geste.model.tar: Lemmatizer pretrained on the Geste corpus

fro-poslemmes_cat-lemma-2019_01_22-02_34_11.tar: lemmatizer and POS-tagger trained on the Geste corpus, and other Old French data from the École des chartes.

Target task: lemma. 
Accuracy on test data
  lemma: 0.9383
  pos: 0.9473

fro-poslemmes_cat-lemma-2019_01_23-00_34_12: same as the previous one, but using pre-trained word embeddings from a large unlabelled corpus.

Target task: lemma. 
Accuracy on test data
  lemma: 0.9409
  pos: 0.9468

fro-poslemmes_cat-lemma-2019_01_24-00_05_57.tar: same as the previous one, but using convolutions (cnn) for the character embeddings.

Target task: lemma. 
Accuracy on test data
  lemma: 0.9462
  pos: 0.9509

model_fro_poslemmesmorph.tar: POS-tagger, lemmatizer and morphological analyzer trained on the Geste corpus

Latin (lat)

capitula.model.tar: Lemmatizer pretrained on a non-open source dataset of medieval latin

Turkish (tur)

turkish-IMST.model.tar: Lemmatizer pretrained on the IMST corpus for Turkish (part of the Universal Dependencies)

Example config file for training a lemmatizer

lemma.config.json is an example config file for training a lemmatizer to reasonable good accuracy.

PIE

Installation

For more information check the repo at , but in short:

virtualenv env -p python3.7
source env/bin/activate
pip3 install -r requirements.txt

pie-models's People

Contributors

jean-baptiste-camps avatar

Stargazers

Ryan Baumann avatar

Watchers

James Cloos avatar  avatar Thibault Clérice avatar Mike Kestemont avatar

pie-models's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.