Git Product home page Git Product logo

tener's Introduction

Tensorflow-Keras port of https://github.com/fastnlp/TENER.

The motivation behind this port is:

  • To check the claim that this architecture can perrform well in small dataset
  • To build the basic data preprocessing from scratch
  • Implement the Tener architecture in Tensorflow
  • Tryout different position strategy after mimiking the basic metrics

Checklist

  • Dataset preparation and test cases
  • Gin-config
  • Plug and play trainer for datasets and model architecture
  • Vanilla transformer model integration
  • Tener transformer model
    • Embeddings
      • Sinusoidal
      • RelativeSinusoidal
      • Character Embedding
    • Attention
      • MultiHeadNaive
      • MiltiHeadRelative
  • Tuning and Debugging
    • Vanilla Transformer Model
    • Tener Transformer Model

Setup

pip install -r requirements.txt

Module Design

  • Google Gin-Config based configuration
  • Trainer script that reads the config and selects the dataset and model to be used
  • Model class that encapsulates the Keras model layers, loss, metrics and train step
gin config file ---> trainer ---> dataset and model ---> Keras Model 

How to run?:

  • Train
cd tener/
# To use vanilla transformer architecture from Tensorflow Tutorial
python bin/trainer.py --config_file=config/vanilla_transformer.gin
# Tener absed architecture 
python bin/trainer.py --config_file=config/tener.gin
  • Test
cd tener/src/

pytest -s

# some times pytest can be picked from global installation
# breaking the pytest to use your environment related packages
/home/{user_home}/anaconda3/envs/{env}/bin/pytest -s

References:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.