Git Product home page Git Product logo

zero-shot-translation-transformer's Introduction

Zero-Shot translation ( tensorflow / Transformer )

  • Google has released a zero-shot translation task through the GNMT model. The project will try to use the new model(Transformer) to perform translation tasks and experiment with the possibility of translating zero-shot through various languages.
  • First, You can train your translation model ( Transformer ) using vanilla_transformer
  • Second, You can have experiment of Zero-Shot translation using Transformer by transformer_zero_shot
  • Figure below this paragraph shows the concept of Zero-Shot translation image

Requirements

File description

  • preprocessor.py download & preprocess data ( you can use OpenSubtitle2018 / MultiUN dataset )
  • data_loader.py define class for data load ( Input pipeline developed by tf.dataset)
  • model_layer.py define all layer needed for construct transformer model
  • model.py define transformer model
  • train.py for training model
  • eval.py is for evaluation

Training & Evaluation Process

  • STEP 1. Set the configuration by editing hyperparams.py
  • STEP 2. Run preprocessor.py , it will preprocess and make training data
  • STEP 3. Run train.py to train your model
  • STEP 4. Run eval.py to save result and calculate BLEU score

Vanilla translation

Training process

image image

Zero-Shot translation

Result

image

Discussion

image

Reference

  • Ashish Vaswan, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser Illia Polosukhin Attention Is All You Need, arXiv:1706.03762v5, 2017
  • Melvin Johnson, Mike Schuster, Quoc V. Le, Maxim Krikun, Yonghui Wu, Zhifeng Chen, Nikhil Thorat , Fernanda Viégas, Martin Wattenberg, Greg Corrado, Macduff Hughes, Jeffrey Dean, Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation, arXiv:1611.04558v2 2017
  • Jörg Tiedemann, 2012, Parallel Data, Tools and Interfaces in OPUS. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)
  • P. Lison and J. Tiedemann, 2016, OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
  • https://github.com/Kyubyong/transformer

zero-shot-translation-transformer's People

Contributors

joon-park92 avatar

Stargazers

Tom avatar

Watchers

James Cloos avatar Tom avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.