Git Product home page Git Product logo

s-align's Introduction

S-Align (Soft alignment for E2E Speech Translation)

The code is forked from Fairseq-v0.12.3. For more Installation details, please refer to Fairseq

Useage

Training scripts and configurations for the MuST-C dataset are as follows:

egs
|---machine_translation
|    |---train.sh
|    |---decode.sh
|    |---load_embedding.py
|---pretrain-all
|    |---joint_train_merge.sh
|    |---decode.sh
|    |---device_run.sh
|    |---conf

Step 1. MT Pretrain

• Prepare MT training data.

• Modify the necessary paths in machine_translation/train.sh, and run machine_translation/train.sh to pretrain MT model.

• Adjust all the required paths in the machine_translation/decode.sh to match those in machine_translation/train.sh, and run machine_translation/decode.sh to inference your pretrained MT model.

• Use machine_translation/load_embedding.py to fetch necessary word embeddings from pretrianed MT model.

Step 2. Multi-Task Fine-tuning

• Download the Hubert-base pretrained Model without fune-tuning.

• Prepare the MuST-C ST training data, please follow here.

• Modify the necessary paths in the pretrain-all/conf/train_soft_alignment.yaml, such as:

w2v-path=/your/path/to/hubert
mt-model-path=/your/path/to/mt/pretrain/model
decoder-embed-path=/your/path/to/mt/word/embedding

• Set data path and other required paths in the pretrain-all/joint_train_merge.sh, and run pretrain-all/joint_train_merge.sh to fune-tune your model.

• Use pretrain-all/decode.sh to inference your model

Citation

s-align's People

Contributors

mukai2000 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.