Git Product home page Git Product logo

reneeye / const Goto Github PK

View Code? Open in Web Editor NEW
58.0 58.0 6.0 3.7 MB

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

License: MIT License

Python 97.11% Shell 0.26% C++ 0.66% Cuda 1.53% Cython 0.44%
contrastive-learning machine-translation naacl2022 neural-machine-translation pytorch speec speech-translation spoken-language-processing transformer translation

const's People

Contributors

reneeye avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

const's Issues

freeze wav2vec

您好!在train_en2x.sh中没有看到--freeze-w2v的参数设置,请问wav2vec2在ST训练中需要freeze吗?

Extra MT Data

Hi, you have restricted in the script download_wmt.sh that only en-ro parallel data comes from wmt16 version, however you wrote in you paper that en-de and en-ru parallel data also come from wmt16. Is there something wrong in your paper or your script?

# download_wmt.sh
if [[ $version == "wmt16" && $target != "ro" ]] || [[ $version != "wmt16" && $target == "ro" ]]; then
    echo "--wmt16 if and only if target is ro"
    exit
fi

Reproduce experiments with external MT data

Hello,

Thank you very much for sharing your work!
I have followed your script to run the experiments on the En->De ST. I got a BLEU score of 25.31 without using external MT data. With external MT data used, I got 25.82, which is lower than 28.3 in your paper.
I'm wondering if a pre-trained MT model was perhaps utilized to achieve the 28.3 score? Or is there anything else that I might have missed in the process? Any suggestions or guidance you can offer would be greatly appreciated.

Thank you in advanced!

Fail to run the experiment

Hi, when I was using bash ConST/scripts/train_en2x.sh de checkpoint/model_saved. to train the model, I encountered some problem. It would be great help to me if you can help taking a look at my bug.
In the original ConST/scripts/train_en2x.sh file, language prefix token looks like this:
image
which brings me the following error:
image
After deleting the '<' and '>' outside lang:${TGT_LANG}, it can start training on train_st but when it began to validate on dev_st, another assertion error occured like this:
image
which comes from fairseq/tasks/speech_to_text_triplet_with_extra_mt.py, line 377.
I tried adding '<' and '>' in speech_to_text_triplet_with_extra_mt.py before assertion, but it didn't work. It would be of great help to me if you know some solutions. Thank you!

Following your “Training & Generation Instruction”, BLEU is 25.68.

Hello, I can't reproduce your results. The possible reasons are

  1. I used 1 rtx-3090(update-freq=2) instead of 8 v100.
  2. The script you provided does not seem to be pre-trained specifically on external translation data.

Can you give me some ideas about how to reach your performance? Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.