vlb9ae / m-speechclip Goto Github PK

Implementation of the M-SpeechCLIP model, introduced in the paper "M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval" (https://arxiv.org/abs/2211.01180)

Python 100.00%

m-speechclip's People

Contributors

Watchers

Forkers

ishine

m-speechclip's Issues

Speech embeddings script akin to SpeechCLIP

May you please make an inference script similar to https://github.com/atosystem/SpeechCLIP/blob/main/example.py?

Pre-trained checkpoints

Hello, thank you very much for open-sourcing the code for M-SpeechCLIP, as I have been trying to replicate the paper's results starting from the original SpeechCLIP implementation. This will be an immense help in my research.

I wanted to ask: is there any plan to share the pre-trained checkpoints too? If not possible, I will train from scratch following the guidelines. To clarify, the checkpoint would be used for non-commercial, research purposes only.

Please let me know, and again thank you so much!🙏

Recommend Projects

vlb9ae / m-speechclip Goto Github PK

m-speechclip's People

Contributors

Watchers

Forkers

m-speechclip's Issues

Speech embeddings script akin to SpeechCLIP

Pre-trained checkpoints

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent