Comments (3)
Hey @deepseek ,
For pure machine translation, those multilingual architectures are not really useful as in a machine translation scenario, we would need the decoder part as well. In our use case, we focus on the encoding part in a way that a sentence in arabic and its translation in urdu would have the same encoding in the semantic space. The quality of such encoding is what is evaluated in this repo.
Additional efforts have been done recently around languages such as Urdu with XLM-R.
What is your exact scenario? :)
N.B: Sorry I am currently having some time off so I would answer with some delay in the future.
from multilingual_similarity_compare.
Take this example, i have multiple books of the same author in Arabic and Urdu.
A new book is issued in Arabic, I want to translate it to Urdu.
i am sure that there must be a github repository that can translate such task.
from multilingual_similarity_compare.
Hey @deepseek
Yes there are but you should look at different architectures than BERT-like architectures which essentially just implements the encoding part.
Did you check with GPT-3 for example?
from multilingual_similarity_compare.
Related Issues (10)
- What metric is tracked in the table? HOT 1
- Why does XLM-R have low performance on this task?
- Add batch-size as a parameter in parser.args
- Add distilBERT with batches and test it HOT 3
- Time comparison between GPU and CPU HOT 1
- Why is mean pooling giving such bad performance? HOT 2
- Shift everything to batch version with GPU support for faster results after testing HOT 2
- Test XLM-R with mean pooling after solving issue on mean pooling HOT 1
- Benchmarks for LaBSE HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from multilingual_similarity_compare.