Comments (7)
Thanks @Maxwell-Lyu ! Fixed!
Let me know if you have any problems or questions.
from knn-transformers.
Hi @vhientran ,
Thank you for your interest in our work!
This is a closed issue, can you please open a new one, and provide the details of what you ran exactly, what is the full error and stack trace?
Thanks,
Uri
from knn-transformers.
Hi @vhientran ,
dstore_size
is the total number of tokens in your training set. You can find this number by starting an evaluation of any model on the train split,
Using the command line here:
https://github.com/neulab/knn-transformers#step-1-evaluating-the-base-language-model
But using --eval_subset train
instead of validation
.
And the number of tokens will be printed by this line:
https://github.com/neulab/knn-transformers/blob/master/run_clm.py#L538
After they are printed, you can stop the run and feed this parameter to a new run.
Best,
Uri
from knn-transformers.
Thanks! Closing this issue now. Glad I can help~
from knn-transformers.
Sorry to disturb you.
I run kNN-MT with default hyper-prams, but it got the error: OverflowError: out of range integral type conversion attempted
. It seems the default max_length hyper-pram for decoding process is small or another reason.
How can I fix this error? Thank you!
from knn-transformers.
Hi @Maxwell-Lyu and @urialon ,
Sorry for disturbing you. I only wonder how we can set or select the value of the hyper-parameter dstore_size effectively and suitably?
from knn-transformers.
Hi @urialon
Thank you so much for your detail explanation. It helps me a lot.
I will follow your guidance to find dstore_size in my model.
Many thanks!
from knn-transformers.
Related Issues (9)
- Got Error when running kNN-MT with T5-base HOT 2
- How to apply knn-transformers to a custom pretrained machine translation model? HOT 2
- automaton: modify database HOT 3
- Performance on neulab/gpt2-large-finetuned-wikitext103 HOT 2
- Could KNNSaver support Multi-GPU strategies like DDP? HOT 5
- The size of the dstore HOT 3
- TypeError: pre_forward_hook() missing 1 required positional argument: 'labels' HOT 5
- Mismatch on KNN-MT result on README HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from knn-transformers.