Comments (6)
Hi, I have included the available checkpoints, although not all of them are saved.
from dept.
Thank for your reply. I am having some difficulties reproducing the MRQA dataset. Does these experiments require prompt initialization from other datasets?
from dept.
No, there is no need to do any transfer learning for initialization. From my experience, just training longer typically helps.
from dept.
In this highlighted context, it is mentioned that prompt initialization is from other datasets. Could you please specify which experiments need this opereation?
from dept.
Thanks for your question. This is only for the few-shot learning experiments.
In general, transfer learning can improve the speed of the convergence and the model performance. However, it might take a while or require some tricks to select the best source tasks. Therefore, in our experiments, when there are enough training examples, we typically train the soft prompt from the random initialization. In these cases, we find that training longer typically leads to better performance.
from dept.
Does training require longer for other datasets? Could you please provide the checkpoints of the other datasets?
from dept.
Related Issues (8)
- LLaMA 2 finetuning HOT 2
- Runing Time HOT 2
- Some tensors share memory, this will lead to duplicate memory HOT 3
- About the hyperparameters HOT 4
- About the max_length of SuperGLUE-MultiRC dataset HOT 2
- General question about padding in the setting of soft-prompt tuning HOT 2
- About the hyperparameters of large models
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dept.