Comments (2)
Dear author,
Thank you for your excellent work!
I am a little curious about the pretrained model, it is trained on the entire ImageNet-21k dataset, and is fixed during training. But will this lead to information leakage?
Take the class incremental setting as an example, I think all 100 classes of CIFAR100 can be found in ImageNet-21k so it is possible that the model has already learned all the features necessary for CIFAR100. But in practice, the model is expected to learn new features. We can not assume the classes in new tasks have already been observed by the backbone, right?
Have you tried to remove CIFAR100 classes in ImageNet and pretrained a model or evaluate the model on some datasets disjoint with ImageNet?
Thank you very much!
I am also very curious about the pre-train part as previous incremental baselines train from the scratch by default.
from l2p.
Great insight!
Actually we have not tried your suggested experiments, but it is definitely something worth trying. Regarding the "information leakage", I think we do make the assumption that we have a "well-pretrained" model, and we use the same pretrained model for all competitors, so the comparison is actually fair. Another thing I would like to highlight is that the idea of prompting is actually leveraging learned knowledge in the model, and trying to "instruct" the model to selectively use learned knowledge for coming tasks. Since large-scale pretrained model is prevalent these days, leveraging them is quite natural.
On the other hand, thinking about the extreme case that the pretrained model is totally off (e.g. trained on a totally different dataset, though we will not do it in practice), L2P will probably fail if the backbone is frozen. Thus, it will be interesting to see how and when to adapt the model backbone as a future direction.
Thanks again for your question and suggestion!
Best,
Zifeng
from l2p.
Related Issues (20)
- Using different ViT and ResNet based models in L2P HOT 1
- RESOURCE_EXHAUSTED: Out of memory while trying to allocate # bytes.
- Bug in classifier?
- Reproduce issue
- Evaluation metrics on CORe50 HOT 1
- Loss become NaN. Results mismatch between different convolution algorithms. HOT 1
- Questions about the reproducibility of the code and the results of the paper HOT 9
- about providing the class orders. HOT 1
- Confusion about the ImageNet-R dataset HOT 2
- Inference HOT 9
- Question regarding the average and last accuracy. HOT 1
- Question regarding on FT-seq-Frozen
- Questions about the pre-trained ViT HOT 1
- reproducing 5-datasets with dualprompt
- Questions about domain-incremental setting, positional embedding and location of prompt HOT 3
- question on the G(eneral)-Prompt learning HOT 1
- DualPrompt: The Results without Prompts HOT 1
- Regarding transferring previous learned prompt params to the new prompt HOT 1
- Question about the t-SNE visualization of prompts (Figure 4)
- Transfer prompt parameter during training process. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from l2p.