Comments (4)
Due to limited storage, we could only upload ViT-B checkpoints using google drive. Here is the link: https://drive.google.com/drive/folders/1gDsfYoCllMHa7sYXEsrdOKk74WoT-XQR?usp=drive_link
from droppos.
It is trained for 800 epochs.
from droppos.
Hi, thanks for releasing them. Is DropPos_pretrain_vit_base_patch16.pth trained for 800 epochs or 200 epochs?
from droppos.
Are you also fine-tuning 800 epochs model for 100 epochs? I can only achieve ~83.2% with this setup
--batch_size 1024 \
--accum_iter 1 \
--model vit_base_patch16 \
--finetune DropPos_pretrain_vit_base_patch16.pth \
\
--epochs 100 \
--warmup_epochs 5 \
--blr 1e-3 --layer_decay 0.8 --weight_decay 0.05 \
--drop_path 0.1 --reprob 0.25 --mixup 0.8 --cutmix 1.0 \
--dist_eval \
--data_path /path \
--nb_classes 1000 \
But If I train DropPos_mae_vit_base_patch16_dec512d2b for 200 epochs and fine-tune for 100 I get 82.91% with the same setup, so it this really close to the paper. What could be the problem?
from droppos.
Related Issues (8)
- 与训练阶段loss的最终值 HOT 2
- Why does DropPos achieve exactly the same performance as HPM? HOT 3
- pretrain model HOT 2
- Cannot Reproduce the results on ViT-L HOT 12
- Position encoding for downsteam task when pos_mask_ratio=1 and other questions HOT 4
- A question about the strategy of DropPos HOT 1
- About detection task HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from droppos.