Comments (6)
You can refer to this repo to reproduce our results. https://github.com/pytorch/examples/blob/master/imagenet/main.py
from res2net-pretrainedmodels.
I train the res2net-v1b about 200 epochs,but only get top1-78.37%,(lr = 0.2 ,per_batch = 128 on 8 GPU), other hyperparameter is same,about 2% less,how can I adjustment?
from res2net-pretrainedmodels.
Excuse me,can I ask one more question?You paper say use a mini-batch of 256 on 4 Titan Xp GPUs,256 is total batch (means 64 per GPU)or 256 per GPU, and the Hyperparameter of res2net_v1b is the same as res2net?
from res2net-pretrainedmodels.
The v1b uses our designed trick that is similar to mixup. Sorry I cannot give it to you since we are still working on it. Using mixup and coslr with 300 epoch can get the similar result. If you are going to compare your method with res2net, you can just refer to the results in the paper.
from res2net-pretrainedmodels.
Thank you very much!So common version of res2net using trick? And do you train net using f16, because 256 per_batch can lead to OOM. I'm very interest in your work~
from res2net-pretrainedmodels.
The res2net_v1b is designed for better downstream task, not for comparison. I didn't use f16, and no OOM happens when using GPU with 11G memory.
from res2net-pretrainedmodels.
Related Issues (20)
- tensorflow,keras预训练版本 HOT 2
- width = int(math.floor(planes * (baseWidth/64.0)))
- Can res2net have a basic block structure? HOT 6
- Res2Net object detection HOT 2
- 提供的Res2Net-v1b-50预训练模型与mmdetection不完全匹配
- 维度不匹配 HOT 2
- What is baseWidth? HOT 3
- About stype.stage HOT 3
- How to determine the width parameter HOT 1
- About ResNet18? HOT 3
- 张量尺寸不匹配问题 HOT 1
- why don't have the Res2Net-v1b-200-SSLD model code ? HOT 1
- 26是否可以替换成其他更小数值 HOT 2
- res2net50-v1b的训练策略 HOT 2
- About SSLD pretrained model HOT 3
- 对于每一个layer的第一个block,计算方式不是层级的,和论文中描述有差别 HOT 3
- Problem of Replacement of "BatchNorm" to "GroupNorm"
- self-supervised training on Imagenet
- 代码中的残差结构和图像中不一致 HOT 1
- Code Copyright Issues HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from res2net-pretrainedmodels.