Comments (1)
Hi, @dhkim0225, thanks for your question! The "cooldown_epochs" is not the necessary setup for training MogaNet, and we also provided 300-epoch implementations and results in OpenMixup. Actually, the "cooldown_epochs" implemented in Timm is the default training setup as the image classification implementation was migrated from PoolFormer, which has little effect on the final performance. It might be useful to some Transformer architectures, e.g., Uniformer. To my knowledge, whether to use 300 or 310 epochs training has little to do with whether to post manuscripts on visual architectures on OpenReview,
from moganet.
Related Issues (17)
- About pretrain models. HOT 4
- Is channel aggregation block stronger than SE block? HOT 2
- About load pretrained models error HOT 3
- Could you share the code for drawing multi-order interactions that shown in Fig. 3 of your paper? HOT 2
- Code Issue about MultiOrderGatedAggregation HOT 3
- Some Questions about the paper HOT 2
- Some small issues about detection and segmentation HOT 1
- What is "trivial interactions" mentioned in the paper? HOT 7
- How do I create an interference code HOT 2
- network name
- Unable to train model HOT 2
- Inquiry for code to train baseline HOT 2
- Cascade Mask RCNN Configuration HOT 1
- Distributions of the interaction strength HOT 3
- Why use Hadamard product in Spatial Aggregation? HOT 2
- What do the two Subtract operations mean?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from moganet.