Comments (6)
Hi there,
Thanks for your interest in the work. First of all, I have to cross-check the differences but be assured that the performance does remain almost same with 1, 8 or even more than 8 attention heads. Same thing happens with the number of layers as well, doesn't really impact the ultimate outcome. If you explore more, you will find that these hyperparameters are taken casually most of the times. Nevertheless, I will update relevant parts if deemed necessary.
Thanks
from visualtransformers.
I see, thanks for reply
from visualtransformers.
from visualtransformers.
If you look closely, you will see that they referred the feature map dimension as HWxC, here C is simply the number of filters. but for tokens, they refer it as LxC, where C is the width of each token. You can think of it like, each token has 1 participant from each of the C feature maps and each token has C length. I hope that makes your concept clear.
from visualtransformers.
from visualtransformers.
from visualtransformers.
Related Issues (12)
- Static Tokenization HOT 3
- Hi, regarding the nn1 of ViTResNet: HOT 3
- Semantic Segmentation HOT 1
- Code for Visual Transformer or Vision Transformer? HOT 3
- No Documentation! HOT 3
- Question on token HOT 3
- Did u training a smeantic segmentation example using this model?aa
- Classification tokens HOT 2
- Performances
- Mask shape is not correct HOT 1
- Only 'BasicBlock',not have 'Bottleneck'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from visualtransformers.