Comments (4)
It's the number of labels.
from attentionxml.
But the total label number is millions.
from attentionxml.
How do the model contain so many parameters?
from attentionxml.
Only parameters of attention is linear to the number of labels. The parameters of attention is hidden size of BiLSTM (1024) × the number of labels. So we can store the whole model on the memory of GPUs.
from attentionxml.
Related Issues (20)
- Some questions about dataset wiki10-31k and wiki-500k HOT 4
- Questions about trainning detail HOT 2
- Regarding execution time on Amazon-670k dataset HOT 4
- Troubleshooting with Amazon-670K HOT 2
- where is PLT compression performed? HOT 3
- Regarding max_leaf hyperparameter for Amazon-670K dataset HOT 3
- Transformers as encoder HOT 1
- .npy file HOT 3
- CUDA ERROR HOT 6
- 无法 加载'glove.840B.300d.gensim' model HOT 1
- License for code HOT 1
- 如何在自己的数据集上做cluster HOT 1
- Cuda Error : RuntimeError: CUDNN_STATUS_EXECUTION_FAILED HOT 4
- Where can I get the label name from the data link. HOT 1
- AttentionXML in production HOT 5
- 执行命令过程的报错问题 HOT 2
- Preprocess not working HOT 5
- 关于论文中PSP@K的评价指标
- What's the difference between AttentionXML and fastAttentionXML? HOT 4
- bert+label wise attention network? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from attentionxml.