Comments (11)
You can compare the networks using Netscope here:
AlexNet, SqueezeNet v1.0, SqueezeNet v1.1
This table gives a summary:
Problem 2) is definitely because SqueezeNet uses more activation memory than AlexNet (5x as much in v1.0 and 3x as much in v1.1). The largest output feature map in SqNet v1.1 is 817k pixels, in AlexNet 290k pixels.
Problem 1) could have different reasons:
- SqueezeNet v1.0 has approximately the same computational complexity as AlexNet (860 million vs. 1140 million multiply-accumulate operations), SqueezeNet v1.1 should be less complex (390 million MACCs).
- SqueezeNet has many more layers than AlexNet (26 CONV layers versus 5 CONV layers), and switching the layers takes some time on the CPU and GPU
- SqueezeNet has larger output feature maps (compare Problem 2 above). Moving this data around between GPU and Memory takes some time, too.
So I don't think you're doing anything wrong, this is probably the price you have to pay for the increased network complexity. I you haven't done already, definitely give SqueezeNet v1.1 a try!
from squeezenet.
You can use the Netscope CNN Analyzer tool: https://dgschwend.github.io/netscope/
The actual code for the calculation is here:
https://github.com/dgschwend/netscope/blob/gh-pages/src/analyzer.coffee
The formulas are pretty self-explanatory.
from squeezenet.
Maybe it is caused because of concat layers?
from squeezenet.
@dgschwend Thank u so much for so many details. The Netscope is such a cool tool! I was using v1.1. I guess the problem comes from the increased activations and layers as you pointed and the concat layers as @Grabber mentioned.
from squeezenet.
You are welcome, glad I could help! ;)
from squeezenet.
@dgschwend , is there any document about how to compute the macc, memory of a network? tks
from squeezenet.
@dgschwend Nice work on that table. Is "Table 4.1" part of a longer paper that I could read (and cite)?
from squeezenet.
@forresti Yes, I just published my Master Thesis.
The project report and all code for "ZynqNet: An FPGA-Accelerated Embedded Convolutional Neural Network" are public. The whole work is based on SqueezeNet, and the report contains a detailed analysis + comparison of prior CNN topologies.
from squeezenet.
@dgschwend hi, can you share the reference to "SqueezeNet++" in your table? couldn't find it
from squeezenet.
That's the work-in-progress codename of ZynqNet CNN from my Master Thesis (see post above). See https://github.com/dgschwend/zynqnet
from squeezenet.
Have the same issue that SqeezeNet v1.1
is slower then AlexNet
, i.e. 3.91 ms vs 3.01 ms.
https://github.com/mrgloom/kaggle-dogs-vs-cats-solution
from squeezenet.
Related Issues (20)
- Has anyone successfully trained Squeezenet with residual connections?
- model convert HOT 1
- SqueezeNet v1.1 with Residual Connections with Dense→Sparse→Dense (DSD) Training
- Top-1 Acc=61.0% on ImageNet, without any sacrificing compared with SqueezeNet v1.1. HOT 4
- Image Width Issue HOT 1
- tensorflow- After hundreds of epochs, my total_loss stay around 0.6~0.7, and not decreased HOT 2
- 1.1 deploy.prototxt HOT 1
- SqueezeNet is slower when using GPU than when using CPU? HOT 2
- training from scratch, random seed HOT 1
- why can not get the output of the prob layer? HOT 1
- SqueezeNet training on cifar HOT 3
- The SqueezeNet deploy.caffemodel files have all 0.0 weight and bias data HOT 1
- Fine-tuning SqueezeNet HOT 2
- which label list you used HOT 1
- why not use lr_mult, decay_mult like {1, 1, 2, 0}? HOT 3
- optimization and compression
- Image normalization values HOT 1
- squeezenet v1_1 for facedetector , possible , feasable ? HOT 1
- squeezenet for speech HOT 3
- Some minor mistakes in the paper HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from squeezenet.