the speed of model pruned did not improve, how to your work?

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

how to speed up to model pruned? about pytorch-pruning HOT 7 OPEN

jacobgil commented on July 25, 2024

how to speed up to model pruned?

from pytorch-pruning.

Comments (7)

eeric commented on July 25, 2024

The ResNet model pruned impletement faster little about 10%, code in
https://github.com/eeric/channel_prune

the reason that non-tensor layers (e.g., batch normalization
and pooling layers) took up more than 40% of the inference
time on GPU.

from pytorch-pruning.

guoxiaolu commented on July 25, 2024

How to get this "the reason that non-tensor layers (e.g., batch normalization
and pooling layers) took up more than 40% of the inference
time on GPU." Are there some papers or other things?
@eeric

from pytorch-pruning.

eeric commented on July 25, 2024

test myself

from pytorch-pruning.

Kuldeep-Attri commented on July 25, 2024

Hey Guys, I pruned the SqueezeNet model and when I test for the inference time, it is same on the GPU. I can see the difference on� CPU(after 67% filter pruning, inference time is almost half) but, on GPU it is same. Any Idea why this is happening?
After Pruning:

Model Size reduced
I can see the difference in FLOP
Pruned model is faster on CPU but not on GPU. :(

Thank you!!

from pytorch-pruning.

zxduan90 commented on July 25, 2024

@eeric Actually, after pruning, the number of input channel and output channel is no longer integer multiple of 32. When cuda compute the convolution, it actually transforms it into matrix mulitiplication. The warp of cuda is 32.

from pytorch-pruning.

eeric commented on July 25, 2024

@buaaJeremyduan, thanks!

from pytorch-pruning.

kssk16 commented on July 25, 2024

@Kuldeep-Attri did you find out the reason?
My observation in that regard is same. Let me know if you have any leads?

from pytorch-pruning.

Recommend Projects

how to speed up to model pruned? about pytorch-pruning HOT 7 OPEN

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent