shoutoutyangjie / mobileone Goto Github PK

View Code? Open in Web Editor NEW

140.0 7.0 31.0 631 KB

An Improved One millisecond Mobile Backbone

License: MIT License

Python 100.00%

mobileone's Issues

repvgg_model_convert function should't have the "x", please confirm it

o = deploy_model(x)
this "x" should't have.

question about merge conv and bn

hi，when i read your code，i find that a question about merge conv and bn.
when merge conv and bn：

why your code ：
std = (running_var + eps).sqrt()
t = (gamma / std).reshape(-1, 1, 1, 1)
return kernel * t, beta - running_mean * gamma / std

why you not use conv's bias, i think the result should be:
weight = conv_weight * bn_weight.view(out_channels, 1, 1, 1) / bn_std.view(out_channels, 1, 1, 1)

     bias = bn_weight * (conv_bias - bn_mean) / bn_std + bn_bias

How did you check the model and depolyment model results consistency

Hi, I find that using repvgg_model_convert function, the output of the model and the deployed model are very different using the same input tensor. How did you check the model and depolyment model results consistency ?

Hi~How about quantization? Is quantization all good?

why use normal stem for self.stage0？

self.stage0 look like normal stem, rather than mobileOne block in paper. Any reason why?

By the way, use mobileOne block as stem might cause huge acc drop in cifar100 (50%+)

why the acc is different between before and after merging blocks

[BUG] Skip connection should be always added to DW branches.

Hi @shoutOutYangJie, I've noticed that you have a bug in your MobileOne implementation:

MobileOne/mobileone.py

Line 77 in 48ca6c9

 self.dw_bn_layer = nn.BatchNorm2d(in_channels) if out_channels == in_channels and stride == 1 else None 

You are adding DW BN layer based on in_channels == out_channels condition but it should be always added because for DW part input channels are always equal to output channels. This condition should be only checked for PW part as there might be channel change.

Can you privide the checkpoint about the make_mobileone_s0

About mobileone_s2

I have no idea about the parameters of mobileone_s2. Could you help offer the parameters to me?

reparameterize requestion

I like your work very much, it gave me a lot of inspiration, because I am a beginner, some codes do not understand, I would like to ask, in this part of the code, I commented "model = copy.deepcopy(model)" Does this line of code affect the final result? Why can't reparameterization be done directly

Could you please provide dataset format?

Hello,I wanna train my model,but I don't know the format of the dataset.
Could you please provide it?Thank you.
The project brought me a lot of help,I really need it.
Please!

add ‘SeBlock’ can't work ?

Validation loss : deployed vs full model

Hi,
Thank you very much for the implementation.
I would like to know if you still have the graphs of the full model validation loss and the inference (ready to deploy) model validation loss.
I would like to know how both models behaved in the very early stage of training (first 20 epochs).
Thank you very much for considering my request :)

推理速度

大佬有在iphone上实测推理速度吗？

Could not get your pretrained weights on google drive disk!

Could you share the weights you trained on BaiduyunDisk or change the access permission of the file shared on google disk?

shoutoutyangjie / mobileone Goto Github PK

mobileone's Issues

repvgg_model_convert function should't have the "x", please confirm it

question about merge conv and bn

How did you check the model and depolyment model results consistency

Hi~How about quantization? Is quantization all good?

why use normal stem for self.stage0？

why the acc is different between before and after merging blocks

[BUG] Skip connection should be always added to DW branches.

Can you privide the checkpoint about the make_mobileone_s0

About mobileone_s2

reparameterize requestion

Could you please provide dataset format?

add ‘SeBlock’ can't work ?

Validation loss : deployed vs full model

推理速度

Could not get your pretrained weights on google drive disk!

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent