Git Product home page Git Product logo

mobileone's Issues

question about merge conv and bn

hi,when i read your code,i find that a question about merge conv and bn.
when merge conv and bn:
image
why your code :
std = (running_var + eps).sqrt()
t = (gamma / std).reshape(-1, 1, 1, 1)
return kernel * t, beta - running_mean * gamma / std

why you not use conv's bias, i think the result should be:
weight = conv_weight * bn_weight.view(out_channels, 1, 1, 1) / bn_std.view(out_channels, 1, 1, 1)

     bias = bn_weight * (conv_bias - bn_mean) / bn_std + bn_bias

why use normal stem for self.stage0?

self.stage0 look like normal stem, rather than mobileOne block in paper. Any reason why?

By the way, use mobileOne block as stem might cause huge acc drop in cifar100 (50%+)

[BUG] Skip connection should be always added to DW branches.

Hi @shoutOutYangJie, I've noticed that you have a bug in your MobileOne implementation:

self.dw_bn_layer = nn.BatchNorm2d(in_channels) if out_channels == in_channels and stride == 1 else None

You are adding DW BN layer based on in_channels == out_channels condition but it should be always added because for DW part input channels are always equal to output channels. This condition should be only checked for PW part as there might be channel change.

About mobileone_s2

I have no idea about the parameters of mobileone_s2. Could you help offer the parameters to me?

reparameterize requestion

image

I like your work very much, it gave me a lot of inspiration, because I am a beginner, some codes do not understand, I would like to ask, in this part of the code, I commented "model = copy.deepcopy(model)" Does this line of code affect the final result? Why can't reparameterization be done directly

Could you please provide dataset format?

Hello,I wanna train my model,but I don't know the format of the dataset.
Could you please provide it?Thank you.
The project brought me a lot of help,I really need it.
Please!

Validation loss : deployed vs full model

Hi,
Thank you very much for the implementation.
I would like to know if you still have the graphs of the full model validation loss and the inference (ready to deploy) model validation loss.
I would like to know how both models behaved in the very early stage of training (first 20 epochs).
Thank you very much for considering my request :)

推理速度

大佬有在iphone上实测推理速度吗?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.